                         SEQUENCE LISTING

<110>  Janssen Vaccines & Prevention B.V.
       Rutten, Lucy
       Langedijk, Johannes P.M.
       Juraszek, Jarek
 
<120>  Trimer Stabilizing HIV Envelope Protein Mutation

<130>  CRU6054

<150>  EP21158800.9
<151>  2021-02-23

<160>  16    

<170>  PatentIn version 3.5

<210>  1
<211>  856
<212>  PRT
<213>  Human immunodeficiency virus


<220>
<221>  MISC_FEATURE
<223>  gp160 of HIV-1 isolate HXB2

<400>  1

Met Arg Val Lys Glu Lys Tyr Gln His Leu Trp Arg Trp Gly Trp Arg 
1               5                   10                  15      


Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu 
            20                  25                  30          


Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
        35                  40                  45              


Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
    50                  55                  60                  


Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65                  70                  75                  80  


Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
                85                  90                  95      


Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp 
            100                 105                 110         


Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
        115                 120                 125             


Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
    130                 135                 140                 


Gly Arg Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn 
145                 150                 155                 160 


Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe 
                165                 170                 175     


Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Lys 
            180                 185                 190         


Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val 
        195                 200                 205             


Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala 
    210                 215                 220                 


Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
225                 230                 235                 240 


Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser 
                245                 250                 255     


Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile 
            260                 265                 270         


Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln Leu 
        275                 280                 285             


Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
    290                 295                 300                 


Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Ala Phe Val Thr Ile 
305                 310                 315                 320 


Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala 
                325                 330                 335     


Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln 
            340                 345                 350         


Phe Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp 
        355                 360                 365             


Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
    370                 375                 380                 


Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp 
385                 390                 395                 400 


Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu 
                405                 410                 415     


Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Lys 
            420                 425                 430         


Ala Met Tyr Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn 
        435                 440                 445             


Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu 
    450                 455                 460                 


Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
465                 470                 475                 480 


Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala 
            500                 505                 510         


Val Gly Ile Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 
        515                 520                 525             


Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu 
    530                 535                 540                 


Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu 
545                 550                 555                 560 


Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu 
                565                 570                 575     


Gln Ala Arg Ile Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu 
            580                 585                 590         


Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val 
        595                 600                 605             


Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gln Ile Trp Asn 
    610                 615                 620                 


His Thr Thr Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser 
625                 630                 635                 640 


Leu Ile His Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 
                645                 650                 655     


Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 
            660                 665                 670         


Phe Asn Ile Thr Asn Trp Leu Trp Tyr Ile Lys Leu Phe Ile Met Ile 
        675                 680                 685             


Val Gly Gly Leu Val Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile 
    690                 695                 700                 


Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His 
705                 710                 715                 720 


Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu 
                725                 730                 735     


Gly Gly Glu Arg Asp Arg Asp Arg Ser Ile Arg Leu Val Asn Gly Ser 
            740                 745                 750         


Leu Ala Leu Ile Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr 
        755                 760                 765             


His Arg Leu Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu 
    770                 775                 780                 


Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu 
785                 790                 795                 800 


Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn 
                805                 810                 815     


Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val 
            820                 825                 830         


Val Gln Gly Ala Cys Arg Ala Ile Arg His Ile Pro Arg Arg Ile Arg 
        835                 840                 845             


Gln Gly Leu Glu Arg Ile Leu Leu 
    850                 855     


<210>  2
<211>  615
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HIV Env consensus clade C

<400>  2

Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
1               5                   10                  15      


Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu 
            20                  25                  30          


Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
        35                  40                  45              


Pro Gln Glu Met Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp 
    50                  55                  60                  


Lys Asn Asp Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp 
65                  70                  75                  80  


Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr 
                85                  90                  95      


Leu Asn Cys Thr Asn Val Asn Val Thr Asn Thr Asn Asn Asn Asn Met 
            100                 105                 110         


Lys Glu Glu Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Glu Ile Arg 
        115                 120                 125             


Asp Lys Lys Gln Lys Glu Tyr Ala Leu Phe Tyr Arg Leu Asp Ile Val 
    130                 135                 140                 


Pro Leu Asn Glu Asn Ser Ser Glu Tyr Arg Leu Ile Asn Cys Asn Thr 
145                 150                 155                 160 


Ser Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro 
                165                 170                 175     


Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn 
            180                 185                 190         


Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gln 
        195                 200                 205             


Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn 
    210                 215                 220                 


Gly Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser Glu Asn Leu Thr 
225                 230                 235                 240 


Asp Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Val Glu Ile 
                245                 250                 255     


Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Arg Ile Gly 
            260                 265                 270         


Pro Gly Gln Thr Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg 
        275                 280                 285             


Gln Ala His Cys Asn Ile Ser Glu Ala Lys Trp Asn Lys Thr Leu Gln 
    290                 295                 300                 


Arg Val Lys Lys Lys Leu Lys Glu His Phe Pro Asn Lys Thr Ile Lys 
305                 310                 315                 320 


Phe Ala Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr His Ser Phe 
                325                 330                 335     


Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn 
            340                 345                 350         


Ser Thr Tyr Asn Asn Thr Thr Ser Asn Ser Thr Ile Thr Leu Pro Cys 
        355                 360                 365             


Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met 
    370                 375                 380                 


Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Lys Ser Asn Ile Thr 
385                 390                 395                 400 


Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Asn Thr Glu 
                405                 410                 415     


Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
            420                 425                 430         


Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly Ile Ala Pro 
        435                 440                 445             


Thr Lys Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Arg Ala Val 
    450                 455                 460                 


Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
465                 470                 475                 480 


Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu 
                485                 490                 495     


Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala 
            500                 505                 510         


Gln Gln His Met Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln 
        515                 520                 525             


Ala Arg Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu 
    530                 535                 540                 


Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro 
545                 550                 555                 560 


Trp Asn Ser Ser Trp Ser Asn Lys Ser Gln Glu Asp Ile Trp Asp Asn 
                565                 570                 575     


Met Thr Trp Met Gln Trp Asp Arg Glu Ile Ser Asn Tyr Thr Asp Thr 
            580                 585                 590         


Ile Tyr Arg Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu 
        595                 600                 605             


Lys Asp Leu Leu Ala Leu Asp 
    610                 615 


<210>  3
<211>  677
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ConC_SOSIP sequence

<400>  3

Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
1               5                   10                  15      


Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu 
            20                  25                  30          


Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
        35                  40                  45              


Pro Gln Glu Met Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp 
    50                  55                  60                  


Lys Asn Asp Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp 
65                  70                  75                  80  


Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr 
                85                  90                  95      


Leu Asn Cys Thr Asn Val Asn Val Thr Asn Thr Asn Asn Asn Asn Met 
            100                 105                 110         


Lys Glu Glu Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Glu Ile Arg 
        115                 120                 125             


Asp Lys Lys Gln Lys Glu Tyr Ala Leu Phe Tyr Arg Leu Asp Ile Val 
    130                 135                 140                 


Pro Leu Asn Glu Asn Ser Ser Glu Tyr Arg Leu Ile Asn Cys Asn Thr 
145                 150                 155                 160 


Ser Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro 
                165                 170                 175     


Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn 
            180                 185                 190         


Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gln 
        195                 200                 205             


Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn 
    210                 215                 220                 


Gly Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser Glu Asn Leu Thr 
225                 230                 235                 240 


Asp Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Val Glu Ile 
                245                 250                 255     


Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Arg Ile Gly 
            260                 265                 270         


Pro Gly Gln Thr Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg 
        275                 280                 285             


Gln Ala His Cys Asn Ile Ser Glu Ala Lys Trp Asn Lys Thr Leu Gln 
    290                 295                 300                 


Arg Val Lys Lys Lys Leu Lys Glu His Phe Pro Asn Lys Thr Ile Lys 
305                 310                 315                 320 


Phe Ala Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr His Ser Phe 
                325                 330                 335     


Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn 
            340                 345                 350         


Ser Thr Tyr Asn Asn Thr Thr Ser Asn Ser Thr Ile Thr Leu Pro Cys 
        355                 360                 365             


Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met 
    370                 375                 380                 


Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Lys Ser Asn Ile Thr 
385                 390                 395                 400 


Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Asn Thr Glu 
                405                 410                 415     


Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
            420                 425                 430         


Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly Ile Ala Pro 
        435                 440                 445             


Thr Lys Cys Lys Arg Arg Val Val Glu Arg Arg Arg Arg Arg Arg Ala 
    450                 455                 460                 


Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 
465                 470                 475                 480 


Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu 
                485                 490                 495     


Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu 
            500                 505                 510         


Ala Gln Gln His Met Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu 
        515                 520                 525             


Gln Ala Arg Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln Gln Leu 
    530                 535                 540                 


Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Cys Thr Ala Val 
545                 550                 555                 560 


Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Gln Glu Asp Ile Trp Asp 
                565                 570                 575     


Asn Met Thr Trp Met Gln Trp Asp Arg Glu Ile Ser Asn Tyr Thr Asp 
            580                 585                 590         


Thr Ile Tyr Arg Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 
        595                 600                 605             


Glu Lys Asp Leu Leu Ala Leu Asp Ala Ala Ala Leu Pro Glu Thr Gly 
    610                 615                 620                 


Gly Gly Ser Asp Tyr Lys Asp Asp Asp Asp Lys Pro Gly Gly Gly Gly 
625                 630                 635                 640 


Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
                645                 650                 655     


Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser His 
            660                 665                 670         


His His His His His 
        675         


<210>  4
<211>  630
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HIV Env consensus clade B

<400>  4

Ala Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys 
1               5                   10                  15      


Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp 
            20                  25                  30          


Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 
        35                  40                  45              


Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe Asn 
    50                  55                  60                  


Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser 
65                  70                  75                  80  


Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 
                85                  90                  95      


Val Thr Leu Asn Cys Thr Asp Leu Asn Asn Asn Thr Thr Asn Asn Asn 
            100                 105                 110         


Ser Ser Ser Glu Lys Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe 
        115                 120                 125             


Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Gln Lys Glu Tyr Ala Leu 
    130                 135                 140                 


Phe Tyr Lys Leu Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr 
145                 150                 155                 160 


Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys 
                165                 170                 175     


Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe 
            180                 185                 190         


Ala Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 
        195                 200                 205             


Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val 
    210                 215                 220                 


Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225                 230                 235                 240 


Ile Arg Ser Glu Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln 
                245                 250                 255     


Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr 
            260                 265                 270         


Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly 
        275                 280                 285             


Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Thr 
    290                 295                 300                 


Lys Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys Leu Arg Glu Gln 
305                 310                 315                 320 


Phe Gly Asn Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Pro 
                325                 330                 335     


Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
            340                 345                 350         


Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Ser Asn Gly Thr Trp 
        355                 360                 365             


Asn Asn Thr Thr Gly Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile Lys 
    370                 375                 380                 


Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro 
385                 390                 395                 400 


Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu 
                405                 410                 415     


Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Asn Thr Thr Glu Thr Phe 
            420                 425                 430         


Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
        435                 440                 445             


Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys 
    450                 455                 460                 


Cys Lys Arg Arg Val Val Gln Arg Arg Arg Arg Arg Arg Ala Val Gly 
465                 470                 475                 480 


Ile Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
                485                 490                 495     


Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser 
            500                 505                 510         


Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Pro Glu Ala Gln 
        515                 520                 525             


Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala 
    530                 535                 540                 


Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly 
545                 550                 555                 560 


Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Cys Thr Ala Val Pro Trp 
                565                 570                 575     


Asn Thr Ser Trp Ser Asn Lys Ser Leu Asp Glu Ile Trp Asp Asn Met 
            580                 585                 590         


Thr Trp Met Gln Trp Glu Arg Glu Ile Asp Asn Tyr Thr Gly Leu Ile 
        595                 600                 605             


Tyr Thr Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln 
    610                 615                 620                 


Glu Leu Leu Glu Leu Asp 
625                 630 


<210>  5
<211>  691
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ConB_SOSIP sequence

<400>  5

Ala Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys 
1               5                   10                  15      


Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp 
            20                  25                  30          


Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 
        35                  40                  45              


Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe Asn 
    50                  55                  60                  


Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser 
65                  70                  75                  80  


Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 
                85                  90                  95      


Val Thr Leu Asn Cys Thr Asp Leu Asn Asn Asn Thr Thr Asn Asn Asn 
            100                 105                 110         


Ser Ser Ser Glu Lys Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe 
        115                 120                 125             


Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Gln Lys Glu Tyr Ala Leu 
    130                 135                 140                 


Phe Tyr Lys Leu Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr 
145                 150                 155                 160 


Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys 
                165                 170                 175     


Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe 
            180                 185                 190         


Ala Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 
        195                 200                 205             


Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val 
    210                 215                 220                 


Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225                 230                 235                 240 


Ile Arg Ser Glu Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln 
                245                 250                 255     


Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr 
            260                 265                 270         


Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly 
        275                 280                 285             


Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Thr 
    290                 295                 300                 


Lys Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys Leu Arg Glu Gln 
305                 310                 315                 320 


Phe Gly Asn Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Pro 
                325                 330                 335     


Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
            340                 345                 350         


Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Ser Asn Gly Thr Trp 
        355                 360                 365             


Asn Asn Thr Thr Gly Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile Lys 
    370                 375                 380                 


Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro 
385                 390                 395                 400 


Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu 
                405                 410                 415     


Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Asn Thr Thr Glu Thr Phe 
            420                 425                 430         


Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
        435                 440                 445             


Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys 
    450                 455                 460                 


Cys Lys Arg Arg Val Val Gln Arg Arg Arg Arg Arg Arg Ala Val Gly 
465                 470                 475                 480 


Ile Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
                485                 490                 495     


Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser 
            500                 505                 510         


Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Pro Glu Ala Gln 
        515                 520                 525             


Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala 
    530                 535                 540                 


Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly 
545                 550                 555                 560 


Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Cys Thr Ala Val Pro Trp 
                565                 570                 575     


Asn Thr Ser Trp Ser Asn Lys Ser Leu Asp Glu Ile Trp Asp Asn Met 
            580                 585                 590         


Thr Trp Met Gln Trp Glu Arg Glu Ile Asp Asn Tyr Thr Gly Leu Ile 
        595                 600                 605             


Tyr Thr Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln 
    610                 615                 620                 


Glu Leu Leu Glu Leu Asp Ala Ala Ala Leu Pro Glu Thr Gly Gly Gly 
625                 630                 635                 640 


Ser Asp Tyr Lys Asp Asp Asp Asp Lys Pro Gly Gly Gly Gly Ser Gly 
                645                 650                 655     


Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            660                 665                 670         


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser His His His 
        675                 680                 685             


His His His 
    690     


<210>  6
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  furin cleavage site mutant sequence

<400>  6

Arg Arg Arg Arg Arg Arg 
1               5       


<210>  7
<211>  31
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  signal sequence

<400>  7

Met Arg Val Arg Gly Ile Leu Arg Asn Trp Gln Gln Trp Trp Ile Trp 
1               5                   10                  15      


Gly Ile Leu Gly Phe Trp Met Leu Met Ile Cys Asn Val Val Gly 
            20                  25                  30      


<210>  8
<211>  29
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  signal sequence

<400>  8

Met Arg Val Lys Gly Ile Arg Lys Asn Tyr Gln His Leu Trp Arg Trp 
1               5                   10                  15      


Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala 
            20                  25                  


<210>  9
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example of 8 amino acid sequence that can replace HR1 loop

<400>  9

Asn Pro Asp Trp Leu Pro Asp Met 
1               5               


<210>  10
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example of 8 amino acid sequence that can replace HR1 loop

<400>  10

Gly Ser Gly Ser Gly Ser Gly Ser 
1               5               


<210>  11
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example of 8 amino acid sequence that can replace HR1 loop

<400>  11

Asp Asp Val His Pro Asp Trp Asp 
1               5               


<210>  12
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example of 8 amino acid sequence that can replace HR1 loop

<400>  12

Arg Asp Thr Phe Ala Leu Met Met 
1               5               


<210>  13
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example of 8 amino acid sequence that can replace HR1 loop

<400>  13

Asp Glu Glu Lys Val Met Asp Phe 
1               5               


<210>  14
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example of 8 amino acid sequence that can replace HR1 loop

<400>  14

Asp Glu Asp Pro His Trp Asp Pro 
1               5               


<210>  15
<211>  61
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sortase A-Flag-His tag

<400>  15

Ala Ala Ala Leu Pro Glu Thr Gly Gly Gly Ser Asp Tyr Lys Asp Asp 
1               5                   10                  15      


Asp Asp Lys Pro Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
            20                  25                  30          


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        35                  40                  45              


Gly Ser Gly Gly Gly Gly Ser His His His His His His 
    50                  55                  60      


<210>  16
<211>  844
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  full length ConC_SOSIP

<400>  16

Met Arg Val Arg Gly Ile Leu Arg Asn Trp Gln Gln Trp Trp Ile Trp 
1               5                   10                  15      


Gly Ile Leu Gly Phe Trp Met Leu Met Ile Cys Asn Val Val Gly Asn 
            20                  25                  30          


Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys 
        35                  40                  45              


Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val 
    50                  55                  60                  


His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65                  70                  75                  80  


Gln Glu Met Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
                85                  90                  95      


Asn Asp Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 
            100                 105                 110         


Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
        115                 120                 125             


Asn Cys Thr Asn Val Asn Val Thr Asn Thr Asn Asn Asn Asn Met Lys 
    130                 135                 140                 


Glu Glu Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Glu Ile Arg Asp 
145                 150                 155                 160 


Lys Lys Gln Lys Glu Tyr Ala Leu Phe Tyr Arg Leu Asp Ile Val Pro 
                165                 170                 175     


Leu Asn Glu Asn Ser Ser Glu Tyr Arg Leu Ile Asn Cys Asn Thr Ser 
            180                 185                 190         


Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro Ile 
        195                 200                 205             


His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys 
    210                 215                 220                 


Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gln Cys 
225                 230                 235                 240 


Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly 
                245                 250                 255     


Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser Glu Asn Leu Thr Asp 
            260                 265                 270         


Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Val Glu Ile Asn 
        275                 280                 285             


Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Arg Ile Gly Pro 
    290                 295                 300                 


Gly Gln Thr Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln 
305                 310                 315                 320 


Ala His Cys Asn Ile Ser Glu Ala Lys Trp Asn Lys Thr Leu Gln Arg 
                325                 330                 335     


Val Lys Lys Lys Leu Lys Glu His Phe Pro Asn Lys Thr Ile Lys Phe 
            340                 345                 350         


Ala Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr His Ser Phe Asn 
        355                 360                 365             


Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn Ser 
    370                 375                 380                 


Thr Tyr Asn Asn Thr Thr Ser Asn Ser Thr Ile Thr Leu Pro Cys Arg 
385                 390                 395                 400 


Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr 
                405                 410                 415     


Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Lys Ser Asn Ile Thr Gly 
            420                 425                 430         


Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Asn Thr Glu Thr 
        435                 440                 445             


Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
    450                 455                 460                 


Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly Ile Ala Pro Thr 
465                 470                 475                 480 


Lys Cys Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Ile 
                485                 490                 495     


Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 
            500                 505                 510         


Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly 
        515                 520                 525             


Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln 
    530                 535                 540                 


His Met Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg 
545                 550                 555                 560 


Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile 
                565                 570                 575     


Trp Gly Cys Ser Gly Lys Leu Ile Cys Cys Thr Ala Val Pro Trp Asn 
            580                 585                 590         


Ser Ser Trp Ser Asn Lys Ser Gln Glu Asp Ile Trp Asp Asn Met Thr 
        595                 600                 605             


Trp Met Gln Trp Asp Arg Glu Ile Ser Asn Tyr Thr Asp Thr Ile Tyr 
    610                 615                 620                 


Arg Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Lys Asp 
625                 630                 635                 640 


Leu Leu Ala Leu Asp Ser Trp Asn Asn Leu Trp Asn Trp Phe Asp Ile 
                645                 650                 655     


Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly 
            660                 665                 670         


Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Ile Val Asn Arg 
        675                 680                 685             


Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Thr Pro Asn 
    690                 695                 700                 


Pro Arg Gly Pro Asp Arg Leu Gly Arg Ile Glu Glu Glu Gly Gly Glu 
705                 710                 715                 720 


Gln Asp Arg Asp Arg Ser Ile Arg Leu Val Ser Gly Phe Leu Ala Leu 
                725                 730                 735     


Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu 
            740                 745                 750         


Arg Asp Phe Ile Leu Ile Ala Ala Arg Ala Val Glu Leu Leu Gly Arg 
        755                 760                 765             


Ser Ser Leu Arg Gly Leu Gln Arg Gly Trp Glu Ala Leu Lys Tyr Leu 
    770                 775                 780                 


Gly Ser Leu Val Gln Tyr Trp Gly Leu Glu Leu Lys Lys Ser Ala Ile 
785                 790                 795                 800 


Ser Leu Leu Asp Thr Ile Ala Ile Ala Val Ala Glu Gly Thr Asp Arg 
                805                 810                 815     


Ile Ile Glu Leu Ile Gln Arg Ile Cys Arg Ala Ile Arg Asn Ile Pro 
            820                 825                 830         


Arg Arg Ile Arg Gln Gly Phe Glu Ala Ala Leu Leu 
        835                 840                 


