                               SEQUENCE LISTING

<110> PRESIDENT AND FELLOWS OF HARVARD COLLEGE
 
<120> METHOD AND SYSTEM OF NANOPORE-BASED INFORMATION ENCODING

<130> 010498.00941/WO

<140>
<141>

<150> 62/325,669
<151> 2016-04-21

<160> 10    

<170> PatentIn version 3.5

<210> 1
<211> 6
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      6xHis tag

<400> 1
His His His His His His 
1               5       


<210> 2
<211> 30
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 2
tttttttttt tttttttttt tttttttttt                                        30


<210> 3
<211> 62
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (5)..(5)
<223> Inosine

<220>
<221> modified_base
<222> (10)..(10)
<223> Inosine

<220>
<221> modified_base
<222> (15)..(15)
<223> Inosine

<220>
<221> modified_base
<222> (20)..(20)
<223> Inosine

<220>
<221> modified_base
<222> (25)..(25)
<223> Inosine

<220>
<221> modified_base
<222> (30)..(30)
<223> Inosine

<220>
<221> modified_base
<222> (35)..(35)
<223> Inosine

<220>
<221> modified_base
<222> (40)..(40)
<223> Inosine

<220>
<221> modified_base
<222> (45)..(45)
<223> Inosine

<220>
<221> modified_base
<222> (50)..(50)
<223> Inosine

<220>
<221> modified_base
<222> (55)..(55)
<223> Inosine

<220>
<221> modified_base
<222> (60)..(60)
<223> Inosine

<400> 3
ctacnacttn caacnataan atctnccaan ctacnatcan taacncctan acttntcacn       60

aa                                                                      62


<210> 4
<211> 62
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 4
ttcgtgacaa gtctaggcgt tactgatcgt agcttggcag atcttatcgt tgcaagtcgt       60

ag                                                                      62


<210> 5
<211> 77
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 5
gaacatttcg aagcttagcg atgcaagtcg ttgctataca gtgcttgcga tgatagattg       60

aagggatttg aaaggta                                                      77


<210> 6
<211> 29
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 6
tatgatgatc agtagtagtc gcgctcgag                                         29


<210> 7
<211> 13
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 7
tgatgatcag tag                                                          13


<210> 8
<211> 12
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 8
agatgatcgt ag                                                           12


<210> 9
<211> 720
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 9
Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Gly Ala Glu Thr His 
1               5                   10                  15      


Met Pro Arg Lys Met Tyr Ser Cys Asp Phe Glu Thr Thr Thr Lys Val 
            20                  25                  30          


Glu Asp Cys Arg Val Trp Ala Tyr Gly Tyr Met Asn Ile Glu Asp His 
        35                  40                  45              


Ser Glu Tyr Lys Ile Gly Asn Ser Leu Asp Glu Phe Met Ala Trp Val 
    50                  55                  60                  


Leu Lys Val Gln Ala Asp Leu Tyr Phe His Asn Leu Lys Phe Asp Gly 
65                  70                  75                  80  


Ala Phe Ile Ile Asn Trp Leu Glu Arg Asn Gly Phe Lys Trp Ser Ala 
                85                  90                  95      


Asp Gly Leu Pro Asn Thr Tyr Asn Thr Ile Ile Ser Arg Met Gly Gln 
            100                 105                 110         


Trp Tyr Met Ile Asp Ile Cys Leu Gly Tyr Lys Gly Lys Arg Lys Ile 
        115                 120                 125             


His Thr Val Ile Tyr Asp Ser Leu Lys Lys Leu Pro Phe Pro Val Lys 
    130                 135                 140                 


Lys Ile Ala Lys Asp Phe Lys Leu Thr Val Leu Lys Gly Asp Ile Asp 
145                 150                 155                 160 


Tyr His Lys Glu Arg Pro Val Gly Tyr Lys Ile Thr Pro Glu Glu Tyr 
                165                 170                 175     


Ala Tyr Ile Lys Asn Asp Ile Gln Ile Ile Ala Glu Ala Leu Leu Ile 
            180                 185                 190         


Gln Phe Lys Gln Gly Leu Asp Arg Met Thr Ala Gly Ser Asp Ser Leu 
        195                 200                 205             


Lys Gly Phe Lys Asp Ile Ile Thr Thr Lys Lys Phe Lys Lys Val Phe 
    210                 215                 220                 


Pro Thr Leu Ser Leu Gly Leu Asp Lys Glu Val Arg Tyr Ala Tyr Arg 
225                 230                 235                 240 


Gly Gly Phe Thr Trp Leu Asn Asp Arg Phe Lys Glu Lys Glu Ile Gly 
                245                 250                 255     


Glu Gly Met Val Phe Asp Val Asn Ser Leu Tyr Pro Ala Gln Met Tyr 
            260                 265                 270         


Ser Arg Leu Leu Pro Tyr Gly Glu Pro Ile Val Phe Glu Gly Lys Tyr 
        275                 280                 285             


Val Trp Asp Glu Asp Tyr Pro Leu His Ile Gln His Ile Arg Cys Glu 
    290                 295                 300                 


Phe Glu Leu Lys Glu Gly Tyr Ile Pro Thr Ile Gln Ile Lys Arg Ser 
305                 310                 315                 320 


Arg Phe Tyr Lys Gly Asn Glu Tyr Leu Lys Ser Ser Gly Gly Glu Ile 
                325                 330                 335     


Ala Asp Leu Trp Leu Ser Asn Val Asp Leu Glu Leu Met Lys Glu His 
            340                 345                 350         


Tyr Asp Leu Tyr Asn Val Glu Tyr Ile Ser Gly Leu Lys Phe Lys Ala 
        355                 360                 365             


Thr Thr Gly Leu Phe Lys Asp Phe Ile Asp Lys Trp Thr Tyr Ile Lys 
    370                 375                 380                 


Thr Thr Ser Glu Gly Ala Ile Lys Gln Leu Ala Lys Leu Met Leu Asn 
385                 390                 395                 400 


Ser Leu Tyr Gly Lys Phe Ala Ser Asn Pro Asp Val Thr Gly Lys Val 
                405                 410                 415     


Pro Tyr Leu Lys Glu Asn Gly Ala Leu Gly Phe Arg Leu Gly Glu Glu 
            420                 425                 430         


Glu Thr Lys Asp Pro Val Tyr Thr Pro Met Gly Val Phe Ile Thr Ala 
        435                 440                 445             


Trp Ala Arg Tyr Thr Thr Ile Thr Ala Ala Gln Ala Cys Tyr Asp Arg 
    450                 455                 460                 


Ile Ile Tyr Cys Asp Thr Asp Ser Ile His Leu Thr Gly Thr Glu Ile 
465                 470                 475                 480 


Pro Asp Val Ile Lys Asp Ile Val Asp Pro Lys Lys Leu Gly Tyr Trp 
                485                 490                 495     


Ala His Glu Ser Thr Phe Lys Arg Ala Lys Tyr Leu Arg Gln Lys Thr 
            500                 505                 510         


Tyr Ile Gln Asp Ile Tyr Met Lys Glu Val Asp Gly Lys Leu Val Glu 
        515                 520                 525             


Gly Ser Pro Asp Asp Tyr Thr Asp Ile Lys Phe Ser Val Lys Cys Ala 
    530                 535                 540                 


Gly Met Thr Asp Lys Ile Lys Lys Glu Val Thr Phe Glu Asn Phe Lys 
545                 550                 555                 560 


Val Gly Phe Ser Arg Lys Met Lys Pro Lys Pro Val Gln Val Pro Gly 
                565                 570                 575     


Gly Val Val Leu Val Asp Asp Thr Phe Thr Ile Lys Gly Ser Gly Asp 
            580                 585                 590         


Tyr Asp Ile Pro Thr Thr Glu Asn Leu Tyr Phe Gln Gly Ala Met Val 
        595                 600                 605             


Asp Thr Leu Ser Gly Leu Ser Ser Glu Gln Gly Gln Ser Gly Asp Met 
    610                 615                 620                 


Thr Ile Glu Glu Asp Ser Ala Thr His Ile Lys Phe Ser Lys Arg Asp 
625                 630                 635                 640 


Glu Asp Gly Lys Glu Leu Ala Gly Ala Thr Met Glu Leu Arg Asp Ser 
                645                 650                 655     


Ser Gly Lys Thr Ile Ser Thr Trp Ile Ser Asp Gly Gln Val Lys Asp 
            660                 665                 670         


Phe Tyr Leu Tyr Pro Gly Lys Tyr Thr Phe Val Glu Thr Ala Ala Pro 
        675                 680                 685             


Asp Gly Tyr Glu Val Ala Thr Ala Ile Thr Phe Thr Val Asn Glu Gln 
    690                 695                 700                 


Gly Gln Val Thr Val Asn Gly Lys Ala Thr Lys Gly Asp Ala His Ile 
705                 710                 715                 720 


<210> 10
<211> 325
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 10
Met Ala Asp Ser Asp Ile Asn Ile Lys Thr Gly Thr Thr Asp Ile Gly 
1               5                   10                  15      


Ser Asn Thr Thr Val Lys Thr Gly Asp Leu Val Thr Tyr Asp Lys Glu 
            20                  25                  30          


Asn Gly Met His Lys Lys Val Phe Tyr Ser Phe Ile Asp Asp Lys Asn 
        35                  40                  45              


His Asn Lys Lys Leu Leu Val Ile Arg Thr Lys Gly Thr Ile Ala Gly 
    50                  55                  60                  


Gln Tyr Arg Val Tyr Ser Glu Glu Gly Ala Asn Lys Ser Gly Leu Ala 
65                  70                  75                  80  


Trp Pro Ser Ala Phe Lys Val Gln Leu Gln Leu Pro Asp Asn Glu Val 
                85                  90                  95      


Ala Gln Ile Ser Asp Tyr Tyr Pro Arg Asn Ser Ile Asp Thr Lys Glu 
            100                 105                 110         


Tyr Met Ser Thr Leu Thr Tyr Gly Phe Asn Gly Asn Val Thr Gly Asp 
        115                 120                 125             


Asp Thr Gly Lys Ile Gly Gly Leu Ile Gly Ala Asn Val Ser Ile Gly 
    130                 135                 140                 


His Thr Leu Lys Tyr Val Gln Pro Asp Phe Lys Thr Ile Leu Glu Ser 
145                 150                 155                 160 


Pro Thr Asp Lys Lys Val Gly Trp Lys Val Ile Phe Asn Asn Met Val 
                165                 170                 175     


Asn Gln Asn Trp Gly Pro Tyr Asp Arg Asp Ser Trp Asn Pro Val Tyr 
            180                 185                 190         


Gly Asn Gln Leu Phe Met Lys Thr Arg Asn Gly Ser Met Lys Ala Ala 
        195                 200                 205             


Glu Asn Phe Leu Asp Pro Asn Lys Ala Ser Ser Leu Leu Ser Ser Gly 
    210                 215                 220                 


Phe Ser Pro Asp Phe Ala Thr Val Ile Thr Met Asp Arg Lys Ala Ser 
225                 230                 235                 240 


Lys Gln Gln Thr Asn Ile Asp Val Ile Tyr Glu Arg Val Arg Asp Asp 
                245                 250                 255     


Tyr Gln Leu His Trp Thr Ser Thr Asn Trp Lys Gly Thr Asn Thr Lys 
            260                 265                 270         


Asp Lys Trp Thr Asp Arg Ser Ser Glu Arg Tyr Lys Ile Asp Trp Glu 
        275                 280                 285             


Lys Glu Glu Met Thr Asn Gly Gly Ser Ser Gly Gly Ser Ser Gly Gly 
    290                 295                 300                 


Ala His Ile Val Met Val Asp Ala Tyr Lys Pro Thr Lys Lys Gly His 
305                 310                 315                 320 


His His His His His 
                325 


