                         SEQUENCE LISTING



<110>  TURKEWITZ, AARON

       BRIGUGLIO, JOSEPH

       KUMAR, SANTOSH


<120>  EXPRESSION PROFILING REVEALS CATHEPSINS INVOLVED IN 
       SECRETORY VESICLE MATURATION IN TETRAHYMENA THERMOPHILIA



<130>  ARCD.P0566WO

<140>  UNKNOWN

<141>  2014-07-03




<150>  61/843,206
<151>  2013-07-05



<160>  12


<170>  PatentIn version 3.5



<210>  1
<211>  893
<212>  PRT
<213>  Tetrahymena thermophila

<400>  1

Met Lys Ala Tyr Lys His Leu Gln Leu Ile Gly Ile Val Leu Leu Ile 
1               5                   10                  15      


Ser Ala Leu Gln Phe Thr Ala Thr Ala Lys Gln Gln Asp Ile Ser Phe 
            20                  25                  30          


Ser Lys Asn Phe Leu Asp Ser Glu Ile Val Asp Val Ile Trp Cys Gly 
        35                  40                  45              


Thr Asp Thr Gln Asn Asp Gln Asn Val Leu Val Gln Thr Asp Ser Gly 
    50                  55                  60                  


Thr Ile Tyr Arg Ser Gln Asn Lys Met Val His Phe Glu Asn Ile Ser 
65                  70                  75                  80  


Asp Asn Leu Val Asn Ala Gly Ile Lys Tyr Val Ala Asp Asn Ser Gln 
                85                  90                  95      


Ile Val Glu Ser Glu Val Ile Arg Met Ile Arg Ser Gln Ala Asn Pro 
            100                 105                 110         


Asn Val Ile Val Leu Gln Gly Lys Asn Glu Val Asn Trp Val Thr Arg 
        115                 120                 125             


Asp Cys Gly Asn Thr Phe Arg Ala Phe Ser Arg Lys Lys Asp Arg Ile 
    130                 135                 140                 


Asn Thr Phe Lys Leu His Pro Ser Gln Glu Ala Trp Met Leu Ala Ser 
145                 150                 155                 160 


Thr Asn Asn Val Cys Ala Lys Ser Gln Lys Ala Pro Cys Phe Ser Phe 
                165                 170                 175     


Ala Ile Leu Trp Leu Ser Lys Asp Leu Gly Asn Ser Trp Glu Lys Leu 
            180                 185                 190         


Thr Gln Tyr Val Tyr Lys Phe Glu Trp Gly Asn Leu Asn Phe Thr Asn 
        195                 200                 205             


Ser Gln Val Pro Gln Gln Arg Ile Phe Trp Val Gln Glu Asp Gly Asn 
    210                 215                 220                 


Lys Gln Asn Gln Asn Arg Tyr Gly Leu His Glu Lys Arg Asn Phe Tyr 
225                 230                 235                 240 


Tyr Ser Asp Asp Phe Leu Ala Ser Lys Lys Leu Leu Met Thr Lys Gly 
                245                 250                 255     


Asn Val Phe Tyr Ile Asp Tyr Asn Tyr Leu Tyr Val Val Gln Leu Leu 
            260                 265                 270         


Glu Gln Asn Ser Gln Gln Val Asn Leu Lys Val Ala Asn Pro Gln Asp 
        275                 280                 285             


Leu Asp Ile Lys Leu Arg Asp Val Gln Leu Gly Glu Lys Leu Gln Asn 
    290                 295                 300                 


His Lys Phe Thr Ile Leu Asp Thr Arg Glu Gly Gln Val Phe Leu Asn 
305                 310                 315                 320 


Val Asn His Leu Gly Ser Thr Ser Pro Met Gly Thr Leu Tyr Ile Ser 
                325                 330                 335     


Asp Ser Leu Gly Ala Arg Phe Ser Ser Ser Leu Gln Gly His Leu Arg 
            340                 345                 350         


Ser Glu Asn Gly Asp Thr Asp Phe Glu Arg Leu His Gly Ile Tyr Gly 
        355                 360                 365             


Ile Tyr Ile Ala Asn Val Tyr Glu Gln Lys Arg Arg Glu Glu Phe Glu 
    370                 375                 380                 


Asn Met Tyr Ala Ser Glu Gln Asn Asp Asp Asp Glu Asn Gln Gly Gln 
385                 390                 395                 400 


Asp Ser Lys Asn Lys Lys Ser Asn Thr Ser Ile Lys Gln Asp Lys Lys 
                405                 410                 415     


Ala Val Lys Met Lys Asp Leu Val Thr Gln Lys Ile Gln Thr Met Ile 
            420                 425                 430         


Thr Phe Asp Lys Gly Gly Met Trp Ser Arg Ile Asn Ala Pro Thr Thr 
        435                 440                 445             


Asp Gln Glu Asn Lys Glu Ile Lys Cys Gly Asp Asn Cys Phe Leu Asn 
    450                 455                 460                 


Ile His Ser Asn Ser Asn Asp Leu Tyr Asn Ser Phe Tyr Ser Ser Lys 
465                 470                 475                 480 


Asn Ala Val Gly Leu Val Leu Ala Asn Gly Asn Val Gly Lys Tyr Leu 
                485                 490                 495     


Ser His Ser Pro Thr Gln Val Asn Thr Tyr Leu Ser Arg Asp Ala Gly 
            500                 505                 510         


Leu Thr Trp Lys Gln Val Ile Gln Asn Gln Asp Leu Thr Ser Tyr Leu 
        515                 520                 525             


Phe Ile Leu Ser Met Ile Gln Lys Ile Lys Arg Gly Ala Tyr Val Phe 
    530                 535                 540                 


Glu Ile Gly Asp His Gly Ser Ile Ile Val Met Ala Lys Asp Lys Asp 
545                 550                 555                 560 


Tyr Gly Thr Thr Lys Phe Ile Glu Tyr Thr Leu Asp Glu Gly Ile Thr 
                565                 570                 575     


Trp Asn Gln Val Gln Ile Ser Asp Thr Asp Ile Glu Ile Asp Asn Ile 
            580                 585                 590         


Ile Thr Glu Pro Ser Asn Thr Gly Thr Ser Phe Met Val Leu Ala Lys 
        595                 600                 605             


Thr Leu Ser Thr Asp Lys Lys Gln Tyr Gly Leu Ala Ile Thr Ile Asp 
    610                 615                 620                 


Phe Ala Asn Gln Phe Asn Arg Asn Cys Ser Gly Ala Thr Ser Pro Asp 
625                 630                 635                 640 


Asp Pro Asp Ser Asp Tyr Glu Lys Trp Ile Pro His Ser Tyr Lys Ser 
                645                 650                 655     


Ser Gln Cys Leu Leu Gly Gln Lys Val Thr Tyr Ser Arg Lys Lys Gln 
            660                 665                 670         


Glu Ser Val Cys Leu Asn Gly Glu Asp Tyr Glu Arg Gln Ile Glu Leu 
        675                 680                 685             


Gln Ala Cys Val Cys Ser Glu Glu Asp Trp Glu Cys Asp Ile Gly Tyr 
    690                 695                 700                 


Ile Arg Asn Gly Gln Asn Gly Pro Cys Val Lys Asp Gly Thr Leu Ser 
705                 710                 715                 720 


Asp Glu Glu Tyr Glu Gly Val Ile Pro Glu Ile Cys Thr Asp Tyr Tyr 
                725                 730                 735     


Gln Val Ser Arg Gly Tyr Arg Lys Ile Pro Tyr Asn Thr Cys Gln Gly 
            740                 745                 750         


Gly Val Asn Tyr Ser Ala Glu Thr Arg Arg Cys Pro Gly Asn Ser Ile 
        755                 760                 765             


Phe Ser Phe Asn Thr Leu Lys Asn Leu Ile Leu Leu Ile Leu Ala Ile 
    770                 775                 780                 


Ala Ala Ile Tyr Tyr Gly Ile Gln Tyr Lys Ser Gln Leu Ser Ser Leu 
785                 790                 795                 800 


Leu Ile Tyr Leu Ser Ser Leu Ile Pro Leu Ile Tyr Ser His Arg Lys 
                805                 810                 815     


Asp Tyr Ile Asp Phe Ser Lys Ala Lys Ser Asp His Glu Glu Lys Glu 
            820                 825                 830         


Asn Lys Phe Met Asn Leu Phe Ser Phe Ser Asn Lys Lys Asn Val Asn 
        835                 840                 845             


His Tyr Ser Asn Val Asn Glu Ser Glu Asp Tyr Glu Asp Ser Glu Asp 
    850                 855                 860                 


His Gln His Leu Asn Asn Gln Asn Tyr Asn His Leu Asn Gln His Asn 
865                 870                 875                 880 


Tyr Phe Thr Asp Asn Gln Asp Glu Glu Ser His Tyr Asp 
                885                 890             


<210>  2
<211>  2784
<212>  DNA
<213>  Tetrahymena thermophila

<400>  2
aaaagtatta atatttagac ttaatgaaag cgtacaagca tttatagctc ataggtattg       60

tattgcttat ttcagctcta cagttcactg caacagctaa atagcaagat atttctttta      120

gcaagaactt tcttgattct gaaatagttg atgtcatttg gtgtggcacc gacacataga      180

atgactaaaa cgttttggtt caaactgata gtggaaccat ttatagatct taaaacaaaa      240

tggtacactt cgaaaatatt agcgataatc ttgttaatgc tggtatcaaa tatgtagctg      300

ataatagtta aatagtagaa agtgaagtta ttagaatgat aagaagtcaa gctaatccta      360

atgttattgt tctttaaggc aaaaatgaag tcaactgggt aacaagagac tgcggtaata      420

catttagagc cttttcaaga aagaaagata gaataaacac atttaagttg catcccagtt      480

aagaagcatg gatgttagca agcactaata acgtttgtgc caaaagctaa aaagctccat      540

gcttttcttt tgctatacta tggttaagta aagatttagg aaatagttgg gagaagctta      600

ctcaatatgt ttacaaattc gaatggggta atttaaactt tactaatagc taagttcctc      660

aacaaagaat attttgggtt taagaagatg gaaataagca aaaccaaaat agatatggat      720

tgcatgaaaa aagaaacttt tattacagcg atgacttttt agcttcaaag aagctactca      780

tgaccaaagg aaatgtcttt tacattgatt ataattacct ttatgttgtt caacttttag      840

aataaaattc atagcaagtt aacctaaaag ttgctaatcc ttaagactta gatattaaat      900

taagggatgt ttagttaggc gagaagctgt aaaaccataa gtttacaatt ttagatactc      960

gtgaaggata ggttttctta aatgtaaatc atttaggttc aacatctcct atgggtactc     1020

tttatatatc agactcatta ggtgctcgct tttcctcaag cttgtagggt catcttagaa     1080

gtgaaaatgg tgatacagat tttgagcgct tacatggaat ttatggaatt tatatagcaa     1140

atgtttatga ataaaaaaga agagaagagt ttgagaatat gtatgcaagc gaataaaatg     1200

atgatgatga aaattaagga taagactcca aaaataaaaa aagcaataca tcaattaaat     1260

aggataaaaa agcagtaaag atgaaagatt tggtcaccca aaaaatatag acaatgatta     1320

ctttcgataa aggtggtatg tggagtagaa ttaatgctcc aaccacagac taggaaaaca     1380

aagaaattaa atgtggtgac aactgcttct taaatataca ttctaattct aatgacttat     1440

ataattcatt ctactcatca aaaaatgctg taggtttagt tttagcaaat ggaaatgttg     1500

gtaagtatct ttcacatagt ccaactcaag ttaatactta cctttcaaga gatgcaggtt     1560

taacttggaa ataagtaatt taaaattaag atttaacttc atatttattt attctttcaa     1620

tgatttaaaa gattaaaaga ggagcttatg ttttcgaaat aggtgatcat ggttcaataa     1680

tagttatggc taaagataag gattatggaa ccactaaatt tatcgaatat actttagatg     1740

aaggtattac ttggaaccaa gtttaaatat cagatactga tatcgaaata gataatataa     1800

taacagagcc atcaaatact ggaacctcat tcatggttct tgcaaaaaca ctatcaacag     1860

ataaaaaata atatggatta gctataacaa tagattttgc taatcagttt aatagaaact     1920

gttctggtgc aacaagtcca gatgatcctg attctgatta tgaaaaatgg atacctcata     1980

gctataaatc atctcaatgt ctcttaggtt agaaagtgac ttactcacgt aaaaaacaag     2040

aatctgtttg cttaaatgga gaagattatg aaagacaaat agaactttaa gcatgtgtct     2100

gctctgaaga agactgggag tgtgatatcg gctatattag aaatggataa aatggtccat     2160

gtgtaaagga tggaacactt agtgatgaag aatatgaagg agtgatccca gaaatatgta     2220

ctgattatta ttaagtaagt agaggttata gaaaaattcc ttacaataca tgctaaggag     2280

gtgtaaatta ttcagcggaa actagaagat gccctggaaa ttcaattttt agctttaata     2340

ctttaaaaaa tttgattctt ttgattttag ctattgcagc tatttattat ggaattcagt     2400

ataagagtta actctctagc ttgttaattt acttaagttc tttgatccct ctaatttatt     2460

ctcatcgtaa agattatatt gacttttcca aagcaaagtc agaccatgaa gaaaaggaaa     2520

ataaatttat gaatctattt tcatttagca acaaaaaaaa tgttaatcat tacagcaacg     2580

taaatgaaag tgaagattat gaagatagtg aagatcatta acatcttaat aaccaaaatt     2640

acaatcattt aaattaacat aactatttta ctgataacca agatgaagag agtcattatg     2700

attgaattta attaactaat tgattttttg ttttttcata aatttctttg tagattaatt     2760

taatttaaaa ataattttaa tagt                                            2784


<210>  3
<211>  893
<212>  PRT
<213>  Tetrahymena thermophila

<400>  3

Met Lys Ile Lys Arg Asn Gln Gln Ile Ala Ile Ile Phe Ala Ile Phe 
1               5                   10                  15      


Ile Leu Thr Ala Ile Gln Ala Ala Asp Asp Val Ala Asp Asp Lys Val 
            20                  25                  30          


Gln Gln Ala Ile Lys Ser Tyr Gln Lys Gln Val Asp Gly Gly Ile Leu 
        35                  40                  45              


Glu Phe Glu Trp Cys Gly Thr Asn Glu Ile Tyr Asn Asp Glu Thr Asp 
    50                  55                  60                  


Arg Val Val Val Asp Gln Glu Val Glu Glu Ser Phe Asp Thr Arg Ile 
65                  70                  75                  80  


Phe Val Leu Thr Asp Glu Gly Gln Val Phe Lys Ser Thr Asn Tyr Gly 
                85                  90                  95      


Lys Ser Trp Val His Val Thr Lys Ser Phe Tyr Gly Ser Asn Asn Gln 
            100                 105                 110         


Pro Phe Phe Ser Thr Glu Val Ser Ile Ser Pro Val Asp Gly Lys Thr 
        115                 120                 125             


Val Tyr Ile Trp Gly His Lys Asp Thr Ser Tyr Val Ser Glu Glu Cys 
    130                 135                 140                 


Gly Lys Thr Trp Lys Lys Leu Asn His Pro Ala Gly Leu Phe Asp Phe 
145                 150                 155                 160 


Arg Phe His Arg Lys Asn Lys Asn Trp Val Leu Ala Phe Thr Asn Ile 
                165                 170                 175     


Glu Cys Lys Arg Phe Asp Glu Asp Cys Glu Ser Asn Met Arg Asn Leu 
            180                 185                 190         


Tyr Val Ser Gln Asp Ala Gly Val Thr Phe Thr Phe Leu Ala Thr Lys 
        195                 200                 205             


Val Leu Glu Ala Ser Trp Asn Arg Met Asn Asn Phe Tyr Asn Val Asp 
    210                 215                 220                 


Ser Pro Gly Ile Leu Met Ala Val Gln Gln Glu Ser Gln Ser Asn Val 
225                 230                 235                 240 


Val Tyr Thr Glu Asp Phe Gly Lys Thr Met His Thr Val Gln Glu Gly 
                245                 250                 255     


Gly Asp Asn Phe Phe Gln Ala Glu Tyr Phe Leu Phe Leu Thr Val Lys 
            260                 265                 270         


Pro Lys Asn Ser Lys Arg Thr Tyr Asp Met Lys Ile Ala Thr Met Phe 
        275                 280                 285             


Asp Asp Phe Asn Tyr Tyr Val Glu Pro Lys Ser Leu Lys Leu Pro Phe 
    290                 295                 300                 


Glu Asn Thr Asp Gln Leu Ser Phe Thr Ile Leu Lys Ser Asp Gly Ala 
305                 310                 315                 320 


Met Val Phe Leu Ala Ile His His Glu Thr Gln Asn Met Trp Gln Ser 
                325                 330                 335     


Asn Ile Tyr Val Ser Asp Trp Arg Gly Tyr Asp Leu Thr Leu Ala Leu 
            340                 345                 350         


Leu Tyr Asn Val Arg Ala Pro Asn Gly Asp Cys Asp Phe Glu Lys Ile 
        355                 360                 365             


Glu Ser Asn Glu Gly Val Tyr Ile Ala Asn Thr Tyr Asp Val Glu Lys 
    370                 375                 380                 


Val Glu Lys Leu Arg Asn Glu Val Lys Lys Met Asp Ile Ser Thr Ala 
385                 390                 395                 400 


Lys Asn Lys Leu Gln Thr Lys Asp Lys Lys Asn Leu His Lys Glu Leu 
                405                 410                 415     


Thr Asn Tyr Arg Lys Ser Val Ile Ser Phe Asp Ser Gly Ser Ser Trp 
            420                 425                 430         


His Pro Ile Arg Ala Pro Ser Gln Arg Trp Asn Gly Lys Thr Val Val 
        435                 440                 445             


Cys Ser Gly Glu Cys Ser Leu His Leu Ala Gly Arg Thr Tyr Tyr Lys 
    450                 455                 460                 


Lys Ser Gln Met Tyr Ser Ser Ser Asn Ala Pro Gly Leu Ile Val Ala 
465                 470                 475                 480 


Leu Gly Ser Ile Gly Thr His Leu Glu Asn Asn Phe Asn Leu Leu Asn 
                485                 490                 495     


Thr Tyr Leu Ser Asn Asp Gly Gly His Gln Trp Arg Glu Ile Leu Lys 
            500                 505                 510         


Gly Pro His Ile Phe Glu Ile Gly Asp His Gly Gly Ile Ile Val Ala 
        515                 520                 525             


Ala Ser Val Ala Asn Lys Thr Asn Ile Ile Lys Tyr Ser Trp Asp Glu 
    530                 535                 540                 


Gly Lys Thr Trp Ser Glu Tyr Lys Leu Ser Ala Leu Pro Phe Glu Ile 
545                 550                 555                 560 


Asp Gln Ile Ile Thr Glu Pro Ser Asn Met Glu Gln Arg Phe Val Val 
                565                 570                 575     


Tyr Gly Lys Gly Arg Asn Gly Thr Glu Thr Ser Met Ile Val Ser Val 
            580                 585                 590         


Asp Leu Gln Asp Leu His Ile Arg Gly Cys Val Gly Ala Glu His Pro 
        595                 600                 605             


Asn Arg Pro Asn Ser Asp Tyr Glu Ile Trp Ile Pro Thr Asn Phe Lys 
    610                 615                 620                 


Gly Glu Gln Cys Ile Phe Gly Arg Lys Val Lys Tyr Val Arg Arg Lys 
625                 630                 635                 640 


Pro Asp Ala Lys Cys Phe Asn Ser Ile Thr Thr Asp Gln Lys Thr Val 
                645                 650                 655     


Ile Glu Glu Cys Pro Cys Thr Gln Glu Asp Trp Glu Cys Asp Phe Gly 
            660                 665                 670         


Phe Tyr Arg Lys Glu Asn Glu Leu Glu Cys Ile Pro Met Asn Glu His 
        675                 680                 685             


Tyr Ser Pro Asp Asn Leu Ala Lys Pro Pro Ala Asp Cys Ser Trp Ser 
    690                 695                 700                 


Tyr Leu Val Ser Lys Gly Tyr Arg Lys Ile Pro Gly Val Phe Cys Gln 
705                 710                 715                 720 


Gly Gly Val Asp Leu Ser Pro Glu Tyr Lys Glu Cys Pro Pro Lys Ile 
                725                 730                 735     


Ser Val Pro Arg Thr Glu Glu Glu Thr Asp Gln Tyr Lys Ser Phe Lys 
            740                 745                 750         


Glu Ala Gln Lys Glu Ile Ile Ser Gln Tyr Gln Gln Gln Gln Gln Gln 
        755                 760                 765             


Ser Asn Ser Gln Asn Gly Lys Thr Asp Ser Ser Ser Ser Ile Asn Trp 
    770                 775                 780                 


Gly Val Ile Phe Thr Gln Ile Phe Tyr Ala Gly Leu Ile Leu Thr Ala 
785                 790                 795                 800 


Leu Ala Leu Ala Phe Ile Phe Arg Glu Asn Ile Lys Gln Val Val Lys 
                805                 810                 815     


Ser Ile Gly Glu Ile Gly His Asn Lys Glu Arg Lys Gln Tyr Gln Gln 
            820                 825                 830         


Leu Gln Ser Ser Gln Asn Lys Gln Ser Ser Tyr Thr Gln Gln Lys Asn 
        835                 840                 845             


Thr Gln Asn Val Arg Ile Gln Glu Thr Glu Glu Arg Asn Tyr Asp Leu 
    850                 855                 860                 


Glu Glu Gln Asp Met His Tyr Pro Glu Asp Glu Lys Pro Val Leu Gln 
865                 870                 875                 880 


Arg Asp Gln Glu Asp Tyr Tyr Tyr Gln Glu Asp Tyr Asp 
                885                 890             


<210>  4
<211>  2682
<212>  DNA
<213>  Tetrahymena thermophila

<400>  4
atgaaaataa aaaggaatta gcaaattgca attatatttg ctattttcat cttgactgct       60

atttaggcag cagatgatgt tgcagatgat aaggtttagt aagctataaa aagttattaa      120

aagtaagtag atggaggtat tttagaattc gagtggtgtg gtacaaatga aatttataac      180

gatgaaactg accgtgttgt tgttgattaa gaagttgaag aatcattcga tactcgtata      240

tttgttctta cagatgaagg ttaagttttt aaaagtacaa actatggtaa aagttgggtc      300

catgtcacta aatcctttta tggttcaaat aattagccat ttttctctac tgaagtttcc      360

atttctcctg ttgatggtaa aacagtctat atttggggac acaaggatac cagctatgtt      420

tctgaggaat gtggtaagac ttggaaaaag ttaaaccatc ctgctggttt gtttgatttt      480

agatttcacc gtaaaaataa aaattgggta ttagctttca ctaatataga atgtaagaga      540

tttgatgaag attgtgaatc taatatgaga aatctttacg tttcttaaga tgcgggtgtt      600

actttcacat tcttagctac taaagtttta gaagcttcat ggaatagaat gaataacttt      660

tacaacgttg acagtcctgg tattttaatg gccgttcaat aagaatcata aagtaatgta      720

gtttacactg aagacttcgg taaaactatg cacacagttt aagaaggtgg tgataatttc      780

ttttaagcag agtacttcct ctttttaaca gttaagccta aaaacagtaa aagaacctat      840

gatatgaaaa tcgcaactat gtttgacgat tttaattact atgttgaacc caaaagctta      900

aagcttccct ttgaaaacac tgattaactt tcgtttacaa ttctaaagag cgatggtgcc      960

atggttttcc ttgccataca ccacgaaact caaaatatgt ggtaaagcaa tatctatgtt     1020

tctgattgga gaggttatga tttgacttta gctttacttt acaatgttag agctccaaac     1080

ggagattgcg actttgaaaa gatagaaagc aatgaaggtg tttatatagc aaatacatat     1140

gatgttgaaa aagttgaaaa attaagaaac gaagttaaaa aaatggatat cagcactgca     1200

aagaataaat tataaacaaa agataaaaag aatttgcaca aagaactaac taattatagg     1260

aaatcagtca tttcatttga cagcggttct agttggcatc caattagagc tccttcatag     1320

agatggaatg gaaagactgt tgtttgcagt ggagaatgca gtttgcattt agctggtaga     1380

acatattata aaaaatctta gatgtattct tcctctaacg ctcctggttt aattgttgca     1440

ttaggaagca ttggaactca tcttgaaaac aacttcaatc ttcttaacac atatctttca     1500

aacgatggtg gtcactaatg gcgtgaaatt cttaagggtc ctcatatttt tgaaattggt     1560

gatcatggtg gtatcatcgt agctgcttct gttgccaata aaacaaatat catcaaatac     1620

agttgggatg aaggaaaaac atggagcgaa tataaattga gtgctttacc atttgaaata     1680

gattaaataa ttactgagcc tagcaatatg gaacagagat ttgttgttta tggaaaagga     1740

agaaatggaa cagaaacttc tatgattgtt tctgtagatt tataagattt gcacattaga     1800

ggttgtgtag gagctgaaca tcctaataga cctaatagtg attatgaaat ctggattcct     1860

actaatttta aaggtgaaca atgtattttc ggtcgtaaag ttaaatatgt tagaagaaag     1920

cctgatgcaa aatgctttaa ttctatcaca acagattaaa aaacagttat tgaagaatgc     1980

ccatgcacat aagaagattg ggaatgtgac ttcggtttct acagaaaaga aaacgaatta     2040

gaatgtattc caatgaatga gcattattct cctgataatc ttgctaaacc tcctgcagat     2100

tgtagttggt cttacttagt ctcaaaggga tatagaaaaa taccaggagt attttgttaa     2160

ggaggtgttg atttaagtcc agaatataaa gaatgtcctc caaaaatatc agtgcctaga     2220

actgaagaag aaacagatta atataaaagc ttcaaagaag cataaaaaga gattattagc     2280

taatattaat agtaatagta gtaatcaaat agttaaaatg gaaaaactga ttcatcatct     2340

tcaataaact ggggtgttat ttttacataa attttctatg ctggattaat tttaacagct     2400

ttagctttag ctttcatatt tagagagaat atcaaataag tagtaaaaag cattggtgaa     2460

ataggacata ataaagaacg caaataatat taataactct aatcatctta gaataaataa     2520

tcatcataca cttaatagaa aaatactcaa aatgtccgca tttaagaaac tgaagaaaga     2580

aattatgatt tagaagaata agacatgcat tatccagaag atgaaaagcc tgtcttgtaa     2640

agagatcaag aagattacta ttattaagaa gattacgatt ga                        2682


<210>  5
<211>  936
<212>  PRT
<213>  Tetrahymena thermophila

<400>  5

Met Lys Lys Glu Ile Arg Ile Ala Leu Ile Ala Leu Phe Cys Cys Ile 
1               5                   10                  15      


Leu Thr Val Asn Cys Arg Asn Glu Tyr Ser Ser Ser Val Ile Gly Asn 
            20                  25                  30          


Pro Ser Ser Leu Asp Ser Pro Leu Gln Asp Ile Gln Trp Cys Gly Glu 
        35                  40                  45              


Asn Ser Ser Asn Asp Asn Leu Val Val Leu Leu Thr Gln Lys Gly Ser 
    50                  55                  60                  


Val Tyr Arg Ser Glu Asp Arg Gly Ala Ser Trp Ile Lys Met Val Asp 
65                  70                  75                  80  


Ser Phe Ala Arg Val Gly Val Asn Val Lys Met Asp Leu Ser Ser Asn 
                85                  90                  95      


Val Gly Ile Val Thr Gln Met Ile Ala Ser Pro Ile Asp Ser Asn Glu 
            100                 105                 110         


Ile Val Phe Met Gly Ser Asp Gly Ile Asn Trp Ile Thr Thr Asp Cys 
        115                 120                 125             


Gly Val Thr Ile Gln Ala Leu Gly Ile Asn Leu Asn Leu Arg Glu Phe 
    130                 135                 140                 


Met Tyr His Pro Thr Glu Lys Asn Trp Met Leu Ala Ser Ser Phe Asn 
145                 150                 155                 160 


Asn Cys Glu Lys Gln Asn Asn Gln Lys Asp Lys Arg Lys Lys Asp Thr 
                165                 170                 175     


Glu Cys Phe Lys Thr Lys Asp Leu Phe Phe Ser Glu Asn Lys Gly Lys 
            180                 185                 190         


Ser Trp Arg Val Leu Leu Lys Tyr Val Val Gln Phe Gly Trp Ala His 
        195                 200                 205             


Lys Val Asn Ser Lys Leu Thr Asn Val Pro Thr Ser Arg Ile Ile Tyr 
    210                 215                 220                 


Ser Lys Glu Val Gly Ser Asn Ser Phe Phe Phe Asn Glu Ala Ser Gln 
225                 230                 235                 240 


Gln Thr Asn Ile Ile Ile Lys Asp Ser Gly His Gln Val Met Lys Gly 
                245                 250                 255     


Trp Ser Met Lys Thr His Leu Phe Tyr Thr Asp Asp Phe Met Lys Asn 
            260                 265                 270         


Gln Asn Met Ile Val Asn Gln Gly Asn Lys Phe Leu Ile Thr Glu Asn 
        275                 280                 285             


Tyr Leu Phe Ala Ala Gln Val His Ser Ser Asp Asn Gln Leu Val Lys 
    290                 295                 300                 


Leu Met Val Ser Gln Ser Asn Gln Lys Glu Tyr Ser Phe Thr Tyr Ala 
305                 310                 315                 320 


Glu Ile Pro Glu Asp Ile His Gln His Ser Phe Thr Ile Leu Asp Thr 
                325                 330                 335     


Lys Glu Gly Gln Val Phe Leu Asn Ile Asn His Leu Gly Ser Asn Ser 
            340                 345                 350         


Pro Met Gly Asn Ile Tyr Gln Ser Asp Ser Thr Gly Thr Arg Phe Ser 
        355                 360                 365             


Leu Ser Leu Glu Asp Asn Val Arg Gly Arg Asp Gly Gln Cys Asp Phe 
    370                 375                 380                 


Glu Ser Val Asn Gly Val Glu Gly Ile Phe Ile Ser Asn Ile Phe Ala 
385                 390                 395                 400 


Pro Ser Lys Lys Leu Lys Gly Ile Lys Gln Met Leu Lys Ser Lys Asn 
                405                 410                 415     


Pro Asp Thr Ser Asp Glu Asp Ile Pro Thr Glu Asn Thr Arg Lys Lys 
            420                 425                 430         


Gly Gln Ala Gln Asn Ser Glu Asp Val Leu Lys Glu Ser Leu Lys Ser 
        435                 440                 445             


Leu Arg Asp Asn Met Val Thr Arg Ile Thr Phe Asp Lys Gly Gly Met 
    450                 455                 460                 


Trp Ser Leu Leu Arg Ala Pro Ala Lys Asp Ser Asn Gly Lys Gln Ile 
465                 470                 475                 480 


Asn Cys Asp Ile Asn Lys Lys Cys Ser Leu His Leu His Ser Val Ser 
                485                 490                 495     


Ser Gln Leu Ser Phe Gly Pro Ala Tyr Ser Ser Glu Asn Ser Leu Gly 
            500                 505                 510         


Leu Ile Ile Ala Thr Gly Asn Thr Gly Gln Phe Leu Ser His Lys Ala 
        515                 520                 525             


Gly Ser Val Asn Thr Tyr Leu Ser Arg Asp Gly Gly Leu Val Trp Glu 
    530                 535                 540                 


Glu Ile Arg Lys Gly Ser His Ile Tyr Glu Val Ala Asp His Gly Ser 
545                 550                 555                 560 


Ile Ile Val Met Ala Thr Asp Gln Glu Pro Thr Lys Asn Ile Ile Phe 
                565                 570                 575     


Ser Trp Asp Glu Gly Arg Thr Trp Asn Thr Lys Gln Ile Ser Asp Thr 
            580                 585                 590         


Pro Val Met Ile Ser Asn Ile Ile Thr Glu Pro Gly Asn Thr Ser Asp 
        595                 600                 605             


Lys Phe Leu Val Tyr Gly Ser Ile Glu Gly Glu Ser Asp Ile Ser Gly 
    610                 615                 620                 


Ile Ile Val Leu Leu Asp Phe Ala Ser Leu His Pro Arg Asp Cys Gln 
625                 630                 635                 640 


Gly Tyr Glu Asn Pro Asp Thr Ser Asp Ser Asp Tyr Glu Tyr Trp Thr 
                645                 650                 655     


Pro His Asn Pro Ser Glu Phe Cys Leu Leu Gly Arg Glu Ile Lys Tyr 
            660                 665                 670         


Val Arg Arg Lys Arg Asp Ala Ala Cys Phe Asn Pro Glu Thr Phe Glu 
        675                 680                 685             


Arg Ser Tyr Val Val Arg Lys Cys Glu Cys Thr Glu Leu Asp Trp Glu 
    690                 695                 700                 


Cys Asp Val Gly Phe Ala Arg Ala Lys Asp Asp Ser Lys Glu Arg Thr 
705                 710                 715                 720 


Gly Pro Cys Val Pro Leu Lys Asp Phe Lys Val Asp Tyr Asn Pro Pro 
                725                 730                 735     


Gln Thr Cys Ser Gly Ser Tyr Gln Val Thr Gln Gly Tyr Arg Arg Val 
            740                 745                 750         


Ala Gly Asn Gln Cys Ile Gly Gly Ile Asp His Ala Pro Ile Gln Tyr 
        755                 760                 765             


Pro Cys Pro Met Phe Gly Phe Leu Ser Tyr Asn Asn Leu Phe Thr Asn 
    770                 775                 780                 


Val Leu Ile Leu Gly Ala Met Ala Gly Val Phe Tyr Leu Ile Ile Gln 
785                 790                 795                 800 


Asn Lys Glu Val Val Ile Thr Phe Val Ala Thr Ser Asn Leu Asp Ala 
                805                 810                 815     


Tyr Ile Asn Leu Gly Lys Thr Tyr Leu Lys Lys Gly Tyr Thr Phe Val 
            820                 825                 830         


Thr Ser Ile Val Leu Pro Gln Ala Ser Asn Gln Gln Gln Gly Tyr Phe 
        835                 840                 845             


Gln Ala Asn Gln Asp Glu Glu Asn Arg Lys Ser His Ser Leu Lys Asp 
    850                 855                 860                 


Gln His His Gln Phe His Asp Asn Leu Ile Glu Ser His Asp His Asp 
865                 870                 875                 880 


Asp Glu Glu Glu Gln Ser Asp Ala Val Gln Gln Gln Leu Thr Ser Ser 
                885                 890                 895     


Gln Val Pro Gln Asn Asn Ser Asn Lys Asn Asn Asn Asn Ser Asn Thr 
            900                 905                 910         


Pro Asn Gln Ala Gln His Lys Asp Leu Leu Asp Glu His Asp Gly Glu 
        915                 920                 925             


Glu Asp Pro Phe Asp Pro Arg Asn 
    930                 935     


<210>  6
<211>  3010
<212>  DNA
<213>  Tetrahymena thermophila

<400>  6
atgaaaaaag aaataagaat agctcttata gctttatttt gctgcatttt gacagtaaat       60

tgtagaaatg aatactcaag cagtgtcatt ggaaacccct caagtttgga ttcacctctt      120

taggacattt aatggtgtgg tgaaaattca tcaaatgata atttggttgt cctcttaact      180

taaaagggta gcgtttacag atcagaagat agaggagcat cttggataaa gatggttgac      240

tcttttgcga gagttggtgt aaatgtaaag atggatctga gctcaaacgt aggtattgtt      300

acttaaatga ttgcaagtcc tattgattct aatgaaatag tctttatggg ctctgatggt      360

attaactgga tcactactga ttgtggtgtt accatttaag cccttggaat caacttaaat      420

ttgagagaat ttatgtatca cccaactgaa aagaattgga tgcttgcttc ttcctttaac      480

aactgtgaaa agcaaaacaa ccaaaaagat aagagaaaaa aggacactga atgctttaag      540

actaaagatt tgtttttctc tgaaaataag ggtaaaagct ggagagtttt acttaaatat      600

gttgtacaat tcggatgggc tcacaaagtt aattctaagc taacaaatgt cccaacttca      660

agaattatat actctaagga agtcggaagt aattcgtttt tctttaatga agcatctcaa      720

taaactaata taataataaa agatagtggt caccaagtga tgaagggttg gagcatgaaa      780

actcatttat tctatactga tgatttcatg aaaaactaga atatgattgt taactaagga      840

aataagtttt tgattactga aaactacttg ttcgctgcat aagttcacag tagtgataat      900

taactagtca agttaatggt ttcttaatct aattaaaaag aatactcttt cacttatgct      960

gaaattcctg aagatataca ctagcactca ttcactattt tagatactaa ggaaggttag     1020

gtattcttaa atattaatca cttgggcagt aactctccta tgggtaatat ttactaatct     1080

gactcaactg gtactcgttt ctctctttct cttgaagata atgtaagagg aagagatggt     1140

taatgcgatt ttgaatcagt taatggtgtt gaaggtattt ttatctcaaa tatattcgct     1200

cctagcaaaa agttaaaggg tatcaagcaa atgttgaaat ccaaaaatcc tgatacaagc     1260

gatgaagata ttccaactga aaacacaaga aagaaaggtc aagcataaaa ttctgaagat     1320

gtcttaaaag aatccttaaa aagtcttaga gataacatgg taactcgtat cactttcgac     1380

aagggtggta tgtggagttt gcttagggct cctgctaaag attctaatgg aaaataaatt     1440

aattgtgata ttaataaaaa gtgttctctt caccttcact cagtttcttc ataactaagt     1500

tttggacctg cttactcaag tgaaaattca ttaggtttaa ttattgctac tggtaacaca     1560

ggataattct taagtcataa agcaggtagc gtcaacactt atctttctcg tgatggtggt     1620

cttgtttggg aagaaatccg taaggggtct cacatatatg aagttgctga tcatggctct     1680

atcatagtta tggctactga ttaagaacct actaagaaca ttattttctc ttgggatgaa     1740

ggccgcacat ggaacaccaa gtaaattagc gatactcctg tcatgatttc aaatattatc     1800

actgaacctg gcaatacttc tgacaagttc ttagtttatg gatctattga aggtgaatct     1860

gatatttcag gaataattgt ccttcttgac tttgcttctc ttcatcctcg cgattgctaa     1920

ggttatgaaa accctgacac ttctgattct gattatgaat actggactcc tcataatccc     1980

agtgaattct gtttattagg acgtgaaatt aaatatgtca gaagaaaaag agatgctgct     2040

tgctttaatc ccgaaacttt tgaaagatct tatgttgtta gaaaatgtga atgtactgaa     2100

cttgattggg aatgtgatgt cggatttgct cgtgctaaag acgatagcaa agaaagaact     2160

ggcccttgcg ttcctttaaa agacttcaaa gtggattaca atcctccata aacttgcagt     2220

ggctcttacc aagttacata aggttacaga agagtagctg gtaattaatg tataggcggt     2280

attgatcatg ctccaattta atacccttgt cctatgtttg gcttcttgag ctataacaac     2340

cttttcacca atgttcttat tttaggagct atggctggtg ttttctactt aattatataa     2400

aataaagaag tagtaataac atttgtagct acatcaaatc ttgatgccta cattaactta     2460

ggtaaaactt acctaaagaa gggttatact tttgttacat caattgtcct tccacaagct     2520

tcaaattaat aataaggata tttccaagct aaccaagatg aggaaaatag aaaatctcat     2580

tccttaaagg atcaacatca ttaattccat gataatttaa ttgaaagcca tgatcatgat     2640

gatgaggaag agtaaagtga tgcagtataa taataattaa cttcttctta agtcccttaa     2700

aataatagta acaaaaacaa taataatagt aatacaccaa actaagctca gcacaaagat     2760

cttcttgatg aacatgatgg tgaagaagat ccttttgatc ctagaaattg aaaaataatt     2820

gactgaataa tattgctaat ttattttttt acttaaataa taaataaata aaaataaata     2880

aattaatttt tgtctttcat taatattatt tagaaagttt ttctaagtaa tttaatatag     2940

tgtgtcaagt atctttttct cttaacttat gtattttatc aaatcctttt ttactttatt     3000

attcctagtt                                                            3010


<210>  7
<211>  872
<212>  PRT
<213>  Tetrahymena thermophila

<400>  7

Met Lys Lys Gln Asp Leu Thr Val Tyr Val Ala Ala Phe Leu Leu Leu 
1               5                   10                  15      


Phe Ser Cys Val Ile His Phe Ala Asn Ala Gln Asp Lys Val Ser Glu 
            20                  25                  30          


Ile Phe Lys Asp Lys Tyr Asp Val Lys Tyr Arg Val Thr Glu Leu Asp 
        35                  40                  45              


Ser Pro Val Gln Glu Ile Leu Trp Cys Gly Ser Ser Gln Ala Thr Ser 
    50                  55                  60                  


Glu Asp Gly Asp Ile Ile Thr Tyr Asp Gln Thr Ala Lys Val Arg Lys 
65                  70                  75                  80  


Leu Tyr Val Leu Thr Asp Lys Gly Lys Leu Tyr Tyr Ser Glu Asp Tyr 
                85                  90                  95      


Gly Ile Thr Leu Lys Leu Ile Asn Asp Asp Ile Arg Gln Ser Thr Asn 
            100                 105                 110         


Ser Lys Gln Thr Gln Val Glu Val Asp Asp Ile Met Ile Ser Pro Val 
        115                 120                 125             


Lys Asn Arg Lys Val Phe Ile Phe Thr Lys Ser Gly Glu Ser Tyr Tyr 
    130                 135                 140                 


Thr Glu Asn Cys Gly Ala Thr Tyr Thr Ser Phe Lys His Glu Ile Leu 
145                 150                 155                 160 


Leu Tyr Asp Ile Gln Pro Asn Pro Ser Asp His Lys Ser Leu Ile Gly 
                165                 170                 175     


Leu Val Pro Val Gln Cys Gln Lys Gly Asp Pro Glu Cys Gln Gly Gly 
            180                 185                 190         


Asp Ser Asp Leu Tyr Leu Thr Val Asp Ser Gly Met Thr Trp Arg Lys 
        195                 200                 205             


Ile Val Ser Asn Val Asn Gln Ala Gln Trp Asp Lys Thr Lys Gln Thr 
    210                 215                 220                 


Leu Met Asn Thr Gln Asn Arg Ile Ile Leu Ser His Gln Glu Gln Glu 
225                 230                 235                 240 


Lys Asn Glu Lys Gly Glu Asn Val Phe Leu Asn Lys Val Ser Tyr Thr 
                245                 250                 255     


Asp Asn Tyr Gly Lys Asp Leu Lys Val Val Glu Lys Asn Gly Val Arg 
            260                 265                 270         


Phe Tyr Gln Thr Glu Glu Tyr Ile Phe Val Leu Ile Gln Gly Lys Glu 
        275                 280                 285             


Phe Gly Lys Tyr Lys Leu Asn Ile Gly Pro Ser Phe Val Thr Gln Ser 
    290                 295                 300                 


Ser Ser Arg Lys Glu Ile Asp Leu Pro Leu Gln Arg Val Lys Asp Glu 
305                 310                 315                 320 


Ser Phe Thr Val Leu Asp Ile Asp Ala Gly Gln Ile Leu Ile Ala Ile 
                325                 330                 335     


Asn His Glu Gly Asp Ser Ala Gly Tyr Thr Asn Val Tyr Ile Ser Asn 
            340                 345                 350         


Ser Gln Gly Glu Gln Phe Thr Leu Ser Leu Gln Tyr Thr Val Gly Asp 
        355                 360                 365             


Asp Asp Ser Asn Ile Asp Phe Glu Pro Ile Asn Ser Asn Glu Gly Val 
    370                 375                 380                 


Tyr Ile Ala Asn Thr Tyr Thr Ala Ala Ser Ile Ser Lys Tyr Gln Lys 
385                 390                 395                 400 


Leu Leu Gln Arg Lys Glu Gly Gln Lys Ser Ser Gly Ser Ser Leu Thr 
                405                 410                 415     


Leu Asp Ser Phe Lys Ile Glu Asn Met Lys Lys Thr Lys Ile Thr Phe 
            420                 425                 430         


Asn Lys Gly Gly Asp Trp His Ala Ile Lys Ala Pro Glu Phe Asn Tyr 
        435                 440                 445             


Ala Gly Asn Pro Ile Arg Cys Ser Gly Asp Cys Ser Leu Asn Phe Lys 
    450                 455                 460                 


Gly Arg Thr Glu Ser Gln Gly Thr Pro Val Tyr Ser Thr Asp Asn Ala 
465                 470                 475                 480 


Pro Gly Ile Ile Leu Ala Thr Gly Asn Val Gly Ser Tyr Leu Thr Asn 
                485                 490                 495     


Asn Gln Asp Glu Leu Arg Thr Tyr Leu Ser Ile Asp Gly Gly His Thr 
            500                 505                 510         


Trp Lys Glu Ile Gln Val Gly Ser His Glu Tyr Glu Ile Gly Asp Gln 
        515                 520                 525             


Gly Gly Ile Ile Ala Met Ala Arg Asp Asp Lys Leu Thr Asn Glu Val 
    530                 535                 540                 


Ile Tyr Ser Val Asp Glu Gly Glu Thr Trp Arg Lys Leu Asn Phe Lys 
545                 550                 555                 560 


Asp Glu Asn Lys Phe Lys Val Asp Ser Phe Val Thr Glu Glu Gly Asn 
                565                 570                 575     


Asp Glu Arg Thr Phe Leu Phe Tyr Gly Thr Lys Thr Gly Ala Asp Gly 
            580                 585                 590         


Asn Thr Lys Gly Val Ile Gly Ala Ile Asn Phe Ser Asn Leu Phe Lys 
        595                 600                 605             


Lys Glu Cys Thr Gly Phe Glu Asn Pro Gly Glu Asp Gly Ser Asp Tyr 
    610                 615                 620                 


Glu Arg Trp Val Pro Leu Asn Phe Glu Gly Lys Lys Cys Leu Phe Gly 
625                 630                 635                 640 


Ser Lys Ile Ser Tyr Ile Arg Lys Lys Thr Asp Ser Ser Cys Phe Asn 
                645                 650                 655     


Asn Arg Lys Val Gly Asp Leu Arg Met Val Gln Gly Ser Cys Glu Cys 
            660                 665                 670         


Thr Glu Glu Asp Phe Glu Cys Asp Tyr Gly Phe Thr Lys Asp Leu Ile 
        675                 680                 685             


Asp Glu Thr Lys Cys Val Pro Ile Asn Ala Lys Phe Ala Lys Lys Arg 
    690                 695                 700                 


Asp Gln Pro Pro Leu Asn Cys Lys Asp Phe Tyr Phe Val Ser Ser Gly 
705                 710                 715                 720 


Lys Arg Lys Ile Ala Asn Asn Gln Cys Gln Gly Gly Ile Glu Glu Leu 
                725                 730                 735     


Tyr Thr Lys Lys Lys Val Arg Cys Pro Gly Asn Glu Glu Ala Gln Gln 
            740                 745                 750         


Thr Gln Gln Gln Thr Gln Asn Thr Gln Ala Asn Thr Ala Gln Asn Asn 
        755                 760                 765             


Gln Gln Asp Leu Phe Ser Arg Lys Pro Glu Asp Ile Lys Lys Glu Ile 
    770                 775                 780                 


Lys Glu Gln Tyr Gly Asn Gln Thr Asp Gln Thr Ser Gly Ile Ser Phe 
785                 790                 795                 800 


Leu Gly Val Leu Ala Ala Phe Leu Val Leu Phe Leu Leu Tyr Thr Tyr 
                805                 810                 815     


Arg Val Glu Ile Leu Ser Lys Ile Lys Glu Tyr Gln Gln Asn Gln Lys 
            820                 825                 830         


Asn Lys Lys Gly Asp Asn Asn Lys Tyr Gly Tyr Lys Gln Lys Ser Tyr 
        835                 840                 845             


Gly Asn Asn Ala Glu Gln Tyr Ser Leu Phe Gln Asn Asp Gln Asp Asn 
    850                 855                 860                 


Asp Glu Tyr Asp Ala Asp Met Leu 
865                 870         


<210>  8
<211>  2689
<212>  DNA
<213>  Tetrahymena thermophila

<400>  8
gaaattacaa aaagcaatct ttttagagta gcatttaaaa taaattataa attaggtatt       60

tgtttagatt atgaaaaaat aagatctgac agtatatgtt gcagctttcc tgcttctctt      120

ttcttgtgtt attcactttg ctaatgctca agataaagtt agtgaaattt ttaaagacaa      180

atatgatgtc aaatatagag taactgaatt agattcacct gtttaggaaa ttctatggtg      240

cggtagttct taagcaacat ctgaagacgg agatattatc acctatgatt aaacagcaaa      300

agttagaaaa ctttatgtct taactgataa aggtaaattg tattactcag aagactatgg      360

cattacattg aagttgatta atgatgatat ccgtcaatca accaattcca aataaactta      420

ggtcgaagtc gatgatatca tgatctcacc tgttaaaaat agaaaagtgt tcatcttcac      480

taaaagcggt gaaagctatt atacagaaaa ctgtggtgcc acttatactt ctttcaagca      540

cgagattctc ctatacgata tctagcccaa tccttctgat cacaagtctt tgataggact      600

tgtacccgtt tagtgctaaa aaggagatcc tgagtgctaa ggtggtgatt ctgatttata      660

cttaacagta gatagcggta tgacttggag aaaaatagtc tctaacgtaa atcaagcata      720

gtgggataag accaaataaa ctctcatgaa cacataaaat agaattattt tgtctcatta      780

agagtaagaa aagaatgaaa aaggagaaaa tgtattcctc aataaagtaa gctacactga      840

taactatggt aaagatttaa aagtggtaga aaagaatgga gttagattct attaaacaga      900

agaatatatt tttgttttaa tctaaggaaa ggaatttggc aaatataaac ttaatattgg      960

accttctttt gttactcaat cttctagcag aaaagagatc gatttacctc tttaaagagt     1020

taaagatgaa tcttttactg tcttggacat agatgcaggc taaattctta tcgctattaa     1080

tcatgaaggt gacagtgctg gatacactaa tgtttacatt tcaaactcct aaggagaata     1140

gttcactctt tcacttcaat atacagtagg tgatgatgat tctaacattg attttgaacc     1200

cattaacagc aacgaaggag tttatattgc aaacacatac actgcagctt caatttcaaa     1260

atatcaaaag cttttgcaaa gaaaagaagg acaaaaatct tctggatctt cactcacttt     1320

ggattcattt aaaattgaaa atatgaaaaa aactaaaatt acatttaaca agggtggtga     1380

ctggcacgca atcaaggctc ccgaattcaa ttatgctgga aatcctattc gttgctctgg     1440

tgactgttct cttaacttta aaggaagaac tgagtctcaa ggtactccag tctattctac     1500

tgataatgct cctggtatta ttttggctac aggtaatgtt ggctcttatc tcactaataa     1560

tcaagatgaa ttaagaactt atctttctat tgatggtgga cacacatgga aagagattca     1620

agttggatct catgaatacg aaattggtga ttaaggcggt atcatcgcta tggctagaga     1680

cgataagctt acaaacgaag ttatttactc tgttgatgaa ggagaaacat ggagaaaatt     1740

gaatttcaag gatgaaaata aatttaaagt agatagtttt gttacagaag aaggcaacga     1800

tgaaagaact ttcttgttct atggaaccaa gactggtgca gatggaaata ctaaaggtgt     1860

aattggtgct atcaactttt caaatttatt caaaaaggaa tgcacaggat ttgaaaaccc     1920

tggcgaagat ggcagtgatt atgagagatg ggtcccatta aactttgaag gaaaaaaatg     1980

cttatttggt tcaaaaattt catacataag aaaaaaaact gattctagtt gctttaacaa     2040

cagaaaagtt ggtgatttaa gaatggtcta aggatcttgt gaatgtacag aagaagattt     2100

cgaatgtgat tatggtttca ctaaagattt aattgatgaa acaaaatgtg ttccaataaa     2160

tgcaaaattt gcaaagaaaa gagactaacc acctttgaac tgcaaagatt tttactttgt     2220

ttcttcagga aaaagaaaaa ttgcaaacaa ctaatgttaa ggcggtattg aagaattata     2280

tacaaagaaa aaagtaagat gcccaggaaa tgaagaagct cagcaaactt agcaataaac     2340

tcaaaatact taagctaata cagcttaaaa taactagtaa gacttattta gcagaaagcc     2400

agaagatata aaaaaagaaa taaaagaata atatggcaat taaacagatt agacatcagg     2460

aatatccttc ctcggtgttt tggcagcttt cttagtatta ttcttattat atacttacag     2520

ggtagaaata cttagcaaga taaaagaata tcaataaaac caaaagaaca aaaagggtga     2580

taacaataaa tatggctata agcaaaaatc ctatggaaat aatgctgaac agtattcact     2640

tttctaaaat gatcaagaca atgatgaata cgatgcagat atgctttga                 2689


<210>  9
<211>  6277
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA Construct

<400>  9
ttgtaagcgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt       60

ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag      120

ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg      180

tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcaccctaat      240

caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc      300

gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga      360

aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac      420

ccgccgcgct taatgcgccg ctacagggcg cgtcccattc gccattcagg ctgcgcaact      480

gttgggaagg gcgatcggtg cgggcctctt cgctattacg ccagctggcg aaagggggat      540

gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa      600

cgacggccag tgaattgtaa tacgactcac tatagggcga attgggtacc gggccccccc      660

tcgaggtcga cggtatcgat aagctctgat tgttaaatgt tgaaagagta tttttatgag      720

aagtattttt tgttttgaaa tcagaaattt tattcctctt ttttagtaaa aatacttaat      780

tgttatttat gtaacaaatg taataaaatc gcaaatgaaa tattcttttt aaccaattaa      840

ataaataata ctatttttaa ttaaaatgat gagcatatta attttaaaat ggatcttttt      900

aattaatgtt aaattataat atttaacaat aaaaaatatg ctgttgatat tttaataaat      960

tcgcaatcaa gaaattattg attttattat ttctattaat aatatttatt aattaattat     1020

ttaatgtaga aataaataaa taaattatga aagaaaataa aaatattaga caagatagat     1080

tgatagaaaa caaaaaataa ttagtgaaaa cattttagtt ttaaaacaaa ttaataagac     1140

tgtttattta acaaatattc agtagttagt ttgttagtta gtattgtatt cattttattt     1200

tgtaaaatga ttgattacat taaaattaat aatcattaat taattaattg cttatgctct     1260

caagtaattt tttaatgata agcttgatat cgaattcaga tcccccgggc tgcatttttc     1320

cagtaaaaat ttgaaaattt aatggcaaaa aaaaatatta ttattggatt tgcagacaaa     1380

tttttaagag ctaacatgta tgtgaagagg aatttttttt tttagaaagt taaaaaaaat     1440

aattgacata aaatatatat acaaatgagt tgtaaaataa tgattttagt caatttggaa     1500

taaattatat tttatagtag tatattaaca cgtttttttg gtgctttaat gttaatataa     1560

tacactaaaa attaatttta tataatatat ttattttata tgaaagtttg taaatatata     1620

ttgaattttt aatttaagga tctcagaaga attcgtcaag aagacgatag aaggcgatac     1680

gttgagaatc gggagcggcg ataccgtaaa ggacaaggaa acggtcagcc cattcaccac     1740

caagttcttc agcaatatca cgggtagcta aggcaatatc ttgataacgg tcggcgacac     1800

caagacgacc acagtcgatg aaaccagaaa aacgaccatt ttcaaccatg atattgggta     1860

agcaggcatc accatgggtg acgacaagat cttcaccgtc gggcatacgg gccttaagtc     1920

tggcgaaaag ttcggcaggg gcaagacctt gatgttcttc gtcaagatca tcttgatcga     1980

caagaccggc ttccatacga gtacgagcac gttcgatacg atgtttggct tggtggtcga     2040

aagggcaggt agcgggatca agggtatgaa gacgacgcat agcatcagcc atgatagaaa     2100

ctttttcggc aggagcaagg tgagaagaaa gaagatcttg accggggact tcacctaaaa     2160

gaagccagtc tctaccggct tcagtgacaa cgtcaaggac agcagcgcaa ggaacaccgg     2220

tggtggcaag ccaagaaaga cgggcagctt catcttgaag ttcattaagg gcaccagaaa     2280

ggtcggtctt gacgaaaaga acaggacgac cttgagcaga aagacggaag acggcggcat     2340

cagagcaacc gatggtttgt tgagcccagt cataaccgaa aagtctttcg acccaagcgg     2400

cgggagaacc agcgtgtaaa ccatcttgtt caatcattat tttaagttta gtattattat     2460

ttattttatt agagctttat taaatttttt taattttttt aaattatata aagaataaaa     2520

aagacgaata tatatatata cactatttac attattttat atggatcatt gtataaatcg     2580

tgaatcacgt agctaagaat tatatcagaa atataaaaaa ttactttata ttcaagagag     2640

attcaagaat cacatctata ttttagaata gaagaatttt gaaaattagt taggttgact     2700

catgatttaa atcatgagtc aatcaattta tattttttat cagaaataaa aagatttaca     2760

aataattcat gacacaaaat tcaagaatca caacttaata ttaaaatata atagaaacgg     2820

ataattgaaa ataaaaaata aatgatagcc taaataatga gtaatatttt gaaaattaat     2880

gattcacata ttataattga tgaatgagct atgttttgag cagcttatat atttaataaa     2940

taaaataatt gatatttatc tattttatat ttcatgtttt ctttaaaaaa catgtcatct     3000

tttttatcaa tatatttgaa atttaaagaa aataattgaa taaacgatac aatatatttt     3060

aagatatata aaaaagtttt gctttcaaga tattaaaaat agtgatataa aaaataagta     3120

ctctattatg ttttttctta ttcagtatta tacctttaat cattattatc tttttattta     3180

tttttagtta gttatttttt atttttatga atatttaaag agctaaaaaa aatttaaaaa     3240

tgtgtattta aattaaagga gttattcaaa acccttatta ttttttattt ttaaatattt     3300

tttagaaata aattgtatat cgaattcctg cagctaaaaa gattgctttt tgtaatttct     3360

atgattttaa gagtattttt taatttaata tttttaatat ttaattaatt atgatttttt     3420

tttttttttg atgagaagat acactttatt aaataacata gttcgaaata tcataatcaa     3480

cttcttttaa aaaatttttt ttaagaatca aacaaactct ctaacataca cgcattcgct     3540

catttattaa aatttttacg ttttgcaaat ttaatttgtt ggcacttttg tatcttcact     3600

caattaccaa aattttctct caattttcct tccttttata aaataaccaa tgataatttt     3660

tgatccaata cgttttaaaa tttagtcttt cttttaaaat aataacaaag aaagataaat     3720

acatgagtaa aaataaaaaa agcagatagc tatgcaattt attaattttt ttgaaattta     3780

taaatatttt tggagatatt ttttcattat agtgattaaa attaatttta tttagaaaaa     3840

tcaagtttta tttatgaata aacagttatt tacttaagat tttgttttca ctattagtat     3900

tctgttttaa atctttaagt attttcttag ttaacaatct tacaatcctt attttgattg     3960

ctatttaaaa ttaaaatatt ttaaatagaa catttaacat aacagatatg aaaataaaca     4020

gcgttttacg ctagcgcatg ctctagagcg gccgccaccg cggtggagct ccagcttttg     4080

ttccctttag tgagggttaa tttcgagctt ggcgtaatca tggtcatagc tgtttcctgt     4140

gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa     4200

agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc     4260

tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag     4320

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt     4380

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga     4440

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg     4500

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa     4560

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt     4620

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct     4680

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct     4740

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc     4800

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt     4860

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc     4920

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat     4980

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa     5040

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa     5100

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga     5160

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct     5220

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga     5280

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc     5340

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg     5400

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat     5460

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat     5520

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg     5580

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc     5640

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa     5700

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc     5760

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt     5820

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag     5880

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt     5940

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag     6000

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac     6060

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc     6120

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca     6180

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg     6240

ggttccgcgc acatttcccc gaaaagtgcc acctaaa                              6277


<210>  10
<211>  6648
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA Construct

<400>  10
cacctaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag       60

ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac      120

cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga      180

ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc      240

accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga accctaaagg      300

gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa      360

gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc tgcgcgtaac      420

caccacaccc gccgcgctta atgcgccgct acagggcgcg tcccattcgc cattcaggct      480

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa      540

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg      600

ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gcttgattta cgacaaattc aatatgccat      720

ttcaaaagta atctgagttt ctggaagttt aatagaataa aaattacaga attgatattt      780

aaatcaaaat tttcaagcta aattagattg atttttaagt tactcaaata ataaaggtag      840

taaataaaat caaatttctc aaacaataat tatcttctac ataaaactgc tttcttaata      900

cccctaatta aaaaagatat atttttgaaa atttaaacaa aatttggaag aaaaattaat      960

ttcatttgat aaattttatt taaagtaggt tctccataac taacccctcc cctaatcaaa     1020

tatttgtaaa agctttgggt ttttttctaa aaaattttca aaatttattt ttttcaaaac     1080

atatttatta tttcaagtta aacattttgt gaatttaatg atttataaaa actcaaaaaa     1140

atagttgttt agaattatta tttagcttta tttgttatat tattaagata tattacattt     1200

tgccttttat aaattaaata cgcatttcac aaaagactgt tcatttataa gattattcat     1260

caatatatta atatattatt ttttgctatt tttttaattt ggttatttaa aattctagaa     1320

tcatgattaa gtatttattt ttaattatct tgattataaa taatctaaat ttttatgtta     1380

actaaaaatc tttggtagta aataaataaa ttattattat tattattaaa aaatcaatac     1440

tattaaaatt atttttaaat taaattaatc tacaaagaaa tttatgaaaa aacaaaaaat     1500

caattagtta attaagcttg atatcgaatt cagatccccc gggctgcatt tttccagtaa     1560

aaatttgaaa atttaatggc aaaaaaaaat attattattg gatttgcaga caaattttta     1620

agagctaaca tgtatgtgaa gaggaatttt tttttttaga aagttaaaaa aaataattga     1680

cataaaatat atatacaaat gagttgtaaa ataatgattt tagtcaattt ggaataaatt     1740

atattttata gtagtatatt aacacgtttt tttggtgctt taatgttaat ataatacact     1800

aaaaattaat tttatataat atatttattt tatatgaaag tttgtaaata tatattgaat     1860

ttttaattta aggatctcag aagaattcgt caagaagacg atagaaggcg atacgttgag     1920

aatcgggagc ggcgataccg taaaggacaa ggaaacggtc agcccattca ccaccaagtt     1980

cttcagcaat atcacgggta gctaaggcaa tatcttgata acggtcggcg acaccaagac     2040

gaccacagtc gatgaaacca gaaaaacgac cattttcaac catgatattg ggtaagcagg     2100

catcaccatg ggtgacgaca agatcttcac cgtcgggcat acgggcctta agtctggcga     2160

aaagttcggc aggggcaaga ccttgatgtt cttcgtcaag atcatcttga tcgacaagac     2220

cggcttccat acgagtacga gcacgttcga tacgatgttt ggcttggtgg tcgaaagggc     2280

aggtagcggg atcaagggta tgaagacgac gcatagcatc agccatgata gaaacttttt     2340

cggcaggagc aaggtgagaa gaaagaagat cttgaccggg gacttcacct aaaagaagcc     2400

agtctctacc ggcttcagtg acaacgtcaa ggacagcagc gcaaggaaca ccggtggtgg     2460

caagccaaga aagacgggca gcttcatctt gaagttcatt aagggcacca gaaaggtcgg     2520

tcttgacgaa aagaacagga cgaccttgag cagaaagacg gaagacggcg gcatcagagc     2580

aaccgatggt ttgttgagcc cagtcataac cgaaaagtct ttcgacccaa gcggcgggag     2640

aaccagcgtg taaaccatct tgttcaatca ttattttaag tttagtatta ttatttattt     2700

tattagagct ttattaaatt tttttaattt ttttaaatta tataaagaat aaaaaagacg     2760

aatatatata tatacactat ttacattatt ttatatggat cattgtataa atcgtgaatc     2820

acgtagctaa gaattatatc agaaatataa aaaattactt tatattcaag agagattcaa     2880

gaatcacatc tatattttag aatagaagaa ttttgaaaat tagttaggtt gactcatgat     2940

ttaaatcatg agtcaatcaa tttatatttt ttatcagaaa taaaaagatt tacaaataat     3000

tcatgacaca aaattcaaga atcacaactt aatattaaaa tataatagaa acggataatt     3060

gaaaataaaa aataaatgat agcctaaata atgagtaata ttttgaaaat taatgattca     3120

catattataa ttgatgaatg agctatgttt tgagcagctt atatatttaa taaataaaat     3180

aattgatatt tatctatttt atatttcatg ttttctttaa aaaacatgtc atctttttta     3240

tcaatatatt tgaaatttaa agaaaataat tgaataaacg atacaatata ttttaagata     3300

tataaaaaag ttttgctttc aagatattaa aaatagtgat ataaaaaata agtactctat     3360

tatgtttttt cttattcagt attatacctt taatcattat tatcttttta tttattttta     3420

gttagttatt ttttattttt atgaatattt aaagagctaa aaaaaattta aaaatgtgta     3480

tttaaattaa aggagttatt caaaaccctt attatttttt atttttaaat attttttaga     3540

aataaattgt atatcgaatt cctgcagaaa gatatttaat cacttaataa ctaagtctgt     3600

ttctcatgcc aagaaaaatt caactaacaa taagtttatc aaaaattttc tatttagatg     3660

tagaaaagaa aaagaaaaaa caaatctaaa ggttattagc atattttttc ttcttaaaca     3720

aggattaatt tttacgtttt taaatttcag accaatcaat caatcatgaa tgataataga     3780

tatttttaaa atatagcttt aaaaaaatac aatatttaac gagattataa tatttttttt     3840

taatactaaa attcttcgct ttgctgagca atttgatttg aaaaagctaa tcaactttat     3900

taattttttt cggattaggt ttttaaaatt ttataaggaa taaacgtttt ttaatgatat     3960

gctatttagg atactgcttt tttaaagtaa ttttttaatt tagatttaag tttactctaa     4020

caataaggat ttaaatataa acaatttaca aataatttta tatagattag aattttaatt     4080

tatttattta tttacttatt aatttaagtt aaattattta atttgattta actaaaattt     4140

attttgaagt tatattacaa aattttcatt ttatgttaaa ctcagttagt ttgatcattg     4200

tttcatacat ctgattaaat attttaatat gatgaggaac caaacttgtg actttaatta     4260

tttgaattaa taaaaaaatt ctgcatatcg ttgctgtctt attttaagtt tagctttaca     4320

ttatataaaa gactatctat tggttggtat tactattatt tattatttaa taatgatgtt     4380

atttactagc tgcctaatcc agactgaggc tagcgcatgc tctagagcgg ccgccaccgc     4440

ggtggagctc cagcttttgt tccctttagt gagggttaat ttcgagcttg gcgtaatcat     4500

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag     4560

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg     4620

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa     4680

tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca     4740

ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg     4800

taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc     4860

agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc     4920

cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac     4980

tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc     5040

tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata     5100

gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc     5160

acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca     5220

acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag     5280

cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta     5340

gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg     5400

gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc     5460

agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt     5520

ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa     5580

ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat     5640

atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga     5700

tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac     5760

gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg     5820

ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg     5880

caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt     5940

cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct     6000

cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat     6060

cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta     6120

agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca     6180

tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat     6240

agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac     6300

atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa     6360

ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt     6420

cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg     6480

caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat     6540

attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt     6600

agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgc                  6648


<210>  11
<211>  6681
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA Construct

<400>  11
cacctaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag       60

ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac      120

cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga      180

ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc      240

accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga accctaaagg      300

gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa      360

gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc tgcgcgtaac      420

caccacaccc gccgcgctta atgcgccgct acagggcgcg tcccattcgc cattcaggct      480

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa      540

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg      600

ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gctgaagata aattattgct tcaatcattt      720

gctcagctaa ttatattagc taatttctat caaagcattt gtaggaagac agagaaaaat      780

cacagtcttt aaataaaata acaaaaaatt tattaatttt aaaggcatca tagatttttg      840

atatatataa attatcacat cctctaacga gatcaacata attttaggcc tgttacagct      900

tatcaaagca gtaagagtta ttcccatcat ctattgatat aagatatatt agataagatt      960

cttttctatt attaactgca taatatttcc catatttttc gttttttatc acctaaaagt     1020

attcattttt attttataaa agctagtcta ttatttaagc tatttatttt tcatttaaaa     1080

tatcatttta tacatttgtt gaaaaccctt aatccatatt tttataattt ttttttttca     1140

aaatttctta tcaaaaattt tttaattaaa acaaattcct ttaaaaacat ttaacttaaa     1200

tttggaagta aattattaga aggtttgtta gaatattttc aaactaggaa taataaagta     1260

aaaaaggatt tgataaaata cataagttaa gagaaaaaga tacttgacac actatattaa     1320

attacttaga aaaactttct aaataatatt aatgaaagac aaaaattaat ttatttattt     1380

ttatttattt attatttaag taaaaaaata aattagcaat attattcagt caattatttt     1440

aagcttgata tcgaattcag atcccccggg ctgcattttt ccagtaaaaa tttgaaaatt     1500

taatggcaaa aaaaaatatt attattggat ttgcagacaa atttttaaga gctaacatgt     1560

atgtgaagag gaattttttt ttttagaaag ttaaaaaaaa taattgacat aaaatatata     1620

tacaaatgag ttgtaaaata atgattttag tcaatttgga ataaattata ttttatagta     1680

gtatattaac acgttttttt ggtgctttaa tgttaatata atacactaaa aattaatttt     1740

atataatata tttattttat atgaaagttt gtaaatatat attgaatttt taatttaagg     1800

atctcagaag aattcgtcaa gaagacgata gaaggcgata cgttgagaat cgggagcggc     1860

gataccgtaa aggacaagga aacggtcagc ccattcacca ccaagttctt cagcaatatc     1920

acgggtagct aaggcaatat cttgataacg gtcggcgaca ccaagacgac cacagtcgat     1980

gaaaccagaa aaacgaccat tttcaaccat gatattgggt aagcaggcat caccatgggt     2040

gacgacaaga tcttcaccgt cgggcatacg ggccttaagt ctggcgaaaa gttcggcagg     2100

ggcaagacct tgatgttctt cgtcaagatc atcttgatcg acaagaccgg cttccatacg     2160

agtacgagca cgttcgatac gatgtttggc ttggtggtcg aaagggcagg tagcgggatc     2220

aagggtatga agacgacgca tagcatcagc catgatagaa actttttcgg caggagcaag     2280

gtgagaagaa agaagatctt gaccggggac ttcacctaaa agaagccagt ctctaccggc     2340

ttcagtgaca acgtcaagga cagcagcgca aggaacaccg gtggtggcaa gccaagaaag     2400

acgggcagct tcatcttgaa gttcattaag ggcaccagaa aggtcggtct tgacgaaaag     2460

aacaggacga ccttgagcag aaagacggaa gacggcggca tcagagcaac cgatggtttg     2520

ttgagcccag tcataaccga aaagtctttc gacccaagcg gcgggagaac cagcgtgtaa     2580

accatcttgt tcaatcatta ttttaagttt agtattatta tttattttat tagagcttta     2640

ttaaattttt ttaatttttt taaattatat aaagaataaa aaagacgaat atatatatat     2700

acactattta cattatttta tatggatcat tgtataaatc gtgaatcacg tagctaagaa     2760

ttatatcaga aatataaaaa attactttat attcaagaga gattcaagaa tcacatctat     2820

attttagaat agaagaattt tgaaaattag ttaggttgac tcatgattta aatcatgagt     2880

caatcaattt atatttttta tcagaaataa aaagatttac aaataattca tgacacaaaa     2940

ttcaagaatc acaacttaat attaaaatat aatagaaacg gataattgaa aataaaaaat     3000

aaatgatagc ctaaataatg agtaatattt tgaaaattaa tgattcacat attataattg     3060

atgaatgagc tatgttttga gcagcttata tatttaataa ataaaataat tgatatttat     3120

ctattttata tttcatgttt tctttaaaaa acatgtcatc ttttttatca atatatttga     3180

aatttaaaga aaataattga ataaacgata caatatattt taagatatat aaaaaagttt     3240

tgctttcaag atattaaaaa tagtgatata aaaaataagt actctattat gttttttctt     3300

attcagtatt atacctttaa tcattattat ctttttattt atttttagtt agttattttt     3360

tatttttatg aatatttaaa gagctaaaaa aaatttaaaa atgtgtattt aaattaaagg     3420

agttattcaa aacccttatt attttttatt tttaaatatt ttttagaaat aaattgtata     3480

tcgaattcct gcagatctat tttttctctc aattattttc ttttcaagga tttgtttttt     3540

ttttgttggt tattctatta attaaggcaa gatgaatgct tctatcaaaa aaaatacgtt     3600

ttttgatttg taattttttc ctattgattt caattatgtt tttaaaatta agtcatattc     3660

ttgtttatca agtttcatca gtaattcaag ctcattaaaa tctttaaaaa aatctttcta     3720

aatagtttca attatactga agtgattcaa tcatttttta atcaaaaata tatttcagtc     3780

aattataatt tcattcatca aaataaaatg agatatttct aaaactgatt cataatttta     3840

gaaaattctt taatataaaa agatacattt tttaacttaa taatattttg gcattacata     3900

gctaatacaa aaatatgatt aatacaataa tgtaaatcat aagattaata tattagtaaa     3960

acaaaacata aaatcaagta ctgaattgtt ttattaatat attattttag taaaaatact     4020

ttcaaaatat tttttgaact aaagttgtaa ctaattatta ttttaacacc gtaaaaaata     4080

aaaaagttta aaagatttta aatattaaat aaactaacaa accatattca aatatattta     4140

aaaatagtaa aaactaaata ataaatattt cttaaattta tgcttcaaat aaaatttttc     4200

aatcagttaa ctatttttat attcaattta ttagatgtga taaattatat aaattaattc     4260

tttgtttttc atttgttaat tttttatttt gtttcagtaa atgatatctt ttaatttctt     4320

cattcaaatt ccttaaaact atataataag gacaaattaa actcataaat atattctcaa     4380

atagttatta attttatata tcataattct tctatacaat tatccaatca taaaagtgga     4440

agctagcgca tgctctagag cggccgccac cgcggtggag ctccagcttt tgttcccttt     4500

agtgagggtt aatttcgagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt     4560

gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg     4620

gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt     4680

cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt     4740

tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc     4800

tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg     4860

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg     4920

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac     4980

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg     5040

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct     5100

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg     5160

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct     5220

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac     5280

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt     5340

tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc     5400

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca     5460

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat     5520

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac     5580

gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt     5640

aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc     5700

aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg     5760

cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg     5820

ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc     5880

cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta     5940

ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg     6000

ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct     6060

ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta     6120

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg     6180

ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga     6240

ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt     6300

gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca     6360

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt     6420

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt     6480

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga     6540

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt     6600

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc     6660

gcacatttcc ccgaaaagtg c                                               6681


<210>  12
<211>  6564
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic DNA Construct

<400>  12
cacctaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag       60

ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac      120

cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga      180

ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc      240

accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga accctaaagg      300

gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa      360

gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc tgcgcgtaac      420

caccacaccc gccgcgctta atgcgccgct acagggcgcg tcccattcgc cattcaggct      480

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa      540

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg      600

ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gctaactaga atattaattg ctaaagtcaa      720

aaatctctaa tataagaaag aaaatattga aatgagaatt taatttaaaa ttaaatgata      780

atgcctatga gttaattaat ttgattaaag aaaggtattt gtttctttga gttcatattt      840

aatcaccagc acttaaaata tgttgatatt tgaattattt aaatactttc ttaaatatta      900

ttaccgtaat agtgtttgaa gtttaagaaa taatgtactt tggttataaa acaatttttt      960

tactattaat agaagggagt gactgtatat ttttcagccg aattattttt tttaaatatt     1020

cgaaattaaa aaataaaaag tttaaaatca taaaaattaa atgacatatc accacctgta     1080

cctacatagt tcgtgatatt ttaattacag gattagcaat atttatacat ataaaaatat     1140

taaatgcttt ttttaagatt ttagttattt ataaaatata cttataagat aaacaatttc     1200

aatttataat aatatattta aattcaatta aaatcctctt ttaaacttta ataattaaca     1260

aaacatttag cataatctgt tttgattgta gattttaaaa tagattaaat taaattatta     1320

agaaattttt tagaaaacta aaaaattaat ttacaaaaaa taaaatatta aacttacaaa     1380

ttaattaaaa tgaagaaatt tgttatttta ataagcttga tatcgaattc agatcccccg     1440

ggctgcattt ttccagtaaa aatttgaaaa tttaatggca aaaaaaaata ttattattgg     1500

atttgcagac aaatttttaa gagctaacat gtatgtgaag aggaattttt ttttttagaa     1560

agttaaaaaa aataattgac ataaaatata tatacaaatg agttgtaaaa taatgatttt     1620

agtcaatttg gaataaatta tattttatag tagtatatta acacgttttt ttggtgcttt     1680

aatgttaata taatacacta aaaattaatt ttatataata tatttatttt atatgaaagt     1740

ttgtaaatat atattgaatt tttaatttaa ggatctcaga agaattcgtc aagaagacga     1800

tagaaggcga tacgttgaga atcgggagcg gcgataccgt aaaggacaag gaaacggtca     1860

gcccattcac caccaagttc ttcagcaata tcacgggtag ctaaggcaat atcttgataa     1920

cggtcggcga caccaagacg accacagtcg atgaaaccag aaaaacgacc attttcaacc     1980

atgatattgg gtaagcaggc atcaccatgg gtgacgacaa gatcttcacc gtcgggcata     2040

cgggccttaa gtctggcgaa aagttcggca ggggcaagac cttgatgttc ttcgtcaaga     2100

tcatcttgat cgacaagacc ggcttccata cgagtacgag cacgttcgat acgatgtttg     2160

gcttggtggt cgaaagggca ggtagcggga tcaagggtat gaagacgacg catagcatca     2220

gccatgatag aaactttttc ggcaggagca aggtgagaag aaagaagatc ttgaccgggg     2280

acttcaccta aaagaagcca gtctctaccg gcttcagtga caacgtcaag gacagcagcg     2340

caaggaacac cggtggtggc aagccaagaa agacgggcag cttcatcttg aagttcatta     2400

agggcaccag aaaggtcggt cttgacgaaa agaacaggac gaccttgagc agaaagacgg     2460

aagacggcgg catcagagca accgatggtt tgttgagccc agtcataacc gaaaagtctt     2520

tcgacccaag cggcgggaga accagcgtgt aaaccatctt gttcaatcat tattttaagt     2580

ttagtattat tatttatttt attagagctt tattaaattt ttttaatttt tttaaattat     2640

ataaagaata aaaaagacga atatatatat atacactatt tacattattt tatatggatc     2700

attgtataaa tcgtgaatca cgtagctaag aattatatca gaaatataaa aaattacttt     2760

atattcaaga gagattcaag aatcacatct atattttaga atagaagaat tttgaaaatt     2820

agttaggttg actcatgatt taaatcatga gtcaatcaat ttatattttt tatcagaaat     2880

aaaaagattt acaaataatt catgacacaa aattcaagaa tcacaactta atattaaaat     2940

ataatagaaa cggataattg aaaataaaaa ataaatgata gcctaaataa tgagtaatat     3000

tttgaaaatt aatgattcac atattataat tgatgaatga gctatgtttt gagcagctta     3060

tatatttaat aaataaaata attgatattt atctatttta tatttcatgt tttctttaaa     3120

aaacatgtca tcttttttat caatatattt gaaatttaaa gaaaataatt gaataaacga     3180

tacaatatat tttaagatat ataaaaaagt tttgctttca agatattaaa aatagtgata     3240

taaaaaataa gtactctatt atgttttttc ttattcagta ttataccttt aatcattatt     3300

atctttttat ttatttttag ttagttattt tttattttta tgaatattta aagagctaaa     3360

aaaaatttaa aaatgtgtat ttaaattaaa ggagttattc aaaaccctta ttatttttta     3420

tttttaaata ttttttagaa ataaattgta tatcgaattc ctgcagcaga cttcctctga     3480

tttcctaaca aagtaattct tacttttttg ttaaaacatt taaaaaaaaa caaaaatttt     3540

taataatttc taaaacgtgt tttatttaat ataaattcat gcatatggct aatatcttac     3600

gacttttcta aatatttaat tttataaatc tagattcaac ataaatagcg atcaactttt     3660

ttttatgagt cttaaaaatc tctacattta aaaacgaaaa attataagtt cagtcaactc     3720

caagctatta taagataatc atccatctaa aatcaataca gtcaattttt attttctata     3780

ttttcatagt ataaattttt atattttaaa ttgaaatttt tttaattttt cattttatta     3840

taaaaataga attagccaat ttgtaattta agatattaaa atttaatatt taaaattaaa     3900

tttacttaaa ggaaataaca taaaagtaaa acttacctta aaaattatga ttacctgtcc     3960

agaacatttt ttagcaatat aaaatatata tatccaaaat tttaaataaa attaaaaaaa     4020

actttgaata tattctgata taaaaatagt aaatacacaa tacatataaa aaaaaattca     4080

aatatttggt caaatttgat attatttaag ttcattatta attttaatta aataaataaa     4140

atatttacaa tcaatagtgt aaacttttaa atgataattt tacttttaga tataaatata     4200

aacaaacaaa atagagttat ataaaattta taaatagttt tagaaataat tcattttatt     4260

ttattttatt tgatgaaatt gtgttatgat aaaaaggaat ttacttattc ttcaattaga     4320

atacgctagc gcatgctcta gagcggccgc caccgcggtg gagctccagc ttttgttccc     4380

tttagtgagg gttaatttcg agcttggcgt aatcatggtc atagctgttt cctgtgtgaa     4440

attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct     4500

ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc     4560

agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg     4620

gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc     4680

ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag     4740

gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa     4800

aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc     4860

gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc     4920

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg     4980

cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt     5040

cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc     5100

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc     5160

cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag     5220

agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg     5280

ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa     5340

ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag     5400

gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact     5460

cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa     5520

attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt     5580

accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag     5640

ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca     5700

gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc     5760

agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt     5820

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg     5880

ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca     5940

gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg     6000

ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca     6060

tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg     6120

tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct     6180

cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca     6240

tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca     6300

gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg     6360

tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac     6420

ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt     6480

attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc     6540

cgcgcacatt tccccgaaaa gtgc                                            6564


