                         SEQUENCE LISTING

<110>  MALCISBO AG
 
<120>  VACCINES COMPRISING GLYCOENGINEERED BACTERIA

<130>  50797PCT

<160>  72    

<170>  PatentIn version 3.5

<210>  1
<211>  12928
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Identified APP2 O-antigen biosynthesis cluster of sequenced APP2 
       strain

<400>  1
atgcttgaac aagcatcaaa ccaaactaac gaggaaatcg atctgattga attaattcgc       60

gtgctttgga agaaaaaatt attaattgct attgttacct ttatttttac tgcattggca      120

gcagtttatg cttttaccgc gaaagagaaa tggacatctc aagcagaggt aattgcgcct      180

agagtgacag atatttccga atatttatcg ttacgaaaag aatataattt aattatgggt      240

tctgagttca aagaaaatga tattcgtaat gaactaaatg agcttttttc tcgttatgtg      300

ctttcttatg acgaagcaat ctctttcttt aagaccacag atacttataa aaaacttgca      360

gaaaaagaaa atgaagcagg tttacaaaga gcagcttctg aatttacaac ggaatcattg      420

aaggtgataa aaccggatgc gaaaaaagac cttaatgctt taggcagtaa gattgctatt      480

tcatctgaga ctgctttatc tgcacaaaca gagttaaatg attttattcg tcatattagt      540

gatatttctt ttaattttag taaaaatgaa tttatttatt gggtaaaaga gagcatatct      600

agcctaaatt acgagaaaga agtgatagag caagatcaga gtattcaacg aaaagttcag      660

attcaaaact tagaagccgc acttgatatg gcgaaaaaag caggaattaa agagtatagt      720

tctgcattgt catcaaatag ttctgtggct aatctcgcgg taagtgatac aaaaattccg      780

ttatcagatt ctaaattagc agatggaacc tatttattta tgcttggtga gaaaaatcta      840

caagcacaat tagatattgc gaaaaccaag gaaattgttt attcgccaag atattatcag      900

attcaagagc aacttttaaa attaaatact ttattaccta aagtagagaa agtaactggg      960

caaacttata gttatgtatc ttcgccgaca tatcctgtta tcaaagatgc acctaaaaaa     1020

ggaattattt tagttatagg tttcttggta ggattgttac tgagttcatt cactgttttg     1080

atttctgtat tagtgagaaa taagaagagc taattatgag gtatgaagag gaatattaac     1140

ctctttatgt gccttttgat gatccaattt actgaagtgc ttagactatt taggtaataa     1200

aatataggga taataatgaa gcataagata ttacatttta gccaagtgct tggcggtgtc     1260

ggtcgatatt tagagctata tgataagtac atcaataaag attcatttga aaatatatat     1320

atattaccta taggtgattg ggaggcggca gaagctcagg ataaacggta tatattaaat     1380

attgaacaat ccttttcacc aattaaattg atctctaatg ttataaaaat tagaaatatc     1440

ttaaaaaaag aaaagccaga tatcttttat ctacatagta cttttgctgg cgttattggg     1500

cggttagctg ctattggtat gaggtgtaaa gtaatttata accctcatgg ttggtcattt     1560

aagatgaatg tatctaggct aaaacaaact ttttataaga ttatcgaagg tggtttagtt     1620

ttcttaactg ataagtttgt tttaatttcg aaatcggagt atgaggcggc acgttcaatt     1680

ggtgtttcag agaagaaatg ttgtctcata tataacggta ttgaaacgat aaaaaaaaca     1740

gatatagcaa tcattcctaa attagatgat aaatatatca ttggaatgat aggacgtatt     1800

agtgagcaaa aaaatccaat gttttttgcc cagtttgcca aagagattat taaacaatac     1860

cctaatactt atttcatttt ggttggtgac ggtgagcaac gagaatcgtt agaagactat     1920

ttagaacgta ataacttgaa tgatgttttt tatattacgg gttgggtaac taatcccgaa     1980

agctacttaa atttatttga tcaagcagta ttattctcaa aatgggaagg gctatgtctt     2040

tcagtctgtg agtatatgtt atatgaaaag cctatattag taagtaatat tggtggtatt     2100

aacgatctta ttcagaatga agttaatggt tttactattg ttgaaggtga tcttaaggat     2160

gcggttaaca aatctaatag attaagaaat gagcctaaaa ctgtagctaa gtttattgaa     2220

gcctcaaaca tacttattca agagaaattt aatgctcaaa aaatggtaaa tagtttagaa     2280

aaacttttta tcaaattatc agagaataaa taatgaaaga aaaatttagt tgcattgttg     2340

tttgttataa ccccgataat tcggtgcttg acaatctcaa aaactatatt agttatgtgg     2400

gaaaagtaat cgtagttgat aattcagatg tggataattc tcaattattt tcctcacttt     2460

cagaatactt aatttatata ccattgtata aaaatgtggg tattgcctat gcactaaata     2520

taggagtaga aaagtccaaa gaattaggat atgaatatat cattactatg gatcaagata     2580

gttcttttgc tactaatcta gtggatgtat attcacatta tataagtaat tatcctatag     2640

atcagatagg agcattatcc ccagtttata ttacggacag gggatttaat cgaacaagta     2700

aagaagaatt taaacaaata aaaattacta tgcaatcagg ttctatgttc tttactgata     2760

aatttgatgt aatcggtcgc tttgataatg atctgttctt agatgtagtt gattgggaat     2820

atttctttag aatttatacg ttaggatata aaacgattca atgtaataaa gcaatgctga     2880

aacatgctcc agcggaaacg ctaacgttat ttaaaataaa aggaaaaaca attggtgttg     2940

gagtcgcttc tccattaagg tattactatc aaattagaaa tctactttgg tgtgttttac     3000

ataaaaagag tttttttatg ataaaaacaa tagcttataa atttattaag attctatttt     3060

tgtttaataa taaaaagcaa tatttatcat ttgcttatat ggctattaaa gacgccttta     3120

ataatcgttt aggggcatat gatacacttt atttagagaa atctcgtaat gaaaaatgat     3180

ttaccattaa ttagtattat tattcctatc tataacgtga agccttatct tgaaaaatgt     3240

gtaaatagtg tattatcaca atcatatcct aatcttgaaa ttattctagt tgatgacggt     3300

gcaactgatg gttctgctca agtgtgtgat gatttttctg aaaagtatgc aaatattcag     3360

gtaattcata agaaaaatgg tgggctatcc tcagcaagaa atgctggaat tgaagctatg     3420

aagggagaat acgtattttt cttagatagt gatgactgga ttgctaatga tgcaatttct     3480

caattatatg atgatatggt ggaatataat gcagatataa cagggattag tttttatcaa     3540

gcatattcag acggtaattt agtattaaat acacatctta ttgaaaaaca aatgctttca     3600

aagaaagagg ctttacgtac tttcctattt aataattacc ttactccttg ttcctgtgga     3660

aaactttata aagcaagtct atggaaagat ataagatttc cggagggacg attatttgaa     3720

gatcagctta ctacttataa agttatcgag ttagcaaata caattatttt taatcctgct     3780

gcaaagtatt tttattttaa aagaatagga tctatcggtc attctgcttt ttctgaaaaa     3840

acatatgacc tttatgaggc tgttaatgaa caatataatg aaataactaa gcatcatcct     3900

gatattgaat ctgatttggc ggttgctaaa attacttggg aaattgtatt tattaatatg     3960

atgctcaatt caaattattc agatcaagcg atagttgata aaacacgagt ttttgcaaga     4020

aaacgtattt tagatgtagt gaaatgtgag tttatcccta atttacgaaa atttcagatt     4080

actttatttg catataattt tagtttatat aaagttttat atgcaagata taaaaagaaa     4140

aatccattat cttaattatt gattttatcg gttgttacaa tgatacctaa aaaaattcat     4200

tattgttggt ttggtggaaa tccattacct aaaagtgtga aaaaatgtat taaatcttgg     4260

aaaaaatact gtccagatta cgagattatt gagtggaatg aatctaatta caatgtgcat     4320

aagaaccttt ttataaaaga agcttatgag aaaaaaaagt ttgcatttgt ttcagattat     4380

gctcgtttag atgtggttca ttctgaaggt gggatttatt tagatactga tgttgagttg     4440

ataaaaccta tagatgattt attagctcat agttgttttt tagcatctga atctattgat     4500

gatgttaata cagggctagg ttttggggct gaaaaaggac attggtttat cgcagaaaat     4560

atgagtgtct atgaaaatat gtactttaat atggaaaata ttattacctg tgtagagatt     4620

actactaaat tattaataga aagaggtttt tctgctagtg ataaaattca aaatatagat     4680

gatattttca tttatccaac tgagtatttt tgcccattaa attataaaac ccacgagttg     4740

catataacac agaatactta ttctatacat cactatgatg caacttggca aagccctctt     4800

atgaaattta aaacaaaaat taagtatata ttgtgtttag ccggaataat aaaatgaact     4860

ccttagtata tagaatagat attagaacac ttattttttc tattttttat tttacttttt     4920

tagtatcgga ttttttatta ttagctcaag atggcactat tacaaaagat atcatcaaat     4980

gggttaaatt attctcatta ttgccattgc tcttattaat atttaaattg cctttgaatc     5040

tcttgatttt aggttttttt actataatga taagtgcttt ttattctatt tatacgggag     5100

attcgttttt attatatata tgtttgctga tgtctttttc ttataaagtt aattttaact     5160

ttttattcaa gataggatta tatcttactt caattctagt tgttctaata ctaacttatt     5220

tcttttttga atattttctg attggtgaca gtcattttgt atatgatgcg acctattggt     5280

ttaaacgtta tacatttaat tttgataatc ctaatgcatt tcctatgaga atattcgttt     5340

tttttatatt ttatatattg catgtaggta agctgcgact ttttgataca tttctatttg     5400

ttatactatt tggaatagtt ttctattttt caaattctag aactgcattt tatattttta     5460

ttttgtgtgt ccttactatt cattttaacc aagtttttaa tgtgctaaat aatacttttg     5520

ttaaattact aattaataat tcaattatat ttataactat tttttcaatt tggtcggcta     5580

tatattatca agattattat tcctatttag aaccgattaa caaaatttta tctaaaagaa     5640

tatactttgc taatgaggct tataagagtt taggatttga attttaccct aggaatatta     5700

aatggtggat agaagaatct gattggcata ttatagataa tggatatgta tatttattta     5760

tttctggtgg tcttttagta ggaaatttat ttatattttc tataacttgg cttatgtata     5820

gactaaataa atttaaccta agtaatgagg caatattatt aatgttttct atgttatatc     5880

ttttatctga gagtcatttt ataaatatat tttacaatat acctatttta ttattagcta     5940

ttttcattaa taaaactaat attgtacgct atttggaatg taaaaaatga ataaaaacct     6000

tgtaaataat agtattatga gttttttgct tacaatatct aactttattt ttccattaat     6060

tacttttact tatgcggcaa gaattttgca acctgataat atgggaaagt ttgcattttc     6120

tctatcggtt gtagattatc tatctctatt tgctacattt ggtgttgtag gttatggtgt     6180

tagagcttgt gcagaagtaa gaaacaataa agaagaacta actaaaacgg tacaagaaat     6240

tttatttatt aatatttttt tagctattat tgcctatctt gtgatatttc ttctaattag     6300

ctatcagcat gcatttagag aagatacttt gttattctta attatgtctt cttgtattat     6360

ctttaatgtg ataggaatag aatggttata taaaagtctc gatgaatata gatacattac     6420

agtaagaagt attctattaa aaataatttc attaataatg attttatgtt ttgttaaaga     6480

aaaggatgat tatccacttt ttgcattgtt ttttgttcta ccaatttgtc tatcttcgtt     6540

gttaaatatt ataaattcaa gaaaaatatt gctttttaaa ttatttaaac ttgatttatc     6600

aaagcatata aaaccaatgt ttgttttatt tttagtgaca ttatcttata cattatatgc     6660

taatgttaat gatgtgctat tagctactgt aactaataca gaacaagttg gttactatag     6720

tgttgctttc aaaataaaag ctgcattatt agctttcatt actagtacaa gtatggtttt     6780

tttacctcga ttaacagagt atattaaaaa taatcaagat attgaattta ttgacttatt     6840

aagaaagtct tttgatctgg ttttttttct agctgtgcca ataacattat ttttcttttt     6900

atacgctaaa gaaacaatat ttttattgtt tggtgagaaa tataataagt caagtttatt     6960

attgcaaacc atgatatggt ctgttttttt tggtggttta aataatatat taagtgtaca     7020

aatgttattg cctttaaaaa aagataatca gttcttaatt tctattttaa gtggtggatg     7080

tatatcttta gttgtgaatt ttatcttctt gagggagctt caatcattaa gtacatcaat     7140

ttcagttcta gttgcagaag ttgttatact gattattcaa ttagttattc taagaaaata     7200

tattgtaaga atttttaata atttaaatcc tttaaaggtg ataatgtcgg tttttttttc     7260

tatatggttt gttaatttaa tttatgccaa ttttattgct ctaggtaata gtttcttaga     7320

gtatattatt tctattttta tattttcatt attttatgtg tttttacttt tttttagtaa     7380

agaaagattt gttcatgatg tgttttttta tataaggagt aaatttgatt aatttattaa     7440

ttagtattct agctaaaatt ctttctagga tttctaaact gattttgaat ataaaaaaac     7500

ggaaggaata caaacgagtt ggctctatag ttgattcaaa gaatatagat ttgagtttta     7560

tttgtggtaa ctattgtaga gtagggagag atactgtaat tgagaaaaat gttattatgg     7620

ggagattatc ttacattaat tcagatatgg gaaaaacata tattggtagt aatgtaaaga     7680

ttggtagttt atgctcaatt tcctcaggtg taataattgc tcctgtaaat cattacctaa     7740

attatgtgac aacgcaccca ttactttata attcctatta tagtagcatt ttaaatatta     7800

attctaatct gttatctcaa caagaattag atgcaaatgt atcaacagtg attggtaatg     7860

atgtatggat tggagctaat gtgattataa agagaggagt aactatagga gatggagcgg     7920

ttattggtgc aggtagtatt ataacaaaag atattccttc ttatgcagta gtagcaggag     7980

ttccagctaa aattattaaa tatcgttttt caaaagatgt aatagaaagc ctgaaagata     8040

gtaagaatgt ttgggaatta tctacctcag aattagaaga gaatttttct catttatatg     8100

atgttgagaa atatcttaat agatttaagt tgtaggatta atttttagtc taggatttta     8160

gtatgagtaa gaaaaatata gttgcacaaa ctttattact ttgcttagat ttattactaa     8220

ttagtatggc aatcttttta gctgtattta ttagaaataa tattttaccg aatattatgt     8280

tatttgagcc tgtatcatat atagagtatc tagtataccc atttccttat gtaatcattg     8340

ttacattgtt tatgtggttt gggctatata caagaagata tgatttatgg caggagtcat     8400

tatttattat aaaagtatgt tttatttctt ttattattat ctttgcaaca ttagcattgg     8460

gtaagaatat agaatattat tctagagctg ttttattatt atctcttttc ttatcagtga     8520

tatttttacc aataggtcgt tattttttga aaaaaagctt gtttagactg ggtctttggg     8580

aaaggaaagt aaagtttatt ggcaatttaa ataagaatga aattgggatt tttaattctc     8640

ctcatgtagg atatgtgtta tctaaagatg atacatatga tgttatattt atatctagtg     8700

gtgataagag tgtatcagaa ttaaatgatt taattgaaag taataaatta ttgaatcgtg     8760

aggttctatt tatccctgtg ttaaatcaat atgattttac tcaatctgtt ttgtacaata     8820

attttagtac aaggctaaat ctatttacgt tagaaaataa attacttgga aagcaaaata     8880

aaattttgaa gtatttacta gattatgtac tagtattatc tactttacct ttttgggggg     8940

ggctgatttt acttattagt ataaaattaa aattagaaga tcctaaaggg aaaatatttt     9000

tcttacaaaa gagattaggt caagagggta agatattcta ttgttataaa tttagaacaa     9060

tggtttcaga ccagagcttt atgcaacaat ggcttattga taatccagaa gaaagagatt     9120

attacgctgt gtatcataag tatattaatg atcctagaat tactaaattc ggacattttt     9180

tgcgaagaac atctttagat gagttacccc aattatttaa tgtacttaaa ggggatatga     9240

gtttagttgg aaatagacct tatatggttg aggaacaaca aaaaatgaaa gatgctgcca     9300

gtattatttt gatgtcaaaa ccaggagtaa caggtttatg gcaagtaagt gggcggagtg     9360

acgtttcatt tgaagaacgt ttacaaattg attcttggta tattaaaaat tggtctattt     9420

ggaatgatat tgttatttta ttcaaaacag ttggtgttgt attaagaaaa gatggagcat     9480

cttagtaata atgtaattac attaaattat tatagatagg gattattatg aaaaaaattt     9540

tagtcaccgg tggtgcaggt tttattggct ctgcggttgt acgtcatatt ataaatgata     9600

cacaagatag tgttgtaaat gttgataaac ttacctatgc gggtaattta gaatcgttat     9660

taatggtaga aaatagccct cgttacgtat ttgagcaagt agatatttgt aatcgtgcgg     9720

aacttgatcg cgtatttgcc caacatcagc ctgatgcagt tatgcactta gccgcagaaa     9780

gccatgttga ccgttcaatc gatgggccgg ctgcttttat cgaaacaaat attgtcggta     9840

cttacacttt gctcgaagct gctcgctatt attggaatag tttagatgct gataaaaaat     9900

cattattccg ttttcatcat atttctacgg atgaggtata tggtgatttg gaaggtacag     9960

aagatttgtt tacggaaacg acgccgtatt ctccgtctag cccatattcg gcttctaaag    10020

cgtcaagtga tcatttagtc cgtgcttggc ttcgtactta tggattacct acgattgtga    10080

ccaattgttc gaataactat ggtccgttcc attttcctga aaaattaatt cctttaatga    10140

ttttaaatgc tttagagggt aaaccattac cagtttatgg taatgggcaa caaatccgtg    10200

actggttatt tgtagaagat catgctagag cattatacaa agtggtaacg gaaggtaagg    10260

tgggagaaac ttataatata ggtggacata atgaaaaagc taatattgat gttgttcgta    10320

ctatttgtag tttattagaa gagcttgtac caaataaacc ggcgggtgtg cataaatatg    10380

aggatttaat tacctacgtt acagatcgtc cagggcatga tgttcgttat gcaattgatg    10440

caacaaaaat tggacgagaa ttaggttgga agccacaaga aacatttgaa acaggtattc    10500

gtaaaacagt cgaatggtat ttaaataata cagagtggtg gagtcgtgta ttagacggtt    10560

cttacaatcg tgagcgttta ggttcaaatt aatattatta caagcgatcc aatttttaat    10620

aaggtttaca atatgaaagg tattattctt gcaggtggct caggtactcg tctttacccg    10680

attactcgtg gcgtgtcaaa acagctctta ccggtatacg ataaaccaat gatttattat    10740

cctttatcag tacttatgct tgcaggtatc cgagaagtct taattattac aacaccggag    10800

gataatgaga gctttaaacg tttattaggc gacggttctg atttcggtat ccaactttcc    10860

tatgctattc aacctagtcc agatggctta gctcaagcat ttttaattgg tgaagagttt    10920

atcggtcagg acagtgtatg tttggttcta ggtgataata tcttctacgg tcagcatttt    10980

actcaatctt tacaagaggc tgtaaaatcg gtagaaacga aaggtgcgac tgtatttggt    11040

tatcaagtga aagatccgga acgttttggt gtggtagagt ttgatgacaa tttccgtgca    11100

ttgtctattg aggaaaaacc gattcaaccc aaatctaatt gggcggtaac cgggttatat    11160

ttctatgata accgagtagt agaatttgca aaacaagtaa aaccctctgc acgtggcgaa    11220

ttagagatta ccactcttaa tgagatgtat cttaatgatg gttcacttaa tgtacaatta    11280

ttagggcgag gctttgcttg gttagatacc ggcacacatg atagcttaca tgatgcggca    11340

gcatttgtga aaacagtaca aaatctacag aatttacagg tagcatgctt agaggaaatt    11400

gcctatcgta acggttggtt atcacttgag caacttgaag cattaacaaa accgatggcg    11460

aaaaatgaat acggtcaata tttgttacgt ttaacaaaag gaacaaaata atggcacgtt    11520

tcttaattac gggagcgaag ggacaggttg gatattgtct tactaagcaa ttacagagca    11580

aagcagatgt cttagcagta gatcgtgatg agcttgatat aacaaatcgt gatgctgtat    11640

ttaaagttgt cagagagttt catcctgatg ttattattaa tgctgccgca catactgctg    11700

tagatcgggc tgagagtgaa atcgaactat cggaagcgat taacgtgaaa ggcccacaat    11760

atcttgcaga agcagccaat gagattgatg caatcatttt acatatttca acggattatg    11820

tctttgaagg gacaggttct ggagaatata aagaaaatga tgaacctaat ccacaaggcg    11880

tatacggcaa aacaaaactt gccggagaga tagcagttca acaggcaaat aaaaggcata    11940

tcattttgcg tactgcttgg gtatttggtg aacatggtaa taactttgtt aaaacgatgc    12000

tccgtttagc aaaagaaaga gaatctttgg gaattgtgag tgatcaattt ggcggaccta    12060

cctatgcagg ggatattgcg agtagcctga ttcatatagc aaatatcatt cttaatagta    12120

agatagatgt atttggtgtt taccatttta ctggcaagcc ttatgtaagt tgggccgatt    12180

ttgctaagaa aatttttgat gaagctgttt cgcaaaaggt attagaaaaa gcaccgcttg    12240

ttaattttat tgctacaagt aattatccaa catcagcaaa acgaccggca aattctcgct    12300

tagatttaac taaaattgat gaggtttttg gtattaaacc gagtaattgg caacaagcat    12360

taaaaaatat taaggcatat gcgtaatgaa gattattgaa acaaatattc cggatgtaaa    12420

gcttttagaa cctcaagtat ttggtgatga acgcggtttt tttatggaaa tttttcgaga    12480

tgaatggttc agacaatatg tcgctgatcg tactttcgtt caagaaaatc attcaaaatc    12540

tattaaggga gttttgagag gcttacatta tcaaactgaa aatacacaag gcaagttagt    12600

gcgtgtagtg caggggtctg tgtttgatgt agcggtagat ttacgtaaaa gttctccgac    12660

ttttggacaa tgggttgggg aagtattatc cgctgaaaat aaacgtcaac tttgggtccc    12720

tgaaggattt gctcacggtt tttatgtatt gacagaaacc gctgaattta cctataaatg    12780

cacagattac tataatccaa aagcggaaca ttcattgatt tggaatgatc cgacagtagc    12840

gattaattgg aatcttggtg gcgcgcctag tttatcagca aaggatttag ctggtaaggt    12900

gttaaatgaa gctgttttat ttgaatag                                       12928


<210>  2
<211>  370
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene wzzB

<400>  2

Met Leu Glu Gln Ala Ser Asn Gln Thr Asn Glu Glu Ile Asp Leu Ile 
1               5                   10                  15      


Glu Leu Ile Arg Val Leu Trp Lys Lys Lys Leu Leu Ile Ala Ile Val 
            20                  25                  30          


Thr Phe Ile Phe Thr Ala Leu Ala Ala Val Tyr Ala Phe Thr Ala Lys 
        35                  40                  45              


Glu Lys Trp Thr Ser Gln Ala Glu Val Ile Ala Pro Arg Val Thr Asp 
    50                  55                  60                  


Ile Ser Glu Tyr Leu Ser Leu Arg Lys Glu Tyr Asn Leu Ile Met Gly 
65                  70                  75                  80  


Ser Glu Phe Lys Glu Asn Asp Ile Arg Asn Glu Leu Asn Glu Leu Phe 
                85                  90                  95      


Ser Arg Tyr Val Leu Ser Tyr Asp Glu Ala Ile Ser Phe Phe Lys Thr 
            100                 105                 110         


Thr Asp Thr Tyr Lys Lys Leu Ala Glu Lys Glu Asn Glu Ala Gly Leu 
        115                 120                 125             


Gln Arg Ala Ala Ser Glu Phe Thr Thr Glu Ser Leu Lys Val Ile Lys 
    130                 135                 140                 


Pro Asp Ala Lys Lys Asp Leu Asn Ala Leu Gly Ser Lys Ile Ala Ile 
145                 150                 155                 160 


Ser Ser Glu Thr Ala Leu Ser Ala Gln Thr Glu Leu Asn Asp Phe Ile 
                165                 170                 175     


Arg His Ile Ser Asp Ile Ser Phe Asn Phe Ser Lys Asn Glu Phe Ile 
            180                 185                 190         


Tyr Trp Val Lys Glu Ser Ile Ser Ser Leu Asn Tyr Glu Lys Glu Val 
        195                 200                 205             


Ile Glu Gln Asp Gln Ser Ile Gln Arg Lys Val Gln Ile Gln Asn Leu 
    210                 215                 220                 


Glu Ala Ala Leu Asp Met Ala Lys Lys Ala Gly Ile Lys Glu Tyr Ser 
225                 230                 235                 240 


Ser Ala Leu Ser Ser Asn Ser Ser Val Ala Asn Leu Ala Val Ser Asp 
                245                 250                 255     


Thr Lys Ile Pro Leu Ser Asp Ser Lys Leu Ala Asp Gly Thr Tyr Leu 
            260                 265                 270         


Phe Met Leu Gly Glu Lys Asn Leu Gln Ala Gln Leu Asp Ile Ala Lys 
        275                 280                 285             


Thr Lys Glu Ile Val Tyr Ser Pro Arg Tyr Tyr Gln Ile Gln Glu Gln 
    290                 295                 300                 


Leu Leu Lys Leu Asn Thr Leu Leu Pro Lys Val Glu Lys Val Thr Gly 
305                 310                 315                 320 


Gln Thr Tyr Ser Tyr Val Ser Ser Pro Thr Tyr Pro Val Ile Lys Asp 
                325                 330                 335     


Ala Pro Lys Lys Gly Ile Ile Leu Val Ile Gly Phe Leu Val Gly Leu 
            340                 345                 350         


Leu Leu Ser Ser Phe Thr Val Leu Ile Ser Val Leu Val Arg Asn Lys 
        355                 360                 365             


Lys Ser 
    370 


<210>  3
<211>  12928
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence of codon optimized rfb cluster

<400>  3
atgctggaac aagctagcaa tcaaaccaac gaagaaattg atctgattga gctgattcgc       60

gtgctgtgga agaagaagct gctgattgcc atcgttacct tcatctttac agctttagcc      120

gcagtttatg cctttacagc aaaagagaaa tggaccagtc aagctgaagt gattgccccg      180

cgcgtgaccg acattagtga atatttatct ttacgtaaag aatacaatct gattatgggc      240

agtgaattta aagaaaatga tattcgcaat gaattaaacg aattattcag ccgctacgtg      300

ctgagctatg atgaagccat ctcctttttt aaaaccaccg atacctacaa gaagctggcc      360

gagaaagaaa acgaggctgg tctgcagcgc gcagccagtg aattcaccac cgaatcttta      420

aaggtgatta agccggacgc caagaaagat ttaaatgctt taggcagcaa gatcgccatt      480

agcagcgaaa ccgctttaag tgcccagacc gagctgaacg actttattcg ccacatcagc      540

gatattagct tcaattttag caaaaatgaa tttatttatt gggtgaagga gagcatcagc      600

tctttaaact atgaaaaaga agtgatcgaa caagatcaga gcatccagcg caaagttcag      660

atccagaatc tggaggccgc actggacatg gccaaaaagg ccggcatcaa agagtacagc      720

agcgcactga gcagcaacag cagcgtggca aatttagcag tgagcgacac caagattccg      780

ctgagcgaca gtaaactggc agatggcacc tatttattca tgctgggcga gaagaattta      840

caagcccaac tggatatcgc caagaccaaa gagatcgtgt acagcccgcg ctattaccag      900

atccaagaac agctgctgaa gctgaatact ttactgccga aagttgagaa ggtgaccggc      960

cagacatata gttacgtgag cagcccgacc tatccggtga tcaaagacgc cccgaagaaa     1020

ggcatcattc tggttatcgg ctttttagtg ggtttactgc tgagcagctt taccgtgctg     1080

atcagcgtgc tggtgcgcaa caaaaagagt taattatgag gtatgaagag gaatattaac     1140

ctctttatgt gccttttgat gatccaattt actgaagtgc ttagactatt taggtaataa     1200

aatataggga taataatgaa acataagatt ttacatttta gtcaagttct gggcggtgtg     1260

ggccgctatc tggaactgta tgacaaatac atcaacaaag atagctttga aaacatttat     1320

attttaccga ttggtgattg ggaagccgca gaagcccaag ataaacgtta cattctgaat     1380

attgaacaga gctttagccc gattaaactg attagtaatg ttattaagat tcgtaacatt     1440

ctgaaaaaag aaaaaccgga tatcttttat ttacacagca ccttcgccgg tgttattggc     1500

cgtttagcag ccattggcat gcgctgcaaa gtgatctaca acccgcatgg ctggagcttc     1560

aagatgaatg tgagccgttt aaagcagacc ttctataaga tcattgaggg cggtttagtg     1620

tttttaacag ataaattcgt gctgatcagc aagagcgaat acgaagcagc ccgcagcatc     1680

ggcgttagcg agaaaaaatg ctgtttaatt tacaatggca ttgaaaccat taaaaagacc     1740

gatattgcaa ttattccgaa gctggatgat aaatatatca ttggcatgat tggccgcatc     1800

agcgagcaga aaaacccgat gtttttcgcc cagttcgcca aggagatcat taagcagtac     1860

ccgaacacct actttatttt agttggcgat ggtgagcagc gcgagagtct ggaggattat     1920

ttagaacgca ataacttaaa cgacgtgttt tatatcaccg gctgggtgac caacccggag     1980

agctatctga atctgttcga ccaagctgtt ctgttcagta aatgggaagg tttatgttta     2040

agcgtgtgcg agtatatgct gtacgagaag ccgattttag tgagcaacat tggtggcatc     2100

aacgatttaa tccagaacga ggttaacggc ttcaccatcg ttgagggcga tttaaaggat     2160

gccgtgaaca agagtaaccg tttacgcaat gaaccgaaaa ccgtggccaa gttcatcgaa     2220

gccagcaaca ttttaattca agaaaagttc aacgcacaga aaatggtgaa tagcttagaa     2280

aaactgttca tcaaactgag cgaaaataaa taatgaagga aaaatttagt tgcatcgtgg     2340

tgtgttacaa cccggacaat agcgtgctgg ataatttaaa aaactatatt agttatgtgg     2400

gtaaagtgat tgtggtggat aacagtgatg tggacaacag ccagctgttc agctctttaa     2460

gcgagtatct gatctacatc ccgctgtaca aaaacgtggg catcgcctat gctttaaaca     2520

tcggtgtgga gaagagcaag gaactgggtt atgagtatat tattaccatg gaccaagata     2580

gcagcttcgc cacaaatctg gtggatgtgt acagccatta catcagcaac tacccgatcg     2640

atcagattgg cgctttaagc ccggtgtata ttaccgaccg cggtttcaac cgtaccagca     2700

aagaagaatt taaacagatt aagatcacca tgcagagcgg cagcatgttc ttcaccgaca     2760

aattcgatgt gatcggccgc tttgacaacg atttattttt agacgtggtg gactgggaat     2820

actttttccg catttatact ttaggttata aaacaattca gtgcaataaa gccatgctga     2880

aacacgcccc ggccgaaact ttaactttat ttaaaattaa aggtaaaacc attggtgtgg     2940

gcgtggcaag cccgctgcgc tattactatc agattcgcaa tctgctgtgg tgcgtgctgc     3000

acaagaaaag cttcttcatg attaagacca ttgcctataa gtttatcaag attctgtttc     3060

tgtttaataa taaaaaacag tatttaagct tcgcatacat ggccatcaag gacgccttca     3120

ataaccgttt aggcgcctat gatacactgt atctggagaa aagccgtaat gaaaaatgat     3180

ctgccgctga tcagcatcat catcccgatc tataacgtga aaccgtattt agaaaagtgc     3240

gtgaacagcg tgctgagcca gagctatccg aatctggaga ttattctggt ggatgacggt     3300

gccaccgatg gcagcgcaca agtttgcgat gattttagcg aaaaatatgc aaatattcaa     3360

gttattcata agaaaaatgg tggtttaagc agcgcacgta atgccggtat tgaggccatg     3420

aaaggcgagt acgtgttctt tctggatagc gacgactgga tcgcaaatga cgccatcagc     3480

cagctgtacg atgatatggt ggagtacaac gccgacatca ccggcatcag cttttaccaa     3540

gcttatagcg acggtaattt agtgctgaac acccatctga tcgagaagca gatgctgagc     3600

aagaaagagg cactgcgtac ctttttattc aataattatt taaccccgtg tagctgcggc     3660

aagctgtata aagcctcttt atggaaggac atccgctttc cggaaggtcg tttatttgaa     3720

gatcagctga ccacctataa agttatcgaa ctggccaaca ccatcatctt caatccggcc     3780

gccaaatact tttatttcaa acgtatcggc agcatcggcc acagcgcctt cagcgagaaa     3840

acctatgatt tatatgaggc agtgaatgaa cagtacaacg agatcaccaa acaccacccg     3900

gatatcgaga gtgatctggc cgtggccaaa attacttggg aaattgtgtt tattaatatg     3960

atgctgaaca gtaactacag cgatcaagct atcgtggaca aaacccgcgt gtttgcacgc     4020

aaacgtattt tagatgtggt gaagtgcgag ttcatcccga atttacgcaa gtttcagatc     4080

actttatttg cctacaattt cagtctgtat aaagttctgt atgcccgcta taagaagaaa     4140

aatccgctga gttaattatt gattttatcg gttgttacaa tgattccgaa gaaaattcat     4200

tattgctggt tcggcggcaa tccgctgccg aaaagtgtga agaagtgcat taaaagttgg     4260

aaaaaatatt gtccggatta tgaaattatt gaatggaatg agagcaatta caatgtgcat     4320

aaaaatttat ttattaaaga ggcctacgag aagaagaagt tcgccttcgt gagcgattac     4380

gcccgtttag atgtggtgca cagtgaaggt ggcatctatc tggacaccga tgtggagctg     4440

atcaaaccga tcgatgattt actggcccat agctgctttc tggccagcga aagcatcgat     4500

gacgtgaata ccggtttagg ctttggtgcc gaaaaaggcc actggttcat cgccgagaac     4560

atgagcgtgt atgaaaatat gtactttaat atggaaaata ttatcacttg tgtggagatc     4620

accaccaaac tgctgatcga acgcggcttt agcgccagcg ataaaattca gaatattgat     4680

gatattttta tttatccgac cgaatatttt tgcccgctga actacaaaac ccacgaactg     4740

cacatcaccc agaacaccta cagcatccac cactatgatg ccacttggca gagcccgctg     4800

atgaaattca aaaccaagat caagtacatt ctgtgtttag ccggcattat taaatgaatt     4860

ctttagtgta tcgcattgac atccgcactt taatttttag catcttttat tttacctttc     4920

tggtgagcga ttttttactg ctggcccaag atggcacaat caccaaggac atcatcaagt     4980

gggtgaagct gttttcttta ctgccgctgc tgctgctgat cttcaagctg ccgctgaatt     5040

tactgattct gggctttttt accattatga ttagtgcctt ctacagcatc tataccggtg     5100

acagcttttt actgtacatc tgtttactga tgagctttag ctacaaagtt aattttaatt     5160

ttttatttaa gattggttta tatctgacca gtattttagt ggtgctgatt ctgacatatt     5220

ttttctttga gtactttctg atcggcgaca gccactttgt gtacgacgcc acctactggt     5280

tcaaacgcta cacctttaac tttgataacc cgaacgcatt tccgatgcgc atctttgtct     5340

tttttatttt ttacattctg catgtgggca aactgcgttt attcgacacc tttctgttcg     5400

ttattctgtt tggtatcgtt ttttacttta gcaacagccg tacagccttt tacattttta     5460

ttctgtgtgt tctgaccatt cattttaatc aagtgtttaa tgttctgaat aatacctttg     5520

ttaaactgct gattaacaat agcattatct ttatcaccat ttttagcatt tggagcgcaa     5580

tctattatca agattactat tcttatctgg aaccgattaa taaaatttta agcaaacgta     5640

tttattttgc aaacgaagcc tataagtctt taggcttcga gttctacccg cgcaatatca     5700

agtggtggat cgaggagagc gactggcata ttatcgacaa tggctatgtt tatttattta     5760

tcagcggcgg tttactggtg ggcaacttat ttatcttttc tattacttgg ctgatgtatc     5820

gtctgaataa atttaattta agcaacgagg ccattttact gatgtttagc atgctgtatt     5880

tactgagcga gagccatttt atcaatattt tttataacat cccgatttta ctgctggcaa     5940

tttttattaa taaaaccaat attgtgcgtt atttagagtg taaaaaatga ataagaatct     6000

ggtgaataat agcattatga gctttttact gaccatcagc aacttcatct tcccgctgat     6060

caccttcacc tacgccgcac gtattctgca gccggacaac atgggtaagt tcgcctttag     6120

cttaagcgtg gttgattatt tatctttatt tgccaccttt ggtgtggtgg gttatggcgt     6180

gcgtgcttgt gccgaagttc gcaataataa ggaagaactg accaaaacag tgcaagaaat     6240

tctgtttatt aatatctttt tagcaattat tgcatatctg gttatttttc tgctgatcag     6300

ttatcagcat gcctttcgcg aggacacact gctgtttctg atcatgagta gctgcatcat     6360

tttcaacgtt atcggcattg agtggctgta taaatcttta gatgagtacc gctacatcac     6420

cgtgcgcagc attttactga agattattag cttaatcatg attctgtgct ttgtgaagga     6480

gaaagacgat tatccgctgt tcgctttatt ctttgtgctg ccgatctgtt taagcagctt     6540

actgaacatt attaacagcc gcaaaatttt actgtttaaa ctgtttaagc tggacttaag     6600

caaacatatt aaaccgatgt ttgtgctgtt tctggttact ttaagttaca ctttatacgc     6660

caacgtgaat gatgtgctgc tggccacagt gaccaacacc gagcaagttg gctactatag     6720

tgtggcattt aaaatcaagg ccgctttact ggcatttatc accagcacca gcatggtgtt     6780

tctgccgcgc ttaaccgagt atatcaaaaa caatcaagat atcgaattta ttgatctgtt     6840

acgtaaaagc tttgatctgg tgttcttttt agccgtgcct attactttat tcttttttct     6900

gtatgccaaa gagaccatct ttttactgtt tggtgaaaaa tataacaaga gctctttact     6960

gctgcagacc atgatctgga gcgttttctt cggcggttta aataacattt taagcgttca     7020

gatgctgctg ccgctgaaaa aggacaatca gtttttaatc agtattttaa gcggcggctg     7080

catttcttta gtggtgaatt tcatcttttt acgcgaatta cagagtctga gtaccagtat     7140

cagcgttctg gtggccgaag tggtgatttt aatcatccag ctggtgattt tacgcaagta     7200

catcgttcgt atcttcaata atttaaatcc gctgaaagtt attatgagcg tcttttttag     7260

catttggttt gtgaatttaa tctatgccaa ctttatcgct ttaggcaaca gctttctgga     7320

gtatattatt agtattttta tcttctcttt attctatgtg tttctgctgt ttttcagcaa     7380

agaacgcttt gtgcatgatg tgttctttta tattcgcagc aaatttgatt aatctgctga     7440

tcagcattct ggccaaaatt ttaagccgca ttagcaaact gattttaaat atcaagaagc     7500

gtaaagagta caagcgcgtg ggcagtattg tggatagcaa aaatattgat ttaagcttta     7560

tctgcggcaa ttattgccgc gtgggccgtg ataccgtgat tgagaagaac gtgatcatgg     7620

gccgtttaag ctacatcaac agcgacatgg gcaagaccta tatcggcagc aatgtgaaga     7680

tcggctcttt atgtagcatc agcagcggcg tgatcattgc cccggtgaac cactatttaa     7740

actatgtgac cacccacccg ctgctgtata acagctacta cagcagcatt ctgaacatca     7800

acagcaattt actgagccag caagaactgg acgcaaacgt gagcaccgtg attggcaatg     7860

atgtgtggat cggtgccaac gtgatcatca aacgcggcgt gaccattggc gatggtgcag     7920

tgatcggcgc tggtagcatt atcaccaaag acatcccgag ttacgcagtg gttgccggcg     7980

tgccggccaa aatcatcaaa tatcgcttta gcaaagatgt gatcgagtct ttaaaggaca     8040

gcaagaatgt gtgggaactg agcaccagcg aactggagga aaactttagc catttatacg     8100

acgtggaaaa atatctgaac cgttttaaac tgtaagatta atttttagtc taggatttta     8160

gtatgagtaa aaagaacatc gtggcacaga ctttactgct gtgtctggat ttactgctga     8220

tcagcatggc catctttctg gcagtgttta ttcgtaataa tattttaccg aacatcatgc     8280

tgttcgagcc ggtgagctat atcgagtatt tagtttatcc tttcccgtat gttattattg     8340

tgactttatt catgtggttt ggtttataca cacgtcgcta cgatctgtgg caagaatctt     8400

tatttatcat caaagtgtgc tttattagtt ttattatcat ttttgcaact ttagccttag     8460

gtaaaaatat tgaatactac agccgtgccg tgctgctgct gtctttattt ttaagcgtga     8520

tttttctgcc gatcggccgt tattttctga agaaatctct gtttcgttta ggtttatggg     8580

aacgcaaagt gaaattcatt ggcaatctga ataaaaacga aattggcatc ttcaacagcc     8640

cgcacgtggg ttacgtgctg agcaaggacg acacctacga cgtgatcttc atcagcagcg     8700

gcgataagag cgttagcgaa ctgaacgatt taatcgagag caataaactg ctgaaccgcg     8760

aggttctgtt catcccggtt ctgaaccagt atgacttcac ccagagtgtt ctgtacaata     8820

atttcagcac ccgtttaaat ttatttacac tggagaacaa attactgggc aaacagaata     8880

aaattttaaa gtatttactg gactacgttc tggtgctgag tacactgccg ttctggggcg     8940

gtctgatttt actgattagc attaagctga aactggaaga tccgaaaggc aaaattttct     9000

ttttacagaa gcgtctgggc caagaaggca aaatcttcta ttgttataag tttcgtacca     9060

tggtgagcga ccaaagcttc atgcagcagt ggctgatcga taacccggag gaacgcgact     9120

actatgccgt gtaccataag tatattaatg acccgcgcat cacaaaattc ggccattttc     9180

tgcgccgtac cagtctggat gaactgccgc agctgtttaa tgtgctgaag ggcgatatgt     9240

ctttagttgg caatcgcccg tatatggttg aagagcagca gaagatgaag gacgccgcca     9300

gcatcattct gatgagtaaa ccgggcgtta ccggtttatg gcaagttagt ggtcgtagcg     9360

atgtgagctt tgaagagcgt ttacagattg acagctggta tatcaaaaat tggagcattt     9420

ggaacgatat tgttattctg ttcaagaccg tgggcgtggt gctgcgtaaa gatggcgcca     9480

gttaataata atgtaattac attaaattat tatagatagg gattattatg aagaaaattc     9540

tggttactgg tggcgctggt tttattggca gtgccgtggt tcgccatatc atcaacgata     9600

cccaagatag cgtggtgaac gtggataaac tgacatacgc cggcaatctg gagtctttac     9660

tgatggtgga aaatagcccg cgctacgtgt tcgaacaagt tgacatttgc aatcgcgccg     9720

aactggaccg tgtttttgcc cagcatcagc cggatgccgt tatgcatctg gccgcagaaa     9780

gccacgttga tcgcagcatc gatggcccgg ccgccttcat cgagaccaat atcgtgggca     9840

catatacttt actggaggcc gcccgctatt attggaatag tctggacgcc gacaaaaagt     9900

ctttatttcg cttccaccac attagcaccg atgaggtgta tggcgattta gaaggcaccg     9960

aggatttatt taccgaaacc accccgtata gcccgagcag cccgtacagc gcaagcaaag    10020

caagcagcga tcatctggtg cgcgcttggc tgcgcacata tggtttaccg accatcgtga    10080

ccaactgcag caacaactac ggcccgttcc attttccgga gaaactgatc ccgctgatga    10140

ttttaaatgc tttagaaggt aaaccgctgc cggtgtatgg taacggccag cagattcgtg    10200

attggctgtt cgtggaggat cacgcccgtg ctttatataa ggttgtgacc gaaggcaaag    10260

tgggcgagac ctacaatatt ggtggccaca acgagaaggc caacatcgac gttgtgcgca    10320

caatttgctc tttactggag gaactggttc cgaataaacc ggccggcgtg cataagtatg    10380

aagatttaat cacatatgtg accgaccgcc ccggtcacga tgttcgttac gccattgatg    10440

ccaccaagat cggtcgcgaa ctgggttgga aacctcaaga aaccttcgaa accggcatcc    10500

gtaaaaccgt ggaatggtat ttaaacaata ccgagtggtg gagccgtgtg ctggatggta    10560

gctacaatcg cgaacgttta ggcagcaact aatattatta caagcgatcc aatttttaat    10620

aaggtttaca atatgaaagg cattattctg gccggcggta gcggtacccg tttatatccg    10680

attacacgcg gtgtgagcaa acagctgctg ccggtgtatg ataagccgat gatctattat    10740

ccgttaagcg tgctgatgct ggccggcatc cgtgaggtgc tgattattac caccccggag    10800

gacaacgaga gctttaaacg tctgctgggc gatggcagcg atttcggcat tcagctgagt    10860

tacgccattc aaccgagccc ggatggtctg gcacaagctt ttctgatcgg tgaagagttc    10920

atcggccaag atagcgtgtg tttagtgctg ggcgacaaca ttttttacgg tcagcatttc    10980

acccagagtc tgcaagaggc cgttaagagc gttgagacca aaggtgccac cgtgtttggc    11040

taccaagtta aagatccgga acgctttggc gtggtggagt ttgacgataa cttccgcgct    11100

ttaagtatcg aggagaaacc gatccagcct aaaagcaact gggccgtgac cggtctgtac    11160

ttctacgaca accgtgtggt ggaattcgcc aaacaagtta agccgagtgc acgcggcgag    11220

ttagagatta ccactttaaa cgaaatgtat ttaaacgatg gcagtctgaa cgtgcagctg    11280

ctgggccgcg gttttgcatg gctggatacc ggtacccacg atagtctgca cgacgccgca    11340

gcctttgtga aaaccgttca gaatttacag aatctgcaag ttgcttgttt agaagaaatc    11400

gcctatcgta acggctggct gagcttagag cagctggagg ctttaaccaa accgatggca    11460

aagaacgagt atggccagta tctgctgcgt ttaaccaaag gcaccaaata atggcacgtt    11520

ttttaatcac cggcgcaaaa ggccaagttg gttattgttt aaccaagcag ctgcagagca    11580

aagccgatgt tctggccgtg gatcgcgatg aactggacat cacaaaccgc gatgccgtgt    11640

ttaaagtggt gcgcgaattc cacccggacg tgattatcaa tgccgccgcc cataccgcag    11700

tggatcgtgc agaaagcgag atcgaactga gcgaagccat caacgttaag ggtccgcagt    11760

atctggccga ggcagcaaac gagatcgacg ccatcatttt acacattagc acagactacg    11820

tgttcgaggg caccggcagc ggcgaatata aagagaatga tgaaccgaac ccgcaaggtg    11880

tgtacggcaa aaccaaactg gccggcgaaa tcgcagttca gcaagctaac aagcgccata    11940

tcattctgcg caccgcttgg gttttcggcg aacacggcaa caacttcgtg aaaacaatgc    12000

tgcgtttagc caaagaacgc gagagcttag gcattgtgag cgatcagttc ggtggtccga    12060

cctatgccgg tgacatcgcc agctctttaa ttcatattgc caacatcatc ttaaacagta    12120

aaattgatgt gttcggcgtg taccatttca ccggtaagcc gtatgtgagc tgggccgatt    12180

tcgccaaaaa gatcttcgac gaggccgtta gccagaaggt tctggaaaaa gccccgctgg    12240

tgaatttcat cgccaccagc aactatccga ccagcgccaa acgcccggca aacagccgtt    12300

tagatttaac caaaatcgac gaggtgtttg gcatcaagcc gagcaattgg cagcaagctt    12360

taaagaatat caaagcctat gcctaatgaa aattatcgaa accaacatcc cggatgtgaa    12420

actgctggaa ccgcaagttt ttggcgacga gcgcggcttt ttcatggaga tcttccgcga    12480

cgagtggttt cgccagtacg tggcagatcg cacctttgtt caagaaaacc acagcaagag    12540

catcaagggt gtgctgcgcg gtctgcacta tcagaccgaa aacacccaag gtaaactggt    12600

gcgtgtggtg caaggtagcg tgtttgacgt ggccgtggat ctgcgcaaaa gcagcccgac    12660

ctttggtcag tgggtgggtg aagtgctgag cgccgaaaat aaacgtcagc tgtgggtgcc    12720

ggaaggcttc gcccatggtt tctatgtgct gaccgagacc gcagagttta cctacaagtg    12780

caccgactac tataacccga aagccgagca ttctttaatc tggaacgatc cgaccgtggc    12840

cattaactgg aatctgggtg gtgccccgtc tttaagtgcc aaagatctgg ccggcaaagt    12900

gctgaacgaa gcagtgctgt ttgaataa                                       12928


<210>  4
<211>  13598
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence codon optimized rfb APP8

<400>  4
atggcgacgc cagagcaagt gagcaaccag accaacgagg agattgacct catcgagctg       60

gttcgcgtgc tgtggaagaa aaagctcctc atcgccatcg tgacgtgcat cttcaccgcc      120

ctcgcggccg tttatgcctt caccgccaaa gagaagtgga cgagccagac cgaagttatc      180

gcgccacgtg tgaccgacat tagcgaatat ctgagtctgc gcaaggagta caacctcatc      240

atcggcagcg agttcaaaga aaacgaaatc cgcaacgagc tcagcgagct cttcagtcgc      300

tatgtgctga gttacgacga agccatcgcc ttcttcaaga cgacggacac gtacaagaag      360

ctggccgaaa ccgagaatga agtgggcctc cagcgcgcgg ttgccgaatt cacgacggag      420

agtctgaagg tgatcaagcc agacgccaag aaagacccga atgcgctggg cagtaagatt      480

gcgatcagct tcgataccgc gctgagtgcc cagaccacgc tgaacgactt catctgccac      540

atcagcgaca ccagtttcaa cttcagtaag aatgagttca tttattggat taaggagagc      600

attagcagtc tgaattacga gaaagaggtt atcgaacaag atcagagcat ccaacgcaag      660

gtgcagatcc agaatctgga gacggccctc gatatggcga agaaggcggg catcaaagag      720

tacagcagcg ccctcagcag taatagtagc gtggccaatc tggcggtgag cgataccaaa      780

atcccgctga gcgatagcaa gctggcggac ggtacctatc tgttcatgct gggcgagaag      840

aatctgcaag cgcagctcga catcgcgaaa acgaaggaga tcgtgtacag cccgcgctac      900

taccagatcc aagaacagct gctgaagctg aataccctcc tcccgaaagt ggaaaaggtt      960

accggccaga gcttcagcta tatcagcagc ccagagctcc cgattaagcg cgattggcca     1020

aagcgcttta ttctgctcct cattggcgcg gtgattggtg gcgttctgag cagtctgtgg     1080

gtgatcggca aacaaatctt cggccagaaa taattatgaa caaggacatc aagattctga     1140

tcgcgacgca taagcagcac ttcatgccga gcgacgaaat gtatctgccg ctccacgtgg     1200

gcaagctggg taaagccgat ctgggttacc aaggcgacga cagcggcgac aacatcagca     1260

tcaaaaaccc aaatttttgc gagctgaccg gtctgtactg ggcgtggaaa aatctgccga     1320

acgattatct gggtctgatc cattatcgcc gcttcttcag cgtgaagaac cgcgcggaac     1380

gcaaaaacaa tccgctggag acgctgtatc tgaccaacga agaagccaac cagctgctga     1440

gtcagtacga tgtgatcgtt ccgagcaagc gcaactacta catcgagacg ctgtacagcc     1500

attacgccaa tacgctgcac gccgaacatc tggacgttac ccgcgaaatc atcgcggaaa     1560

agtgcagcga gtacctcgcc agctttgacg cggtgatcaa acagcgcagc ggctacatgt     1620

tcaacatgtt catcatgagc aaagcgctgg tgaacgacta ctgcagctgg ctgttcccga     1680

ttctgtttga actggagaag cgtatcccaa cggaccagta cagcgccttc catgcccgtt     1740

tctacggccg cgtgagcgaa ctgctgttca acgtgtggct gaaacagtac agccagagca     1800

acccactgaa ggtgaaggcc atcccgtttg tgtatggcga gaagatcaac tggctgaaga     1860

agggtaccgc gtttctggtt gcgaaattct tcggcaaaaa atatgagaaa agcttctaag     1920

ttattatagg gacaaataaa tgaagcgcat tctggtttac ggcatgaccg acaacttcgg     1980

tggcatggag gcctacattc ataacatcta tcagcatctg gataaaaccc aaatccagtt     2040

tgacttcgtg tgtgacttcc cgaaaatgac gctgagcgac tactatctgg acaacggttg     2100

caagatccac ttcatcccgc cgaaaaacca aggtctgttt aagagtctgt gggcgatgtg     2160

gaaagtgatc aaggagaaca actatgatgt tatctatttc aacatcatga acgcgggcta     2220

cgtgctcaac atgctgccgg cctttctgct gggtaagaag atcatcgcgc acagccacaa     2280

tgcggacacc gacaaaaaga agctgcacta cggtctgcgt ctgctgctga acatcgtgac     2340

gaagatcaag ctcgcgtgca gtaaggaggc cggtttcttc atgttcggca aggaagaaaa     2400

tttcagcatc attaataatg cgatcaacct cgaccgctat ctgtatagcg aggagaaata     2460

ccgcgacctc cgccacaaac tgggctgggg cgataagaag gttattctgt acgtggcccg     2520

catgaatcac cagaagaatc cgctgttcgc gctgtatatc atgcgcgaac tgaagcagag     2580

catgccgaat gccgttctgg tgtacgtggg tacgggtgag ctgaaggaac aagttcagca     2640

gtacattctg gacaacaacc tcgacaacgt gattctgctg ggtctgcgca acgatgtgaa     2700

cgagctcatg atcgcggccg atctgtttat tctgccgagt ctgtttgagg gtctgccgat     2760

tgttgccgtt gaggcgcaag ccgcgggtct cccaatcatt ctgagcgaaa acatcagcat     2820

cgaggcgaaa ctcgtgaaca gcacctactt cctcccgatt aacgacgttt ttctgtgggt     2880

taataaaatc aaaaagattc tggagatcag tggcaacaag cgctttagtg accagctggc     2940

gctgagcaaa gcgggttaca acatcgagag cgtggtgaag aacatccaga agattctcgt     3000

gaattaaggt ttggtatgaa taacaccaag atcagtctga tcttcgcgtg ctataacgtg     3060

agccagtatc tggacaatct gttccagctg ctgacgaacc agccgtacca aaatattgaa     3120

atcattttcg tggaggactg tgccacggac gatacgaagg cgaaactcca gagcttcaac     3180

gatccgcgcg tgaagctgct gtgcaacgag aaaaatatcg gtgcggccga aagccgtaac     3240

cgtggcatcc agatcgttac cggcgaatac atctggttcc cggatccaga cgatctgttt     3300

gacgaactgc tgctgaccaa ggtgaacacc atcatccaga aaaaccgccc ggatgtgatc     3360

agcatcggca tgcaagaacg ctacgagatc aacggcaaga cggactacac gaaggacatc     3420

atcagccgct acgacggcct cattaccggc gacttcaccg atgttttcgt ggatctggag     3480

gaaagctttc tgtttggcta cacgaataac aagttttaca aggccaacat catccataag     3540

taccgcattc tgaacgagca ccaagcgctg aaggaagatt tcgaattcaa tatcaaggtg     3600

tttaaacaag ttagtaactt ctatctgctg aatgaaccgc tctatttcta catgaagcgc     3660

aacaacggca gtctgaccag caaattcgtg ccggactatt tccgcatcca catgcagacc     3720

ctcgccagct tcaaaagtct gatcgaggtg aaggccacca tcaacgacaa cgtgaaccgt     3780

ctgctggtga accgcttcgt tcgctactgc ctcagcgcga tcgagcgcaa cagcagtctg     3840

aaaagcggca tgagctttct ggagcagaac caatggatta aggaaaatat ctttaatcaa     3900

gaaaaataca acgagtatct gctgctgagc gatctggtga acaagaaaca gaagctgttt     3960

tactttctga tcaagtatcg catcggtttt ctgctcgtga cggccgcgaa catcgtgaaa     4020

ctggtgaagg cgaagttccc gattctgttc gtgaagctga agggttaatt aactggattt     4080

taaaatgaag aagtaccaga tcgtggagct gagtaccgaa cacaaccatg cgggcagcaa     4140

ggccgtgcaa gatgtgtatg agatcgcgct cagcatgggt tacaaggcga atgtggttcg     4200

cacggccacc agtgtggata gtctgctggc caaaattctg cgccaagtta tcttcttcat     4260

cgactggctg aagatctact tcagcatcga gagtaacagc atcgtgctga tccagaaccc     4320

gtactaccac aaacagctca tccgtaactg gattctgaat cgtctgaagc gcattaaaaa     4380

agtgaagttt atcagtctgg ttcacgacgt ggaagagctg cgcaagagtc tgtacaacaa     4440

ctactataaa aacgagttcg agaccatgct gagtctggcg gacagcatca tcgtgcacaa     4500

tgataagatg aaaagctttt tcatcaaaaa gggctacagc gaggacaaac tcatcagtct     4560

gggcatcttc gactatctgc agaagagcgt ggacaaaaag cgcgtgagct tcgaacgtgc     4620

gatcagcgtg gcgggcaacc tcgatatcaa gaagagcagc tatattgcgc agctcggcag     4680

cctcccggcg atcaaagcgc atctgtacgg tccgaacttc gaacatagtc tggaggcgtt     4740

cccgaacatc gaataccacg gtagcttccc ggccacggaa atcccgcaga aactcgtgag     4800

cggttttggt ctggtgtggg acggccagag cattgaaacg tgcaccggcg acttcggcga     4860

gtacctccag tacaataacc cgcacaagct gagcctctat ctgagcagtg gcatgccggt     4920

tgtgatctgg gacaaagccg ccgaggccga tttcgtgaag aaacacaacg tgggtctgtg     4980

cgtgagcagt ctgagcgagc tccaagacaa gctcaacgtg atgaccgagc aagaatttga     5040

agaaatggtg aacaacgtgg aaaaacagac cgcgtgcctc atcagcggcg agtacaccaa     5100

aaaggcgatc agcgaggcgg aacgtgtgat ctaagaatgt tcctctatct gctggtgttc     5160

agtctgctgc tgattctgat cttcaatctg ctcatcgtga atctggacta catgcacccg     5220

agcatcctct ttgttgtgcc atttctggtg tttggcgtga cgagcattct gggcgaggag     5280

gcgtataaga tcatcttcca cgaggagacg ctgctggtga tcgttagcag cgcgctgatc     5340

ttcaccttca tcacgctgct gagccagacc gtgtacaaaa gcaaagagaa tctgaacttc     5400

ccgctgaccg agatcatcat cagtaagaaa gtgacgctgt tttttattgt gttcttcatc     5460

gtgacccagc tggcgttcat caagtatctg gaggccatta gtctggccca cttcggttac     5520

agcggcagtc tgggtgagat gatcagtctg tacgacgtga tgacgaagtt ctggaccgag     5580

atcttcagcg aactcaacgt gccgatcccg ctgctctacc gtatcggcaa tccaatcacg     5640

caaggcttcg gctatctgat tgtgtatatt ttcatccaca actacgttgc caccaagcgc     5700

atcgataagc tgcatctgct gatcattctg ctgctgtgtc tgaacatcat tctcaacggc     5760

agccgcagtc cgatcttccg catcgttacg atgatgctga tcacctttta tgtgctgtat     5820

aacaagcaga acaacgtgcg tcgcggcaac atcaagtttc tgctgaagag tctgctgatc     5880

gtgatcttca gcggcacctt cttcattgcg ctgctgagtc tgatgggccg tgaaaacgat     5940

ctggacatgt tccattacat ttttatctac gttggtgcgc cgctggtgaa cctcgataac     6000

tatctggcgt ttcgtccgga tggtagctac gccaccatct ttggcgagca aacgtttcgc     6060

ggtctgtacg cctatatcgc gaagatcatc agcgatgaga gtctgatctt cccgacgatc     6120

gatcagttca cgttcagcaa caacggtctg gagatcggta acgtgtatac caccttctat     6180

agcttcatct acgatttcga gtacgtgggc ttcatcccgc tgattctgat tatcgcgctg     6240

tactacgtgt tcacgtatca gcgcctcaag acgcgcgcca tcaagaccaa taaagtgcat     6300

ttcagtctgt tcatctatgc ctacctcttc aacgacctca tcatgctggc cttcagtaat     6360

cgcttctaca ccacggtgct ggacatcggc ttcatcaaga ttgttatctt cagctatatc     6420

tgccacctcc tctttgtgca ccgcagcaag atcaaaggca ccgtatgaac gttaaaagtg     6480

tgaaatttaa tttcattatg aatctgattc tgaccgttag caactttctg ttcccgctgg     6540

tgacgttccc atacgttagt cgcattctgc agccagaagg taccggtaaa gtggcctttg     6600

cgattagcgt ggttagctac ttcagcatct tcgcgagtct gggtgtggcc acctatggcg     6660

ttcgtgcgtg tgcgcaagtt cgcgacaata aagatctgct gagtcgtacg gtgcatgagc     6720

tgctgttcat caacatcatc gccacgatca ttgtgtacgt ttgctttctg ctggtggtgg     6780

cgtttacccc acgctttagc gcggaaaaag agctgttctg ggcgacgagc atctttattc     6840

tgttcaccat cattggcatc gagtggctct acaagggtct ggagaagtac cagtacatca     6900

cgatccgcac gatcatcttc aagctcattg cgctggtgct cgtgtttgtg ttcatcaaga     6960

cgaaggatga ctacgtgatc ttcgccgtga tcagtgtgtt tgcgatcgtt ggcagcggca     7020

tcttcaacct ctttaacagt cgcaagctga ttaactacca tctgtacgag gattacgagt     7080

tccgcaagca tttcaagcca atgtttctgc tgtttctcac gacgctcagc atcgccatct     7140

acaccagtgt ggatgaagcg attctgggtc tgctgacgag tccgcaagat gtgggctact     7200

ataacgcggc catgaaggtt aagggcattc tgtttacgct gatcaccagt ctgggcattg     7260

tgctgctgcc gcgtctgagc tattatgttg agaacaatat gacggatgaa ttccatgccg     7320

ccctcaagaa gagcatgaac ttcatcatcg tgatcgccgt tccagtggtg atcttcttca     7380

tgctgttcgc caaggagatt attctgctgc tggccggcga aagttatatc aacgccattc     7440

tgccgctgca gattattgtg tgggcgctgc tgctcagcgc cattaccaac attctgggca     7500

tccagattct gctgccgctc aagaaggata aagagctgct gatcagcgtg ctgctcgcgg     7560

ccattgtgga cattgtggcc aatctgattc tggttccgca actcgccagc gttggtaccg     7620

ccatcagcgt tgtgatggcc gaactcaccg tgctggtggt gcagctggtt atcctccgca     7680

agtacatctg gatcctcttc agcaatctcc agttcgtgcg catcggtctg agcatcgttt     7740

tcagcatcgt gctgagcctc agcatctatc agtggaacat cacgaacagc atcatgctca     7800

cgtttctgat catgggcttc atcttcttca cgacctactt cattctgctg ctgattctga     7860

aggagaactt catgatgtac gtgtaccaga ccatccagca caagattctg aaataaatta     7920

tatagtgtta tcacataacg tatccttgga gaatagaaat gaaatatgat tatctgatcg     7980

tgggcgccgg tctgtttggc agcatctttg cgcgcgaggc caccaagcgt ggcaagaaat     8040

gtctggttat cgagaagcgc gatcacatcg gtggcaactg ctacacgcag aacgtggaag     8100

gcatcaacgt tcacaaatac ggtgcgcaca tcttccacac cagcaacaag gtggtttggg     8160

actacatcca gcagttcgcc gagttcaatc gctttaccaa cagcccggtg gcccgctata     8220

aggacgaact gtacagcctc ccgttcaaca tgctcacctt caacaagatg tggggcgtta     8280

tcacgccgca agaagccgaa gcgaaaatca aggagcagat cgcgaaggag aacatcacgg     8340

atccgaagaa tctcgaggag caagccatca gtctggttgg tcgcgatatc tacgagaagc     8400

tcatcaaggg ctataccgag aagcagtggg gccgtaagtg tacggagctg ccagccttca     8460

tcatcaagcg tctgccagtt cgctacacgt acgacaacaa ctacttctac gacacctatc     8520

aaggcatccc gatcggtggc tacaccggca tctttgaacg catgctcgag ggcatcgagg     8580

tgaaactggg cgttgacttc ttcgcggaac gcgaacatta cgagagtctg gccgagaaga     8640

tcgtgttcac cggtatgatt gacgaatatt ttggttacca gttcggcaaa ctggaatacc     8700

gcagtctgcg cttcgacaac gaagtgctga acatcccgaa ctaccaaggc aatgcggtgg     8760

tgaactatac ggaagccgag gtgccatata cgcgcatcat cgagcataag catttcgagt     8820

acggcaccca gccgaaaacc gtgatcacgc gcgaacacag caaggagtac gaagaaggcg     8880

acgagccgta ttacccgatc aacgacgccc gcaacaacga actgtacgcc aagtacaagg     8940

cgctggccga cgcgacccca aacgttattt tcggtggccg tctggcccag tataagtact     9000

tcgacatgca caatatcatc gccgaggcgc tggagtgcgt taaggtgcac ttttaatata     9060

agggagtaac gctatgaata agatcatcgc gaagatcagt ctgatcctcg tggatatcgt     9120

ggccatcttc gttagcattc tgatcgccgt gagtctgcgt aaaattctgg gtctgctctt     9180

cacgctgccg gagatcgact acagctacat cttcttcgcg tatgtgtatc tgattctgat     9240

tctgatgatg acgtacctcg gcgcgtatac caaacgctac gacttttggc acgaaagccg     9300

tctgatcgtg cgcggcagct ttctcagtct gctgattctg ctgagtgccc tcgcgctggg     9360

ccaaaacgcg gaatactata gccgcagcac gctcgtgctg atctttctct gctgcgccat     9420

cgtgctgccg atcgccaaga ttttcaccaa aaaaattctg ttcaaactgg gtatctggca     9480

gctgccggcg aaggtgatca gcgagaacga ccagttcaaa aacgagctct tcgaagacca     9540

gtatctgggc tatgtgaagg cgaaacacag cgagcacaag attatcttca tcgacggcgc     9600

gaatctgggc aaagatcgtc tgaaccagat catcgaggac aacatcaaga atagccgtga     9660

gatcatcttc accccggttc tgaatggcta cgacttcagc catagctaca tttataacat     9720

cttcaacacg cgcaccaaca ttttcacgct ggagaacgag ctgctgagca aaagcaaccg     9780

catcttcaaa ctgctgatgg actatattct ggtgctgggt agtgccgtgt tctgggtgcc     9840

ggtgctggtg ctcatcgcgt tctggatcaa gaaggaggat ccgaaaggcg aggtgttctt     9900

tctgcagcgt cgcctcggcg tgaatggcaa ggaattcatg tgctacaaat tccgcagcat     9960

gtacagcgac cagagcttca tgcaagaatg gctggagaaa aatccggagg aggccgcgta    10020

ctaccgcatc taccataagt atatgaacga tccgcgcatc accaaaatcg gcgcgttcct    10080

ccgcaaaacc agtctggacg aactgccgca gctgatcaac gtgctgcgtg gtgagatgag    10140

tctcgttggt ccgcgcccgt acatggttat cgagaagaag gacatcggca aaaaagcccc    10200

actggtgctc gcggttaagc cgggcattac gggcatgtgg caagttagcg gccgcagtga    10260

tgtgaacttc gacagccgcg tggagatgga tgtgtggtat atgaaaaatt ggagtctgtg    10320

gaatgacatc gtgattctga tcaaaacggt gcaagccgtg ttcaagcgcg acggtgccta    10380

ttaaagtatg atcaccagca tccagtacct ccgtggcatc gccgcgctgt tcgtggtgct    10440

gttccacatg aagtggatgc tcaacaatgt gtacgtggag aagaacctcg gcgacatctt    10500

cttcatcagc ggcaacttcg gcgtggatct gttcttcgtg atcagcggct tcgtgatctg    10560

tctgagcacg gaacgcgaaa cgctgcaccc ggtgaaggag tttttcatcc gccgcttctt    10620

ccgcatctac ccactgctgc tgctgagcgt ttgcaccatc tacattctgg gcgacttcaa    10680

gatccacgag ctgatcctca gcatgatccc aatccatctg gactacagca gcccgagccc    10740

ggtgttcggc tacaacattc tggttagcgc gtggaccatc acctacgaga ttagcttcta    10800

catcatcctc gtgctgagtc tgatgatcaa ccatcgcttc cgctgcgaac tgaccattct    10860

gttctaatta tcattaatat agtttcaaac tattattatt ttggtgaata tagcctatca    10920

ctagatagag agatacccct tgataaaagg ggacattttt ttgttatgtt ctcatcatca    10980

atgttattaa catttattta tgggatttta atatatataa aattacaaat tttatgaaaa    11040

gcattatcat cctcgacaag tacttcctct acagcattct gctggtggtg atcagcttcg    11100

tgttcatcaa acacccgatc ttcgacggcc acggtgtgct gaaatggggc tttctgagct    11160

tcatcattct gctgattctg ctcatcatcg agaacaccta cggcatcgcc aaaagcaact    11220

ttctgttctg gctgggcgaa atcagctaca gtctgtatct gacgcacatc attatcctcg    11280

aattcattct gaagcacatc accccggaga tctggaacaa cccgaatctg ggcatgagca    11340

agatcctctt ctacctcgcc atcagcatca gcttcagcta tctggtgtat ctgctggtgg    11400

agaagccgtt catcaacctc ggcaagaagc tgatcacgaa gctgtaaata ttaatggatg    11460

attttatgaa gtcacgaaat ctcgaaccta caaaaacgca tctgatctat ttagatatac    11520

taaatatttt tgcttgcatt gctgtacttt tttacatcac aatggtattg tacattggta    11580

taacgtaaat gaattggctt ggaaacaagc cttatttttt gaagtggctt tttattgggc    11640

tgttcctatt ttctttatgc tcaccggcgc cacgctgttc gaataccgca accgctacag    11700

cacgaagcag tttttcatca agcgcatcca gcgcgccgtg ttcccgtttc tgagctgcag    11760

cctcattctg ctgggctata gcttttacag cggcatgatc gaggccttta gcatccgcga    11820

cagcatcagt gccatcttca acaccaagga catcccgttc attgaaatct attggttttt    11880

tatccatctc tttagtctgt acatggtgat cccggtgctc agtctgctga aagataacta    11940

ccgcattctg tgctatattg tgggcgccat gtttctgacc cacagtctgt ttccggtgat    12000

ctttgacttc ttcaagctgc actacaactg gagcatcatt ttcccgatgg cgggctacag    12060

catctatctg gttctgggct atctgctgag taaggtgaaa ctggaaaaga aatatcagat    12120

catcatttac attctgggca ttctgagcgt gctgctccgc tacttttata cctacgtgag    12180

cagtctggag gccaaccagc tcgatcgcac gctgttcagc tacatgcaat tccacaccgt    12240

gtttctggcg gtggcgatct tcattttcgt gaaggaattc ttcagcggtg tgaaactgtt    12300

caacgccaag gtgctggcgg tgttcagcag ctgtagtctg ggcatctatc tgatccacaa    12360

gctcgtgatg gactacgaac tcaagtttct gggcatcagc gaggacaatc tctactggcg    12420

ctttttcggc gccttcatga cgtacggcgc gtgcctcgtg atcgtgctgt ttgttaagcg    12480

catcccgtat ctgcgcgcca tctttccgta aagatattat aaatatgaaa attctgatca    12540

ccggtggcgc cggttttatc ggcagcgccg tgatccgcta tatcatccag catacccaag    12600

atagcgtggt gaatgtggac aaactgacct acgccggcaa tctggcgagt ctggaaagcg    12660

tgagcaatag cagccgctac cactttgagc aagcggatat ttgcgacagc acccgcatca    12720

gtcagatctt ctgcaagtac cagccggatg ttgtgatgca tctggccgcc gagagccacg    12780

ttgatcgcag cattgatggt ccggcggcgt tcatgcagac gaacatcatc ggcacctata    12840

ccctcctcga agccagccgc cagtattggc tcagtctgcc gctggaacgc aagcaaacct    12900

tccgcttcca gcacatcagt acggacgagg tgtatggcga tctcaacgat agcaacgagc    12960

tgttcagcga gaacacggcc tatagcccga gcagcccata tagcgccagc aaggccgcca    13020

gcgatcatct cgttcgtgcg tggtttcgta cctatggtct gccgacgctg gtgaccaact    13080

gcagcaataa ctatggcccg ttccagttcc cggagaaact gatcccgctg atgattctga    13140

acgccattag tggcaaaccg ctgccgatct atggcaatgg tctgcagatc cgcgactggc    13200

tgttcgttga agaccacgcc atcgcgctgt atcaagttct ctgtcgcggc aaagtgggcg    13260

aaacgtacaa catcggtggc cacaatgaga agaccaatat cgaggtggtg caagcgatct    13320

gccgtctgct ggacgaactg gtgccgaata aaccgagcgg catcgagcag tatgaagaac    13380

tcgtgaccta cgtggccgat cgcccgggcc atgatgttcg ctacgccatc gacgcgagca    13440

aaatcgagaa tcagctgggt tggacgccga aagaaacctt cgaaagcggt ctccgcaaga    13500

ccgtggagtg gtatctgaat aaccagaagt ggtggcagag cgttctggat ggcagttact    13560

gcggtgagcg tctgggtctg agtctgaaaa gctactaa                            13598


<210>  5
<211>  370
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Chain length determination protein

<400>  5

Met Ala Thr Pro Glu Gln Val Ser Asn Gln Thr Asn Glu Glu Ile Asp 
1               5                   10                  15      


Leu Ile Glu Leu Val Arg Val Leu Trp Lys Lys Lys Leu Leu Ile Ala 
            20                  25                  30          


Ile Val Thr Cys Ile Phe Thr Ala Leu Ala Ala Val Tyr Ala Phe Thr 
        35                  40                  45              


Ala Lys Glu Lys Trp Thr Ser Gln Thr Glu Val Ile Ala Pro Arg Val 
    50                  55                  60                  


Thr Asp Ile Ser Glu Tyr Leu Ser Leu Arg Lys Glu Tyr Asn Leu Ile 
65                  70                  75                  80  


Ile Gly Ser Glu Phe Lys Glu Asn Glu Ile Arg Asn Glu Leu Ser Glu 
                85                  90                  95      


Leu Phe Ser Arg Tyr Val Leu Ser Tyr Asp Glu Ala Ile Ala Phe Phe 
            100                 105                 110         


Lys Thr Thr Asp Thr Tyr Lys Lys Leu Ala Glu Thr Glu Asn Glu Val 
        115                 120                 125             


Gly Leu Gln Arg Ala Val Ala Glu Phe Thr Thr Glu Ser Leu Lys Val 
    130                 135                 140                 


Ile Lys Pro Asp Ala Lys Lys Asp Pro Asn Ala Leu Gly Ser Lys Ile 
145                 150                 155                 160 


Ala Ile Ser Phe Asp Thr Ala Leu Ser Ala Gln Thr Thr Leu Asn Asp 
                165                 170                 175     


Phe Ile Cys His Ile Ser Asp Thr Ser Phe Asn Phe Ser Lys Asn Glu 
            180                 185                 190         


Phe Ile Tyr Trp Ile Lys Glu Ser Ile Ser Ser Leu Asn Tyr Glu Lys 
        195                 200                 205             


Glu Val Ile Glu Gln Asp Gln Ser Ile Gln Arg Lys Val Gln Ile Gln 
    210                 215                 220                 


Asn Leu Glu Thr Ala Leu Asp Met Ala Lys Lys Ala Gly Ile Lys Glu 
225                 230                 235                 240 


Tyr Ser Ser Ala Leu Ser Ser Asn Ser Ser Val Ala Asn Leu Ala Val 
                245                 250                 255     


Ser Asp Thr Lys Ile Pro Leu Ser Asp Ser Lys Leu Ala Asp Gly Thr 
            260                 265                 270         


Tyr Leu Phe Met Leu Gly Glu Lys Asn Leu Gln Ala Gln Leu Asp Ile 
        275                 280                 285             


Ala Lys Thr Lys Glu Ile Val Tyr Ser Pro Arg Tyr Tyr Gln Ile Gln 
    290                 295                 300                 


Glu Gln Leu Leu Lys Leu Asn Thr Leu Leu Pro Lys Val Glu Lys Val 
305                 310                 315                 320 


Thr Gly Gln Ser Phe Ser Tyr Ile Ser Ser Pro Glu Leu Pro Ile Lys 
                325                 330                 335     


Arg Asp Trp Pro Lys Arg Phe Ile Leu Leu Leu Ile Gly Ala Val Ile 
            340                 345                 350         


Gly Gly Val Leu Ser Ser Leu Trp Val Ile Gly Lys Gln Ile Phe Gly 
        355                 360                 365             


Gln Lys 
    370 


<210>  6
<211>  1014
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence of gne-HA

<400>  6
atgaaaattc ttattagcgg tggtgcaggt tatataggtt ctcatacttt aagacaattt       60

ttaaaaacag atcatgaaat ttgtgtttta gataatcttt ctaagggttc taaaatcgca      120

atagaagatt tgcaaaaaat aagaactttt aaattttttg aacaagattt aagtgatttt      180

caaggcgtaa aagcattgtt tgagagagaa aaatttgacg ctattgtgca ttttgcagcg      240

agcattgaag tttttgaaag tatgcaaaac cctttaaagt attatatgaa taacactgtt      300

aatacgacaa atctcatcga aacttgtttg caaactggag tgaataaatt tatattttct      360

tcaacggcag ccacttatgg cgaaccacaa actcccgttg tgagcgaaac aagtccttta      420

gcacctatta atccttatgg gcgtagtaag cttatgagcg aagaggtttt gcgtgatgca      480

agtatggcaa atcctgaatt taagcattgt attttaagat attttaatgt tgcaggtgct      540

tgcatggatt atactttagg acaacgctat ccaaaagcga ctttgcttat aaaagttgca      600

gctgaatgtg ccgcaggaaa acgtaataaa cttttcatat ttggcgatga ttatgataca      660

aaagatggca cttgcataag agattttatc catgtggatg atatttcaag tgcgcattta      720

tcggctttgg attatttaaa agagaatgaa agcaatgttt ttaatgtagg ttatggacat      780

ggttttagcg taaaagaagt gattgaagcg atgaaaaaag ttagcggagt ggattttaaa      840

gtagaacttg ccccacgccg tgcgggtgat cctagtgtat tgatttctga tgcaagtaaa      900

atcagaaatc ttacttcttg gcagcctaaa tatgatgatt tagggcttat ttgtaaatct      960

gcttttgatt gggaaaaaca gtgctaccca tacgatgttc cagattacgc ttaa           1014


<210>  7
<211>  1137
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence of wzy (APP2)

<400>  7
atgaactcct tagtatatag aatagatatt agaacactta ttttttctat tttttatttt       60

acttttttag tatcggattt tttattatta gctcaagatg gcactattac aaaagatatc      120

atcaaatggg ttaaattatt ctcattattg ccattgctct tattaatatt taaattgcct      180

ttgaatctct tgattttagg tttttttact ataatgataa gtgcttttta ttctatttat      240

acgggagatt cgtttttatt atatatatgt ttgctgatgt ctttttctta taaagttaat      300

tttaactttt tattcaagat aggattatat cttacttcaa ttctagttgt tctaatacta      360

acttatttct tttttgaata ttttctgatt ggtgacagtc attttgtata tgatgcgacc      420

tattggttta aacgttatac atttaatttt gataatccta atgcatttcc tatgagaata      480

ttcgtttttt ttatatttta tatattgcat gtaggtaagc tgcgactttt tgatacattt      540

ctatttgtta tactatttgg aatagttttc tatttttcaa attctagaac tgcattttat      600

atttttattt tgtgtgtcct tactattcat tttaaccaag tttttaatgt gctaaataat      660

acttttgtta aattactaat taataattca attatattta taactatttt ttcaatttgg      720

tcggctatat attatcaaga ttattattcc tatttagaac cgattaacaa aattttatct      780

aaaagaatat actttgctaa tgaggcttat aagagtttag gatttgaatt ttaccctagg      840

aatattaaat ggtggataga agaatctgat tggcatatta tagataatgg atatgtatat      900

ttatttattt ctggtggtct tttagtagga aatttattta tattttctat aacttggctt      960

atgtatagac taaataaatt taacctaagt aatgaggcaa tattattaat gttttctatg     1020

ttatatcttt tatctgagag tcattttata aatatatttt acaatatacc tattttatta     1080

ttagctattt tcattaataa aactaatatt gtacgctatt tggaatgtaa aaaatga        1137


<210>  8
<211>  1137
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence of codon optimized wzy (APP2)

<400>  8
atgaattctt tagtgtatcg cattgacatc cgcactttaa tttttagcat cttttatttt       60

acctttctgg tgagcgattt tttactgctg gcccaagatg gcacaatcac caaggacatc      120

atcaagtggg tgaagctgtt ttctttactg ccgctgctgc tgctgatctt caagctgccg      180

ctgaatttac tgattctggg cttttttacc attatgatta gtgccttcta cagcatctat      240

accggtgaca gctttttact gtacatctgt ttactgatga gctttagcta caaagttaat      300

tttaattttt tatttaagat tggtttatat ctgaccagta ttttagtggt gctgattctg      360

acatattttt tctttgagta ctttctgatc ggcgacagcc actttgtgta cgacgccacc      420

tactggttca aacgctacac ctttaacttt gataacccga acgcatttcc gatgcgcatc      480

tttgtctttt ttatttttta cattctgcat gtgggcaaac tgcgtttatt cgacaccttt      540

ctgttcgtta ttctgtttgg tatcgttttt tactttagca acagccgtac agccttttac      600

atttttattc tgtgtgttct gaccattcat tttaatcaag tgtttaatgt tctgaataat      660

acctttgtta aactgctgat taacaatagc attatcttta tcaccatttt tagcatttgg      720

agcgcaatct attatcaaga ttactattct tatctggaac cgattaataa aattttaagc      780

aaacgtattt attttgcaaa cgaagcctat aagtctttag gcttcgagtt ctacccgcgc      840

aatatcaagt ggtggatcga ggagagcgac tggcatatta tcgacaatgg ctatgtttat      900

ttatttatca gcggcggttt actggtgggc aacttattta tcttttctat tacttggctg      960

atgtatcgtc tgaataaatt taatttaagc aacgaggcca ttttactgat gtttagcatg     1020

ctgtatttac tgagcgagag ccattttatc aatatttttt ataacatccc gattttactg     1080

ctggcaattt ttattaataa aaccaatatt gtgcgttatt tagagtgtaa aaaatga        1137


<210>  9
<211>  97
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ec_SL/Kan_fw

<400>  9
caaattccgg ttaaaaaaag accgcttgtt tgagagtgat aatcgcaaac aagcggtctt       60

ttttgatcaa aatattatta cacgtcttga gcgattg                                97


<210>  10
<211>  91
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ec_SL/Kan_rev

<400>  10
gatgaagagc aaagattggg agataatgtg agaaatcttt agattcaaac taagctgaga       60

agaaaaaggt ccatatgaat atcctcctta g                                      91


<210>  11
<211>  82
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E.c._5_DELTArfb fw

<400>  11
gtaatgttaa tgaaagcata taagaaattt tcaaatgaat aaagaaactg tttcagttat       60

tattacacgt cttgagcgat tg                                                82


<210>  12
<211>  85
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E.c._5_DELTArfb rev

<400>  12
gagcatgtaa tcttctgata aaaatcattt gtacgatatt ttcagttaca tactatgcgt       60

aggtccatat gaatatcctc cttag                                             85


<210>  13
<211>  83
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SL1344_DELTArfb fw

<400>  13
gagcaattaa tttttattgg caaattaaat accacattaa atacgcctta tggaatagaa       60

aaattacacg tcttgagcga ttg                                               83


<210>  14
<211>  83
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SL1344_DELTArfb rev

<400>  14
gcgttcagat tttacgcagg ctaatttata caattattat tcagtacttc tcggtaagcg       60

gtccatatga atatcctcct tag                                               83


<210>  15
<211>  92
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ec_SL/Kan_fw_elo

<400>  15
cagggctagc gctaattacc aatttattgt ttagcttagg aattttttta ggttagttgc       60

aaattccggt taaaaaaaga ccgcttgttt ga                                     92


<210>  16
<211>  92
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ec_SL/Kan_rev_elo

<400>  16
caatattagc ttatgtatta tattagaagg cctacagata agcaaaaaat attattgatg       60

aagagcaaag attgggagat aatgtgagaa at                                     92


<210>  17
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  BamHI-Fw KanR-Fw

<400>  17
cgggaattca agcttggatc cc                                                22


<210>  18
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  XhoI 3'rspU- Rev

<400>  18
gacgctagca tatgagctcg ag                                                22


<210>  19
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  BamHI-SL-gnd Fw ext

<400>  19
gtttcatcag taatgggaca gaaaggtacc                                        30


<210>  20
<211>  40
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  XhoI-SL-GalF Rev ext

<400>  20
cacactcgag caattgaccg gtttttctat tccataaggc                             40


<210>  21
<211>  46
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' NdeI_wzy

<400>  21
caggtaccat atgaactcct tagtatatag aatagatatt agaaca                      46


<210>  22
<211>  40
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' EcoRI_wzy

<400>  22
cttatcagaa ttcatttttt acattccaaa tagcgtacaa                             40


<210>  23
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' EcoRI_pEC415fw

<400>  23
gtaccgagct cgaattcttg aagacgaaag g                                      31


<210>  24
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' NdeI_pEC415rev

<400>  24
cactgcaatc gcgatagctg tctttttcat atgt                                   34


<210>  25
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3'-gne-cat_overlap

<400>  25
gaataggaac taaggaggat attcatatgg accttaagcg taatctggaa catcgtatgg       60

g                                                                       61


<210>  26
<211>  80
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5'-gne_Sl1344rfaL

<400>  26
attgctcaaa ttggtatcat taccggtttt ctgctggcgc taagaaatag ataatgaaaa       60

ttcttattag cggtggtgca                                                   80


<210>  27
<211>  80
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3'-cat_Sl1344rfaL

<400>  27
aaaaactggt ttgataagtg attgagtcct gatgatggaa aacgcgctga taccgtaatt       60

gtgtaggctg gagctgcttc                                                   80


<210>  28
<211>  80
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5'-elo-gne_Sl1344rfaL

<400>  28
ttttatcttt cgtcggtttt tatatcgttc gtggcaattt tgaacaggtc gatattgctc       60

aaattggtat cattaccggt                                                   80


<210>  29
<211>  80
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3'-elo-cat_Sl1344rfaL

<400>  29
tttcaaaata cagttgggaa aatgtagcgc agcgtttcga ggaacaaatg aaaaactggt       60

ttgataagtg attgagtcct                                                   80


<210>  30
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5'-cat-P2new

<400>  30
ggtccatatg aatatcctcc ttagttccta ttc                                    33


<210>  31
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' gne overlap_wzy (co)

<400>  31
cccatacgat gttccagatt acgcttaatg aattctttag tgtatcgcat tgacatcc         58


<210>  32
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' wzy (co)-cat_overlap

<400>  32
gaataggaac taaggaggat attcatatgg acctcatttt ttacactcta aataacgcac       60

aatattgg                                                                68


<210>  33
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' gne_wzy (co) overlap

<400>  33
ggatgtcaat gcgatacact aaagaattca ttaagcgtaa tctggaacat cgtatggg         58


<210>  34
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' XmaI-XhoI-APXIIIne-HIS-fw

<400>  34
aaaaaacccg ggctcgagat ggatgtaact aaaaatggtt tgcaatatgg g                51


<210>  35
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' APXIIIne-HIS-HindIII-rv

<400>  35
aaaaaaaagc ttttagtggt gatgatgatg gtgatggt                               38


<210>  36
<211>  72
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FW_pliCint

<400>  36
ctaattagta accactttta agcatggtta atcctatttt gaaaaagcaa aatccctggt       60

gttttcaaaa ta                                                           72


<210>  37
<211>  71
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  REV_pagCint

<400>  37
gattcactct gaaaaatttt cctggaatta atcacaatgt caggtcgata ttgctcaaat       60

tggtatcatt a                                                            71


<210>  38
<211>  76
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  eloFW_pliCint

<400>  38
cgtaacgtta aagaatatgt gaatcactac cgtagtataa tggctaatta gtaaccactt       60

ttaagcatgg ttaatc                                                       76


<210>  39
<211>  79
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  eloREV_pagCint

<400>  39
gataagcagg aaggaaaatc tggtgtaaat aacgccagat ctcacaagat tcactctgaa       60

aaattttcct ggaattaat                                                    79


<210>  40
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FW_rfaK_rfaL

<400>  40
ctatttatat ggcgctatca tcagggaaac ag                                     32


<210>  41
<211>  34
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  REV_rfaL_rfaK

<400>  41
gacagtataa ttaatgatat taaccgtgcg cttg                                   34


<210>  42
<211>  13750
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  APP2 O-antigen biosynthesis cluster integrated located on pDOC 
       plasmid

<400>  42
tttctgtccc attactgatg aaacacggct tatttgttta tcgccacaaa gacccgattg       60

aattacgccc aattttagga caaatggcag ggctagcgct aattaccaat ttattgttta      120

gcttaggaat ttttttaggt tagttgcaaa ttccggttaa aaaaagaccg cttgtttgag      180

agtgataatc gcaaacaagc ggtctttttt gatcaaaata tttcaaatgc taaaggatga      240

accgcaaccg caggttgagc tagcattcgg gttttgcacc acaaaacgag agccgtccaa      300

accttcggta taatctaccg taccaccgat taaatattgc aggctcatag gatcaaccac      360

taaaccgacg ttttggtttt caatggttaa atcgccttca ttgatttggt cgtcgaaggt      420

aaaaccgtat tggaaaccgc tacaaccgcc acccgtaata tacactcgta aacgaagatt      480

cgggttatcc tcgccctcaa ttaaactctt aactttcttc gccgctgcat cggtaaagat      540

aagaggaatt tgaatatcgt ccattttttc ttctcagctt agtttgaatc taaagatttc      600

tcacattatc tcccaatctt tgctcttcat caataatatt ttttgcttat ctgtaggcct      660

tctaatataa tacataagct aatattggct attattttga gagtaatatt atgcttgaac      720

aagcatcaaa ccaaactaac gaggaaatcg atctgattga attaattcgc gtgctttgga      780

agaaaaaatt attaattgct attgttacct ttatttttac tgcattggca gcagtttatg      840

cttttaccgc gaaagagaaa tggacatctc aagcagaggt aattgcgcct agagtgacag      900

atatttccga atatttatcg ttacgaaaag aatataattt aattatgggt tctgagttca      960

aagaaaatga tattcgtaat gaactaaatg agcttttttc tcgttatgtg ctttcttatg     1020

acgaagcaat ctctttcttt aagaccacag atacttataa aaaacttgca gaaaaagaaa     1080

atgaagcagg tttacaaaga gcagcttctg aatttacaac ggaatcattg aaggtgataa     1140

aaccggatgc gaaaaaagac cttaatgctt taggcagtaa gattgctatt tcatctgaga     1200

ctgctttatc tgcacaaaca gagttaaatg attttattcg tcatattagt gatatttctt     1260

ttaattttag taaaaatgaa tttatttatt gggtaaaaga gagcatatct agcctaaatt     1320

acgagaaaga agtgatagag caagatcaga gtattcaacg aaaagttcag attcaaaact     1380

tagaagccgc acttgatatg gcgaaaaaag caggaattaa agagtatagt tctgcattgt     1440

catcaaatag ttctgtggct aatctcgcgg taagtgatac aaaaattccg ttatcagatt     1500

ctaaattagc agatggaacc tatttattta tgcttggtga gaaaaatcta caagcacaat     1560

tagatattgc gaaaaccaag gaaattgttt attcgccaag atattatcag attcaagagc     1620

aacttttaaa attaaatact ttattaccta aagtagagaa agtaactggg caaacttata     1680

gttatgtatc ttcgccgaca tatcctgtta tcaaagatgc acctaaaaaa ggaattattt     1740

tagttatagg tttcttggta ggattgttac tgagttcatt cactgttttg atttctgtat     1800

tagtgagaaa taagaagagc taattatgag gtatgaagag gaatattaac ctctttatgt     1860

gccttttgat gatccaattt actgaagtgc ttagactatt taggtaataa aatataggga     1920

taataatgaa gcataagata ttacatttta gccaagtgct tggcggtgtc ggtcgatatt     1980

tagagctata tgataagtac atcaataaag attcatttga aaatatatat atattaccta     2040

taggtgattg ggaggcggca gaagctcagg ataaacggta tatattaaat attgaacaat     2100

ccttttcacc aattaaattg atctctaatg ttataaaaat tagaaatatc ttaaaaaaag     2160

aaaagccaga tatcttttat ctacatagta cttttgctgg cgttattggg cggttagctg     2220

ctattggtat gaggtgtaaa gtaatttata accctcatgg ttggtcattt aagatgaatg     2280

tatctaggct aaaacaaact ttttataaga ttatcgaagg tggtttagtt ttcttaactg     2340

ataagtttgt tttaatttcg aaatcggagt atgaggcggc acgttcaatt ggtgtttcag     2400

agaagaaatg ttgtctcata tataacggta ttgaaacgat aaaaaaaaca gatatagcaa     2460

tcattcctaa attagatgat aaatatatca ttggaatgat aggacgtatt agtgagcaaa     2520

aaaatccaat gttttttgcc cagtttgcca aagagattat taaacaatac cctaatactt     2580

atttcatttt ggttggtgac ggtgagcaac gagaatcgtt agaagactat ttagaacgta     2640

ataacttgaa tgatgttttt tatattacgg gttgggtaac taatcccgaa agctacttaa     2700

atttatttga tcaagcagta ttattctcaa aatgggaagg gctatgtctt tcagtctgtg     2760

agtatatgtt atatgaaaag cctatattag taagtaatat tggtggtatt aacgatctta     2820

ttcagaatga agttaatggt tttactattg ttgaaggtga tcttaaggat gcggttaaca     2880

aatctaatag attaagaaat gagcctaaaa ctgtagctaa gtttattgaa gcctcaaaca     2940

tacttattca agagaaattt aatgctcaaa aaatggtaaa tagtttagaa aaacttttta     3000

tcaaattatc agagaataaa taatgaaaga aaaatttagt tgcattgttg tttgttataa     3060

ccccgataat tcggtgcttg acaatctcaa aaactatatt agttatgtgg gaaaagtaat     3120

cgtagttgat aattcagatg tggataattc tcaattattt tcctcacttt cagaatactt     3180

aatttatata ccattgtata aaaatgtggg tattgcctat gcactaaata taggagtaga     3240

aaagtccaaa gaattaggat atgaatatat cattactatg gatcaagata gttcttttgc     3300

tactaatcta gtggatgtat attcacatta tataagtaat tatcctatag atcagatagg     3360

agcattatcc ccagtttata ttacggacag gggatttaat cgaacaagta aagaagaatt     3420

taaacaaata aaaattacta tgcaatcagg ttctatgttc tttactgata aatttgatgt     3480

aatcggtcgc tttgataatg atctgttctt agatgtagtt gattgggaat atttctttag     3540

aatttatacg ttaggatata aaacgattca atgtaataaa gcaatgctga aacatgctcc     3600

agcggaaacg ctaacgttat ttaaaataaa aggaaaaaca attggtgttg gagtcgcttc     3660

tccattaagg tattactatc aaattagaaa tctactttgg tgtgttttac ataaaaagag     3720

tttttttatg ataaaaacaa tagcttataa atttattaag attctatttt tgtttaataa     3780

taaaaagcaa tatttatcat ttgcttatat ggctattaaa gacgccttta ataatcgttt     3840

aggggcatat gatacacttt atttagagaa atctcgtaat gaaaaatgat ttaccattaa     3900

ttagtattat tattcctatc tataacgtga agccttatct tgaaaaatgt gtaaatagtg     3960

tattatcaca atcatatcct aatcttgaaa ttattctagt tgatgacggt gcaactgatg     4020

gttctgctca agtgtgtgat gatttttctg aaaagtatgc aaatattcag gtaattcata     4080

agaaaaatgg tgggctatcc tcagcaagaa atgctggaat tgaagctatg aagggagaat     4140

acgtattttt cttagatagt gatgactgga ttgctaatga tgcaatttct caattatatg     4200

atgatatggt ggaatataat gcagatataa cagggattag tttttatcaa gcatattcag     4260

acggtaattt agtattaaat acacatctta ttgaaaaaca aatgctttca aagaaagagg     4320

ctttacgtac tttcctattt aataattacc ttactccttg ttcctgtgga aaactttata     4380

aagcaagtct atggaaagat ataagatttc cggagggacg attatttgaa gatcagctta     4440

ctacttataa agttatcgag ttagcaaata caattatttt taatcctgct gcaaagtatt     4500

tttattttaa aagaatagga tctatcggtc attctgcttt ttctgaaaaa acatatgacc     4560

tttatgaggc tgttaatgaa caatataatg aaataactaa gcatcatcct gatattgaat     4620

ctgatttggc ggttgctaaa attacttggg aaattgtatt tattaatatg atgctcaatt     4680

caaattattc agatcaagcg atagttgata aaacacgagt ttttgcaaga aaacgtattt     4740

tagatgtagt gaaatgtgag tttatcccta atttacgaaa atttcagatt actttatttg     4800

catataattt tagtttatat aaagttttat atgcaagata taaaaagaaa aatccattat     4860

cttaattatt gattttatcg gttgttacaa tgatacctaa aaaaattcat tattgttggt     4920

ttggtggaaa tccattacct aaaagtgtga aaaaatgtat taaatcttgg aaaaaatact     4980

gtccagatta cgagattatt gagtggaatg aatctaatta caatgtgcat aagaaccttt     5040

ttataaaaga agcttatgag aaaaaaaagt ttgcatttgt ttcagattat gctcgtttag     5100

atgtggttca ttctgaaggt gggatttatt tagatactga tgttgagttg ataaaaccta     5160

tagatgattt attagctcat agttgttttt tagcatctga atctattgat gatgttaata     5220

cagggctagg ttttggggct gaaaaaggac attggtttat cgcagaaaat atgagtgtct     5280

atgaaaatat gtactttaat atggaaaata ttattacctg tgtagagatt actactaaat     5340

tattaataga aagaggtttt tctgctagtg ataaaattca aaatatagat gatattttca     5400

tttatccaac tgagtatttt tgcccattaa attataaaac ccacgagttg catataacac     5460

agaatactta ttctatacat cactatgatg caacttggca aagccctctt atgaaattta     5520

aaacaaaaat taagtatata ttgtgtttag ccggaataat aaaatgaact ccttagtata     5580

tagaatagat attagaacac ttattttttc tattttttat tttacttttt tagtatcgga     5640

ttttttatta ttagctcaag atggcactat tacaaaagat atcatcaaat gggttaaatt     5700

attctcatta ttgccattgc tcttattaat atttaaattg cctttgaatc tcttgatttt     5760

aggttttttt actataatga taagtgcttt ttattctatt tatacgggag attcgttttt     5820

attatatata tgtttgctga tgtctttttc ttataaagtt aattttaact ttttattcaa     5880

gataggatta tatcttactt caattctagt tgttctaata ctaacttatt tcttttttga     5940

atattttctg attggtgaca gtcattttgt atatgatgcg acctattggt ttaaacgtta     6000

tacatttaat tttgataatc ctaatgcatt tcctatgaga atattcgttt tttttatatt     6060

ttatatattg catgtaggta agctgcgact ttttgataca tttctatttg ttatactatt     6120

tggaatagtt ttctattttt caaattctag aactgcattt tatattttta ttttgtgtgt     6180

ccttactatt cattttaacc aagtttttaa tgtgctaaat aatacttttg ttaaattact     6240

aattaataat tcaattatat ttataactat tttttcaatt tggtcggcta tatattatca     6300

agattattat tcctatttag aaccgattaa caaaatttta tctaaaagaa tatactttgc     6360

taatgaggct tataagagtt taggatttga attttaccct aggaatatta aatggtggat     6420

agaagaatct gattggcata ttatagataa tggatatgta tatttattta tttctggtgg     6480

tcttttagta ggaaatttat ttatattttc tataacttgg cttatgtata gactaaataa     6540

atttaaccta agtaatgagg caatattatt aatgttttct atgttatatc ttttatctga     6600

gagtcatttt ataaatatat tttacaatat acctatttta ttattagcta ttttcattaa     6660

taaaactaat attgtacgct atttggaatg taaaaaatga ataaaaacct tgtaaataat     6720

agtattatga gttttttgct tacaatatct aactttattt ttccattaat tacttttact     6780

tatgcggcaa gaattttgca acctgataat atgggaaagt ttgcattttc tctatcggtt     6840

gtagattatc tatctctatt tgctacattt ggtgttgtag gttatggtgt tagagcttgt     6900

gcagaagtaa gaaacaataa agaagaacta actaaaacgg tacaagaaat tttatttatt     6960

aatatttttt tagctattat tgcctatctt gtgatatttc ttctaattag ctatcagcat     7020

gcatttagag aagatacttt gttattctta attatgtctt cttgtattat ctttaatgtg     7080

ataggaatag aatggttata taaaagtctc gatgaatata gatacattac agtaagaagt     7140

attctattaa aaataatttc attaataatg attttatgtt ttgttaaaga aaaggatgat     7200

tatccacttt ttgcattgtt ttttgttcta ccaatttgtc tatcttcgtt gttaaatatt     7260

ataaattcaa gaaaaatatt gctttttaaa ttatttaaac ttgatttatc aaagcatata     7320

aaaccaatgt ttgttttatt tttagtgaca ttatcttata cattatatgc taatgttaat     7380

gatgtgctat tagctactgt aactaataca gaacaagttg gttactatag tgttgctttc     7440

aaaataaaag ctgcattatt agctttcatt actagtacaa gtatggtttt tttacctcga     7500

ttaacagagt atattaaaaa taatcaagat attgaattta ttgacttatt aagaaagtct     7560

tttgatctgg ttttttttct agctgtgcca ataacattat ttttcttttt atacgctaaa     7620

gaaacaatat ttttattgtt tggtgagaaa tataataagt caagtttatt attgcaaacc     7680

atgatatggt ctgttttttt tggtggttta aataatatat taagtgtaca aatgttattg     7740

cctttaaaaa aagataatca gttcttaatt tctattttaa gtggtggatg tatatcttta     7800

gttgtgaatt ttatcttctt gagggagctt caatcattaa gtacatcaat ttcagttcta     7860

gttgcagaag ttgttatact gattattcaa ttagttattc taagaaaata tattgtaaga     7920

atttttaata atttaaatcc tttaaaggtg ataatgtcgg tttttttttc tatatggttt     7980

gttaatttaa tttatgccaa ttttattgct ctaggtaata gtttcttaga gtatattatt     8040

tctattttta tattttcatt attttatgtg tttttacttt tttttagtaa agaaagattt     8100

gttcatgatg tgttttttta tataaggagt aaatttgatt aatttattaa ttagtattct     8160

agctaaaatt ctttctagga tttctaaact gattttgaat ataaaaaaac ggaaggaata     8220

caaacgagtt ggctctatag ttgattcaaa gaatatagat ttgagtttta tttgtggtaa     8280

ctattgtaga gtagggagag atactgtaat tgagaaaaat gttattatgg ggagattatc     8340

ttacattaat tcagatatgg gaaaaacata tattggtagt aatgtaaaga ttggtagttt     8400

atgctcaatt tcctcaggtg taataattgc tcctgtaaat cattacctaa attatgtgac     8460

aacgcaccca ttactttata attcctatta tagtagcatt ttaaatatta attctaatct     8520

gttatctcaa caagaattag atgcaaatgt atcaacagtg attggtaatg atgtatggat     8580

tggagctaat gtgattataa agagaggagt aactatagga gatggagcgg ttattggtgc     8640

aggtagtatt ataacaaaag atattccttc ttatgcagta gtagcaggag ttccagctaa     8700

aattattaaa tatcgttttt caaaagatgt aatagaaagc ctgaaagata gtaagaatgt     8760

ttgggaatta tctacctcag aattagaaga gaatttttct catttatatg atgttgagaa     8820

atatcttaat agatttaagt tgtaggatta atttttagtc taggatttta gtatgagtaa     8880

gaaaaatata gttgcacaaa ctttattact ttgcttagat ttattactaa ttagtatggc     8940

aatcttttta gctgtattta ttagaaataa tattttaccg aatattatgt tatttgagcc     9000

tgtatcatat atagagtatc tagtataccc atttccttat gtaatcattg ttacattgtt     9060

tatgtggttt gggctatata caagaagata tgatttatgg caggagtcat tatttattat     9120

aaaagtatgt tttatttctt ttattattat ctttgcaaca ttagcattgg gtaagaatat     9180

agaatattat tctagagctg ttttattatt atctcttttc ttatcagtga tatttttacc     9240

aataggtcgt tattttttga aaaaaagctt gtttagactg ggtctttggg aaaggaaagt     9300

aaagtttatt ggcaatttaa ataagaatga aattgggatt tttaattctc ctcatgtagg     9360

atatgtgtta tctaaagatg atacatatga tgttatattt atatctagtg gtgataagag     9420

tgtatcagaa ttaaatgatt taattgaaag taataaatta ttgaatcgtg aggttctatt     9480

tatccctgtg ttaaatcaat atgattttac tcaatctgtt ttgtacaata attttagtac     9540

aaggctaaat ctatttacgt tagaaaataa attacttgga aagcaaaata aaattttgaa     9600

gtatttacta gattatgtac tagtattatc tactttacct ttttgggggg ggctgatttt     9660

acttattagt ataaaattaa aattagaaga tcctaaaggg aaaatatttt tcttacaaaa     9720

gagattaggt caagagggta agatattcta ttgttataaa tttagaacaa tggtttcaga     9780

ccagagcttt atgcaacaat ggcttattga taatccagaa gaaagagatt attacgctgt     9840

gtatcataag tatattaatg atcctagaat tactaaattc ggacattttt tgcgaagaac     9900

atctttagat gagttacccc aattatttaa tgtacttaaa ggggatatga gtttagttgg     9960

aaatagacct tatatggttg aggaacaaca aaaaatgaaa gatgctgcca gtattatttt    10020

gatgtcaaaa ccaggagtaa caggtttatg gcaagtaagt gggcggagtg acgtttcatt    10080

tgaagaacgt ttacaaattg attcttggta tattaaaaat tggtctattt ggaatgatat    10140

tgttatttta ttcaaaacag ttggtgttgt attaagaaaa gatggagcat cttagtaata    10200

atgtaattac attaaattat tatagatagg gattattatg aaaaaaattt tagtcaccgg    10260

tggtgcaggt tttattggct ctgcggttgt acgtcatatt ataaatgata cacaagatag    10320

tgttgtaaat gttgataaac ttacctatgc gggtaattta gaatcgttat taatggtaga    10380

aaatagccct cgttacgtat ttgagcaagt agatatttgt aatcgtgcgg aacttgatcg    10440

cgtatttgcc caacatcagc ctgatgcagt tatgcactta gccgcagaaa gccatgttga    10500

ccgttcaatc gatgggccgg ctgcttttat cgaaacaaat attgtcggta cttacacttt    10560

gctcgaagct gctcgctatt attggaatag tttagatgct gataaaaaat cattattccg    10620

ttttcatcat atttctacgg atgaggtata tggtgatttg gaaggtacag aagatttgtt    10680

tacggaaacg acgccgtatt ctccgtctag cccatattcg gcttctaaag cgtcaagtga    10740

tcatttagtc cgtgcttggc ttcgtactta tggattacct acgattgtga ccaattgttc    10800

gaataactat ggtccgttcc attttcctga aaaattaatt cctttaatga ttttaaatgc    10860

tttagagggt aaaccattac cagtttatgg taatgggcaa caaatccgtg actggttatt    10920

tgtagaagat catgctagag cattatacaa agtggtaacg gaaggtaagg tgggagaaac    10980

ttataatata ggtggacata atgaaaaagc taatattgat gttgttcgta ctatttgtag    11040

tttattagaa gagcttgtac caaataaacc ggcgggtgtg cataaatatg aggatttaat    11100

tacctacgtt acagatcgtc cagggcatga tgttcgttat gcaattgatg caacaaaaat    11160

tggacgagaa ttaggttgga agccacaaga aacatttgaa acaggtattc gtaaaacagt    11220

cgaatggtat ttaaataata cagagtggtg gagtcgtgta ttagacggtt cttacaatcg    11280

tgagcgttta ggttcaaatt aatattatta caagcgatcc aatttttaat aaggtttaca    11340

atatgaaagg tattattctt gcaggtggct caggtactcg tctttacccg attactcgtg    11400

gcgtgtcaaa acagctctta ccggtatacg ataaaccaat gatttattat cctttatcag    11460

tacttatgct tgcaggtatc cgagaagtct taattattac aacaccggag gataatgaga    11520

gctttaaacg tttattaggc gacggttctg atttcggtat ccaactttcc tatgctattc    11580

aacctagtcc agatggctta gctcaagcat ttttaattgg tgaagagttt atcggtcagg    11640

acagtgtatg tttggttcta ggtgataata tcttctacgg tcagcatttt actcaatctt    11700

tacaagaggc tgtaaaatcg gtagaaacga aaggtgcgac tgtatttggt tatcaagtga    11760

aagatccgga acgttttggt gtggtagagt ttgatgacaa tttccgtgca ttgtctattg    11820

aggaaaaacc gattcaaccc aaatctaatt gggcggtaac cgggttatat ttctatgata    11880

accgagtagt agaatttgca aaacaagtaa aaccctctgc acgtggcgaa ttagagatta    11940

ccactcttaa tgagatgtat cttaatgatg gttcacttaa tgtacaatta ttagggcgag    12000

gctttgcttg gttagatacc ggcacacatg atagcttaca tgatgcggca gcatttgtga    12060

aaacagtaca aaatctacag aatttacagg tagcatgctt agaggaaatt gcctatcgta    12120

acggttggtt atcacttgag caacttgaag cattaacaaa accgatggcg aaaaatgaat    12180

acggtcaata tttgttacgt ttaacaaaag gaacaaaata atggcacgtt tcttaattac    12240

gggagcgaag ggacaggttg gatattgtct tactaagcaa ttacagagca aagcagatgt    12300

cttagcagta gatcgtgatg agcttgatat aacaaatcgt gatgctgtat ttaaagttgt    12360

cagagagttt catcctgatg ttattattaa tgctgccgca catactgctg tagatcgggc    12420

tgagagtgaa atcgaactat cggaagcgat taacgtgaaa ggcccacaat atcttgcaga    12480

agcagccaat gagattgatg caatcatttt acatatttca acggattatg tctttgaagg    12540

gacaggttct ggagaatata aagaaaatga tgaacctaat ccacaaggcg tatacggcaa    12600

aacaaaactt gccggagaga tagcagttca acaggcaaat aaaaggcata tcattttgcg    12660

tactgcttgg gtatttggtg aacatggtaa taactttgtt aaaacgatgc tccgtttagc    12720

aaaagaaaga gaatctttgg gaattgtgag tgatcaattt ggcggaccta cctatgcagg    12780

ggatattgcg agtagcctga ttcatatagc aaatatcatt cttaatagta agatagatgt    12840

atttggtgtt taccatttta ctggcaagcc ttatgtaagt tgggccgatt ttgctaagaa    12900

aatttttgat gaagctgttt cgcaaaaggt attagaaaaa gcaccgcttg ttaattttat    12960

tgctacaagt aattatccaa catcagcaaa acgaccggca aattctcgct tagatttaac    13020

taaaattgat gaggtttttg gtattaaacc gagtaattgg caacaagcat taaaaaatat    13080

taaggcatat gcgtaatgaa gattattgaa acaaatattc cggatgtaaa gcttttagaa    13140

cctcaagtat ttggtgatga acgcggtttt tttatggaaa tttttcgaga tgaatggttc    13200

agacaatatg tcgctgatcg tactttcgtt caagaaaatc attcaaaatc tattaaggga    13260

gttttgagag gcttacatta tcaaactgaa aatacacaag gcaagttagt gcgtgtagtg    13320

caggggtctg tgtttgatgt agcggtagat ttacgtaaaa gttctccgac ttttggacaa    13380

tgggttgggg aagtattatc cgctgaaaat aaacgtcaac tttgggtccc tgaaggattt    13440

gctcacggtt tttatgtatt gacagaaacc gctgaattta cctataaatg cacagattac    13500

tataatccaa aagcggaaca ttcattgatt tggaatgatc cgacagtagc gattaattgg    13560

aatcttggtg gcgcgcctag tttatcagca aaggatttag ctggtaaggt gttaaatgaa    13620

gctgttttat ttgaatagta aattctctat ttacttttta tcttgactac gatataattg    13680

gatacctttt tttagttcta tgtcgccaaa aattgtgtgc gactttattt aaacatatat    13740

ttcctgaggt                                                           13750


<210>  43
<211>  22933
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pDOC_E.c.5_Drfb::KanR_APP2 LPS(cod.opt.)

<400>  43
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accataatcg gcattttctt ttgcgttttt atttgttaac tgttaattgt ccttgttcaa      240

ggatgctgtc tttgacaaca gatgttttct tgcctttgat gttcagcagg aagctaggcg      300

caaacgttga ttgtttgtct gcgtagaatc ctctgtttgt catatagctt gtaatcacga      360

cattgtttcc tttcgcttga ggtacagcga agtgtgagta agtaaaggtt acatcgttag      420

gatcaagatc catttttaac acaaggccag ttttgttcag cggcttgtat gggccagtta      480

aagaattaga aacataacca agcatgtaaa tatcgttaga cgtaatgccg tcaatcgtca      540

tttttgatcc gcgggagtca gtgaacagat accatttgcc gttcatttta aagacgttcg      600

cgcgttcaat ttcatctgtt actgtgttag atgcaatcag cggtttcatc acttttttca      660

gtgtgtaatc atcgtttagc tcaatcatac cgagagcgcc gtttgctaac tcagccgtgc      720

gttttttatc gctttgcaga agtttttgac tttcttgacg gaagaatgat gtgcttttgc      780

catagtatgc tttgttaaat aaagattctt cgccttggta gccatcttca gttccagtgt      840

ttgcttcaaa tactaagtat ttgtggcctt tatcttctac gtagtgagga tctctcagcg      900

tatggttgtc gcctgagctg tagttgcctt catcgatgaa ctgctgtaca ttttgatacg      960

tttttccgtc accgtcaaag attgatttat aatcctctac accgttgatg ttcaaagagc     1020

tgtctgatgc tgatacgtta acttgtgcag ttgtcagtgt ttgtttgccg taatgtttac     1080

cggagaaatc agtgtagaat aaacggattt ttccgtcaga tgtaaatgtg gctgaacctg     1140

accattcttg tgtttggtct tttaggatag aatcatttgc atcgaatttg tcgctgtctt     1200

taaagacgcg gccagcgttt ttccagctgt caatagaagt ttcgccgact ttttgataga     1260

acatgtaaat cgatgtgtca tccgcatttt taggatctcc ggctaatgca aagacgatgt     1320

ggtagccgtg atagtttgcg acagtgccgt cagcgttttg taatggccag ctgtcccaaa     1380

cgtccaggcc ttttgcagaa gagatatttt taattgtgga cgaatcgaac tcaggaactt     1440

gatatttttc atttttttgc tgttcaggga tttgcagcat atcatggcgt gtaatatggg     1500

aaatgccgta tgtttcctta tatggctttt ggttcgtttc tttcgcaaac gcttgagttg     1560

cgcctcctgc cagcagtgcg gtagtaaagg ttaatactgt tgcttgtttt gcaaactttt     1620

tgatgttcat cgttcatgtc tcctttttta tgtactgtgt tagcggtctg cttcttccag     1680

ccctcctgtt tgaagatggc aagttagtta cgcacaataa aaaaagacct aaaatatgta     1740

aggggtgacg ccaaagtata cactttgccc tttacacatt ttaggtcttg cctgctttat     1800

cagtaacaaa cccgcgcgat ttacttttcg acctcattct attagactct cgtttggatt     1860

gcaactggtc tattttcctc ttttgtttga tagaaaatca taaaaggatt tgcagactac     1920

gggcctaaag aactaaaaaa tctatctgtt tcttttcatt ctctgtattt tttatagttt     1980

ctgttgcatg ggcataaagt tgccttttta atcacaattc agaaaatatc ataatatctc     2040

atttcactaa ataatagtga acggcaggta tatgtgatgg gttaaaaagg atcgatcctc     2100

tagctagagt cgatcttcgc cagcagggcg aggatcgtgg catcaccgaa ccgcgccgtg     2160

cgcgggtcgt cggtgagcca gagtttcagc aggccgccca ggcggcccag gtcgccattg     2220

atgcgggcca gctcgcggac gtgctcatag tccacgacgc ccgtgatttt gtagccctgg     2280

ccgacggcca gcaggtaggc cgacaggctc atgccggccg ccgccgcctt ttcctcaatc     2340

gctcttcgtt cgtctggaag gcagtacacc ttgataggtg ggctgccctt cctggttggc     2400

ttggtttcat cagccatccg cttgccctca tctgttacgc cggcggtagc cggccagcct     2460

cgcagagcag gattcccgtt gagcaccgcc aggtgcgaat aagggacagt gaagaaggaa     2520

cacccgctcg cgggtgggcc tacttcacct atcctgcccg gctgacgccg ttggatacac     2580

caaggaaagt ctacacgaac cctttggcaa aatcctgtat atcgtgcgaa aaaggatgga     2640

tataccgaaa aaatcgctat aatgaccccg aagcagggtt atgcagcgga aaagcgctgc     2700

ttccctgctg ttttgtggaa tatctaccga ctggaaacag gcaaatgcag gaaattactg     2760

aactgagggg acaggcgaga gacgatgcca aagagctaca ccgacgagct ggccgagtgg     2820

gttgaatccc gcgcggccaa gaagcgccgg cgtgatgagg ctgcggttgc gttcctggcg     2880

gtgagggcgg atgtcgatat gcgtaaggag aaaataccgc atcaggcgca tgcatatttg     2940

aatgtattta gaaaaataaa caaaaagagt ttgtagaaac gcaaaaaggc catccgtcag     3000

gatggccttc tgcttaattt gatgcctggc agtttatggc gggcgtcctg cccgccaccc     3060

tccgggccgt tgcttcgcaa cgttcaaatc cgctcccggc ggatttgtcc tactcaggag     3120

agcgttcacc gacaaacaac agataaaacg aaaggcccag tctttcgact gagcctttcg     3180

ttttatttga tgcctggcag ttccctactc tcgcatgggg agaccccaca ctaccatcgg     3240

cgctacggcg tttcacttct gagttcggca tggggtcagg tgggaccacc gcgctactgc     3300

cgccaggcaa attctgtttt atcagaccgc ttctgcgttc tgatttaatc tgtatcaggc     3360

tgaaaatctt ctctcatccg ccaaaacagc caagctcgcc attcgccatt caggctgcgc     3420

aactgttggg aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg     3480

ggatgtgctg caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt     3540

aaaacgacgg ccagtgccaa gctcattacc ctgttatccc tactagcagg tagctgctta     3600

gttcaccgtt attccactcg gtaaaggtct gcgcaagttc ttcgttggtg aggttcagac     3660

cacctttaag cagagaatag gcttcagcaa tcagctgcat atcgccgtat tcaataccgt     3720

tatgaaccat cttcacatag tgacctgcgc catcggcacc aatataggta acgcatggct     3780

caccgtcttc agccactgcg gcaattttag tcaggatcgg cgcaacaagt tcataggctt     3840

ctttctgccc accaggcata atggaaggac ctttcagcgc accttcttca ccaccggaga     3900

caccggtacc gataaagtta aagccttcgg cagaaagctc acggttacga cgaatggtgt     3960

cctggaagaa ggtattacca ccatcaatga tgatgtcacc tttatcgagg tatggcttga     4020

gggaatcaat agcagaatca gtgcctgcac ctgctttcac cattaacagg atacgacgag     4080

gcatttccag agattcaaca aactctttca ccgtataata aggaaccagt ttcttgcctg     4140

gattttcggc aatcacttct tccgtctttt cgcgggaacg gttgaaaata gagacggtat     4200

aaccacggct ttcgatgttg agcgcaaggt tacgccccat cactgccata ccaactacgc     4260

cgatctgttg ctttgacatt gtttactcct gtcagagatt tgtaacttac ttaattatat     4320

taattgagca tgtaatcttc tgataaaaat catttgtacg atattttcag ttacatacta     4380

tgcgtatcgg gaattcaagc ttggatcccg ggtacctttc tgtcccatta ctgatgaaac     4440

acggcttatt tgtttatcgc cacaaagacc cgattgaatt acgcccaatt ttaggacaaa     4500

tggcagggct agcgctaatt accaatttat tgtttagctt aggaattttt ttaggttagt     4560

tgcaaattcc ggttaaaaaa agaccgcttg tttgagagtg ataatcgcaa acaagcggtc     4620

ttttttgatc aaaatattat tacacgtctt gagcgattgt gtaggctgga gctgcttcga     4680

agttcctata ctttctagag aataggaact tcggaatagg aacttcaaga tcccctcacg     4740

ctgccgcaag cactcagggc gcaagggctg ctaaaggaag cggaacacgt agaaagccag     4800

tccgcagaaa cggtgctgac cccggatgaa tgtcagctac tgggctatct ggacaaggga     4860

aaacgcaagc gcaaagagaa agcaggtagc ttgcagtggg cttacatggc gatagctaga     4920

ctgggcggtt ttatggacag caagcgaacc ggaattgcca gctggggcgc cctctggtaa     4980

ggttgggaag ccctgcaaag taaactggat ggctttcttg ccgccaagga tctgatggcg     5040

caggggatca agatctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga     5100

tggattgcac gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc     5160

acaacagaca atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc     5220

ggttcttttt gtcaagaccg acctgtccgg tgccctgaat gaactgcagg acgaggcagc     5280

gcggctatcg tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac     5340

tgaagcggga agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc     5400

tcaccttgct cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac     5460

gcttgatccg gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg     5520

tactcggatg gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct     5580

cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt     5640

cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg     5700

attcatcgac tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac     5760

ccgtgatatt gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg     5820

tatcgccgct cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg     5880

agcgggactc tggggttcga aatgaccgac caagcgacgc ccaacctgcc atcacgagat     5940

ttcgattcca ccgccgcctt ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc     6000

ggctggatga tcctccagcg cggggatctc atgctggagt tcttcgccca ccccagcttc     6060

aaaagcgctc tgaagttcct atactttcta gagaatagga acttcggaat aggaactaag     6120

gaggatattc atatggacct ttttcttctc agcttagttt gaatctaaag atttctcaca     6180

ttatctccca atctttgctc ttcatcaata atattttttg cttatctgta ggccttctaa     6240

tataatacat aagctaatat tggctattat tttgagagta atattatgct ggaacaagct     6300

agcaatcaaa ccaacgaaga aattgatctg attgagctga ttcgcgtgct gtggaagaag     6360

aagctgctga ttgccatcgt taccttcatc tttacagctt tagccgcagt ttatgccttt     6420

acagcaaaag agaaatggac cagtcaagct gaagtgattg ccccgcgcgt gaccgacatt     6480

agtgaatatt tatctttacg taaagaatac aatctgatta tgggcagtga atttaaagaa     6540

aatgatattc gcaatgaatt aaacgaatta ttcagccgct acgtgctgag ctatgatgaa     6600

gccatctcct tttttaaaac caccgatacc tacaagaagc tggccgagaa agaaaacgag     6660

gctggtctgc agcgcgcagc cagtgaattc accaccgaat ctttaaaggt gattaagccg     6720

gacgccaaga aagatttaaa tgctttaggc agcaagatcg ccattagcag cgaaaccgct     6780

ttaagtgccc agaccgagct gaacgacttt attcgccaca tcagcgatat tagcttcaat     6840

tttagcaaaa atgaatttat ttattgggtg aaggagagca tcagctcttt aaactatgaa     6900

aaagaagtga tcgaacaaga tcagagcatc cagcgcaaag ttcagatcca gaatctggag     6960

gccgcactgg acatggccaa aaaggccggc atcaaagagt acagcagcgc actgagcagc     7020

aacagcagcg tggcaaattt agcagtgagc gacaccaaga ttccgctgag cgacagtaaa     7080

ctggcagatg gcacctattt attcatgctg ggcgagaaga atttacaagc ccaactggat     7140

atcgccaaga ccaaagagat cgtgtacagc ccgcgctatt accagatcca agaacagctg     7200

ctgaagctga atactttact gccgaaagtt gagaaggtga ccggccagac atatagttac     7260

gtgagcagcc cgacctatcc ggtgatcaaa gacgccccga agaaaggcat cattctggtt     7320

atcggctttt tagtgggttt actgctgagc agctttaccg tgctgatcag cgtgctggtg     7380

cgcaacaaaa agagttaatt atgaggtatg aagaggaata ttaacctctt tatgtgcctt     7440

ttgatgatcc aatttactga agtgcttaga ctatttaggt aataaaatat agggataata     7500

atgaaacata agattttaca ttttagtcaa gttctgggcg gtgtgggccg ctatctggaa     7560

ctgtatgaca aatacatcaa caaagatagc tttgaaaaca tttatatttt accgattggt     7620

gattgggaag ccgcagaagc ccaagataaa cgttacattc tgaatattga acagagcttt     7680

agcccgatta aactgattag taatgttatt aagattcgta acattctgaa aaaagaaaaa     7740

ccggatatct tttatttaca cagcaccttc gccggtgtta ttggccgttt agcagccatt     7800

ggcatgcgct gcaaagtgat ctacaacccg catggctgga gcttcaagat gaatgtgagc     7860

cgtttaaagc agaccttcta taagatcatt gagggcggtt tagtgttttt aacagataaa     7920

ttcgtgctga tcagcaagag cgaatacgaa gcagcccgca gcatcggcgt tagcgagaaa     7980

aaatgctgtt taatttacaa tggcattgaa accattaaaa agaccgatat tgcaattatt     8040

ccgaagctgg atgataaata tatcattggc atgattggcc gcatcagcga gcagaaaaac     8100

ccgatgtttt tcgcccagtt cgccaaggag atcattaagc agtacccgaa cacctacttt     8160

attttagttg gcgatggtga gcagcgcgag agtctggagg attatttaga acgcaataac     8220

ttaaacgacg tgttttatat caccggctgg gtgaccaacc cggagagcta tctgaatctg     8280

ttcgaccaag ctgttctgtt cagtaaatgg gaaggtttat gtttaagcgt gtgcgagtat     8340

atgctgtacg agaagccgat tttagtgagc aacattggtg gcatcaacga tttaatccag     8400

aacgaggtta acggcttcac catcgttgag ggcgatttaa aggatgccgt gaacaagagt     8460

aaccgtttac gcaatgaacc gaaaaccgtg gccaagttca tcgaagccag caacatttta     8520

attcaagaaa agttcaacgc acagaaaatg gtgaatagct tagaaaaact gttcatcaaa     8580

ctgagcgaaa ataaataatg aaggaaaaat ttagttgcat cgtggtgtgt tacaacccgg     8640

acaatagcgt gctggataat ttaaaaaact atattagtta tgtgggtaaa gtgattgtgg     8700

tggataacag tgatgtggac aacagccagc tgttcagctc tttaagcgag tatctgatct     8760

acatcccgct gtacaaaaac gtgggcatcg cctatgcttt aaacatcggt gtggagaaga     8820

gcaaggaact gggttatgag tatattatta ccatggacca agatagcagc ttcgccacaa     8880

atctggtgga tgtgtacagc cattacatca gcaactaccc gatcgatcag attggcgctt     8940

taagcccggt gtatattacc gaccgcggtt tcaaccgtac cagcaaagaa gaatttaaac     9000

agattaagat caccatgcag agcggcagca tgttcttcac cgacaaattc gatgtgatcg     9060

gccgctttga caacgattta tttttagacg tggtggactg ggaatacttt ttccgcattt     9120

atactttagg ttataaaaca attcagtgca ataaagccat gctgaaacac gccccggccg     9180

aaactttaac tttatttaaa attaaaggta aaaccattgg tgtgggcgtg gcaagcccgc     9240

tgcgctatta ctatcagatt cgcaatctgc tgtggtgcgt gctgcacaag aaaagcttct     9300

tcatgattaa gaccattgcc tataagttta tcaagattct gtttctgttt aataataaaa     9360

aacagtattt aagcttcgca tacatggcca tcaaggacgc cttcaataac cgtttaggcg     9420

cctatgatac actgtatctg gagaaaagcc gtaatgaaaa atgatctgcc gctgatcagc     9480

atcatcatcc cgatctataa cgtgaaaccg tatttagaaa agtgcgtgaa cagcgtgctg     9540

agccagagct atccgaatct ggagattatt ctggtggatg acggtgccac cgatggcagc     9600

gcacaagttt gcgatgattt tagcgaaaaa tatgcaaata ttcaagttat tcataagaaa     9660

aatggtggtt taagcagcgc acgtaatgcc ggtattgagg ccatgaaagg cgagtacgtg     9720

ttctttctgg atagcgacga ctggatcgca aatgacgcca tcagccagct gtacgatgat     9780

atggtggagt acaacgccga catcaccggc atcagctttt accaagctta tagcgacggt     9840

aatttagtgc tgaacaccca tctgatcgag aagcagatgc tgagcaagaa agaggcactg     9900

cgtacctttt tattcaataa ttatttaacc ccgtgtagct gcggcaagct gtataaagcc     9960

tctttatgga aggacatccg ctttccggaa ggtcgtttat ttgaagatca gctgaccacc    10020

tataaagtta tcgaactggc caacaccatc atcttcaatc cggccgccaa atacttttat    10080

ttcaaacgta tcggcagcat cggccacagc gccttcagcg agaaaaccta tgatttatat    10140

gaggcagtga atgaacagta caacgagatc accaaacacc acccggatat cgagagtgat    10200

ctggccgtgg ccaaaattac ttgggaaatt gtgtttatta atatgatgct gaacagtaac    10260

tacagcgatc aagctatcgt ggacaaaacc cgcgtgtttg cacgcaaacg tattttagat    10320

gtggtgaagt gcgagttcat cccgaattta cgcaagtttc agatcacttt atttgcctac    10380

aatttcagtc tgtataaagt tctgtatgcc cgctataaga agaaaaatcc gctgagttaa    10440

ttattgattt tatcggttgt tacaatgatt ccgaagaaaa ttcattattg ctggttcggc    10500

ggcaatccgc tgccgaaaag tgtgaagaag tgcattaaaa gttggaaaaa atattgtccg    10560

gattatgaaa ttattgaatg gaatgagagc aattacaatg tgcataaaaa tttatttatt    10620

aaagaggcct acgagaagaa gaagttcgcc ttcgtgagcg attacgcccg tttagatgtg    10680

gtgcacagtg aaggtggcat ctatctggac accgatgtgg agctgatcaa accgatcgat    10740

gatttactgg cccatagctg ctttctggcc agcgaaagca tcgatgacgt gaataccggt    10800

ttaggctttg gtgccgaaaa aggccactgg ttcatcgccg agaacatgag cgtgtatgaa    10860

aatatgtact ttaatatgga aaatattatc acttgtgtgg agatcaccac caaactgctg    10920

atcgaacgcg gctttagcgc cagcgataaa attcagaata ttgatgatat ttttatttat    10980

ccgaccgaat atttttgccc gctgaactac aaaacccacg aactgcacat cacccagaac    11040

acctacagca tccaccacta tgatgccact tggcagagcc cgctgatgaa attcaaaacc    11100

aagatcaagt acattctgtg tttagccggc attattaaat gaattcttta gtgtatcgca    11160

ttgacatccg cactttaatt tttagcatct tttattttac ctttctggtg agcgattttt    11220

tactgctggc ccaagatggc acaatcacca aggacatcat caagtgggtg aagctgtttt    11280

ctttactgcc gctgctgctg ctgatcttca agctgccgct gaatttactg attctgggct    11340

tttttaccat tatgattagt gccttctaca gcatctatac cggtgacagc tttttactgt    11400

acatctgttt actgatgagc tttagctaca aagttaattt taatttttta tttaagattg    11460

gtttatatct gaccagtatt ttagtggtgc tgattctgac atattttttc tttgagtact    11520

ttctgatcgg cgacagccac tttgtgtacg acgccaccta ctggttcaaa cgctacacct    11580

ttaactttga taacccgaac gcatttccga tgcgcatctt tgtctttttt attttttaca    11640

ttctgcatgt gggcaaactg cgtttattcg acacctttct gttcgttatt ctgtttggta    11700

tcgtttttta ctttagcaac agccgtacag ccttttacat ttttattctg tgtgttctga    11760

ccattcattt taatcaagtg tttaatgttc tgaataatac ctttgttaaa ctgctgatta    11820

acaatagcat tatctttatc accattttta gcatttggag cgcaatctat tatcaagatt    11880

actattctta tctggaaccg attaataaaa ttttaagcaa acgtatttat tttgcaaacg    11940

aagcctataa gtctttaggc ttcgagttct acccgcgcaa tatcaagtgg tggatcgagg    12000

agagcgactg gcatattatc gacaatggct atgtttattt atttatcagc ggcggtttac    12060

tggtgggcaa cttatttatc ttttctatta cttggctgat gtatcgtctg aataaattta    12120

atttaagcaa cgaggccatt ttactgatgt ttagcatgct gtatttactg agcgagagcc    12180

attttatcaa tattttttat aacatcccga ttttactgct ggcaattttt attaataaaa    12240

ccaatattgt gcgttattta gagtgtaaaa aatgaataag aatctggtga ataatagcat    12300

tatgagcttt ttactgacca tcagcaactt catcttcccg ctgatcacct tcacctacgc    12360

cgcacgtatt ctgcagccgg acaacatggg taagttcgcc tttagcttaa gcgtggttga    12420

ttatttatct ttatttgcca cctttggtgt ggtgggttat ggcgtgcgtg cttgtgccga    12480

agttcgcaat aataaggaag aactgaccaa aacagtgcaa gaaattctgt ttattaatat    12540

ctttttagca attattgcat atctggttat ttttctgctg atcagttatc agcatgcctt    12600

tcgcgaggac acactgctgt ttctgatcat gagtagctgc atcattttca acgttatcgg    12660

cattgagtgg ctgtataaat ctttagatga gtaccgctac atcaccgtgc gcagcatttt    12720

actgaagatt attagcttaa tcatgattct gtgctttgtg aaggagaaag acgattatcc    12780

gctgttcgct ttattctttg tgctgccgat ctgtttaagc agcttactga acattattaa    12840

cagccgcaaa attttactgt ttaaactgtt taagctggac ttaagcaaac atattaaacc    12900

gatgtttgtg ctgtttctgg ttactttaag ttacacttta tacgccaacg tgaatgatgt    12960

gctgctggcc acagtgacca acaccgagca agttggctac tatagtgtgg catttaaaat    13020

caaggccgct ttactggcat ttatcaccag caccagcatg gtgtttctgc cgcgcttaac    13080

cgagtatatc aaaaacaatc aagatatcga atttattgat ctgttacgta aaagctttga    13140

tctggtgttc tttttagccg tgcctattac tttattcttt tttctgtatg ccaaagagac    13200

catcttttta ctgtttggtg aaaaatataa caagagctct ttactgctgc agaccatgat    13260

ctggagcgtt ttcttcggcg gtttaaataa cattttaagc gttcagatgc tgctgccgct    13320

gaaaaaggac aatcagtttt taatcagtat tttaagcggc ggctgcattt ctttagtggt    13380

gaatttcatc tttttacgcg aattacagag tctgagtacc agtatcagcg ttctggtggc    13440

cgaagtggtg attttaatca tccagctggt gattttacgc aagtacatcg ttcgtatctt    13500

caataattta aatccgctga aagttattat gagcgtcttt tttagcattt ggtttgtgaa    13560

tttaatctat gccaacttta tcgctttagg caacagcttt ctggagtata ttattagtat    13620

ttttatcttc tctttattct atgtgtttct gctgtttttc agcaaagaac gctttgtgca    13680

tgatgtgttc ttttatattc gcagcaaatt tgattaatct gctgatcagc attctggcca    13740

aaattttaag ccgcattagc aaactgattt taaatatcaa gaagcgtaaa gagtacaagc    13800

gcgtgggcag tattgtggat agcaaaaata ttgatttaag ctttatctgc ggcaattatt    13860

gccgcgtggg ccgtgatacc gtgattgaga agaacgtgat catgggccgt ttaagctaca    13920

tcaacagcga catgggcaag acctatatcg gcagcaatgt gaagatcggc tctttatgta    13980

gcatcagcag cggcgtgatc attgccccgg tgaaccacta tttaaactat gtgaccaccc    14040

acccgctgct gtataacagc tactacagca gcattctgaa catcaacagc aatttactga    14100

gccagcaaga actggacgca aacgtgagca ccgtgattgg caatgatgtg tggatcggtg    14160

ccaacgtgat catcaaacgc ggcgtgacca ttggcgatgg tgcagtgatc ggcgctggta    14220

gcattatcac caaagacatc ccgagttacg cagtggttgc cggcgtgccg gccaaaatca    14280

tcaaatatcg ctttagcaaa gatgtgatcg agtctttaaa ggacagcaag aatgtgtggg    14340

aactgagcac cagcgaactg gaggaaaact ttagccattt atacgacgtg gaaaaatatc    14400

tgaaccgttt taaactgtaa gattaatttt tagtctagga ttttagtatg agtaaaaaga    14460

acatcgtggc acagacttta ctgctgtgtc tggatttact gctgatcagc atggccatct    14520

ttctggcagt gtttattcgt aataatattt taccgaacat catgctgttc gagccggtga    14580

gctatatcga gtatttagtt tatcctttcc cgtatgttat tattgtgact ttattcatgt    14640

ggtttggttt atacacacgt cgctacgatc tgtggcaaga atctttattt atcatcaaag    14700

tgtgctttat tagttttatt atcatttttg caactttagc cttaggtaaa aatattgaat    14760

actacagccg tgccgtgctg ctgctgtctt tatttttaag cgtgattttt ctgccgatcg    14820

gccgttattt tctgaagaaa tctctgtttc gtttaggttt atgggaacgc aaagtgaaat    14880

tcattggcaa tctgaataaa aacgaaattg gcatcttcaa cagcccgcac gtgggttacg    14940

tgctgagcaa ggacgacacc tacgacgtga tcttcatcag cagcggcgat aagagcgtta    15000

gcgaactgaa cgatttaatc gagagcaata aactgctgaa ccgcgaggtt ctgttcatcc    15060

cggttctgaa ccagtatgac ttcacccaga gtgttctgta caataatttc agcacccgtt    15120

taaatttatt tacactggag aacaaattac tgggcaaaca gaataaaatt ttaaagtatt    15180

tactggacta cgttctggtg ctgagtacac tgccgttctg gggcggtctg attttactga    15240

ttagcattaa gctgaaactg gaagatccga aaggcaaaat tttcttttta cagaagcgtc    15300

tgggccaaga aggcaaaatc ttctattgtt ataagtttcg taccatggtg agcgaccaaa    15360

gcttcatgca gcagtggctg atcgataacc cggaggaacg cgactactat gccgtgtacc    15420

ataagtatat taatgacccg cgcatcacaa aattcggcca ttttctgcgc cgtaccagtc    15480

tggatgaact gccgcagctg tttaatgtgc tgaagggcga tatgtcttta gttggcaatc    15540

gcccgtatat ggttgaagag cagcagaaga tgaaggacgc cgccagcatc attctgatga    15600

gtaaaccggg cgttaccggt ttatggcaag ttagtggtcg tagcgatgtg agctttgaag    15660

agcgtttaca gattgacagc tggtatatca aaaattggag catttggaac gatattgtta    15720

ttctgttcaa gaccgtgggc gtggtgctgc gtaaagatgg cgccagttaa taataatgta    15780

attacattaa attattatag atagggatta ttatgaagaa aattctggtt actggtggcg    15840

ctggttttat tggcagtgcc gtggttcgcc atatcatcaa cgatacccaa gatagcgtgg    15900

tgaacgtgga taaactgaca tacgccggca atctggagtc tttactgatg gtggaaaata    15960

gcccgcgcta cgtgttcgaa caagttgaca tttgcaatcg cgccgaactg gaccgtgttt    16020

ttgcccagca tcagccggat gccgttatgc atctggccgc agaaagccac gttgatcgca    16080

gcatcgatgg cccggccgcc ttcatcgaga ccaatatcgt gggcacatat actttactgg    16140

aggccgcccg ctattattgg aatagtctgg acgccgacaa aaagtcttta tttcgcttcc    16200

accacattag caccgatgag gtgtatggcg atttagaagg caccgaggat ttatttaccg    16260

aaaccacccc gtatagcccg agcagcccgt acagcgcaag caaagcaagc agcgatcatc    16320

tggtgcgcgc ttggctgcgc acatatggtt taccgaccat cgtgaccaac tgcagcaaca    16380

actacggccc gttccatttt ccggagaaac tgatcccgct gatgatttta aatgctttag    16440

aaggtaaacc gctgccggtg tatggtaacg gccagcagat tcgtgattgg ctgttcgtgg    16500

aggatcacgc ccgtgcttta tataaggttg tgaccgaagg caaagtgggc gagacctaca    16560

atattggtgg ccacaacgag aaggccaaca tcgacgttgt gcgcacaatt tgctctttac    16620

tggaggaact ggttccgaat aaaccggccg gcgtgcataa gtatgaagat ttaatcacat    16680

atgtgaccga ccgccccggt cacgatgttc gttacgccat tgatgccacc aagatcggtc    16740

gcgaactggg ttggaaacct caagaaacct tcgaaaccgg catccgtaaa accgtggaat    16800

ggtatttaaa caataccgag tggtggagcc gtgtgctgga tggtagctac aatcgcgaac    16860

gtttaggcag caactaatat tattacaagc gatccaattt ttaataaggt ttacaatatg    16920

aaaggcatta ttctggccgg cggtagcggt acccgtttat atccgattac acgcggtgtg    16980

agcaaacagc tgctgccggt gtatgataag ccgatgatct attatccgtt aagcgtgctg    17040

atgctggccg gcatccgtga ggtgctgatt attaccaccc cggaggacaa cgagagcttt    17100

aaacgtctgc tgggcgatgg cagcgatttc ggcattcagc tgagttacgc cattcaaccg    17160

agcccggatg gtctggcaca agcttttctg atcggtgaag agttcatcgg ccaagatagc    17220

gtgtgtttag tgctgggcga caacattttt tacggtcagc atttcaccca gagtctgcaa    17280

gaggccgtta agagcgttga gaccaaaggt gccaccgtgt ttggctacca agttaaagat    17340

ccggaacgct ttggcgtggt ggagtttgac gataacttcc gcgctttaag tatcgaggag    17400

aaaccgatcc agcctaaaag caactgggcc gtgaccggtc tgtacttcta cgacaaccgt    17460

gtggtggaat tcgccaaaca agttaagccg agtgcacgcg gcgagttaga gattaccact    17520

ttaaacgaaa tgtatttaaa cgatggcagt ctgaacgtgc agctgctggg ccgcggtttt    17580

gcatggctgg ataccggtac ccacgatagt ctgcacgacg ccgcagcctt tgtgaaaacc    17640

gttcagaatt tacagaatct gcaagttgct tgtttagaag aaatcgccta tcgtaacggc    17700

tggctgagct tagagcagct ggaggcttta accaaaccga tggcaaagaa cgagtatggc    17760

cagtatctgc tgcgtttaac caaaggcacc aaataatggc acgtttttta atcaccggcg    17820

caaaaggcca agttggttat tgtttaacca agcagctgca gagcaaagcc gatgttctgg    17880

ccgtggatcg cgatgaactg gacatcacaa accgcgatgc cgtgtttaaa gtggtgcgcg    17940

aattccaccc ggacgtgatt atcaatgccg ccgcccatac cgcagtggat cgtgcagaaa    18000

gcgagatcga actgagcgaa gccatcaacg ttaagggtcc gcagtatctg gccgaggcag    18060

caaacgagat cgacgccatc attttacaca ttagcacaga ctacgtgttc gagggcaccg    18120

gcagcggcga atataaagag aatgatgaac cgaacccgca aggtgtgtac ggcaaaacca    18180

aactggccgg cgaaatcgca gttcagcaag ctaacaagcg ccatatcatt ctgcgcaccg    18240

cttgggtttt cggcgaacac ggcaacaact tcgtgaaaac aatgctgcgt ttagccaaag    18300

aacgcgagag cttaggcatt gtgagcgatc agttcggtgg tccgacctat gccggtgaca    18360

tcgccagctc tttaattcat attgccaaca tcatcttaaa cagtaaaatt gatgtgttcg    18420

gcgtgtacca tttcaccggt aagccgtatg tgagctgggc cgatttcgcc aaaaagatct    18480

tcgacgaggc cgttagccag aaggttctgg aaaaagcccc gctggtgaat ttcatcgcca    18540

ccagcaacta tccgaccagc gccaaacgcc cggcaaacag ccgtttagat ttaaccaaaa    18600

tcgacgaggt gtttggcatc aagccgagca attggcagca agctttaaag aatatcaaag    18660

cctatgccta atgaaaatta tcgaaaccaa catcccggat gtgaaactgc tggaaccgca    18720

agtttttggc gacgagcgcg gctttttcat ggagatcttc cgcgacgagt ggtttcgcca    18780

gtacgtggca gatcgcacct ttgttcaaga aaaccacagc aagagcatca agggtgtgct    18840

gcgcggtctg cactatcaga ccgaaaacac ccaaggtaaa ctggtgcgtg tggtgcaagg    18900

tagcgtgttt gacgtggccg tggatctgcg caaaagcagc ccgacctttg gtcagtgggt    18960

gggtgaagtg ctgagcgccg aaaataaacg tcagctgtgg gtgccggaag gcttcgccca    19020

tggtttctat gtgctgaccg agaccgcaga gtttacctac aagtgcaccg actactataa    19080

cccgaaagcc gagcattctt taatctggaa cgatccgacc gtggccatta actggaatct    19140

gggtggtgcc ccgtctttaa gtgccaaaga tctggccggc aaagtgctga acgaagcagt    19200

gctgtttgaa taataaattc tctatttact ttttatcttg actacgatat aattggatac    19260

ctttttttag ttctatgtcg ccaaaaattg tgtgcgactt tatttaaaca tatatttcct    19320

gaggtgatgg catttcatat ctcgagctca tatgctagcg tcggatatga atatcctcct    19380

tagttcctat tccgaagcag ctccagccaa ttgaccggta ataactgaaa cagtttcttt    19440

attcatttga aaatttctta tatgctttca ttaacattac ttaagcacgc taccgcccct    19500

ggcttaacag ctaccagtgc actaattaaa aagttatgtt gcaaagagca tactcagctc    19560

atgtaaaaac attatatcca gtattcatta tctgattatt caaaaggaga aacaatacct    19620

ctgaaaaaaa aatcgctagc atattgattt tatgcccatt acctctatga ttaattaaaa    19680

aactcactaa cagaaatatt cattatacta tacttctaca acaagaagtt tactgctact    19740

ataaatttct cggctgccgt tacgcccaat cagattttta ttcacttaaa attatctcaa    19800

tacatttgcg gaacttcgcc ccttctttca ggttgcgcaa tccatacttc acaaacgcct    19860

gcatatagcc cattttttta ccgcagtcgt agctgtctcc agtcatcagc atggcgtcaa    19920

ctgactgttt tttcgccagc ccggcaatgg catcagtcag ctgaatacgc ccccatgcac    19980

caggctgagt gcattcaagt tccggccaaa tatcggcaga aagcacatag cgaccaacgg    20040

ccatgatgtc tgagtccagc gtctgcggct gatccggttt ttcgataaat tcaacaatgc    20100

ggctgacttt gccttcacga tccagcggct ctttggtctg aataacggag tattcagaaa    20160

ggtcacccgg catacgtttt gccagcacct ggctacggcc cgtttcattg aaacgcgcaa    20220

tcatggcagc aaggttgtag cgcagcgggt cggcactggc gtcgtcgatc acaacatctg    20280

gcagtaccac gacaaatggg ttgtcaccaa tggcgggtcg tgcacacaaa atggagtgac    20340

ctaaacctaa aggttcgccc tgacgcacgt tcataatagt cacgcccggc gggcagatag    20400

attgcacttc cgccagtagt tgacgcttca cgcgctgctc aaggagagat tctaattcat    20460

aagaggtgtc gaagtggttt tcgaccgcgt tcttggacgc gtgagttacc aggaggattt    20520

ctttgatccc tgcagccaca atctcgtcaa caatgtactg aatcattggc ttgtcgacga    20580

tcggtagcat ctctttggga atcgccttag tggcaggcaa catatgcatc ccaagacctg    20640

ctaccggtat aactgctttt aaattcgtca tatcgattac cctgttatcc ctagagcttg    20700

gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac    20760

aacatacgag ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc    20820

acattaattg cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg    20880

cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct    20940

tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac    21000

tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga    21060

gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat    21120

aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac    21180

ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct    21240

gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg    21300

ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg    21360

ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt    21420

cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg    21480

attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac    21540

ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga    21600

aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt    21660

gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt    21720

tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga    21780

ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc    21840

taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct    21900

atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata    21960

actacgatac gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca    22020

cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga    22080

agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga    22140

gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg    22200

gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga    22260

gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt    22320

gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct    22380

cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca    22440

ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat    22500

accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga    22560

aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc    22620

aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg    22680

caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc    22740

ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt    22800

gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca    22860

cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg    22920

aggccctttc gtc                                                       22933


<210>  44
<211>  23019
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pDOC_SL1344_Drfb_KanR::APP2 LPS(cod.opt.)

<400>  44
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt       60

cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt      120

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat      180

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt      240

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg      300

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga      360

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc      420

tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac      480

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg      540

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca      600

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg      660

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg      720

acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg      780

gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag      840

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg      900

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct      960

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac     1020

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact     1080

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     1140

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     1200

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     1260

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     1320

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc     1380

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     1440

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     1500

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     1560

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     1620

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     1680

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     1740

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     1800

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     1860

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     1920

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     1980

cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc     2040

cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca     2100

acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc     2160

cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg     2220

accatgatta cgccaagctc tagggataac agggtaatcg atatgttatg gtgccgacta     2280

agacgtaatg tagagcgtgc catcattatc cctggcagca gagtaattca tgctggcgaa     2340

aacaagctaa agagctataa ttcagcaacc attttacagg tggaagaaac aatgatgaat     2400

ttgaaagcag ttataccggt agcgggtttg ggtatgcata tgttgcctgc caccaaggca     2460

atcccaaaag agatgctacc gatcgtcgac aagccaatga ttcagtacat tgtcgatgag     2520

attgtggctg cagggatcaa agaaatcgtg ctggtgactc acgcgtctaa aaacgccgtt     2580

gagaaccact tcgacacctc ttatgaactt gaatcacttc ttgagcagcg cgttaagcgt     2640

cagcttttgg cggaagtgca atctatctgc ccaccgggcg tgacgattat gaacgttcgc     2700

caggcgcagc cgttagggct ggggcattct attctgtgcg cgcgtccggt cgtgggcgat     2760

aaccctttca ttgtggtact cccggatatt attatcgatg atgctaccgc cgatccgctg     2820

cgctataacc ttgcggcgat ggtggcgcgt ttcaatgaaa cgggtcgcag ccaggtgctg     2880

gcgaagcgca tgaaaggtga tttatcggag tattccgtta tccagacgaa agaacctctg     2940

gataatgaag gcaaagtcag ccggattgtg gagtttatcg aaaaaccgga tcagccgcag     3000

acgctggatt ccgatttgat ggcggtaggc cgttatgtgc tttcagccga catctgggcg     3060

gaactggaaa gaaccgaacc gggcgcctgg ggccgcatcc agctcaccga tgccattgct     3120

gaactggcga aaaaacagtc ggttgacgcg atgctaatga cgggtgacag ctatgactgc     3180

ggtaaaaaaa tgggctacat gcaggcattt gtgaagtacg ggctgcgcaa cctgaaagaa     3240

ggagccaagt tccgtaagag catagagcag cttttgcatg aataagtatt aacaaccgtg     3300

ataaatggtt ggtgataaac ataataacgg cagtgaacat tcgaagcggc aagttggctg     3360

aaacgagtgt tgactgccgt tttagttttg tataaagggc ttaagtaaca aggggttatc     3420

tggagcattt taatgctgat tttataagat taatccttgt ttccggatgc aattaataag     3480

acaattagcg tttaagtttt agtgagcttt gccctgctgg gcgaggtttg caacaagtcg     3540

atatgtacgc agtgcactgg tagctgatga gccaggggcg gtagcgtgtg taacgacttg     3600

agcaattaat ttttattggc aaattaaata ccacattaaa tacgccttat ggaatagaaa     3660

aaccggtcaa ttgctcgaga tatgaaatgc catcacctca ggaaatatat gtttaaataa     3720

agtcgcacac aatttttggc gacatagaac taaaaaaagg tatccaatta tatcgtagtc     3780

aagataaaaa gtaaatagag aatttattat tcaaacagca ctgcttcgtt cagcactttg     3840

ccggccagat ctttggcact taaagacggg gcaccaccca gattccagtt aatggccacg     3900

gtcggatcgt tccagattaa agaatgctcg gctttcgggt tatagtagtc ggtgcacttg     3960

taggtaaact ctgcggtctc ggtcagcaca tagaaaccat gggcgaagcc ttccggcacc     4020

cacagctgac gtttattttc ggcgctcagc acttcaccca cccactgacc aaaggtcggg     4080

ctgcttttgc gcagatccac ggccacgtca aacacgctac cttgcaccac acgcaccagt     4140

ttaccttggg tgttttcggt ctgatagtgc agaccgcgca gcacaccctt gatgctcttg     4200

ctgtggtttt cttgaacaaa ggtgcgatct gccacgtact ggcgaaacca ctcgtcgcgg     4260

aagatctcca tgaaaaagcc gcgctcgtcg ccaaaaactt gcggttccag cagtttcaca     4320

tccgggatgt tggtttcgat aattttcatt aggcataggc tttgatattc tttaaagctt     4380

gctgccaatt gctcggcttg atgccaaaca cctcgtcgat tttggttaaa tctaaacggc     4440

tgtttgccgg gcgtttggcg ctggtcggat agttgctggt ggcgatgaaa ttcaccagcg     4500

gggctttttc cagaaccttc tggctaacgg cctcgtcgaa gatctttttg gcgaaatcgg     4560

cccagctcac atacggctta ccggtgaaat ggtacacgcc gaacacatca attttactgt     4620

ttaagatgat gttggcaata tgaattaaag agctggcgat gtcaccggca taggtcggac     4680

caccgaactg atcgctcaca atgcctaagc tctcgcgttc tttggctaaa cgcagcattg     4740

ttttcacgaa gttgttgccg tgttcgccga aaacccaagc ggtgcgcaga atgatatggc     4800

gcttgttagc ttgctgaact gcgatttcgc cggccagttt ggttttgccg tacacacctt     4860

gcgggttcgg ttcatcattc tctttatatt cgccgctgcc ggtgccctcg aacacgtagt     4920

ctgtgctaat gtgtaaaatg atggcgtcga tctcgtttgc tgcctcggcc agatactgcg     4980

gacccttaac gttgatggct tcgctcagtt cgatctcgct ttctgcacga tccactgcgg     5040

tatgggcggc ggcattgata atcacgtccg ggtggaattc gcgcaccact ttaaacacgg     5100

catcgcggtt tgtgatgtcc agttcatcgc gatccacggc cagaacatcg gctttgctct     5160

gcagctgctt ggttaaacaa taaccaactt ggccttttgc gccggtgatt aaaaaacgtg     5220

ccattatttg gtgcctttgg ttaaacgcag cagatactgg ccatactcgt tctttgccat     5280

cggtttggtt aaagcctcca gctgctctaa gctcagccag ccgttacgat aggcgatttc     5340

ttctaaacaa gcaacttgca gattctgtaa attctgaacg gttttcacaa aggctgcggc     5400

gtcgtgcaga ctatcgtggg taccggtatc cagccatgca aaaccgcggc ccagcagctg     5460

cacgttcaga ctgccatcgt ttaaatacat ttcgtttaaa gtggtaatct ctaactcgcc     5520

gcgtgcactc ggcttaactt gtttggcgaa ttccaccaca cggttgtcgt agaagtacag     5580

accggtcacg gcccagttgc ttttaggctg gatcggtttc tcctcgatac ttaaagcgcg     5640

gaagttatcg tcaaactcca ccacgccaaa gcgttccgga tctttaactt ggtagccaaa     5700

cacggtggca cctttggtct caacgctctt aacggcctct tgcagactct gggtgaaatg     5760

ctgaccgtaa aaaatgttgt cgcccagcac taaacacacg ctatcttggc cgatgaactc     5820

ttcaccgatc agaaaagctt gtgccagacc atccgggctc ggttgaatgg cgtaactcag     5880

ctgaatgccg aaatcgctgc catcgcccag cagacgttta aagctctcgt tgtcctccgg     5940

ggtggtaata atcagcacct cacggatgcc ggccagcatc agcacgctta acggataata     6000

gatcatcggc ttatcataca ccggcagcag ctgtttgctc acaccgcgtg taatcggata     6060

taaacgggta ccgctaccgc cggccagaat aatgcctttc atattgtaaa ccttattaaa     6120

aattggatcg cttgtaataa tattagttgc tgcctaaacg ttcgcgattg tagctaccat     6180

ccagcacacg gctccaccac tcggtattgt ttaaatacca ttccacggtt ttacggatgc     6240

cggtttcgaa ggtttcttga ggtttccaac ccagttcgcg accgatcttg gtggcatcaa     6300

tggcgtaacg aacatcgtga ccggggcggt cggtcacata tgtgattaaa tcttcatact     6360

tatgcacgcc ggccggttta ttcggaacca gttcctccag taaagagcaa attgtgcgca     6420

caacgtcgat gttggccttc tcgttgtggc caccaatatt gtaggtctcg cccactttgc     6480

cttcggtcac aaccttatat aaagcacggg cgtgatcctc cacgaacagc caatcacgaa     6540

tctgctggcc gttaccatac accggcagcg gtttaccttc taaagcattt aaaatcatca     6600

gcgggatcag tttctccgga aaatggaacg ggccgtagtt gttgctgcag ttggtcacga     6660

tggtcggtaa accatatgtg cgcagccaag cgcgcaccag atgatcgctg cttgctttgc     6720

ttgcgctgta cgggctgctc gggctatacg gggtggtttc ggtaaataaa tcctcggtgc     6780

cttctaaatc gccatacacc tcatcggtgc taatgtggtg gaagcgaaat aaagactttt     6840

tgtcggcgtc cagactattc caataatagc gggcggcctc cagtaaagta tatgtgccca     6900

cgatattggt ctcgatgaag gcggccgggc catcgatgct gcgatcaacg tggctttctg     6960

cggccagatg cataacggca tccggctgat gctgggcaaa aacacggtcc agttcggcgc     7020

gattgcaaat gtcaacttgt tcgaacacgt agcgcgggct attttccacc atcagtaaag     7080

actccagatt gccggcgtat gtcagtttat ccacgttcac cacgctatct tgggtatcgt     7140

tgatgatatg gcgaaccacg gcactgccaa taaaaccagc gccaccagta accagaattt     7200

tcttcataat aatccctatc tataataatt taatgtaatt acattattat taactggcgc     7260

catctttacg cagcaccacg cccacggtct tgaacagaat aacaatatcg ttccaaatgc     7320

tccaattttt gatataccag ctgtcaatct gtaaacgctc ttcaaagctc acatcgctac     7380

gaccactaac ttgccataaa ccggtaacgc ccggtttact catcagaatg atgctggcgg     7440

cgtccttcat cttctgctgc tcttcaacca tatacgggcg attgccaact aaagacatat     7500

cgcccttcag cacattaaac agctgcggca gttcatccag actggtacgg cgcagaaaat     7560

ggccgaattt tgtgatgcgc gggtcattaa tatacttatg gtacacggca tagtagtcgc     7620

gttcctccgg gttatcgatc agccactgct gcatgaagct ttggtcgctc accatggtac     7680

gaaacttata acaatagaag attttgcctt cttggcccag acgcttctgt aaaaagaaaa     7740

ttttgccttt cggatcttcc agtttcagct taatgctaat cagtaaaatc agaccgcccc     7800

agaacggcag tgtactcagc accagaacgt agtccagtaa atactttaaa attttattct     7860

gtttgcccag taatttgttc tccagtgtaa ataaatttaa acgggtgctg aaattattgt     7920

acagaacact ctgggtgaag tcatactggt tcagaaccgg gatgaacaga acctcgcggt     7980

tcagcagttt attgctctcg attaaatcgt tcagttcgct aacgctctta tcgccgctgc     8040

tgatgaagat cacgtcgtag gtgtcgtcct tgctcagcac gtaacccacg tgcgggctgt     8100

tgaagatgcc aatttcgttt ttattcagat tgccaatgaa tttcactttg cgttcccata     8160

aacctaaacg aaacagagat ttcttcagaa aataacggcc gatcggcaga aaaatcacgc     8220

ttaaaaataa agacagcagc agcacggcac ggctgtagta ttcaatattt ttacctaagg     8280

ctaaagttgc aaaaatgata ataaaactaa taaagcacac tttgatgata aataaagatt     8340

cttgccacag atcgtagcga cgtgtgtata aaccaaacca catgaataaa gtcacaataa     8400

taacatacgg gaaaggataa actaaatact cgatatagct caccggctcg aacagcatga     8460

tgttcggtaa aatattatta cgaataaaca ctgccagaaa gatggccatg ctgatcagca     8520

gtaaatccag acacagcagt aaagtctgtg ccacgatgtt ctttttactc atactaaaat     8580

cctagactaa aaattaatct tacagtttaa aacggttcag atatttttcc acgtcgtata     8640

aatggctaaa gttttcctcc agttcgctgg tgctcagttc ccacacattc ttgctgtcct     8700

ttaaagactc gatcacatct ttgctaaagc gatatttgat gattttggcc ggcacgccgg     8760

caaccactgc gtaactcggg atgtctttgg tgataatgct accagcgccg atcactgcac     8820

catcgccaat ggtcacgccg cgtttgatga tcacgttggc accgatccac acatcattgc     8880

caatcacggt gctcacgttt gcgtccagtt cttgctggct cagtaaattg ctgttgatgt     8940

tcagaatgct gctgtagtag ctgttataca gcagcgggtg ggtggtcaca tagtttaaat     9000

agtggttcac cggggcaatg atcacgccgc tgctgatgct acataaagag ccgatcttca     9060

cattgctgcc gatataggtc ttgcccatgt cgctgttgat gtagcttaaa cggcccatga     9120

tcacgttctt ctcaatcacg gtatcacggc ccacgcggca ataattgccg cagataaagc     9180

ttaaatcaat atttttgcta tccacaatac tgcccacgcg cttgtactct ttacgcttct     9240

tgatatttaa aatcagtttg ctaatgcggc ttaaaatttt ggccagaatg ctgatcagca     9300

gattaatcaa atttgctgcg aatataaaag aacacatcat gcacaaagcg ttctttgctg     9360

aaaaacagca gaaacacata gaataaagag aagataaaaa tactaataat atactccaga     9420

aagctgttgc ctaaagcgat aaagttggca tagattaaat tcacaaacca aatgctaaaa     9480

aagacgctca taataacttt cagcggattt aaattattga agatacgaac gatgtacttg     9540

cgtaaaatca ccagctggat gattaaaatc accacttcgg ccaccagaac gctgatactg     9600

gtactcagac tctgtaattc gcgtaaaaag atgaaattca ccactaaaga aatgcagccg     9660

ccgcttaaaa tactgattaa aaactgattg tcctttttca gcggcagcag catctgaacg     9720

cttaaaatgt tatttaaacc gccgaagaaa acgctccaga tcatggtctg cagcagtaaa     9780

gagctcttgt tatatttttc accaaacagt aaaaagatgg tctctttggc atacagaaaa     9840

aagaataaag taataggcac ggctaaaaag aacaccagat caaagctttt acgtaacaga     9900

tcaataaatt cgatatcttg attgtttttg atatactcgg ttaagcgcgg cagaaacacc     9960

atgctggtgc tggtgataaa tgccagtaaa gcggccttga ttttaaatgc cacactatag    10020

tagccaactt gctcggtgtt ggtcactgtg gccagcagca catcattcac gttggcgtat    10080

aaagtgtaac ttaaagtaac cagaaacagc acaaacatcg gtttaatatg tttgcttaag    10140

tccagcttaa acagtttaaa cagtaaaatt ttgcggctgt taataatgtt cagtaagctg    10200

cttaaacaga tcggcagcac aaagaataaa gcgaacagcg gataatcgtc tttctccttc    10260

acaaagcaca gaatcatgat taagctaata atcttcagta aaatgctgcg cacggtgatg    10320

tagcggtact catctaaaga tttatacagc cactcaatgc cgataacgtt gaaaatgatg    10380

cagctactca tgatcagaaa cagcagtgtg tcctcgcgaa aggcatgctg ataactgatc    10440

agcagaaaaa taaccagata tgcaataatt gctaaaaaga tattaataaa cagaatttct    10500

tgcactgttt tggtcagttc ttccttatta ttgcgaactt cggcacaagc acgcacgcca    10560

taacccacca caccaaaggt ggcaaataaa gataaataat caaccacgct taagctaaag    10620

gcgaacttac ccatgttgtc cggctgcaga atacgtgcgg cgtaggtgaa ggtgatcagc    10680

gggaagatga agttgctgat ggtcagtaaa aagctcataa tgctattatt caccagattc    10740

ttattcattt tttacactct aaataacgca caatattggt tttattaata aaaattgcca    10800

gcagtaaaat cgggatgtta taaaaaatat tgataaaatg gctctcgctc agtaaataca    10860

gcatgctaaa catcagtaaa atggcctcgt tgcttaaatt aaatttattc agacgataca    10920

tcagccaagt aatagaaaag ataaataagt tgcccaccag taaaccgccg ctgataaata    10980

aataaacata gccattgtcg ataatatgcc agtcgctctc ctcgatccac cacttgatat    11040

tgcgcgggta gaactcgaag cctaaagact tataggcttc gtttgcaaaa taaatacgtt    11100

tgcttaaaat tttattaatc ggttccagat aagaatagta atcttgataa tagattgcgc    11160

tccaaatgct aaaaatggtg ataaagataa tgctattgtt aatcagcagt ttaacaaagg    11220

tattattcag aacattaaac acttgattaa aatgaatggt cagaacacac agaataaaaa    11280

tgtaaaaggc tgtacggctg ttgctaaagt aaaaaacgat accaaacaga ataacgaaca    11340

gaaaggtgtc gaataaacgc agtttgccca catgcagaat gtaaaaaata aaaaagacaa    11400

agatgcgcat cggaaatgcg ttcgggttat caaagttaaa ggtgtagcgt ttgaaccagt    11460

aggtggcgtc gtacacaaag tggctgtcgc cgatcagaaa gtactcaaag aaaaaatatg    11520

tcagaatcag caccactaaa atactggtca gatataaacc aatcttaaat aaaaaattaa    11580

aattaacttt gtagctaaag ctcatcagta aacagatgta cagtaaaaag ctgtcaccgg    11640

tatagatgct gtagaaggca ctaatcataa tggtaaaaaa gcccagaatc agtaaattca    11700

gcggcagctt gaagatcagc agcagcagcg gcagtaaaga aaacagcttc acccacttga    11760

tgatgtcctt ggtgattgtg ccatcttggg ccagcagtaa aaaatcgctc accagaaagg    11820

taaaataaaa gatgctaaaa attaaagtgc ggatgtcaat gcgatacact aaagaattca    11880

tttaataatg ccggctaaac acagaatgta cttgatcttg gttttgaatt tcatcagcgg    11940

gctctgccaa gtggcatcat agtggtggat gctgtaggtg ttctgggtga tgtgcagttc    12000

gtgggttttg tagttcagcg ggcaaaaata ttcggtcgga taaataaaaa tatcatcaat    12060

attctgaatt ttatcgctgg cgctaaagcc gcgttcgatc agcagtttgg tggtgatctc    12120

cacacaagtg ataatatttt ccatattaaa gtacatattt tcatacacgc tcatgttctc    12180

ggcgatgaac cagtggcctt tttcggcacc aaagcctaaa ccggtattca cgtcatcgat    12240

gctttcgctg gccagaaagc agctatgggc cagtaaatca tcgatcggtt tgatcagctc    12300

cacatcggtg tccagataga tgccaccttc actgtgcacc acatctaaac gggcgtaatc    12360

gctcacgaag gcgaacttct tcttctcgta ggcctcttta ataaataaat ttttatgcac    12420

attgtaattg ctctcattcc attcaataat ttcataatcc ggacaatatt ttttccaact    12480

tttaatgcac ttcttcacac ttttcggcag cggattgccg ccgaaccagc aataatgaat    12540

tttcttcgga atcattgtaa caaccgataa aatcaataat taactcagcg gatttttctt    12600

cttatagcgg gcatacagaa ctttatacag actgaaattg taggcaaata aagtgatctg    12660

aaacttgcgt aaattcggga tgaactcgca cttcaccaca tctaaaatac gtttgcgtgc    12720

aaacacgcgg gttttgtcca cgatagcttg atcgctgtag ttactgttca gcatcatatt    12780

aataaacaca atttcccaag taattttggc cacggccaga tcactctcga tatccgggtg    12840

gtgtttggtg atctcgttgt actgttcatt cactgcctca tataaatcat aggttttctc    12900

gctgaaggcg ctgtggccga tgctgccgat acgtttgaaa taaaagtatt tggcggccgg    12960

attgaagatg atggtgttgg ccagttcgat aactttatag gtggtcagct gatcttcaaa    13020

taaacgacct tccggaaagc ggatgtcctt ccataaagag gctttataca gcttgccgca    13080

gctacacggg gttaaataat tattgaataa aaaggtacgc agtgcctctt tcttgctcag    13140

catctgcttc tcgatcagat gggtgttcag cactaaatta ccgtcgctat aagcttggta    13200

aaagctgatg ccggtgatgt cggcgttgta ctccaccata tcatcgtaca gctggctgat    13260

ggcgtcattt gcgatccagt cgtcgctatc cagaaagaac acgtactcgc ctttcatggc    13320

ctcaataccg gcattacgtg cgctgcttaa accaccattt ttcttatgaa taacttgaat    13380

atttgcatat ttttcgctaa aatcatcgca aacttgtgcg ctgccatcgg tggcaccgtc    13440

atccaccaga ataatctcca gattcggata gctctggctc agcacgctgt tcacgcactt    13500

ttctaaatac ggtttcacgt tatagatcgg gatgatgatg ctgatcagcg gcagatcatt    13560

tttcattacg gcttttctcc agatacagtg tatcataggc gcctaaacgg ttattgaagg    13620

cgtccttgat ggccatgtat gcgaagctta aatactgttt tttattatta aacagaaaca    13680

gaatcttgat aaacttatag gcaatggtct taatcatgaa gaagcttttc ttgtgcagca    13740

cgcaccacag cagattgcga atctgatagt aatagcgcag cgggcttgcc acgcccacac    13800

caatggtttt acctttaatt ttaaataaag ttaaagtttc ggccggggcg tgtttcagca    13860

tggctttatt gcactgaatt gttttataac ctaaagtata aatgcggaaa aagtattccc    13920

agtccaccac gtctaaaaat aaatcgttgt caaagcggcc gatcacatcg aatttgtcgg    13980

tgaagaacat gctgccgctc tgcatggtga tcttaatctg tttaaattct tctttgctgg    14040

tacggttgaa accgcggtcg gtaatataca ccgggcttaa agcgccaatc tgatcgatcg    14100

ggtagttgct gatgtaatgg ctgtacacat ccaccagatt tgtggcgaag ctgctatctt    14160

ggtccatggt aataatatac tcataaccca gttccttgct cttctccaca ccgatgttta    14220

aagcataggc gatgcccacg tttttgtaca gcgggatgta gatcagatac tcgcttaaag    14280

agctgaacag ctggctgttg tccacatcac tgttatccac cacaatcact ttacccacat    14340

aactaatata gttttttaaa ttatccagca cgctattgtc cgggttgtaa cacaccacga    14400

tgcaactaaa tttttccttc attatttatt ttcgctcagt ttgatgaaca gtttttctaa    14460

gctattcacc attttctgtg cgttgaactt ttcttgaatt aaaatgttgc tggcttcgat    14520

gaacttggcc acggttttcg gttcattgcg taaacggtta ctcttgttca cggcatcctt    14580

taaatcgccc tcaacgatgg tgaagccgtt aacctcgttc tggattaaat cgttgatgcc    14640

accaatgttg ctcactaaaa tcggcttctc gtacagcata tactcgcaca cgcttaaaca    14700

taaaccttcc catttactga acagaacagc ttggtcgaac agattcagat agctctccgg    14760

gttggtcacc cagccggtga tataaaacac gtcgtttaag ttattgcgtt ctaaataatc    14820

ctccagactc tcgcgctgct caccatcgcc aactaaaata aagtaggtgt tcgggtactg    14880

cttaatgatc tccttggcga actgggcgaa aaacatcggg tttttctgct cgctgatgcg    14940

gccaatcatg ccaatgatat atttatcatc cagcttcgga ataattgcaa tatcggtctt    15000

tttaatggtt tcaatgccat tgtaaattaa acagcatttt ttctcgctaa cgccgatgct    15060

gcgggctgct tcgtattcgc tcttgctgat cagcacgaat ttatctgtta aaaacactaa    15120

accgccctca atgatcttat agaaggtctg ctttaaacgg ctcacattca tcttgaagct    15180

ccagccatgc gggttgtaga tcactttgca gcgcatgcca atggctgcta aacggccaat    15240

aacaccggcg aaggtgctgt gtaaataaaa gatatccggt ttttcttttt tcagaatgtt    15300

acgaatctta ataacattac taatcagttt aatcgggcta aagctctgtt caatattcag    15360

aatgtaacgt ttatcttggg cttctgcggc ttcccaatca ccaatcggta aaatataaat    15420

gttttcaaag ctatctttgt tgatgtattt gtcatacagt tccagatagc ggcccacacc    15480

gcccagaact tgactaaaat gtaaaatctt atgtttcatt attatcccta tattttatta    15540

cctaaatagt ctaagcactt cagtaaattg gatcatcaaa aggcacataa agaggttaat    15600

attcctcttc atacctcata attaactctt tttgttgcgc accagcacgc tgatcagcac    15660

ggtaaagctg ctcagcagta aacccactaa aaagccgata accagaatga tgcctttctt    15720

cggggcgtct ttgatcaccg gataggtcgg gctgctcacg taactatatg tctggccggt    15780

caccttctca actttcggca gtaaagtatt cagcttcagc agctgttctt ggatctggta    15840

atagcgcggg ctgtacacga tctctttggt cttggcgata tccagttggg cttgtaaatt    15900

cttctcgccc agcatgaata aataggtgcc atctgccagt ttactgtcgc tcagcggaat    15960

cttggtgtcg ctcactgcta aatttgccac gctgctgttg ctgctcagtg cgctgctgta    16020

ctctttgatg ccggcctttt tggccatgtc cagtgcggcc tccagattct ggatctgaac    16080

tttgcgctgg atgctctgat cttgttcgat cacttctttt tcatagttta aagagctgat    16140

gctctccttc acccaataaa taaattcatt tttgctaaaa ttgaagctaa tatcgctgat    16200

gtggcgaata aagtcgttca gctcggtctg ggcacttaaa gcggtttcgc tgctaatggc    16260

gatcttgctg cctaaagcat ttaaatcttt cttggcgtcc ggcttaatca cctttaaaga    16320

ttcggtggtg aattcactgg ctgcgcgctg cagaccagcc tcgttttctt tctcggccag    16380

cttcttgtag gtatcggtgg ttttaaaaaa ggagatggct tcatcatagc tcagcacgta    16440

gcggctgaat aattcgttta attcattgcg aatatcattt tctttaaatt cactgcccat    16500

aatcagattg tattctttac gtaaagataa atattcacta atgtcggtca cgcgcggggc    16560

aatcacttca gcttgactgg tccatttctc ttttgctgta aaggcataaa ctgcggctaa    16620

agctgtaaag atgaaggtaa cgatggcaat cagcagcttc ttcttccaca gcacgcgaat    16680

cagctcaatc agatcaattt cttcgttggt ttgattgcta gcttgttcca gcataatatt    16740

actctcaaaa taatagccaa tattagctta tgtattatat tagaaggcct acagataagc    16800

aaaaaatatt attgatgaag agcaaagatt gggagataat gtgagaaatc tttagattca    16860

aactaagctg agaagaaaaa ggtccatatg aatatcctcc ttagttccta ttccgaagtt    16920

cctattctct agaaagtata ggaacttcag agcgcttttg aagctggggt gggcgaagaa    16980

ctccagcatg agatccccgc gctggaggat catccagccg gcgtcccgga aaacgattcc    17040

gaagcccaac ctttcataga aggcggcggt ggaatcgaaa tctcgtgatg gcaggttggg    17100

cgtcgcttgg tcggtcattt cgaaccccag agtcccgctc agaagaactc gtcaagaagg    17160

cgatagaagg cgatgcgctg cgaatcggga gcggcgatac cgtaaagcac gaggaagcgg    17220

tcagcccatt cgccgccaag ctcttcagca atatcacggg tagccaacgc tatgtcctga    17280

tagcggtccg ccacacccag ccggccacag tcgatgaatc cagaaaagcg gccattttcc    17340

accatgatat tcggcaagca ggcatcgcca tgggtcacga cgagatcctc gccgtcgggc    17400

atgcgcgcct tgagcctggc gaacagttcg gctggcgcga gcccctgatg ctcttcgtcc    17460

agatcatcct gatcgacaag accggcttcc atccgagtac gtgctcgctc gatgcgatgt    17520

ttcgcttggt ggtcgaatgg gcaggtagcc ggatcaagcg tatgcagccg ccgcattgca    17580

tcagccatga tggatacttt ctcggcagga gcaaggtgag atgacaggag atcctgcccc    17640

ggcacttcgc ccaatagcag ccagtccctt cccgcttcag tgacaacgtc gagcacagct    17700

gcgcaaggaa cgcccgtcgt ggccagccac gatagccgcg ctgcctcgtc ctgcagttca    17760

ttcagggcac cggacaggtc ggtcttgaca aaaagaaccg ggcgcccctg cgctgacagc    17820

cggaacacgg cggcatcaga gcagccgatt gtctgttgtg cccagtcata gccgaatagc    17880

ctctccaccc aagcggccgg agaacctgcg tgcaatccat cttgttcaat catgcgaaac    17940

gatcctcatc ctgtctcttg atcagatctt gatcccctgc gccatcagat ccttggcggc    18000

aagaaagcca tccagtttac tttgcagggc ttcccaacct taccagaggg cgccccagct    18060

ggcaattccg gttcgcttgc tgtccataaa accgcccagt ctagctatcg ccatgtaagc    18120

ccactgcaag ctacctgctt tctctttgcg cttgcgtttt cccttgtcca gatagcccag    18180

tagctgacat tcatccgggg tcagcaccgt ttctgcggac tggctttcta cgtgttccgc    18240

ttcctttagc agcccttgcg ccctgagtgc ttgcggcagc gtgaggggat cttgaagttc    18300

ctattccgaa gttcctattc tctagaaagt ataggaactt cgaagcagct ccagcctaca    18360

caatcgctca agacgtgtaa taatattttg atcaaaaaag accgcttgtt tgcgattatc    18420

actctcaaac aagcggtctt tttttaaccg gaatttgcaa ctaacctaaa aaaattccta    18480

agctaaacaa taaattggta attagcgcta gccctgccat ttgtcctaaa attgggcgta    18540

attcaatcgg gtctttgtgg cgataaacaa ataagccgtg tttcatcagt aatgggacag    18600

aaaggtaccc gggatccaag cttgaattcc cgagcttacc gagaagtact gaataataat    18660

tgtataaatt agcctgcgta aaatctgaac gcatcaatcg ctaccttaat atcatacctt    18720

tgagttaaca tactattcac ctttaacctg ccatgaccgt ttgtggcagg gtttccacac    18780

ctgacaggag tatgtaatgt ccaagcaaca gatcggcgtc gtcggtatgg cagtgatggg    18840

gcgcaacctc gcgctcaaca tcgaaagccg tggttatacc gtctccgttt tcaaccgctc    18900

ccgtgaaaag accgaagaag tgattgccga gaatcccggc aaaaagctgg tgccttatta    18960

cacggtgaaa gagtttgttg aatccctcga aacgcctcgt cgtatcctgt taatggtgaa    19020

agcgggcgca ggtactgatg cagctatcga ttcgctgaaa ccgtatctgg aaaaaggcga    19080

tatcattatt gatggcggta acaccttctt ccaggacaca atccgtcgca atcgcgagct    19140

gtctgcggaa ggttttaact ttatcggtac cggtgtttcc ggtggtgaag agggcgcgct    19200

gaaagggcca tctatcatgc ctggcggtca gaaagatgcc tatgaactgg tggcgccgat    19260

cctgacgaag attgctgctg tggcagaaga tggcgaaccg tgcgtgacct atatcggcgc    19320

cgatggtgct ggtcactacg tcaagatggt ccacaatggt attgaatatg gcgatatgca    19380

gcttatcgct gaagcttact ccctgctgaa aggcggcctg aatctcagca atgctagtag    19440

ggataacagg gtaatgagct tggcactggc cgtcgtttta caacgtcgtg actgggaaaa    19500

ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa    19560

tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg    19620

gcgagcttgg ctgttttggc ggatgagaga agattttcag cctgatacag attaaatcag    19680

aacgcagaag cggtctgata aaacagaatt tgcctggcgg cagtagcgcg gtggtcccac    19740

ctgaccccat gccgaactca gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc    19800

cccatgcgag agtagggaac tgccaggcat caaataaaac gaaaggctca gtcgaaagac    19860

tgggcctttc gttttatctg ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg    19920

ccgggagcgg atttgaacgt tgcgaagcaa cggcccggag ggtggcgggc aggacgcccg    19980

ccataaactg ccaggcatca aattaagcag aaggccatcc tgacggatgg cctttttgcg    20040

tttctacaaa ctctttttgt ttatttttct aaatacattc aaatatgcat gcgcctgatg    20100

cggtattttc tccttacgca tatcgacatc cgccctcacc gccaggaacg caaccgcagc    20160

ctcatcacgc cggcgcttct tggccgcgcg ggattcaacc cactcggcca gctcgtcggt    20220

gtagctcttt ggcatcgtct ctcgcctgtc ccctcagttc agtaatttcc tgcatttgcc    20280

tgtttccagt cggtagatat tccacaaaac agcagggaag cagcgctttt ccgctgcata    20340

accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt tcgcacgata    20400

tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac ggcgtcagcc    20460

gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca ctgtccctta    20520

ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg ctaccgccgg    20580

cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga agggcagccc    20640

acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa aggcggcggc    20700

ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca aaatcacggg    20760

cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc tgggccgcct    20820

gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt tcggtgatgc    20880

cacgatcctc gccctgctgg cgaagatcga ctctagctag aggatcgatc ctttttaacc    20940

catcacatat acctgccgtt cactattatt tagtgaaatg agatattatg atattttctg    21000

aattgtgatt aaaaaggcaa ctttatgccc atgcaacaga aactataaaa aatacagaga    21060

atgaaaagaa acagatagat tttttagttc tttaggcccg tagtctgcaa atccttttat    21120

gattttctat caaacaaaag aggaaaatag accagttgca atccaaacga gagtctaata    21180

gaatgaggtc gaaaagtaaa tcgcgcgggt ttgttactga taaagcaggc aagacctaaa    21240

atgtgtaaag ggcaaagtgt atactttggc gtcacccctt acatatttta ggtctttttt    21300

tattgtgcgt aactaacttg ccatcttcaa acaggagggc tggaagaagc agaccgctaa    21360

cacagtacat aaaaaaggag acatgaacga tgaacatcaa aaagtttgca aaacaagcaa    21420

cagtattaac ctttactacc gcactgctgg caggaggcgc aactcaagcg tttgcgaaag    21480

aaacgaacca aaagccatat aaggaaacat acggcatttc ccatattaca cgccatgata    21540

tgctgcaaat ccctgaacag caaaaaaatg aaaaatatca agttcctgag ttcgattcgt    21600

ccacaattaa aaatatctct tctgcaaaag gcctggacgt ttgggacagc tggccattac    21660

aaaacgctga cggcactgtc gcaaactatc acggctacca catcgtcttt gcattagccg    21720

gagatcctaa aaatgcggat gacacatcga tttacatgtt ctatcaaaaa gtcggcgaaa    21780

cttctattga cagctggaaa aacgctggcc gcgtctttaa agacagcgac aaattcgatg    21840

caaatgattc tatcctaaaa gaccaaacac aagaatggtc aggttcagcc acatttacat    21900

ctgacggaaa aatccgttta ttctacactg atttctccgg taaacattac ggcaaacaaa    21960

cactgacaac tgcacaagtt aacgtatcag catcagacag ctctttgaac atcaacggtg    22020

tagaggatta taaatcaatc tttgacggtg acggaaaaac gtatcaaaat gtacagcagt    22080

tcatcgatga aggcaactac agctcaggcg acaaccatac gctgagagat cctcactacg    22140

tagaagataa aggccacaaa tacttagtat ttgaagcaaa cactggaact gaagatggct    22200

accaaggcga agaatcttta tttaacaaag catactatgg caaaagcaca tcattcttcc    22260

gtcaagaaag tcaaaaactt ctgcaaagcg ataaaaaacg cacggctgag ttagcaaacg    22320

gcgctctcgg tatgattgag ctaaacgatg attacacact gaaaaaagtg atgaaaccgc    22380

tgattgcatc taacacagta acagatgaaa ttgaacgcgc gaacgtcttt aaaatgaacg    22440

gcaaatggta tctgttcact gactcccgcg gatcaaaaat gacgattgac ggcattacgt    22500

ctaacgatat ttacatgctt ggttatgttt ctaattcttt aactggccca tacaagccgc    22560

tgaacaaaac tggccttgtg ttaaaaatgg atcttgatcc taacgatgta acctttactt    22620

actcacactt cgctgtacct caagcgaaag gaaacaatgt cgtgattaca agctatatga    22680

caaacagagg attctacgca gacaaacaat caacgtttgc gcctagcttc ctgctgaaca    22740

tcaaaggcaa gaaaacatct gttgtcaaag acagcatcct tgaacaagga caattaacag    22800

ttaacaaata aaaacgcaaa agaaaatgcc gattatggtg cactctcagt acaatctgct    22860

ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac    22920

gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca    22980

tgtgtcagag gttttcaccg tcatcaccga aacgcgcga                           23019


<210>  45
<211>  1414
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthesized chloramphenicol resistance cassette flanked on the 3'
       side by kanamycin promoter

<400>  45
attacacgtc ttgagcgatt gtgtaggctg gagctgcttc gaagttccta tactttctag       60

agaataggaa cttcggaata ggaacttcat ttaaatggcg cgccttacgc cccgccctgc      120

cactcatcgc agtactgttg taattcatta agcattctgc cgacatggaa gccatcacaa      180

acggcatgat gaacctgaat cgccagcggc atcagcacct tgtcgccttg cgtataatat      240

ttgcccatgg tgaaaacggg ggcgaagaag ttgtccatat tggccacgtt taaatcaaaa      300

ctggtgaaac tcacccaggg attggctgag acgaaaaaca tattctcaat aaacccttta      360

gggaaatagg ccaggttttc accgtaacac gccacatctt gcgaatatat gtgtagaaac      420

tgccggaaat cgtcgtggta ttcactccag agcgatgaaa acgtttcagt ttgctcatgg      480

aaaacggtgt aacaagggtg aacactatcc catatcacca gctcaccgtc tttcattgcc      540

atacgtaatt ccggatgagc attcatcagg cgggcaagaa tgtgaataaa ggccggataa      600

aacttgtgct tatttttctt tacggtcttt aaaaaggccg taatatccag ctgaacggtc      660

tggttatagg tacattgagc aactgactga aatgcctcaa aatgttcttt acgatgccat      720

tgggatatat caacggtggt atatccagtg atttttttct ccattttagc ttccttagct      780

cctgaaaatc tcgacaactc aaaaaatacg cccggtagtg atcttatttc attatggtga      840

aagttggaac ctcttacgtg ccgatcaacg tctcattttc gccaaaagtt ggcccagggc      900

ttcccggtat caacagggac accaggattt atttattctg cgaagtgatc ttccgtcaca      960

ggtaggcgcg ccgaagttcc tatactttct agagaatagg aacttcggaa taggaaggaa     1020

taggaacttc aagatcccct cacgctgccg caagcactca gggcgcaagg gctgctaaag     1080

gaagcggaac acgtagaaag ccagtccgca gaaacggtgc tgaccccgga tgaatgtcag     1140

ctactgggct atctggacaa gggaaaacgc aagcgcaaag agaaagcagg tagcttgcag     1200

tgggcttaca tggcgatagc tagactgggc ggttttatgg acagcaagcg aaccggaatt     1260

gccagctggg gcgccctctg gtaaggttgg gaagccctgc aaagtaaact ggatggcttt     1320

cttgccgcca aggatctgat ggcgcagggg atcaagatct gatcaagaga caggatgagg     1380

atcgtttcgc ctaaggagga tattcatatg gacc                                 1414


<210>  46
<211>  17796
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized APP8 rfb cluster with cat/kP promoter and 
       flanking hom.rec sites for SL1344 rfb integration (for pDOC)

<400>  46
attaccctgt tatccctact agcattgctg agattcaggc cgcctttcag cagggagtaa       60

gcttcagcga taagctgcat atcgccatat tcaataccat tgtggaccat cttgacgtag      120

tgaccagcac catcggcgcc gatataggtc acgcacggtt cgccatcttc tgccacagca      180

gcaatcttcg tcaggatcgg cgccaccagt tcataggcat ctttctgacc gccaggcatg      240

atagatggcc ctttcagcgc gccctcttca ccaccggaaa caccggtacc gataaagtta      300

aaaccttccg cagacagctc gcgattgcga cggattgtgt cctggaagaa ggtgttaccg      360

ccatcaataa tgatatcgcc tttttccaga tacggtttca gcgaatcgat agctgcatca      420

gtacctgcgc ccgctttcac cattaacagg atacgacgag gcgtttcgag ggattcaaca      480

aactctttca ccgtgtaata aggcaccagc tttttgccgg gattctcggc aatcacttct      540

tcggtctttt cacgggagcg gttgaaaacg gagacggtat aaccacggct ttcgatgttg      600

agcgcgaggt tgcgccccat cactgccata ccgacgacgc cgatctgttg cttggacatt      660

acatactcct gtcaggtgtg gaaaccctgc cacaaacggt catggcaggt taaaggtgaa      720

tagtatgtta actcaaaggt atgatattaa ggtagcgatt gatgcgttca gattttacgc      780

aggctaattt atacaattat tattcagtac ttctcggtaa gctcgggaat tcaagcttgg      840

atcccgggta cctttctgtc ccattactga tgaaacacgg cttatttgtt tatcgccaca      900

aagacccgat tgaattacgc ccaattttag gacaaatggc agggctagcg ctaattacca      960

atttattgtt tagcttagga atttttttag gttagttgca aattccggtt aaaaaaagac     1020

cgcttgtttg agagtgataa tcgcaaacaa gcggtctttt ttgatcaaaa tatattacac     1080

gtcttgagcg attgtgtagg ctggagctgc ttcgaagttc ctatactttc tagagaatag     1140

gaacttcgga ataggaactt catttaaatg gcgcgcctta cgccccgccc tgccactcat     1200

cgcagtactg ttgtaattca ttaagcattc tgccgacatg gaagccatca caaacggcat     1260

gatgaacctg aatcgccagc ggcatcagca ccttgtcgcc ttgcgtataa tatttgccca     1320

tggtgaaaac gggggcgaag aagttgtcca tattggccac gtttaaatca aaactggtga     1380

aactcaccca gggattggct gagacgaaaa acatattctc aataaaccct ttagggaaat     1440

aggccaggtt ttcaccgtaa cacgccacat cttgcgaata tatgtgtaga aactgccgga     1500

aatcgtcgtg gtattcactc cagagcgatg aaaacgtttc agtttgctca tggaaaacgg     1560

tgtaacaagg gtgaacacta tcccatatca ccagctcacc gtctttcatt gccatacgta     1620

attccggatg agcattcatc aggcgggcaa gaatgtgaat aaaggccgga taaaacttgt     1680

gcttattttt ctttacggtc tttaaaaagg ccgtaatatc cagctgaacg gtctggttat     1740

aggtacattg agcaactgac tgaaatgcct caaaatgttc tttacgatgc cattgggata     1800

tatcaacggt ggtatatcca gtgatttttt tctccatttt agcttcctta gctcctgaaa     1860

atctcgacaa ctcaaaaaat acgcccggta gtgatcttat ttcattatgg tgaaagttgg     1920

aacctcttac gtgccgatca acgtctcatt ttcgccaaaa gttggcccag ggcttcccgg     1980

tatcaacagg gacaccagga tttatttatt ctgcgaagtg atcttccgtc acaggtaggc     2040

gcgccgaagt tcctatactt tctagagaat aggaacttcg gaataggaag gaataggaac     2100

ttcaagatcc cctcacgctg ccgcaagcac tcagggcgca agggctgcta aaggaagcgg     2160

aacacgtaga aagccagtcc gcagaaacgg tgctgacccc ggatgaatgt cagctactgg     2220

gctatctgga caagggaaaa cgcaagcgca aagagaaagc aggtagcttg cagtgggctt     2280

acatggcgat agctagactg ggcggtttta tggacagcaa gcgaaccgga attgccagct     2340

ggggcgccct ctggtaaggt tgggaagccc tgcaaagtaa actggatggc tttcttgccg     2400

ccaaggatct gatggcgcag gggatcaaga tctgatcaag agacaggatg aggatcgttt     2460

cgcctaagga ggatattcat atggaccttt tttattctca gtttgaatct aaaaattcta     2520

ggtattatcc cccaatatta actctgcatc aataacattc tttgcttatc tataggcttt     2580

ctaatatact atgcagtcaa tatatttttg ttatttattt ttgagaaaac cttatggcga     2640

cgccagagca agtgagcaac cagaccaacg aggagattga cctcatcgag ctggttcgcg     2700

tgctgtggaa gaaaaagctc ctcatcgcca tcgtgacgtg catcttcacc gccctcgcgg     2760

ccgtttatgc cttcaccgcc aaagagaagt ggacgagcca gaccgaagtt atcgcgccac     2820

gtgtgaccga cattagcgaa tatctgagtc tgcgcaagga gtacaacctc atcatcggca     2880

gcgagttcaa agaaaacgaa atccgcaacg agctcagcga gctcttcagt cgctatgtgc     2940

tgagttacga cgaagccatc gccttcttca agacgacgga cacgtacaag aagctggccg     3000

aaaccgagaa tgaagtgggc ctccagcgcg cggttgccga attcacgacg gagagtctga     3060

aggtgatcaa gccagacgcc aagaaagacc cgaatgcgct gggcagtaag attgcgatca     3120

gcttcgatac cgcgctgagt gcccagacca cgctgaacga cttcatctgc cacatcagcg     3180

acaccagttt caacttcagt aagaatgagt tcatttattg gattaaggag agcattagca     3240

gtctgaatta cgagaaagag gttatcgaac aagatcagag catccaacgc aaggtgcaga     3300

tccagaatct ggagacggcc ctcgatatgg cgaagaaggc gggcatcaaa gagtacagca     3360

gcgccctcag cagtaatagt agcgtggcca atctggcggt gagcgatacc aaaatcccgc     3420

tgagcgatag caagctggcg gacggtacct atctgttcat gctgggcgag aagaatctgc     3480

aagcgcagct cgacatcgcg aaaacgaagg agatcgtgta cagcccgcgc tactaccaga     3540

tccaagaaca gctgctgaag ctgaataccc tcctcccgaa agtggaaaag gttaccggcc     3600

agagcttcag ctatatcagc agcccagagc tcccgattaa gcgcgattgg ccaaagcgct     3660

ttattctgct cctcattggc gcggtgattg gtggcgttct gagcagtctg tgggtgatcg     3720

gcaaacaaat cttcggccag aaataattat gaacaaggac atcaagattc tgatcgcgac     3780

gcataagcag cacttcatgc cgagcgacga aatgtatctg ccgctccacg tgggcaagct     3840

gggtaaagcc gatctgggtt accaaggcga cgacagcggc gacaacatca gcatcaaaaa     3900

cccaaatttt tgcgagctga ccggtctgta ctgggcgtgg aaaaatctgc cgaacgatta     3960

tctgggtctg atccattatc gccgcttctt cagcgtgaag aaccgcgcgg aacgcaaaaa     4020

caatccgctg gagacgctgt atctgaccaa cgaagaagcc aaccagctgc tgagtcagta     4080

cgatgtgatc gttccgagca agcgcaacta ctacatcgag acgctgtaca gccattacgc     4140

caatacgctg cacgccgaac atctggacgt tacccgcgaa atcatcgcgg aaaagtgcag     4200

cgagtacctc gccagctttg acgcggtgat caaacagcgc agcggctaca tgttcaacat     4260

gttcatcatg agcaaagcgc tggtgaacga ctactgcagc tggctgttcc cgattctgtt     4320

tgaactggag aagcgtatcc caacggacca gtacagcgcc ttccatgccc gtttctacgg     4380

ccgcgtgagc gaactgctgt tcaacgtgtg gctgaaacag tacagccaga gcaacccact     4440

gaaggtgaag gccatcccgt ttgtgtatgg cgagaagatc aactggctga agaagggtac     4500

cgcgtttctg gttgcgaaat tcttcggcaa aaaatatgag aaaagcttct aagttattat     4560

agggacaaat aaatgaagcg cattctggtt tacggcatga ccgacaactt cggtggcatg     4620

gaggcctaca ttcataacat ctatcagcat ctggataaaa cccaaatcca gtttgacttc     4680

gtgtgtgact tcccgaaaat gacgctgagc gactactatc tggacaacgg ttgcaagatc     4740

cacttcatcc cgccgaaaaa ccaaggtctg tttaagagtc tgtgggcgat gtggaaagtg     4800

atcaaggaga acaactatga tgttatctat ttcaacatca tgaacgcggg ctacgtgctc     4860

aacatgctgc cggcctttct gctgggtaag aagatcatcg cgcacagcca caatgcggac     4920

accgacaaaa agaagctgca ctacggtctg cgtctgctgc tgaacatcgt gacgaagatc     4980

aagctcgcgt gcagtaagga ggccggtttc ttcatgttcg gcaaggaaga aaatttcagc     5040

atcattaata atgcgatcaa cctcgaccgc tatctgtata gcgaggagaa ataccgcgac     5100

ctccgccaca aactgggctg gggcgataag aaggttattc tgtacgtggc ccgcatgaat     5160

caccagaaga atccgctgtt cgcgctgtat atcatgcgcg aactgaagca gagcatgccg     5220

aatgccgttc tggtgtacgt gggtacgggt gagctgaagg aacaagttca gcagtacatt     5280

ctggacaaca acctcgacaa cgtgattctg ctgggtctgc gcaacgatgt gaacgagctc     5340

atgatcgcgg ccgatctgtt tattctgccg agtctgtttg agggtctgcc gattgttgcc     5400

gttgaggcgc aagccgcggg tctcccaatc attctgagcg aaaacatcag catcgaggcg     5460

aaactcgtga acagcaccta cttcctcccg attaacgacg tttttctgtg ggttaataaa     5520

atcaaaaaga ttctggagat cagtggcaac aagcgcttta gtgaccagct ggcgctgagc     5580

aaagcgggtt acaacatcga gagcgtggtg aagaacatcc agaagattct cgtgaattaa     5640

ggtttggtat gaataacacc aagatcagtc tgatcttcgc gtgctataac gtgagccagt     5700

atctggacaa tctgttccag ctgctgacga accagccgta ccaaaatatt gaaatcattt     5760

tcgtggagga ctgtgccacg gacgatacga aggcgaaact ccagagcttc aacgatccgc     5820

gcgtgaagct gctgtgcaac gagaaaaata tcggtgcggc cgaaagccgt aaccgtggca     5880

tccagatcgt taccggcgaa tacatctggt tcccggatcc agacgatctg tttgacgaac     5940

tgctgctgac caaggtgaac accatcatcc agaaaaaccg cccggatgtg atcagcatcg     6000

gcatgcaaga acgctacgag atcaacggca agacggacta cacgaaggac atcatcagcc     6060

gctacgacgg cctcattacc ggcgacttca ccgatgtttt cgtggatctg gaggaaagct     6120

ttctgtttgg ctacacgaat aacaagtttt acaaggccaa catcatccat aagtaccgca     6180

ttctgaacga gcaccaagcg ctgaaggaag atttcgaatt caatatcaag gtgtttaaac     6240

aagttagtaa cttctatctg ctgaatgaac cgctctattt ctacatgaag cgcaacaacg     6300

gcagtctgac cagcaaattc gtgccggact atttccgcat ccacatgcag accctcgcca     6360

gcttcaaaag tctgatcgag gtgaaggcca ccatcaacga caacgtgaac cgtctgctgg     6420

tgaaccgctt cgttcgctac tgcctcagcg cgatcgagcg caacagcagt ctgaaaagcg     6480

gcatgagctt tctggagcag aaccaatgga ttaaggaaaa tatctttaat caagaaaaat     6540

acaacgagta tctgctgctg agcgatctgg tgaacaagaa acagaagctg ttttactttc     6600

tgatcaagta tcgcatcggt tttctgctcg tgacggccgc gaacatcgtg aaactggtga     6660

aggcgaagtt cccgattctg ttcgtgaagc tgaagggtta attaactgga ttttaaaatg     6720

aagaagtacc agatcgtgga gctgagtacc gaacacaacc atgcgggcag caaggccgtg     6780

caagatgtgt atgagatcgc gctcagcatg ggttacaagg cgaatgtggt tcgcacggcc     6840

accagtgtgg atagtctgct ggccaaaatt ctgcgccaag ttatcttctt catcgactgg     6900

ctgaagatct acttcagcat cgagagtaac agcatcgtgc tgatccagaa cccgtactac     6960

cacaaacagc tcatccgtaa ctggattctg aatcgtctga agcgcattaa aaaagtgaag     7020

tttatcagtc tggttcacga cgtggaagag ctgcgcaaga gtctgtacaa caactactat     7080

aaaaacgagt tcgagaccat gctgagtctg gcggacagca tcatcgtgca caatgataag     7140

atgaaaagct ttttcatcaa aaagggctac agcgaggaca aactcatcag tctgggcatc     7200

ttcgactatc tgcagaagag cgtggacaaa aagcgcgtga gcttcgaacg tgcgatcagc     7260

gtggcgggca acctcgatat caagaagagc agctatattg cgcagctcgg cagcctcccg     7320

gcgatcaaag cgcatctgta cggtccgaac ttcgaacata gtctggaggc gttcccgaac     7380

atcgaatacc acggtagctt cccggccacg gaaatcccgc agaaactcgt gagcggtttt     7440

ggtctggtgt gggacggcca gagcattgaa acgtgcaccg gcgacttcgg cgagtacctc     7500

cagtacaata acccgcacaa gctgagcctc tatctgagca gtggcatgcc ggttgtgatc     7560

tgggacaaag ccgccgaggc cgatttcgtg aagaaacaca acgtgggtct gtgcgtgagc     7620

agtctgagcg agctccaaga caagctcaac gtgatgaccg agcaagaatt tgaagaaatg     7680

gtgaacaacg tggaaaaaca gaccgcgtgc ctcatcagcg gcgagtacac caaaaaggcg     7740

atcagcgagg cggaacgtgt gatctaagaa tgttcctcta tctgctggtg ttcagtctgc     7800

tgctgattct gatcttcaat ctgctcatcg tgaatctgga ctacatgcac ccgagcatcc     7860

tctttgttgt gccatttctg gtgtttggcg tgacgagcat tctgggcgag gaggcgtata     7920

agatcatctt ccacgaggag acgctgctgg tgatcgttag cagcgcgctg atcttcacct     7980

tcatcacgct gctgagccag accgtgtaca aaagcaaaga gaatctgaac ttcccgctga     8040

ccgagatcat catcagtaag aaagtgacgc tgttttttat tgtgttcttc atcgtgaccc     8100

agctggcgtt catcaagtat ctggaggcca ttagtctggc ccacttcggt tacagcggca     8160

gtctgggtga gatgatcagt ctgtacgacg tgatgacgaa gttctggacc gagatcttca     8220

gcgaactcaa cgtgccgatc ccgctgctct accgtatcgg caatccaatc acgcaaggct     8280

tcggctatct gattgtgtat attttcatcc acaactacgt tgccaccaag cgcatcgata     8340

agctgcatct gctgatcatt ctgctgctgt gtctgaacat cattctcaac ggcagccgca     8400

gtccgatctt ccgcatcgtt acgatgatgc tgatcacctt ttatgtgctg tataacaagc     8460

agaacaacgt gcgtcgcggc aacatcaagt ttctgctgaa gagtctgctg atcgtgatct     8520

tcagcggcac cttcttcatt gcgctgctga gtctgatggg ccgtgaaaac gatctggaca     8580

tgttccatta catttttatc tacgttggtg cgccgctggt gaacctcgat aactatctgg     8640

cgtttcgtcc ggatggtagc tacgccacca tctttggcga gcaaacgttt cgcggtctgt     8700

acgcctatat cgcgaagatc atcagcgatg agagtctgat cttcccgacg atcgatcagt     8760

tcacgttcag caacaacggt ctggagatcg gtaacgtgta taccaccttc tatagcttca     8820

tctacgattt cgagtacgtg ggcttcatcc cgctgattct gattatcgcg ctgtactacg     8880

tgttcacgta tcagcgcctc aagacgcgcg ccatcaagac caataaagtg catttcagtc     8940

tgttcatcta tgcctacctc ttcaacgacc tcatcatgct ggccttcagt aatcgcttct     9000

acaccacggt gctggacatc ggcttcatca agattgttat cttcagctat atctgccacc     9060

tcctctttgt gcaccgcagc aagatcaaag gcaccgtatg aacgttaaaa gtgtgaaatt     9120

taatttcatt atgaatctga ttctgaccgt tagcaacttt ctgttcccgc tggtgacgtt     9180

cccatacgtt agtcgcattc tgcagccaga aggtaccggt aaagtggcct ttgcgattag     9240

cgtggttagc tacttcagca tcttcgcgag tctgggtgtg gccacctatg gcgttcgtgc     9300

gtgtgcgcaa gttcgcgaca ataaagatct gctgagtcgt acggtgcatg agctgctgtt     9360

catcaacatc atcgccacga tcattgtgta cgtttgcttt ctgctggtgg tggcgtttac     9420

cccacgcttt agcgcggaaa aagagctgtt ctgggcgacg agcatcttta ttctgttcac     9480

catcattggc atcgagtggc tctacaaggg tctggagaag taccagtaca tcacgatccg     9540

cacgatcatc ttcaagctca ttgcgctggt gctcgtgttt gtgttcatca agacgaagga     9600

tgactacgtg atcttcgccg tgatcagtgt gtttgcgatc gttggcagcg gcatcttcaa     9660

cctctttaac agtcgcaagc tgattaacta ccatctgtac gaggattacg agttccgcaa     9720

gcatttcaag ccaatgtttc tgctgtttct cacgacgctc agcatcgcca tctacaccag     9780

tgtggatgaa gcgattctgg gtctgctgac gagtccgcaa gatgtgggct actataacgc     9840

ggccatgaag gttaagggca ttctgtttac gctgatcacc agtctgggca ttgtgctgct     9900

gccgcgtctg agctattatg ttgagaacaa tatgacggat gaattccatg ccgccctcaa     9960

gaagagcatg aacttcatca tcgtgatcgc cgttccagtg gtgatcttct tcatgctgtt    10020

cgccaaggag attattctgc tgctggccgg cgaaagttat atcaacgcca ttctgccgct    10080

gcagattatt gtgtgggcgc tgctgctcag cgccattacc aacattctgg gcatccagat    10140

tctgctgccg ctcaagaagg ataaagagct gctgatcagc gtgctgctcg cggccattgt    10200

ggacattgtg gccaatctga ttctggttcc gcaactcgcc agcgttggta ccgccatcag    10260

cgttgtgatg gccgaactca ccgtgctggt ggtgcagctg gttatcctcc gcaagtacat    10320

ctggatcctc ttcagcaatc tccagttcgt gcgcatcggt ctgagcatcg ttttcagcat    10380

cgtgctgagc ctcagcatct atcagtggaa catcacgaac agcatcatgc tcacgtttct    10440

gatcatgggc ttcatcttct tcacgaccta cttcattctg ctgctgattc tgaaggagaa    10500

cttcatgatg tacgtgtacc agaccatcca gcacaagatt ctgaaataaa ttatatagtg    10560

ttatcacata acgtatcctt ggagaataga aatgaaatat gattatctga tcgtgggcgc    10620

cggtctgttt ggcagcatct ttgcgcgcga ggccaccaag cgtggcaaga aatgtctggt    10680

tatcgagaag cgcgatcaca tcggtggcaa ctgctacacg cagaacgtgg aaggcatcaa    10740

cgttcacaaa tacggtgcgc acatcttcca caccagcaac aaggtggttt gggactacat    10800

ccagcagttc gccgagttca atcgctttac caacagcccg gtggcccgct ataaggacga    10860

actgtacagc ctcccgttca acatgctcac cttcaacaag atgtggggcg ttatcacgcc    10920

gcaagaagcc gaagcgaaaa tcaaggagca gatcgcgaag gagaacatca cggatccgaa    10980

gaatctcgag gagcaagcca tcagtctggt tggtcgcgat atctacgaga agctcatcaa    11040

gggctatacc gagaagcagt ggggccgtaa gtgtacggag ctgccagcct tcatcatcaa    11100

gcgtctgcca gttcgctaca cgtacgacaa caactacttc tacgacacct atcaaggcat    11160

cccgatcggt ggctacaccg gcatctttga acgcatgctc gagggcatcg aggtgaaact    11220

gggcgttgac ttcttcgcgg aacgcgaaca ttacgagagt ctggccgaga agatcgtgtt    11280

caccggtatg attgacgaat attttggtta ccagttcggc aaactggaat accgcagtct    11340

gcgcttcgac aacgaagtgc tgaacatccc gaactaccaa ggcaatgcgg tggtgaacta    11400

tacggaagcc gaggtgccat atacgcgcat catcgagcat aagcatttcg agtacggcac    11460

ccagccgaaa accgtgatca cgcgcgaaca cagcaaggag tacgaagaag gcgacgagcc    11520

gtattacccg atcaacgacg cccgcaacaa cgaactgtac gccaagtaca aggcgctggc    11580

cgacgcgacc ccaaacgtta ttttcggtgg ccgtctggcc cagtataagt acttcgacat    11640

gcacaatatc atcgccgagg cgctggagtg cgttaaggtg cacttttaat ataagggagt    11700

aacgctatga ataagatcat cgcgaagatc agtctgatcc tcgtggatat cgtggccatc    11760

ttcgttagca ttctgatcgc cgtgagtctg cgtaaaattc tgggtctgct cttcacgctg    11820

ccggagatcg actacagcta catcttcttc gcgtatgtgt atctgattct gattctgatg    11880

atgacgtacc tcggcgcgta taccaaacgc tacgactttt ggcacgaaag ccgtctgatc    11940

gtgcgcggca gctttctcag tctgctgatt ctgctgagtg ccctcgcgct gggccaaaac    12000

gcggaatact atagccgcag cacgctcgtg ctgatctttc tctgctgcgc catcgtgctg    12060

ccgatcgcca agattttcac caaaaaaatt ctgttcaaac tgggtatctg gcagctgccg    12120

gcgaaggtga tcagcgagaa cgaccagttc aaaaacgagc tcttcgaaga ccagtatctg    12180

ggctatgtga aggcgaaaca cagcgagcac aagattatct tcatcgacgg cgcgaatctg    12240

ggcaaagatc gtctgaacca gatcatcgag gacaacatca agaatagccg tgagatcatc    12300

ttcaccccgg ttctgaatgg ctacgacttc agccatagct acatttataa catcttcaac    12360

acgcgcacca acattttcac gctggagaac gagctgctga gcaaaagcaa ccgcatcttc    12420

aaactgctga tggactatat tctggtgctg ggtagtgccg tgttctgggt gccggtgctg    12480

gtgctcatcg cgttctggat caagaaggag gatccgaaag gcgaggtgtt ctttctgcag    12540

cgtcgcctcg gcgtgaatgg caaggaattc atgtgctaca aattccgcag catgtacagc    12600

gaccagagct tcatgcaaga atggctggag aaaaatccgg aggaggccgc gtactaccgc    12660

atctaccata agtatatgaa cgatccgcgc atcaccaaaa tcggcgcgtt cctccgcaaa    12720

accagtctgg acgaactgcc gcagctgatc aacgtgctgc gtggtgagat gagtctcgtt    12780

ggtccgcgcc cgtacatggt tatcgagaag aaggacatcg gcaaaaaagc cccactggtg    12840

ctcgcggtta agccgggcat tacgggcatg tggcaagtta gcggccgcag tgatgtgaac    12900

ttcgacagcc gcgtggagat ggatgtgtgg tatatgaaaa attggagtct gtggaatgac    12960

atcgtgattc tgatcaaaac ggtgcaagcc gtgttcaagc gcgacggtgc ctattaaagt    13020

atgatcacca gcatccagta cctccgtggc atcgccgcgc tgttcgtggt gctgttccac    13080

atgaagtgga tgctcaacaa tgtgtacgtg gagaagaacc tcggcgacat cttcttcatc    13140

agcggcaact tcggcgtgga tctgttcttc gtgatcagcg gcttcgtgat ctgtctgagc    13200

acggaacgcg aaacgctgca cccggtgaag gagtttttca tccgccgctt cttccgcatc    13260

tacccactgc tgctgctgag cgtttgcacc atctacattc tgggcgactt caagatccac    13320

gagctgatcc tcagcatgat cccaatccat ctggactaca gcagcccgag cccggtgttc    13380

ggctacaaca ttctggttag cgcgtggacc atcacctacg agattagctt ctacatcatc    13440

ctcgtgctga gtctgatgat caaccatcgc ttccgctgcg aactgaccat tctgttctaa    13500

ttatcattaa tatagtttca aactattatt attttggtga atatagccta tcactagata    13560

gagagatacc ccttgataaa aggggacatt tttttgttat gttctcatca tcaatgttat    13620

taacatttat ttatgggatt ttaatatata taaaattaca aattttatga aaagcattat    13680

catcctcgac aagtacttcc tctacagcat tctgctggtg gtgatcagct tcgtgttcat    13740

caaacacccg atcttcgacg gccacggtgt gctgaaatgg ggctttctga gcttcatcat    13800

tctgctgatt ctgctcatca tcgagaacac ctacggcatc gccaaaagca actttctgtt    13860

ctggctgggc gaaatcagct acagtctgta tctgacgcac atcattatcc tcgaattcat    13920

tctgaagcac atcaccccgg agatctggaa caacccgaat ctgggcatga gcaagatcct    13980

cttctacctc gccatcagca tcagcttcag ctatctggtg tatctgctgg tggagaagcc    14040

gttcatcaac ctcggcaaga agctgatcac gaagctgtaa atattaatgg atgattttat    14100

gaagtcacga aatctcgaac ctacaaaaac gcatctgatc tatttagata tactaaatat    14160

ttttgcttgc attgctgtac ttttttacat cacaatggta ttgtacattg gtataacgta    14220

aatgaattgg cttggaaaca agccttattt tttgaagtgg ctttttattg ggctgttcct    14280

attttcttta tgctcaccgg cgccacgctg ttcgaatacc gcaaccgcta cagcacgaag    14340

cagtttttca tcaagcgcat ccagcgcgcc gtgttcccgt ttctgagctg cagcctcatt    14400

ctgctgggct atagctttta cagcggcatg atcgaggcct ttagcatccg cgacagcatc    14460

agtgccatct tcaacaccaa ggacatcccg ttcattgaaa tctattggtt ttttatccat    14520

ctctttagtc tgtacatggt gatcccggtg ctcagtctgc tgaaagataa ctaccgcatt    14580

ctgtgctata ttgtgggcgc catgtttctg acccacagtc tgtttccggt gatctttgac    14640

ttcttcaagc tgcactacaa ctggagcatc attttcccga tggcgggcta cagcatctat    14700

ctggttctgg gctatctgct gagtaaggtg aaactggaaa agaaatatca gatcatcatt    14760

tacattctgg gcattctgag cgtgctgctc cgctactttt atacctacgt gagcagtctg    14820

gaggccaacc agctcgatcg cacgctgttc agctacatgc aattccacac cgtgtttctg    14880

gcggtggcga tcttcatttt cgtgaaggaa ttcttcagcg gtgtgaaact gttcaacgcc    14940

aaggtgctgg cggtgttcag cagctgtagt ctgggcatct atctgatcca caagctcgtg    15000

atggactacg aactcaagtt tctgggcatc agcgaggaca atctctactg gcgctttttc    15060

ggcgccttca tgacgtacgg cgcgtgcctc gtgatcgtgc tgtttgttaa gcgcatcccg    15120

tatctgcgcg ccatctttcc gtaaagatat tataaatatg aaaattctga tcaccggtgg    15180

cgccggtttt atcggcagcg ccgtgatccg ctatatcatc cagcataccc aagatagcgt    15240

ggtgaatgtg gacaaactga cctacgccgg caatctggcg agtctggaaa gcgtgagcaa    15300

tagcagccgc taccactttg agcaagcgga tatttgcgac agcacccgca tcagtcagat    15360

cttctgcaag taccagccgg atgttgtgat gcatctggcc gccgagagcc acgttgatcg    15420

cagcattgat ggtccggcgg cgttcatgca gacgaacatc atcggcacct ataccctcct    15480

cgaagccagc cgccagtatt ggctcagtct gccgctggaa cgcaagcaaa ccttccgctt    15540

ccagcacatc agtacggacg aggtgtatgg cgatctcaac gatagcaacg agctgttcag    15600

cgagaacacg gcctatagcc cgagcagccc atatagcgcc agcaaggccg ccagcgatca    15660

tctcgttcgt gcgtggtttc gtacctatgg tctgccgacg ctggtgacca actgcagcaa    15720

taactatggc ccgttccagt tcccggagaa actgatcccg ctgatgattc tgaacgccat    15780

tagtggcaaa ccgctgccga tctatggcaa tggtctgcag atccgcgact ggctgttcgt    15840

tgaagaccac gccatcgcgc tgtatcaagt tctctgtcgc ggcaaagtgg gcgaaacgta    15900

caacatcggt ggccacaatg agaagaccaa tatcgaggtg gtgcaagcga tctgccgtct    15960

gctggacgaa ctggtgccga ataaaccgag cggcatcgag cagtatgaag aactcgtgac    16020

ctacgtggcc gatcgcccgg gccatgatgt tcgctacgcc atcgacgcga gcaaaatcga    16080

gaatcagctg ggttggacgc cgaaagaaac cttcgaaagc ggtctccgca agaccgtgga    16140

gtggtatctg aataaccaga agtggtggca gagcgttctg gatggcagtt actgcggtga    16200

gcgtctgggt ctgagtctga aaagctacta agcggcaaat agtatttcag tggggatcat    16260

tttggatcca tataaaatag ttggtctgtt tttgttgaaa ttttagcgaa aattgttaaa    16320

aaataagtcg atttgcctct tattctcact gaatttaccc tttactttaa ctatcatttt    16380

ctattccata aggcgtattt aatgtggtat ttaatttgcc aataaaaatt aattgctcaa    16440

gtcgttacac acgctaccgc ccctggctca tcagctacca gtgcactgcg tacatatcga    16500

cttgttgcaa acctcgccca gcagggcaaa gctcactaaa acttaaacgc taattgtctt    16560

attaattgca tccggaaaca aggattaatc ttataaaatc agcattaaaa tgctccagat    16620

aaccccttgt tacttaagcc ctttatacaa aactaaaacg gcagtcaaca ctcgtttcag    16680

ccaacttgcc gcttcgaatg ttcactgccg ttattatgtt tatcaccaac catttatcac    16740

ggttgttaat acttattcat gcaaaagctg ctctatgctc ttacggaact tggctccttc    16800

tttcaggttg cgcagcccgt acttcacaaa tgcctgcatg tagcccattt ttttaccgca    16860

gtcatagctg tcacccgtca ttagcatcgc gtcaaccgac tgttttttcg ccagttcagc    16920

aatggcatcg gtgagctgga tgcggcccca ggcgcccggt tcggttcttt ccagttccgc    16980

ccagatgtcg gctgaaagca cataacggcc taccgccatc aaatcggaat ccagcgtctg    17040

cggctgatcc ggtttttcga taaactccac aatccggctg actttgcctt cattatccag    17100

aggttctttc gtctggataa cggaatactc cgataaatca cctttcatgc gcttcgccag    17160

cacctggctg cgacccgttt cattgaaacg cgccaccatc gccgcaaggt tatagcgcag    17220

cggatcggcg gtagcatcat cgataataat atccgggagt accacaatga aagggttatc    17280

gcccacgacc ggacgcgcgc acagaataga atgccccagc cctaacggct gcgcctggcg    17340

aacgttcata atcgtcacgc ccggtgggca gatagattgc acttccgcca aaagctgacg    17400

cttaacgcgc tgctcaagaa gtgattcaag ttcataagag gtgtcgaagt ggttctcaac    17460

ggcgttttta gacgcgtgag tcaccagcac gatttctttg atccctgcag ccacaatctc    17520

atcgacaatg tactgaatca ttggcttgtc gacgatcggt agcatctctt ttgggattgc    17580

cttggtggca ggcaacatat gcatacccaa acccgctacc ggtataactg ctttcaaatt    17640

catcattgtt tcttccacct gtaaaatggt tgctgaatta tagctcttta gcttgttttc    17700

gccagcatga attactctgc tgccagggat aatgatggca cgctctacat tacgtcttag    17760

tcggcaccat aacatatcga ttaccctgtt atccct                              17796


<210>  47
<211>  1128
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  truncated ApxII (N-terminal HIS10 tag ApxII(439-801aa)

<400>  47
atggggcacc accatcacca tcatcatcac caccatcaag gttatgattc tcgtcattta       60

gctgatttac aagacaatat gaagtttctt atcaatttaa ataaagaact tcaggctgaa      120

cgcgtagtag ctattaccca acaaagatgg gataaccaaa ttggagacct agcggcaatt      180

agccgtagaa cggataaaat ttccagtgga aaagcttatg tggatgcttt tgaggagggg      240

caacaccagt cctacgattc atccgtacag ctagataaca aaaacggtat tattaatatt      300

agtaatacaa atagaaagac acaaagtgtt ttattcagaa ctccattact aactccaggt      360

gaagagaatc gggaacgtat tcaggaaggt aaaaattctt atattacaaa attacatata      420

caaagagttg acagttggac tgtaacagat ggtgatgcta gctcaagcgt agatttcact      480

aatgtagtac aacgaatcgc tgtgaaattt gatgatgcag gtaacattat agaatctaaa      540

gatactaaaa ttatcgcaaa tttaggtgct ggtaacgata atgtatttgt tgggtcaagt      600

actaccgtta ttgatggcgg ggacggacat gatcgagttc actacagtag aggagaatat      660

ggcgcattag ttattgatgc tacagccgag acagaaaaag gctcatattc agtaaaacgc      720

tatgtcggag acagtaaagc attacatgaa acaattgcca cccacccaac aaatgttggt      780

aatcgtgaag aaaaaattga atatcgtcgt gaagatgatc gttttcatac tggttatact      840

gtgacggact cactcaaatc agttgaagag atcattggtt cacaatttaa tgatattttc      900

aaaggaagcc aatttgatga tgtgttccat ggtggtaatg gtgtagacac tattgatggt      960

aacgatggtg acgatcattt atttggtggc gcaggcgatg atgttatcga tggaggaaac     1020

ggtaacaatt tccttgttgg aggaaccggt aatgatatta tctcgggagg taaagataat     1080

gatatttatg tccataaaac aggcgatgga aatgattcta ttacataa                  1128


<210>  48
<211>  693
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ApxIII(27-245aa)-HIS10 (synthesized)

<400>  48
gatgtaacta aaaatggttt gcaatatggg gtgagtcaag caaaattaca agcattagca       60

gctggtaaag ccgttcaaaa gtacggtaat aaattagttt tagttattcc aaaagagtat      120

gacggaagtg ttggtaacgg tttctttgat ttagtaaaag cagctgagga attaggcatt      180

caagttaaat atgttaaccg taatgaattg gaagttgccc ataaaagttt aggtaccgca      240

gaccaattct tgggtttaac agaacgtgga cttactttat ttgcaccgca actagatcag      300

ttcttacaaa aacattcaaa aatttctaac gtagtgggca gttctactgg tgatgcagta      360

agtaaacttg ctaagagtca aactattatt tcaggaattc aatctgtatt aggtactgta      420

ttagcaggta ttaatcttaa tgaagctatt attagtggcg gttcagagct cgaattagct      480

gaagctggtg tttctttagc ctctgagctc gttagtaata ttgctaaagg tacaacaaca      540

atagatgctt tcactacaca aatccagaac tttgggaaat tagtggaaaa tgctaaaggg      600

ttaggtggtg ttggccgcca attacagaat atttcaggtt ctgcattaag caaaactgga      660

caccaccatc accatcatca tcaccaccat taa                                   693


<210>  49
<211>  4581
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  rfaK(SL1344)-proD- 
       ClyA-ApxI(628-845aa)-ApxII(612-801aa)-ApxIII(626-860aa)-HIS6-chlo
       ramphenicol resistance cassette-rfaL(SL1344)

<400>  49
cgatcgcaat ttgtgttaga tggcataacg ggctatcacc tcgcagaacc tatgtcgagc       60

gacagtataa ttaatgatat taaccgtgcg cttgctgata aggaacgcca ccagattgcc      120

gaaaaagcaa aatccctggt gttttcaaaa tacagttggg aaaatgtagc gcagcgtttc      180

gaggaacaaa tgaaaaactg gtttgataag tgactcgagt tctagagcac agctaacacc      240

acgtcgtccc tatctgctgc cctaggtcta tgagtggttg ctggataact ttacgggcat      300

gcataaggct cgtataatat attcagggag accacaacgg tttccctcta caaataattt      360

tgtttaactt ttactagagt cacacaggaa agtactagat gaccgaaatc gtggcggaca      420

agaccgtgga ggtggttaaa aacgccatcg aaacggcgga tggcgccctc gatctgtata      480

ataaatatct cgaccaagtt attccgtggc agaccttcga tgagacgatc aaggaactca      540

gccgtttcaa gcaagaatac agccaagccg ccagtgtgct ggtgggcgat atcaaaacgc      600

tgctgatgga cagccaagat aagtacttcg aggcgaccca aaccgtgtat gaatggtgcg      660

gtgttgccac gcaactgctg gccgcgtaca ttctgctgtt cgatgagtat aacgagaaga      720

aggcgagcgc ccagaaggac attctgatca aggtgctcga cgacggcatc acgaagctga      780

acgaagcgca gaagagtctg ctcgttagta gccagagctt caataacgcc agtggcaagc      840

tcctcgcgct ggacagtcag ctgacgaacg actttagcga gaaaagcagt tacttccaga      900

gccaagttga caaaatccgc aaggaggcgt acgccggtgc cgccgcgggt gttgtggccg      960

gtccgtttgg cctcattatc agctacagca tcgcggccgg cgtggtggaa ggcaaactga     1020

tcccggagct gaaaaataaa ctgaaaagcg ttcagaattt cttcaccacg ctgagcaaca     1080

cggtgaagca agcgaacaag gacatcgacg cggcgaaact gaagctgacc acggagatcg     1140

cggccatcgg cgaaatcaag accgaaacgg aaaccacgcg cttctatgtg gactacgatg     1200

atctgatgct cagtctgctg aaggaagcgg cgaagaaaat gattaacacg tgcaacgagt     1260

accagaagcg ccatggtaag aagacgctct tcgaggtgcc ggaggtgtac gcgggtaatg     1320

gccacgacgt ggcgtactac gataagacgg acaccggtta cctcaccttc gatggtcaga     1380

gtgcccaaaa ggcgggcgaa tatatcgtta ccaaggaact gaaggcggac gttaaggtgc     1440

tcaaagaggt ggtgaaaacc caagatatta gcgttggcaa gcgcagcgag aaactggaat     1500

accgcgacta tgagctgagc ccattcgaac tcggcaacgg tatccgcgcc aaagatgagc     1560

tgcatagcgt ggaagaaatt atcggcagca accgcaagga taagttcttc ggcagtcgct     1620

tcacggatat cttccatggc gcgaaaggcg acgacgaaat ctatggcaat gacggtcacg     1680

acattctgta cggcgatgac ggcaacgacg ttattcatgg tggtgacggt aatgaccatc     1740

tggttggcgg caatggtaat gatcgcctca tcggcggcaa aggtaataat ttcctcaacg     1800

gcggcgatgg tgatgatgaa ctgcaagttt tcgagggtca gtacaacgtt ctgctcggtg     1860

gtgcgggtaa cgacattctg tatggcagcg atggcacgaa tctgttcgat ggcggtgtgg     1920

gtaacgacaa gatttacggc ggtctcggca aagatatcaa tctcggtgcg ggcaatgaca     1980

acgtgtttgt gggcagcagt accacggtga ttgatggtgg tgatggccat gatcgcgttc     2040

attacagccg cggcgaatac ggcgccctcg ttattgacgc caccgcggaa acggagaagg     2100

gtagctatag cgtgaagcgt tacgtgggcg atagcaaagc cctccacgaa accatcgcga     2160

cccatcagac caatgtgggc aaccgcgaag agaaaatcga gtaccgccgc gaggatgatc     2220

gcttccatac gggctacacg gtgaccgata gtctgaaaag tgtggaggaa atcatcggta     2280

gccagtttaa tgatatcttc aagggcagcc aattcgacga cgtgttccac ggcggcaacg     2340

gtgtggatac catcgatggc aatgatggtg acgaccatct cttcggcggt gcgggcgatg     2400

atgttatcga tggcggtaac ggcaataact ttctggttgg cggtaccggc aacgacatca     2460

ttagtggcgg caaggacaac gacatctacg ttcacaaaac gggcgacggt aacgacagca     2520

tcaccgatag cggtggccaa gataaactgg cgcatctggg taacggcaac gataaagtgt     2580

ttctggccgc gggtagtgcg gagattcatg cgggtgaggg tcacgatgtg gtgtactatg     2640

ataagaccga taccggtctg ctggtgatcg atggtacgaa agcgacggaa caaggccgct     2700

acagcgttac gcgcgagctg agtggcgcga ccaaaattct gcgtgaggtg attaaaaatc     2760

agaaaagcgc ggttggcaaa cgcgaagaga cgctggaata ccgtgactac gaactgaccc     2820

agagcggcaa cagcaatctg aaagcccacg atgaactgca tagtgtggag gagattattg     2880

gcagcaatca gcgtgacgag ttcaaaggca gcaagttccg cgacatcttc cacggcgccg     2940

acggtgacga tctgctgaac ggtaatgacg gcgatgacat tctgtacggc gacaaaggca     3000

atgatgagct gcgcggcgac aacggtaatg atcagctgta tggcggtgag ggtgatgata     3060

aactgctggg cggtaatggc aacaattatc tgagtggtgg cgacggcaac gatgagctgc     3120

aagttctggg caacggcttt aacgtgctgc gcggcggtaa aggcgatgat aagctgtacg     3180

gtagcagtgg tagcgatctg ctggatggcg gcgagggcaa tgattatctg gaaggcggtg     3240

atggcagtga ctttcaccat caccaccatc actaactaaa tatattttag gtcacctctc     3300

aaatcgtttg cctgataccg ctccaattac acgtcttgag cgattgtgta ggctggagct     3360

gcttcgaagt tcctatactt tctagagaat aggaacttcg gaataggaac ttcatttaaa     3420

tggcgcgcct tacgccccgc cctgccactc atcgcagtac tgttgtaatt cattaagcat     3480

tctgccgaca tggaagccat cacaaacggc atgatgaacc tgaatcgcca gcggcatcag     3540

caccttgtcg ccttgcgtat aatatttgcc catggtgaaa acgggggcga agaagttgtc     3600

catattggcc acgtttaaat caaaactggt gaaactcacc cagggattgg ctgagacgaa     3660

aaacatattc tcaataaacc ctttagggaa ataggccagg ttttcaccgt aacacgccac     3720

atcttgcgaa tatatgtgta gaaactgccg gaaatcgtcg tggtattcac tccagagcga     3780

tgaaaacgtt tcagtttgct catggaaaac ggtgtaacaa gggtgaacac tatcccatat     3840

caccagctca ccgtctttca ttgccatacg taattccgga tgagcattca tcaggcgggc     3900

aagaatgtga ataaaggccg gataaaactt gtgcttattt ttctttacgg tctttaaaaa     3960

ggccgtaata tccagctgaa cggtctggtt ataggtacat tgagcaactg actgaaatgc     4020

ctcaaaatgt tctttacgat gccattggga tatatcaacg gtggtatatc cagtgatttt     4080

tttctccatt ttagcttcct tagctcctga aaatctcgac aactcaaaaa atacgcccgg     4140

tagtgatctt atttcattat ggtgaaagtt ggaacctctt acgtgccgat caacgtctca     4200

ttttcgccaa aagttggccc agggcttccc ggtatcaaca gggacaccag gatttattta     4260

ttctgcgaag tgatcttccg tcacaggtag gcgcgccgaa gttcctatac tttctagaga     4320

ataggaactt cggaatagga actaaggagg atattcatat ggacccagcg cgttttttta     4380

tctatttctt agcgccagca gaaaaccggt aatgatacca atttgagcaa tatcgacctg     4440

ttcaaaattg ccacgaacga tataaaaacc gacgaaagat aaaaatagca agagatgagc     4500

attgtagggg cttatctcta ctttcctgag ggtagagctg gctgtttccc tgatgatagc     4560

gccatataaa taggcggccg c                                               4581


<210>  50
<211>  365
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene kanE

<400>  50

Met Lys His Lys Ile Leu His Phe Ser Gln Val Leu Gly Gly Val Gly 
1               5                   10                  15      


Arg Tyr Leu Glu Leu Tyr Asp Lys Tyr Ile Asn Lys Asp Ser Phe Glu 
            20                  25                  30          


Asn Ile Tyr Ile Leu Pro Ile Gly Asp Trp Glu Ala Ala Glu Ala Gln 
        35                  40                  45              


Asp Lys Arg Tyr Ile Leu Asn Ile Glu Gln Ser Phe Ser Pro Ile Lys 
    50                  55                  60                  


Leu Ile Ser Asn Val Ile Lys Ile Arg Asn Ile Leu Lys Lys Glu Lys 
65                  70                  75                  80  


Pro Asp Ile Phe Tyr Leu His Ser Thr Phe Ala Gly Val Ile Gly Arg 
                85                  90                  95      


Leu Ala Ala Ile Gly Met Arg Cys Lys Val Ile Tyr Asn Pro His Gly 
            100                 105                 110         


Trp Ser Phe Lys Met Asn Val Ser Arg Leu Lys Gln Thr Phe Tyr Lys 
        115                 120                 125             


Ile Ile Glu Gly Gly Leu Val Phe Leu Thr Asp Lys Phe Val Leu Ile 
    130                 135                 140                 


Ser Lys Ser Glu Tyr Glu Ala Ala Arg Ser Ile Gly Val Ser Glu Lys 
145                 150                 155                 160 


Lys Cys Cys Leu Ile Tyr Asn Gly Ile Glu Thr Ile Lys Lys Thr Asp 
                165                 170                 175     


Ile Ala Ile Ile Pro Lys Leu Asp Asp Lys Tyr Ile Ile Gly Met Ile 
            180                 185                 190         


Gly Arg Ile Ser Glu Gln Lys Asn Pro Met Phe Phe Ala Gln Phe Ala 
        195                 200                 205             


Lys Glu Ile Ile Lys Gln Tyr Pro Asn Thr Tyr Phe Ile Leu Val Gly 
    210                 215                 220                 


Asp Gly Glu Gln Arg Glu Ser Leu Glu Asp Tyr Leu Glu Arg Asn Asn 
225                 230                 235                 240 


Leu Asn Asp Val Phe Tyr Ile Thr Gly Trp Val Thr Asn Pro Glu Ser 
                245                 250                 255     


Tyr Leu Asn Leu Phe Asp Gln Ala Val Leu Phe Ser Lys Trp Glu Gly 
            260                 265                 270         


Leu Cys Leu Ser Val Cys Glu Tyr Met Leu Tyr Glu Lys Pro Ile Leu 
        275                 280                 285             


Val Ser Asn Ile Gly Gly Ile Asn Asp Leu Ile Gln Asn Glu Val Asn 
    290                 295                 300                 


Gly Phe Thr Ile Val Glu Gly Asp Leu Lys Asp Ala Val Asn Lys Ser 
305                 310                 315                 320 


Asn Arg Leu Arg Asn Glu Pro Lys Thr Val Ala Lys Phe Ile Glu Ala 
                325                 330                 335     


Ser Asn Ile Leu Ile Gln Glu Lys Phe Asn Ala Gln Lys Met Val Asn 
            340                 345                 350         


Ser Leu Glu Lys Leu Phe Ile Lys Leu Ser Glu Asn Lys 
        355                 360                 365 


<210>  51
<211>  288
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene cpsP

<400>  51

Met Lys Glu Lys Phe Ser Cys Ile Val Val Cys Tyr Asn Pro Asp Asn 
1               5                   10                  15      


Ser Val Leu Asp Asn Leu Lys Asn Tyr Ile Ser Tyr Val Gly Lys Val 
            20                  25                  30          


Ile Val Val Asp Asn Ser Asp Val Asp Asn Ser Gln Leu Phe Ser Ser 
        35                  40                  45              


Leu Ser Glu Tyr Leu Ile Tyr Ile Pro Leu Tyr Lys Asn Val Gly Ile 
    50                  55                  60                  


Ala Tyr Ala Leu Asn Ile Gly Val Glu Lys Ser Lys Glu Leu Gly Tyr 
65                  70                  75                  80  


Glu Tyr Ile Ile Thr Met Asp Gln Asp Ser Ser Phe Ala Thr Asn Leu 
                85                  90                  95      


Val Asp Val Tyr Ser His Tyr Ile Ser Asn Tyr Pro Ile Asp Gln Ile 
            100                 105                 110         


Gly Ala Leu Ser Pro Val Tyr Ile Thr Asp Arg Gly Phe Asn Arg Thr 
        115                 120                 125             


Ser Lys Glu Glu Phe Lys Gln Ile Lys Ile Thr Met Gln Ser Gly Ser 
    130                 135                 140                 


Met Phe Phe Thr Asp Lys Phe Asp Val Ile Gly Arg Phe Asp Asn Asp 
145                 150                 155                 160 


Leu Phe Leu Asp Val Val Asp Trp Glu Tyr Phe Phe Arg Ile Tyr Thr 
                165                 170                 175     


Leu Gly Tyr Lys Thr Ile Gln Cys Asn Lys Ala Met Leu Lys His Ala 
            180                 185                 190         


Pro Ala Glu Thr Leu Thr Leu Phe Lys Ile Lys Gly Lys Thr Ile Gly 
        195                 200                 205             


Val Gly Val Ala Ser Pro Leu Arg Tyr Tyr Tyr Gln Ile Arg Asn Leu 
    210                 215                 220                 


Leu Trp Cys Val Leu His Lys Lys Ser Phe Phe Met Ile Lys Thr Ile 
225                 230                 235                 240 


Ala Tyr Lys Phe Ile Lys Ile Leu Phe Leu Phe Asn Asn Lys Lys Gln 
                245                 250                 255     


Tyr Leu Ser Phe Ala Tyr Met Ala Ile Lys Asp Ala Phe Asn Asn Arg 
            260                 265                 270         


Leu Gly Ala Tyr Asp Thr Leu Tyr Leu Glu Lys Ser Arg Asn Glu Lys 
        275                 280                 285             


<210>  52
<211>  328
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene epsJ

<400>  52

Met Lys Asn Asp Leu Pro Leu Ile Ser Ile Ile Ile Pro Ile Tyr Asn 
1               5                   10                  15      


Val Lys Pro Tyr Leu Glu Lys Cys Val Asn Ser Val Leu Ser Gln Ser 
            20                  25                  30          


Tyr Pro Asn Leu Glu Ile Ile Leu Val Asp Asp Gly Ala Thr Asp Gly 
        35                  40                  45              


Ser Ala Gln Val Cys Asp Asp Phe Ser Glu Lys Tyr Ala Asn Ile Gln 
    50                  55                  60                  


Val Ile His Lys Lys Asn Gly Gly Leu Ser Ser Ala Arg Asn Ala Gly 
65                  70                  75                  80  


Ile Glu Ala Met Lys Gly Glu Tyr Val Phe Phe Leu Asp Ser Asp Asp 
                85                  90                  95      


Trp Ile Ala Asn Asp Ala Ile Ser Gln Leu Tyr Asp Asp Met Val Glu 
            100                 105                 110         


Tyr Asn Ala Asp Ile Thr Gly Ile Ser Phe Tyr Gln Ala Tyr Ser Asp 
        115                 120                 125             


Gly Asn Leu Val Leu Asn Thr His Leu Ile Glu Lys Gln Met Leu Ser 
    130                 135                 140                 


Lys Lys Glu Ala Leu Arg Thr Phe Leu Phe Asn Asn Tyr Leu Thr Pro 
145                 150                 155                 160 


Cys Ser Cys Gly Lys Leu Tyr Lys Ala Ser Leu Trp Lys Asp Ile Arg 
                165                 170                 175     


Phe Pro Glu Gly Arg Leu Phe Glu Asp Gln Leu Thr Thr Tyr Lys Val 
            180                 185                 190         


Ile Glu Leu Ala Asn Thr Ile Ile Phe Asn Pro Ala Ala Lys Tyr Phe 
        195                 200                 205             


Tyr Phe Lys Arg Ile Gly Ser Ile Gly His Ser Ala Phe Ser Glu Lys 
    210                 215                 220                 


Thr Tyr Asp Leu Tyr Glu Ala Val Asn Glu Gln Tyr Asn Glu Ile Thr 
225                 230                 235                 240 


Lys His His Pro Asp Ile Glu Ser Asp Leu Ala Val Ala Lys Ile Thr 
                245                 250                 255     


Trp Glu Ile Val Phe Ile Asn Met Met Leu Asn Ser Asn Tyr Ser Asp 
            260                 265                 270         


Gln Ala Ile Val Asp Lys Thr Arg Val Phe Ala Arg Lys Arg Ile Leu 
        275                 280                 285             


Asp Val Val Lys Cys Glu Phe Ile Pro Asn Leu Arg Lys Phe Gln Ile 
    290                 295                 300                 


Thr Leu Phe Ala Tyr Asn Phe Ser Leu Tyr Lys Val Leu Tyr Ala Arg 
305                 310                 315                 320 


Tyr Lys Lys Lys Asn Pro Leu Ser 
                325             


<210>  53
<211>  225
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene setA

<400>  53

Met Ile Pro Lys Lys Ile His Tyr Cys Trp Phe Gly Gly Asn Pro Leu 
1               5                   10                  15      


Pro Lys Ser Val Lys Lys Cys Ile Lys Ser Trp Lys Lys Tyr Cys Pro 
            20                  25                  30          


Asp Tyr Glu Ile Ile Glu Trp Asn Glu Ser Asn Tyr Asn Val His Lys 
        35                  40                  45              


Asn Leu Phe Ile Lys Glu Ala Tyr Glu Lys Lys Lys Phe Ala Phe Val 
    50                  55                  60                  


Ser Asp Tyr Ala Arg Leu Asp Val Val His Ser Glu Gly Gly Ile Tyr 
65                  70                  75                  80  


Leu Asp Thr Asp Val Glu Leu Ile Lys Pro Ile Asp Asp Leu Leu Ala 
                85                  90                  95      


His Ser Cys Phe Leu Ala Ser Glu Ser Ile Asp Asp Val Asn Thr Gly 
            100                 105                 110         


Leu Gly Phe Gly Ala Glu Lys Gly His Trp Phe Ile Ala Glu Asn Met 
        115                 120                 125             


Ser Val Tyr Glu Asn Met Tyr Phe Asn Met Glu Asn Ile Ile Thr Cys 
    130                 135                 140                 


Val Glu Ile Thr Thr Lys Leu Leu Ile Glu Arg Gly Phe Ser Ala Ser 
145                 150                 155                 160 


Asp Lys Ile Gln Asn Ile Asp Asp Ile Phe Ile Tyr Pro Thr Glu Tyr 
                165                 170                 175     


Phe Cys Pro Leu Asn Tyr Lys Thr His Glu Leu His Ile Thr Gln Asn 
            180                 185                 190         


Thr Tyr Ser Ile His His Tyr Asp Ala Thr Trp Gln Ser Pro Leu Met 
        195                 200                 205             


Lys Phe Lys Thr Lys Ile Lys Tyr Ile Leu Cys Leu Ala Gly Ile Ile 
    210                 215                 220                 


Lys 
225 


<210>  54
<211>  378
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene Hypothetical 
       protein

<400>  54

Met Asn Ser Leu Val Tyr Arg Ile Asp Ile Arg Thr Leu Ile Phe Ser 
1               5                   10                  15      


Ile Phe Tyr Phe Thr Phe Leu Val Ser Asp Phe Leu Leu Leu Ala Gln 
            20                  25                  30          


Asp Gly Thr Ile Thr Lys Asp Ile Ile Lys Trp Val Lys Leu Phe Ser 
        35                  40                  45              


Leu Leu Pro Leu Leu Leu Leu Ile Phe Lys Leu Pro Leu Asn Leu Leu 
    50                  55                  60                  


Ile Leu Gly Phe Phe Thr Ile Met Ile Ser Ala Phe Tyr Ser Ile Tyr 
65                  70                  75                  80  


Thr Gly Asp Ser Phe Leu Leu Tyr Ile Cys Leu Leu Met Ser Phe Ser 
                85                  90                  95      


Tyr Lys Val Asn Phe Asn Phe Leu Phe Lys Ile Gly Leu Tyr Leu Thr 
            100                 105                 110         


Ser Ile Leu Val Val Leu Ile Leu Thr Tyr Phe Phe Phe Glu Tyr Phe 
        115                 120                 125             


Leu Ile Gly Asp Ser His Phe Val Tyr Asp Ala Thr Tyr Trp Phe Lys 
    130                 135                 140                 


Arg Tyr Thr Phe Asn Phe Asp Asn Pro Asn Ala Phe Pro Met Arg Ile 
145                 150                 155                 160 


Phe Val Phe Phe Ile Phe Tyr Ile Leu His Val Gly Lys Leu Arg Leu 
                165                 170                 175     


Phe Asp Thr Phe Leu Phe Val Ile Leu Phe Gly Ile Val Phe Tyr Phe 
            180                 185                 190         


Ser Asn Ser Arg Thr Ala Phe Tyr Ile Phe Ile Leu Cys Val Leu Thr 
        195                 200                 205             


Ile His Phe Asn Gln Val Phe Asn Val Leu Asn Asn Thr Phe Val Lys 
    210                 215                 220                 


Leu Leu Ile Asn Asn Ser Ile Ile Phe Ile Thr Ile Phe Ser Ile Trp 
225                 230                 235                 240 


Ser Ala Ile Tyr Tyr Gln Asp Tyr Tyr Ser Tyr Leu Glu Pro Ile Asn 
                245                 250                 255     


Lys Ile Leu Ser Lys Arg Ile Tyr Phe Ala Asn Glu Ala Tyr Lys Ser 
            260                 265                 270         


Leu Gly Phe Glu Phe Tyr Pro Arg Asn Ile Lys Trp Trp Ile Glu Glu 
        275                 280                 285             


Ser Asp Trp His Ile Ile Asp Asn Gly Tyr Val Tyr Leu Phe Ile Ser 
    290                 295                 300                 


Gly Gly Leu Leu Val Gly Asn Leu Phe Ile Phe Ser Ile Thr Trp Leu 
305                 310                 315                 320 


Met Tyr Arg Leu Asn Lys Phe Asn Leu Ser Asn Glu Ala Ile Leu Leu 
                325                 330                 335     


Met Phe Ser Met Leu Tyr Leu Leu Ser Glu Ser His Phe Ile Asn Ile 
            340                 345                 350         


Phe Tyr Asn Ile Pro Ile Leu Leu Leu Ala Ile Phe Ile Asn Lys Thr 
        355                 360                 365             


Asn Ile Val Arg Tyr Leu Glu Cys Lys Lys 
    370                 375             


<210>  55
<211>  481
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene rfbX

<400>  55

Met Asn Lys Asn Leu Val Asn Asn Ser Ile Met Ser Phe Leu Leu Thr 
1               5                   10                  15      


Ile Ser Asn Phe Ile Phe Pro Leu Ile Thr Phe Thr Tyr Ala Ala Arg 
            20                  25                  30          


Ile Leu Gln Pro Asp Asn Met Gly Lys Phe Ala Phe Ser Leu Ser Val 
        35                  40                  45              


Val Asp Tyr Leu Ser Leu Phe Ala Thr Phe Gly Val Val Gly Tyr Gly 
    50                  55                  60                  


Val Arg Ala Cys Ala Glu Val Arg Asn Asn Lys Glu Glu Leu Thr Lys 
65                  70                  75                  80  


Thr Val Gln Glu Ile Leu Phe Ile Asn Ile Phe Leu Ala Ile Ile Ala 
                85                  90                  95      


Tyr Leu Val Ile Phe Leu Leu Ile Ser Tyr Gln His Ala Phe Arg Glu 
            100                 105                 110         


Asp Thr Leu Leu Phe Leu Ile Met Ser Ser Cys Ile Ile Phe Asn Val 
        115                 120                 125             


Ile Gly Ile Glu Trp Leu Tyr Lys Ser Leu Asp Glu Tyr Arg Tyr Ile 
    130                 135                 140                 


Thr Val Arg Ser Ile Leu Leu Lys Ile Ile Ser Leu Ile Met Ile Leu 
145                 150                 155                 160 


Cys Phe Val Lys Glu Lys Asp Asp Tyr Pro Leu Phe Ala Leu Phe Phe 
                165                 170                 175     


Val Leu Pro Ile Cys Leu Ser Ser Leu Leu Asn Ile Ile Asn Ser Arg 
            180                 185                 190         


Lys Ile Leu Leu Phe Lys Leu Phe Lys Leu Asp Leu Ser Lys His Ile 
        195                 200                 205             


Lys Pro Met Phe Val Leu Phe Leu Val Thr Leu Ser Tyr Thr Leu Tyr 
    210                 215                 220                 


Ala Asn Val Asn Asp Val Leu Leu Ala Thr Val Thr Asn Thr Glu Gln 
225                 230                 235                 240 


Val Gly Tyr Tyr Ser Val Ala Phe Lys Ile Lys Ala Ala Leu Leu Ala 
                245                 250                 255     


Phe Ile Thr Ser Thr Ser Met Val Phe Leu Pro Arg Leu Thr Glu Tyr 
            260                 265                 270         


Ile Lys Asn Asn Gln Asp Ile Glu Phe Ile Asp Leu Leu Arg Lys Ser 
        275                 280                 285             


Phe Asp Leu Val Phe Phe Leu Ala Val Pro Ile Thr Leu Phe Phe Phe 
    290                 295                 300                 


Leu Tyr Ala Lys Glu Thr Ile Phe Leu Leu Phe Gly Glu Lys Tyr Asn 
305                 310                 315                 320 


Lys Ser Ser Leu Leu Leu Gln Thr Met Ile Trp Ser Val Phe Phe Gly 
                325                 330                 335     


Gly Leu Asn Asn Ile Leu Ser Val Gln Met Leu Leu Pro Leu Lys Lys 
            340                 345                 350         


Asp Asn Gln Phe Leu Ile Ser Ile Leu Ser Gly Gly Cys Ile Ser Leu 
        355                 360                 365             


Val Val Asn Phe Ile Phe Leu Arg Glu Leu Gln Ser Leu Ser Thr Ser 
    370                 375                 380                 


Ile Ser Val Leu Val Ala Glu Val Val Ile Leu Ile Ile Gln Leu Val 
385                 390                 395                 400 


Ile Leu Arg Lys Tyr Ile Val Arg Ile Phe Asn Asn Leu Asn Pro Leu 
                405                 410                 415     


Lys Val Ile Met Ser Val Phe Phe Ser Ile Trp Phe Val Asn Leu Ile 
            420                 425                 430         


Tyr Ala Asn Phe Ile Ala Leu Gly Asn Ser Phe Leu Glu Tyr Ile Ile 
        435                 440                 445             


Ser Ile Phe Ile Phe Ser Leu Phe Tyr Val Phe Leu Leu Phe Phe Ser 
    450                 455                 460                 


Lys Glu Arg Phe Val His Asp Val Phe Phe Tyr Ile Arg Ser Lys Phe 
465                 470                 475                 480 


Asp 
    


<210>  56
<211>  236
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene vatD

<400>  56

Leu Ile Asn Leu Leu Ile Ser Ile Leu Ala Lys Ile Leu Ser Arg Ile 
1               5                   10                  15      


Ser Lys Leu Ile Leu Asn Ile Lys Lys Arg Lys Glu Tyr Lys Arg Val 
            20                  25                  30          


Gly Ser Ile Val Asp Ser Lys Asn Ile Asp Leu Ser Phe Ile Cys Gly 
        35                  40                  45              


Asn Tyr Cys Arg Val Gly Arg Asp Thr Val Ile Glu Lys Asn Val Ile 
    50                  55                  60                  


Met Gly Arg Leu Ser Tyr Ile Asn Ser Asp Met Gly Lys Thr Tyr Ile 
65                  70                  75                  80  


Gly Ser Asn Val Lys Ile Gly Ser Leu Cys Ser Ile Ser Ser Gly Val 
                85                  90                  95      


Ile Ile Ala Pro Val Asn His Tyr Leu Asn Tyr Val Thr Thr His Pro 
            100                 105                 110         


Leu Leu Tyr Asn Ser Tyr Tyr Ser Ser Ile Leu Asn Ile Asn Ser Asn 
        115                 120                 125             


Leu Leu Ser Gln Gln Glu Leu Asp Ala Asn Val Ser Thr Val Ile Gly 
    130                 135                 140                 


Asn Asp Val Trp Ile Gly Ala Asn Val Ile Ile Lys Arg Gly Val Thr 
145                 150                 155                 160 


Ile Gly Asp Gly Ala Val Ile Gly Ala Gly Ser Ile Ile Thr Lys Asp 
                165                 170                 175     


Ile Pro Ser Tyr Ala Val Val Ala Gly Val Pro Ala Lys Ile Ile Lys 
            180                 185                 190         


Tyr Arg Phe Ser Lys Asp Val Ile Glu Ser Leu Lys Asp Ser Lys Asn 
        195                 200                 205             


Val Trp Glu Leu Ser Thr Ser Glu Leu Glu Glu Asn Phe Ser His Leu 
    210                 215                 220                 


Tyr Asp Val Glu Lys Tyr Leu Asn Arg Phe Lys Leu 
225                 230                 235     


<210>  57
<211>  440
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene pglC

<400>  57

Met Ser Lys Lys Asn Ile Val Ala Gln Thr Leu Leu Leu Cys Leu Asp 
1               5                   10                  15      


Leu Leu Leu Ile Ser Met Ala Ile Phe Leu Ala Val Phe Ile Arg Asn 
            20                  25                  30          


Asn Ile Leu Pro Asn Ile Met Leu Phe Glu Pro Val Ser Tyr Ile Glu 
        35                  40                  45              


Tyr Leu Val Tyr Pro Phe Pro Tyr Val Ile Ile Val Thr Leu Phe Met 
    50                  55                  60                  


Trp Phe Gly Leu Tyr Thr Arg Arg Tyr Asp Leu Trp Gln Glu Ser Leu 
65                  70                  75                  80  


Phe Ile Ile Lys Val Cys Phe Ile Ser Phe Ile Ile Ile Phe Ala Thr 
                85                  90                  95      


Leu Ala Leu Gly Lys Asn Ile Glu Tyr Tyr Ser Arg Ala Val Leu Leu 
            100                 105                 110         


Leu Ser Leu Phe Leu Ser Val Ile Phe Leu Pro Ile Gly Arg Tyr Phe 
        115                 120                 125             


Leu Lys Lys Ser Leu Phe Arg Leu Gly Leu Trp Glu Arg Lys Val Lys 
    130                 135                 140                 


Phe Ile Gly Asn Leu Asn Lys Asn Glu Ile Gly Ile Phe Asn Ser Pro 
145                 150                 155                 160 


His Val Gly Tyr Val Leu Ser Lys Asp Asp Thr Tyr Asp Val Ile Phe 
                165                 170                 175     


Ile Ser Ser Gly Asp Lys Ser Val Ser Glu Leu Asn Asp Leu Ile Glu 
            180                 185                 190         


Ser Asn Lys Leu Leu Asn Arg Glu Val Leu Phe Ile Pro Val Leu Asn 
        195                 200                 205             


Gln Tyr Asp Phe Thr Gln Ser Val Leu Tyr Asn Asn Phe Ser Thr Arg 
    210                 215                 220                 


Leu Asn Leu Phe Thr Leu Glu Asn Lys Leu Leu Gly Lys Gln Asn Lys 
225                 230                 235                 240 


Ile Leu Lys Tyr Leu Leu Asp Tyr Val Leu Val Leu Ser Thr Leu Pro 
                245                 250                 255     


Phe Trp Gly Gly Leu Ile Leu Leu Ile Ser Ile Lys Leu Lys Leu Glu 
            260                 265                 270         


Asp Pro Lys Gly Lys Ile Phe Phe Leu Gln Lys Arg Leu Gly Gln Glu 
        275                 280                 285             


Gly Lys Ile Phe Tyr Cys Tyr Lys Phe Arg Thr Met Val Ser Asp Gln 
    290                 295                 300                 


Ser Phe Met Gln Gln Trp Leu Ile Asp Asn Pro Glu Glu Arg Asp Tyr 
305                 310                 315                 320 


Tyr Ala Val Tyr His Lys Tyr Ile Asn Asp Pro Arg Ile Thr Lys Phe 
                325                 330                 335     


Gly His Phe Leu Arg Arg Thr Ser Leu Asp Glu Leu Pro Gln Leu Phe 
            340                 345                 350         


Asn Val Leu Lys Gly Asp Met Ser Leu Val Gly Asn Arg Pro Tyr Met 
        355                 360                 365             


Val Glu Glu Gln Gln Lys Met Lys Asp Ala Ala Ser Ile Ile Leu Met 
    370                 375                 380                 


Ser Lys Pro Gly Val Thr Gly Leu Trp Gln Val Ser Gly Arg Ser Asp 
385                 390                 395                 400 


Val Ser Phe Glu Glu Arg Leu Gln Ile Asp Ser Trp Tyr Ile Lys Asn 
                405                 410                 415     


Trp Ser Ile Trp Asn Asp Ile Val Ile Leu Phe Lys Thr Val Gly Val 
            420                 425                 430         


Val Leu Arg Lys Asp Gly Ala Ser 
        435                 440 


<210>  58
<211>  354
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene rffG

<400>  58

Met Lys Lys Ile Leu Val Thr Gly Gly Ala Gly Phe Ile Gly Ser Ala 
1               5                   10                  15      


Val Val Arg His Ile Ile Asn Asp Thr Gln Asp Ser Val Val Asn Val 
            20                  25                  30          


Asp Lys Leu Thr Tyr Ala Gly Asn Leu Glu Ser Leu Leu Met Val Glu 
        35                  40                  45              


Asn Ser Pro Arg Tyr Val Phe Glu Gln Val Asp Ile Cys Asn Arg Ala 
    50                  55                  60                  


Glu Leu Asp Arg Val Phe Ala Gln His Gln Pro Asp Ala Val Met His 
65                  70                  75                  80  


Leu Ala Ala Glu Ser His Val Asp Arg Ser Ile Asp Gly Pro Ala Ala 
                85                  90                  95      


Phe Ile Glu Thr Asn Ile Val Gly Thr Tyr Thr Leu Leu Glu Ala Ala 
            100                 105                 110         


Arg Tyr Tyr Trp Asn Ser Leu Asp Ala Asp Lys Lys Ser Leu Phe Arg 
        115                 120                 125             


Phe His His Ile Ser Thr Asp Glu Val Tyr Gly Asp Leu Glu Gly Thr 
    130                 135                 140                 


Glu Asp Leu Phe Thr Glu Thr Thr Pro Tyr Ser Pro Ser Ser Pro Tyr 
145                 150                 155                 160 


Ser Ala Ser Lys Ala Ser Ser Asp His Leu Val Arg Ala Trp Leu Arg 
                165                 170                 175     


Thr Tyr Gly Leu Pro Thr Ile Val Thr Asn Cys Ser Asn Asn Tyr Gly 
            180                 185                 190         


Pro Phe His Phe Pro Glu Lys Leu Ile Pro Leu Met Ile Leu Asn Ala 
        195                 200                 205             


Leu Glu Gly Lys Pro Leu Pro Val Tyr Gly Asn Gly Gln Gln Ile Arg 
    210                 215                 220                 


Asp Trp Leu Phe Val Glu Asp His Ala Arg Ala Leu Tyr Lys Val Val 
225                 230                 235                 240 


Thr Glu Gly Lys Val Gly Glu Thr Tyr Asn Ile Gly Gly His Asn Glu 
                245                 250                 255     


Lys Ala Asn Ile Asp Val Val Arg Thr Ile Cys Ser Leu Leu Glu Glu 
            260                 265                 270         


Leu Val Pro Asn Lys Pro Ala Gly Val His Lys Tyr Glu Asp Leu Ile 
        275                 280                 285             


Thr Tyr Val Thr Asp Arg Pro Gly His Asp Val Arg Tyr Ala Ile Asp 
    290                 295                 300                 


Ala Thr Lys Ile Gly Arg Glu Leu Gly Trp Lys Pro Gln Glu Thr Phe 
305                 310                 315                 320 


Glu Thr Gly Ile Arg Lys Thr Val Glu Trp Tyr Leu Asn Asn Thr Glu 
                325                 330                 335     


Trp Trp Ser Arg Val Leu Asp Gly Ser Tyr Asn Arg Glu Arg Leu Gly 
            340                 345                 350         


Ser Asn 
        


<210>  59
<211>  292
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene rmlA2

<400>  59

Met Lys Gly Ile Ile Leu Ala Gly Gly Ser Gly Thr Arg Leu Tyr Pro 
1               5                   10                  15      


Ile Thr Arg Gly Val Ser Lys Gln Leu Leu Pro Val Tyr Asp Lys Pro 
            20                  25                  30          


Met Ile Tyr Tyr Pro Leu Ser Val Leu Met Leu Ala Gly Ile Arg Glu 
        35                  40                  45              


Val Leu Ile Ile Thr Thr Pro Glu Asp Asn Glu Ser Phe Lys Arg Leu 
    50                  55                  60                  


Leu Gly Asp Gly Ser Asp Phe Gly Ile Gln Leu Ser Tyr Ala Ile Gln 
65                  70                  75                  80  


Pro Ser Pro Asp Gly Leu Ala Gln Ala Phe Leu Ile Gly Glu Glu Phe 
                85                  90                  95      


Ile Gly Gln Asp Ser Val Cys Leu Val Leu Gly Asp Asn Ile Phe Tyr 
            100                 105                 110         


Gly Gln His Phe Thr Gln Ser Leu Gln Glu Ala Val Lys Ser Val Glu 
        115                 120                 125             


Thr Lys Gly Ala Thr Val Phe Gly Tyr Gln Val Lys Asp Pro Glu Arg 
    130                 135                 140                 


Phe Gly Val Val Glu Phe Asp Asp Asn Phe Arg Ala Leu Ser Ile Glu 
145                 150                 155                 160 


Glu Lys Pro Ile Gln Pro Lys Ser Asn Trp Ala Val Thr Gly Leu Tyr 
                165                 170                 175     


Phe Tyr Asp Asn Arg Val Val Glu Phe Ala Lys Gln Val Lys Pro Ser 
            180                 185                 190         


Ala Arg Gly Glu Leu Glu Ile Thr Thr Leu Asn Glu Met Tyr Leu Asn 
        195                 200                 205             


Asp Gly Ser Leu Asn Val Gln Leu Leu Gly Arg Gly Phe Ala Trp Leu 
    210                 215                 220                 


Asp Thr Gly Thr His Asp Ser Leu His Asp Ala Ala Ala Phe Val Lys 
225                 230                 235                 240 


Thr Val Gln Asn Leu Gln Asn Leu Gln Val Ala Cys Leu Glu Glu Ile 
                245                 250                 255     


Ala Tyr Arg Asn Gly Trp Leu Ser Leu Glu Gln Leu Glu Ala Leu Thr 
            260                 265                 270         


Lys Pro Met Ala Lys Asn Glu Tyr Gly Gln Tyr Leu Leu Arg Leu Thr 
        275                 280                 285             


Lys Gly Thr Lys 
    290         


<210>  60
<211>  291
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene rmlD

<400>  60

Met Ala Arg Phe Leu Ile Thr Gly Ala Lys Gly Gln Val Gly Tyr Cys 
1               5                   10                  15      


Leu Thr Lys Gln Leu Gln Ser Lys Ala Asp Val Leu Ala Val Asp Arg 
            20                  25                  30          


Asp Glu Leu Asp Ile Thr Asn Arg Asp Ala Val Phe Lys Val Val Arg 
        35                  40                  45              


Glu Phe His Pro Asp Val Ile Ile Asn Ala Ala Ala His Thr Ala Val 
    50                  55                  60                  


Asp Arg Ala Glu Ser Glu Ile Glu Leu Ser Glu Ala Ile Asn Val Lys 
65                  70                  75                  80  


Gly Pro Gln Tyr Leu Ala Glu Ala Ala Asn Glu Ile Asp Ala Ile Ile 
                85                  90                  95      


Leu His Ile Ser Thr Asp Tyr Val Phe Glu Gly Thr Gly Ser Gly Glu 
            100                 105                 110         


Tyr Lys Glu Asn Asp Glu Pro Asn Pro Gln Gly Val Tyr Gly Lys Thr 
        115                 120                 125             


Lys Leu Ala Gly Glu Ile Ala Val Gln Gln Ala Asn Lys Arg His Ile 
    130                 135                 140                 


Ile Leu Arg Thr Ala Trp Val Phe Gly Glu His Gly Asn Asn Phe Val 
145                 150                 155                 160 


Lys Thr Met Leu Arg Leu Ala Lys Glu Arg Glu Ser Leu Gly Ile Val 
                165                 170                 175     


Ser Asp Gln Phe Gly Gly Pro Thr Tyr Ala Gly Asp Ile Ala Ser Ser 
            180                 185                 190         


Leu Ile His Ile Ala Asn Ile Ile Leu Asn Ser Lys Ile Asp Val Phe 
        195                 200                 205             


Gly Val Tyr His Phe Thr Gly Lys Pro Tyr Val Ser Trp Ala Asp Phe 
    210                 215                 220                 


Ala Lys Lys Ile Phe Asp Glu Ala Val Ser Gln Lys Val Leu Glu Lys 
225                 230                 235                 240 


Ala Pro Leu Val Asn Phe Ile Ala Thr Ser Asn Tyr Pro Thr Ser Ala 
                245                 250                 255     


Lys Arg Pro Ala Asn Ser Arg Leu Asp Leu Thr Lys Ile Asp Glu Val 
            260                 265                 270         


Phe Gly Ile Lys Pro Ser Asn Trp Gln Gln Ala Leu Lys Asn Ile Lys 
        275                 280                 285             


Ala Tyr Ala 
    290     


<210>  61
<211>  180
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence of rfb (APP2) cluster gene rfbC

<400>  61

Met Lys Ile Ile Glu Thr Asn Ile Pro Asp Val Lys Leu Leu Glu Pro 
1               5                   10                  15      


Gln Val Phe Gly Asp Glu Arg Gly Phe Phe Met Glu Ile Phe Arg Asp 
            20                  25                  30          


Glu Trp Phe Arg Gln Tyr Val Ala Asp Arg Thr Phe Val Gln Glu Asn 
        35                  40                  45              


His Ser Lys Ser Ile Lys Gly Val Leu Arg Gly Leu His Tyr Gln Thr 
    50                  55                  60                  


Glu Asn Thr Gln Gly Lys Leu Val Arg Val Val Gln Gly Ser Val Phe 
65                  70                  75                  80  


Asp Val Ala Val Asp Leu Arg Lys Ser Ser Pro Thr Phe Gly Gln Trp 
                85                  90                  95      


Val Gly Glu Val Leu Ser Ala Glu Asn Lys Arg Gln Leu Trp Val Pro 
            100                 105                 110         


Glu Gly Phe Ala His Gly Phe Tyr Val Leu Thr Glu Thr Ala Glu Phe 
        115                 120                 125             


Thr Tyr Lys Cys Thr Asp Tyr Tyr Asn Pro Lys Ala Glu His Ser Leu 
    130                 135                 140                 


Ile Trp Asn Asp Pro Thr Val Ala Ile Asn Trp Asn Leu Gly Gly Ala 
145                 150                 155                 160 


Pro Ser Leu Ser Ala Lys Asp Leu Ala Gly Lys Val Leu Asn Glu Ala 
                165                 170                 175     


Val Leu Phe Glu 
            180 


<210>  62
<211>  267
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 DUF4422 domain-containing protein

<400>  62

Met Asn Lys Asp Ile Lys Ile Leu Ile Ala Thr His Lys Gln His Phe 
1               5                   10                  15      


Met Pro Ser Asp Glu Met Tyr Leu Pro Leu His Val Gly Lys Leu Gly 
            20                  25                  30          


Lys Ala Asp Leu Gly Tyr Gln Gly Asp Asp Ser Gly Asp Asn Ile Ser 
        35                  40                  45              


Ile Lys Asn Pro Asn Phe Cys Glu Leu Thr Gly Leu Tyr Trp Ala Trp 
    50                  55                  60                  


Lys Asn Leu Pro Asn Asp Tyr Leu Gly Leu Ile His Tyr Arg Arg Phe 
65                  70                  75                  80  


Phe Ser Val Lys Asn Arg Ala Glu Arg Lys Asn Asn Pro Leu Glu Thr 
                85                  90                  95      


Leu Tyr Leu Thr Asn Glu Glu Ala Asn Gln Leu Leu Ser Gln Tyr Asp 
            100                 105                 110         


Val Ile Val Pro Ser Lys Arg Asn Tyr Tyr Ile Glu Thr Leu Tyr Ser 
        115                 120                 125             


His Tyr Ala Asn Thr Leu His Ala Glu His Leu Asp Val Thr Arg Glu 
    130                 135                 140                 


Ile Ile Ala Glu Lys Cys Ser Glu Tyr Leu Ala Ser Phe Asp Ala Val 
145                 150                 155                 160 


Ile Lys Gln Arg Ser Gly Tyr Met Phe Asn Met Phe Ile Met Ser Lys 
                165                 170                 175     


Ala Leu Val Asn Asp Tyr Cys Ser Trp Leu Phe Pro Ile Leu Phe Glu 
            180                 185                 190         


Leu Glu Lys Arg Ile Pro Thr Asp Gln Tyr Ser Ala Phe His Ala Arg 
        195                 200                 205             


Phe Tyr Gly Arg Val Ser Glu Leu Leu Phe Asn Val Trp Leu Lys Gln 
    210                 215                 220                 


Tyr Ser Gln Ser Asn Pro Leu Lys Val Lys Ala Ile Pro Phe Val Tyr 
225                 230                 235                 240 


Gly Glu Lys Ile Asn Trp Leu Lys Lys Gly Thr Ala Phe Leu Val Ala 
                245                 250                 255     


Lys Phe Phe Gly Lys Lys Tyr Glu Lys Ser Phe 
            260                 265         


<210>  63
<211>  355
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Glycosyltransferase family 1 protein

<400>  63

Met Lys Arg Ile Leu Val Tyr Gly Met Thr Asp Asn Phe Gly Gly Met 
1               5                   10                  15      


Glu Ala Tyr Ile His Asn Ile Tyr Gln His Leu Asp Lys Thr Gln Ile 
            20                  25                  30          


Gln Phe Asp Phe Val Cys Asp Phe Pro Lys Met Thr Leu Ser Asp Tyr 
        35                  40                  45              


Tyr Leu Asp Asn Gly Cys Lys Ile His Phe Ile Pro Pro Lys Asn Gln 
    50                  55                  60                  


Gly Leu Phe Lys Ser Leu Trp Ala Met Trp Lys Val Ile Lys Glu Asn 
65                  70                  75                  80  


Asn Tyr Asp Val Ile Tyr Phe Asn Ile Met Asn Ala Gly Tyr Val Leu 
                85                  90                  95      


Asn Met Leu Pro Ala Phe Leu Leu Gly Lys Lys Ile Ile Ala His Ser 
            100                 105                 110         


His Asn Ala Asp Thr Asp Lys Lys Lys Leu His Tyr Gly Leu Arg Leu 
        115                 120                 125             


Leu Leu Asn Ile Val Thr Lys Ile Lys Leu Ala Cys Ser Lys Glu Ala 
    130                 135                 140                 


Gly Phe Phe Met Phe Gly Lys Glu Glu Asn Phe Ser Ile Ile Asn Asn 
145                 150                 155                 160 


Ala Ile Asn Leu Asp Arg Tyr Leu Tyr Ser Glu Glu Lys Tyr Arg Asp 
                165                 170                 175     


Leu Arg His Lys Leu Gly Trp Gly Asp Lys Lys Val Ile Leu Tyr Val 
            180                 185                 190         


Ala Arg Met Asn His Gln Lys Asn Pro Leu Phe Ala Leu Tyr Ile Met 
        195                 200                 205             


Arg Glu Leu Lys Gln Ser Met Pro Asn Ala Val Leu Val Tyr Val Gly 
    210                 215                 220                 


Thr Gly Glu Leu Lys Glu Gln Val Gln Gln Tyr Ile Leu Asp Asn Asn 
225                 230                 235                 240 


Leu Asp Asn Val Ile Leu Leu Gly Leu Arg Asn Asp Val Asn Glu Leu 
                245                 250                 255     


Met Ile Ala Ala Asp Leu Phe Ile Leu Pro Ser Leu Phe Glu Gly Leu 
            260                 265                 270         


Pro Ile Val Ala Val Glu Ala Gln Ala Ala Gly Leu Pro Ile Ile Leu 
        275                 280                 285             


Ser Glu Asn Ile Ser Ile Glu Ala Lys Leu Val Asn Ser Thr Tyr Phe 
    290                 295                 300                 


Leu Pro Ile Asn Asp Val Phe Leu Trp Val Asn Lys Ile Lys Lys Ile 
305                 310                 315                 320 


Leu Glu Ile Ser Gly Asn Lys Arg Phe Ser Asp Gln Leu Ala Leu Ser 
                325                 330                 335     


Lys Ala Gly Tyr Asn Ile Glu Ser Val Val Lys Asn Ile Gln Lys Ile 
            340                 345                 350         


Leu Val Asn 
        355 


<210>  64
<211>  350
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Glycosyltransferase family 2 protein

<400>  64

Met Asn Asn Thr Lys Ile Ser Leu Ile Phe Ala Cys Tyr Asn Val Ser 
1               5                   10                  15      


Gln Tyr Leu Asp Asn Leu Phe Gln Leu Leu Thr Asn Gln Pro Tyr Gln 
            20                  25                  30          


Asn Ile Glu Ile Ile Phe Val Glu Asp Cys Ala Thr Asp Asp Thr Lys 
        35                  40                  45              


Ala Lys Leu Gln Ser Phe Asn Asp Pro Arg Val Lys Leu Leu Cys Asn 
    50                  55                  60                  


Glu Lys Asn Ile Gly Ala Ala Glu Ser Arg Asn Arg Gly Ile Gln Ile 
65                  70                  75                  80  


Val Thr Gly Glu Tyr Ile Trp Phe Pro Asp Pro Asp Asp Leu Phe Asp 
                85                  90                  95      


Glu Leu Leu Leu Thr Lys Val Asn Thr Ile Ile Gln Lys Asn Arg Pro 
            100                 105                 110         


Asp Val Ile Ser Ile Gly Met Gln Glu Arg Tyr Glu Ile Asn Gly Lys 
        115                 120                 125             


Thr Asp Tyr Thr Lys Asp Ile Ile Ser Arg Tyr Asp Gly Leu Ile Thr 
    130                 135                 140                 


Gly Asp Phe Thr Asp Val Phe Val Asp Leu Glu Glu Ser Phe Leu Phe 
145                 150                 155                 160 


Gly Tyr Thr Asn Asn Lys Phe Tyr Lys Ala Asn Ile Ile His Lys Tyr 
                165                 170                 175     


Arg Ile Leu Asn Glu His Gln Ala Leu Lys Glu Asp Phe Glu Phe Asn 
            180                 185                 190         


Ile Lys Val Phe Lys Gln Val Ser Asn Phe Tyr Leu Leu Asn Glu Pro 
        195                 200                 205             


Leu Tyr Phe Tyr Met Lys Arg Asn Asn Gly Ser Leu Thr Ser Lys Phe 
    210                 215                 220                 


Val Pro Asp Tyr Phe Arg Ile His Met Gln Thr Leu Ala Ser Phe Lys 
225                 230                 235                 240 


Ser Leu Ile Glu Val Lys Ala Thr Ile Asn Asp Asn Val Asn Arg Leu 
                245                 250                 255     


Leu Val Asn Arg Phe Val Arg Tyr Cys Leu Ser Ala Ile Glu Arg Asn 
            260                 265                 270         


Ser Ser Leu Lys Ser Gly Met Ser Phe Leu Glu Gln Asn Gln Trp Ile 
        275                 280                 285             


Lys Glu Asn Ile Phe Asn Gln Glu Lys Tyr Asn Glu Tyr Leu Leu Leu 
    290                 295                 300                 


Ser Asp Leu Val Asn Lys Lys Gln Lys Leu Phe Tyr Phe Leu Ile Lys 
305                 310                 315                 320 


Tyr Arg Ile Gly Phe Leu Leu Val Thr Ala Ala Asn Ile Val Lys Leu 
                325                 330                 335     


Val Lys Ala Lys Phe Pro Ile Leu Phe Val Lys Leu Lys Gly 
            340                 345                 350 


<210>  65
<211>  349
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Beta-1,6-galactofuranosyltransferase

<400>  65

Met Lys Lys Tyr Gln Ile Val Glu Leu Ser Thr Glu His Asn His Ala 
1               5                   10                  15      


Gly Ser Lys Ala Val Gln Asp Val Tyr Glu Ile Ala Leu Ser Met Gly 
            20                  25                  30          


Tyr Lys Ala Asn Val Val Arg Thr Ala Thr Ser Val Asp Ser Leu Leu 
        35                  40                  45              


Ala Lys Ile Leu Arg Gln Val Ile Phe Phe Ile Asp Trp Leu Lys Ile 
    50                  55                  60                  


Tyr Phe Ser Ile Glu Ser Asn Ser Ile Val Leu Ile Gln Asn Pro Tyr 
65                  70                  75                  80  


Tyr His Lys Gln Leu Ile Arg Asn Trp Ile Leu Asn Arg Leu Lys Arg 
                85                  90                  95      


Ile Lys Lys Val Lys Phe Ile Ser Leu Val His Asp Val Glu Glu Leu 
            100                 105                 110         


Arg Lys Ser Leu Tyr Asn Asn Tyr Tyr Lys Asn Glu Phe Glu Thr Met 
        115                 120                 125             


Leu Ser Leu Ala Asp Ser Ile Ile Val His Asn Asp Lys Met Lys Ser 
    130                 135                 140                 


Phe Phe Ile Lys Lys Gly Tyr Ser Glu Asp Lys Leu Ile Ser Leu Gly 
145                 150                 155                 160 


Ile Phe Asp Tyr Leu Gln Lys Ser Val Asp Lys Lys Arg Val Ser Phe 
                165                 170                 175     


Glu Arg Ala Ile Ser Val Ala Gly Asn Leu Asp Ile Lys Lys Ser Ser 
            180                 185                 190         


Tyr Ile Ala Gln Leu Gly Ser Leu Pro Ala Ile Lys Ala His Leu Tyr 
        195                 200                 205             


Gly Pro Asn Phe Glu His Ser Leu Glu Ala Phe Pro Asn Ile Glu Tyr 
    210                 215                 220                 


His Gly Ser Phe Pro Ala Thr Glu Ile Pro Gln Lys Leu Val Ser Gly 
225                 230                 235                 240 


Phe Gly Leu Val Trp Asp Gly Gln Ser Ile Glu Thr Cys Thr Gly Asp 
                245                 250                 255     


Phe Gly Glu Tyr Leu Gln Tyr Asn Asn Pro His Lys Leu Ser Leu Tyr 
            260                 265                 270         


Leu Ser Ser Gly Met Pro Val Val Ile Trp Asp Lys Ala Ala Glu Ala 
        275                 280                 285             


Asp Phe Val Lys Lys His Asn Val Gly Leu Cys Val Ser Ser Leu Ser 
    290                 295                 300                 


Glu Leu Gln Asp Lys Leu Asn Val Met Thr Glu Gln Glu Phe Glu Glu 
305                 310                 315                 320 


Met Val Asn Asn Val Glu Lys Gln Thr Ala Cys Leu Ile Ser Gly Glu 
                325                 330                 335     


Tyr Thr Lys Lys Ala Ile Ser Glu Ala Glu Arg Val Ile 
            340                 345                 


<210>  66
<211>  443
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Oligosaccharide repeat unit 
       polymerase

<400>  66

Met Phe Leu Tyr Leu Leu Val Phe Ser Leu Leu Leu Ile Leu Ile Phe 
1               5                   10                  15      


Asn Leu Leu Ile Val Asn Leu Asp Tyr Met His Pro Ser Ile Leu Phe 
            20                  25                  30          


Val Val Pro Phe Leu Val Phe Gly Val Thr Ser Ile Leu Gly Glu Glu 
        35                  40                  45              


Ala Tyr Lys Ile Ile Phe His Glu Glu Thr Leu Leu Val Ile Val Ser 
    50                  55                  60                  


Ser Ala Leu Ile Phe Thr Phe Ile Thr Leu Leu Ser Gln Thr Val Tyr 
65                  70                  75                  80  


Lys Ser Lys Glu Asn Leu Asn Phe Pro Leu Thr Glu Ile Ile Ile Ser 
                85                  90                  95      


Lys Lys Val Thr Leu Phe Phe Ile Val Phe Phe Ile Val Thr Gln Leu 
            100                 105                 110         


Ala Phe Ile Lys Tyr Leu Glu Ala Ile Ser Leu Ala His Phe Gly Tyr 
        115                 120                 125             


Ser Gly Ser Leu Gly Glu Met Ile Ser Leu Tyr Asp Val Met Thr Lys 
    130                 135                 140                 


Phe Trp Thr Glu Ile Phe Ser Glu Leu Asn Val Pro Ile Pro Leu Leu 
145                 150                 155                 160 


Tyr Arg Ile Gly Asn Pro Ile Thr Gln Gly Phe Gly Tyr Leu Ile Val 
                165                 170                 175     


Tyr Ile Phe Ile His Asn Tyr Val Ala Thr Lys Arg Ile Asp Lys Leu 
            180                 185                 190         


His Leu Leu Ile Ile Leu Leu Leu Cys Leu Asn Ile Ile Leu Asn Gly 
        195                 200                 205             


Ser Arg Ser Pro Ile Phe Arg Ile Val Thr Met Met Leu Ile Thr Phe 
    210                 215                 220                 


Tyr Val Leu Tyr Asn Lys Gln Asn Asn Val Arg Arg Gly Asn Ile Lys 
225                 230                 235                 240 


Phe Leu Leu Lys Ser Leu Leu Ile Val Ile Phe Ser Gly Thr Phe Phe 
                245                 250                 255     


Ile Ala Leu Leu Ser Leu Met Gly Arg Glu Asn Asp Leu Asp Met Phe 
            260                 265                 270         


His Tyr Ile Phe Ile Tyr Val Gly Ala Pro Leu Val Asn Leu Asp Asn 
        275                 280                 285             


Tyr Leu Ala Phe Arg Pro Asp Gly Ser Tyr Ala Thr Ile Phe Gly Glu 
    290                 295                 300                 


Gln Thr Phe Arg Gly Leu Tyr Ala Tyr Ile Ala Lys Ile Ile Ser Asp 
305                 310                 315                 320 


Glu Ser Leu Ile Phe Pro Thr Ile Asp Gln Phe Thr Phe Ser Asn Asn 
                325                 330                 335     


Gly Leu Glu Ile Gly Asn Val Tyr Thr Thr Phe Tyr Ser Phe Ile Tyr 
            340                 345                 350         


Asp Phe Glu Tyr Val Gly Phe Ile Pro Leu Ile Leu Ile Ile Ala Leu 
        355                 360                 365             


Tyr Tyr Val Phe Thr Tyr Gln Arg Leu Lys Thr Arg Ala Ile Lys Thr 
    370                 375                 380                 


Asn Lys Val His Phe Ser Leu Phe Ile Tyr Ala Tyr Leu Phe Asn Asp 
385                 390                 395                 400 


Leu Ile Met Leu Ala Phe Ser Asn Arg Phe Tyr Thr Thr Val Leu Asp 
                405                 410                 415     


Ile Gly Phe Ile Lys Ile Val Ile Phe Ser Tyr Ile Cys His Leu Leu 
            420                 425                 430         


Phe Val His Arg Ser Lys Ile Lys Gly Thr Val 
        435                 440             


<210>  67
<211>  483
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Flippase

<400>  67

Met Asn Val Lys Ser Val Lys Phe Asn Phe Ile Met Asn Leu Ile Leu 
1               5                   10                  15      


Thr Val Ser Asn Phe Leu Phe Pro Leu Val Thr Phe Pro Tyr Val Ser 
            20                  25                  30          


Arg Ile Leu Gln Pro Glu Gly Thr Gly Lys Val Ala Phe Ala Ile Ser 
        35                  40                  45              


Val Val Ser Tyr Phe Ser Ile Phe Ala Ser Leu Gly Val Ala Thr Tyr 
    50                  55                  60                  


Gly Val Arg Ala Cys Ala Gln Val Arg Asp Asn Lys Asp Leu Leu Ser 
65                  70                  75                  80  


Arg Thr Val His Glu Leu Leu Phe Ile Asn Ile Ile Ala Thr Ile Ile 
                85                  90                  95      


Val Tyr Val Cys Phe Leu Leu Val Val Ala Phe Thr Pro Arg Phe Ser 
            100                 105                 110         


Ala Glu Lys Glu Leu Phe Trp Ala Thr Ser Ile Phe Ile Leu Phe Thr 
        115                 120                 125             


Ile Ile Gly Ile Glu Trp Leu Tyr Lys Gly Leu Glu Lys Tyr Gln Tyr 
    130                 135                 140                 


Ile Thr Ile Arg Thr Ile Ile Phe Lys Leu Ile Ala Leu Val Leu Val 
145                 150                 155                 160 


Phe Val Phe Ile Lys Thr Lys Asp Asp Tyr Val Ile Phe Ala Val Ile 
                165                 170                 175     


Ser Val Phe Ala Ile Val Gly Ser Gly Ile Phe Asn Leu Phe Asn Ser 
            180                 185                 190         


Arg Lys Leu Ile Asn Tyr His Leu Tyr Glu Asp Tyr Glu Phe Arg Lys 
        195                 200                 205             


His Phe Lys Pro Met Phe Leu Leu Phe Leu Thr Thr Leu Ser Ile Ala 
    210                 215                 220                 


Ile Tyr Thr Ser Val Asp Glu Ala Ile Leu Gly Leu Leu Thr Ser Pro 
225                 230                 235                 240 


Gln Asp Val Gly Tyr Tyr Asn Ala Ala Met Lys Val Lys Gly Ile Leu 
                245                 250                 255     


Phe Thr Leu Ile Thr Ser Leu Gly Ile Val Leu Leu Pro Arg Leu Ser 
            260                 265                 270         


Tyr Tyr Val Glu Asn Asn Met Thr Asp Glu Phe His Ala Ala Leu Lys 
        275                 280                 285             


Lys Ser Met Asn Phe Ile Ile Val Ile Ala Val Pro Val Val Ile Phe 
    290                 295                 300                 


Phe Met Leu Phe Ala Lys Glu Ile Ile Leu Leu Leu Ala Gly Glu Ser 
305                 310                 315                 320 


Tyr Ile Asn Ala Ile Leu Pro Leu Gln Ile Ile Val Trp Ala Leu Leu 
                325                 330                 335     


Leu Ser Ala Ile Thr Asn Ile Leu Gly Ile Gln Ile Leu Leu Pro Leu 
            340                 345                 350         


Lys Lys Asp Lys Glu Leu Leu Ile Ser Val Leu Leu Ala Ala Ile Val 
        355                 360                 365             


Asp Ile Val Ala Asn Leu Ile Leu Val Pro Gln Leu Ala Ser Val Gly 
    370                 375                 380                 


Thr Ala Ile Ser Val Val Met Ala Glu Leu Thr Val Leu Val Val Gln 
385                 390                 395                 400 


Leu Val Ile Leu Arg Lys Tyr Ile Trp Ile Leu Phe Ser Asn Leu Gln 
                405                 410                 415     


Phe Val Arg Ile Gly Leu Ser Ile Val Phe Ser Ile Val Leu Ser Leu 
            420                 425                 430         


Ser Ile Tyr Gln Trp Asn Ile Thr Asn Ser Ile Met Leu Thr Phe Leu 
        435                 440                 445             


Ile Met Gly Phe Ile Phe Phe Thr Thr Tyr Phe Ile Leu Leu Leu Ile 
    450                 455                 460                 


Leu Lys Glu Asn Phe Met Met Tyr Val Tyr Gln Thr Ile Gln His Lys 
465                 470                 475                 480 


Ile Leu Lys 
            


<210>  68
<211>  365
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 UDP-galactopyranose mutase

<400>  68

Met Lys Tyr Asp Tyr Leu Ile Val Gly Ala Gly Leu Phe Gly Ser Ile 
1               5                   10                  15      


Phe Ala Arg Glu Ala Thr Lys Arg Gly Lys Lys Cys Leu Val Ile Glu 
            20                  25                  30          


Lys Arg Asp His Ile Gly Gly Asn Cys Tyr Thr Gln Asn Val Glu Gly 
        35                  40                  45              


Ile Asn Val His Lys Tyr Gly Ala His Ile Phe His Thr Ser Asn Lys 
    50                  55                  60                  


Val Val Trp Asp Tyr Ile Gln Gln Phe Ala Glu Phe Asn Arg Phe Thr 
65                  70                  75                  80  


Asn Ser Pro Val Ala Arg Tyr Lys Asp Glu Leu Tyr Ser Leu Pro Phe 
                85                  90                  95      


Asn Met Leu Thr Phe Asn Lys Met Trp Gly Val Ile Thr Pro Gln Glu 
            100                 105                 110         


Ala Glu Ala Lys Ile Lys Glu Gln Ile Ala Lys Glu Asn Ile Thr Asp 
        115                 120                 125             


Pro Lys Asn Leu Glu Glu Gln Ala Ile Ser Leu Val Gly Arg Asp Ile 
    130                 135                 140                 


Tyr Glu Lys Leu Ile Lys Gly Tyr Thr Glu Lys Gln Trp Gly Arg Lys 
145                 150                 155                 160 


Cys Thr Glu Leu Pro Ala Phe Ile Ile Lys Arg Leu Pro Val Arg Tyr 
                165                 170                 175     


Thr Tyr Asp Asn Asn Tyr Phe Tyr Asp Thr Tyr Gln Gly Ile Pro Ile 
            180                 185                 190         


Gly Gly Tyr Thr Gly Ile Phe Glu Arg Met Leu Glu Gly Ile Glu Val 
        195                 200                 205             


Lys Leu Gly Val Asp Phe Phe Ala Glu Arg Glu His Tyr Glu Ser Leu 
    210                 215                 220                 


Ala Glu Lys Ile Val Phe Thr Gly Met Ile Asp Glu Tyr Phe Gly Tyr 
225                 230                 235                 240 


Gln Phe Gly Lys Leu Glu Tyr Arg Ser Leu Arg Phe Asp Asn Glu Val 
                245                 250                 255     


Leu Asn Ile Pro Asn Tyr Gln Gly Asn Ala Val Val Asn Tyr Thr Glu 
            260                 265                 270         


Ala Glu Val Pro Tyr Thr Arg Ile Ile Glu His Lys His Phe Glu Tyr 
        275                 280                 285             


Gly Thr Gln Pro Lys Thr Val Ile Thr Arg Glu His Ser Lys Glu Tyr 
    290                 295                 300                 


Glu Glu Gly Asp Glu Pro Tyr Tyr Pro Ile Asn Asp Ala Arg Asn Asn 
305                 310                 315                 320 


Glu Leu Tyr Ala Lys Tyr Lys Ala Leu Ala Asp Ala Thr Pro Asn Val 
                325                 330                 335     


Ile Phe Gly Gly Arg Leu Ala Gln Tyr Lys Tyr Phe Asp Met His Asn 
            340                 345                 350         


Ile Ile Ala Glu Ala Leu Glu Cys Val Lys Val His Phe 
        355                 360                 365 


<210>  69
<211>  436
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 UDP-phopshate galactose 
       phosphotransferase

<400>  69

Met Asn Lys Ile Ile Ala Lys Ile Ser Leu Ile Leu Val Asp Ile Val 
1               5                   10                  15      


Ala Ile Phe Val Ser Ile Leu Ile Ala Val Ser Leu Arg Lys Ile Leu 
            20                  25                  30          


Gly Leu Leu Phe Thr Leu Pro Glu Ile Asp Tyr Ser Tyr Ile Phe Phe 
        35                  40                  45              


Ala Tyr Val Tyr Leu Ile Leu Ile Leu Met Met Thr Tyr Leu Gly Ala 
    50                  55                  60                  


Tyr Thr Lys Arg Tyr Asp Phe Trp His Glu Ser Arg Leu Ile Val Arg 
65                  70                  75                  80  


Gly Ser Phe Leu Ser Leu Leu Ile Leu Leu Ser Ala Leu Ala Leu Gly 
                85                  90                  95      


Gln Asn Ala Glu Tyr Tyr Ser Arg Ser Thr Leu Val Leu Ile Phe Leu 
            100                 105                 110         


Cys Cys Ala Ile Val Leu Pro Ile Ala Lys Ile Phe Thr Lys Lys Ile 
        115                 120                 125             


Leu Phe Lys Leu Gly Ile Trp Gln Leu Pro Ala Lys Val Ile Ser Glu 
    130                 135                 140                 


Asn Asp Gln Phe Lys Asn Glu Leu Phe Glu Asp Gln Tyr Leu Gly Tyr 
145                 150                 155                 160 


Val Lys Ala Lys His Ser Glu His Lys Ile Ile Phe Ile Asp Gly Ala 
                165                 170                 175     


Asn Leu Gly Lys Asp Arg Leu Asn Gln Ile Ile Glu Asp Asn Ile Lys 
            180                 185                 190         


Asn Ser Arg Glu Ile Ile Phe Thr Pro Val Leu Asn Gly Tyr Asp Phe 
        195                 200                 205             


Ser His Ser Tyr Ile Tyr Asn Ile Phe Asn Thr Arg Thr Asn Ile Phe 
    210                 215                 220                 


Thr Leu Glu Asn Glu Leu Leu Ser Lys Ser Asn Arg Ile Phe Lys Leu 
225                 230                 235                 240 


Leu Met Asp Tyr Ile Leu Val Leu Gly Ser Ala Val Phe Trp Val Pro 
                245                 250                 255     


Val Leu Val Leu Ile Ala Phe Trp Ile Lys Lys Glu Asp Pro Lys Gly 
            260                 265                 270         


Glu Val Phe Phe Leu Gln Arg Arg Leu Gly Val Asn Gly Lys Glu Phe 
        275                 280                 285             


Met Cys Tyr Lys Phe Arg Ser Met Tyr Ser Asp Gln Ser Phe Met Gln 
    290                 295                 300                 


Glu Trp Leu Glu Lys Asn Pro Glu Glu Ala Ala Tyr Tyr Arg Ile Tyr 
305                 310                 315                 320 


His Lys Tyr Met Asn Asp Pro Arg Ile Thr Lys Ile Gly Ala Phe Leu 
                325                 330                 335     


Arg Lys Thr Ser Leu Asp Glu Leu Pro Gln Leu Ile Asn Val Leu Arg 
            340                 345                 350         


Gly Glu Met Ser Leu Val Gly Pro Arg Pro Tyr Met Val Ile Glu Lys 
        355                 360                 365             


Lys Asp Ile Gly Lys Lys Ala Pro Leu Val Leu Ala Val Lys Pro Gly 
    370                 375                 380                 


Ile Thr Gly Met Trp Gln Val Ser Gly Arg Ser Asp Val Asn Phe Asp 
385                 390                 395                 400 


Ser Arg Val Glu Met Asp Val Trp Tyr Met Lys Asn Trp Ser Leu Trp 
                405                 410                 415     


Asn Asp Ile Val Ile Leu Ile Lys Thr Val Gln Ala Val Phe Lys Arg 
            420                 425                 430         


Asp Gly Ala Tyr 
        435     


<210>  70
<211>  159
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Acyltransferase

<400>  70

Met Ile Thr Ser Ile Gln Tyr Leu Arg Gly Ile Ala Ala Leu Phe Val 
1               5                   10                  15      


Val Leu Phe His Met Lys Trp Met Leu Asn Asn Val Tyr Val Glu Lys 
            20                  25                  30          


Asn Leu Gly Asp Ile Phe Phe Ile Ser Gly Asn Phe Gly Val Asp Leu 
        35                  40                  45              


Phe Phe Val Ile Ser Gly Phe Val Ile Cys Leu Ser Thr Glu Arg Glu 
    50                  55                  60                  


Thr Leu His Pro Val Lys Glu Phe Phe Ile Arg Arg Phe Phe Arg Ile 
65                  70                  75                  80  


Tyr Pro Leu Leu Leu Leu Ser Val Cys Thr Ile Tyr Ile Leu Gly Asp 
                85                  90                  95      


Phe Lys Ile His Glu Leu Ile Leu Ser Met Ile Pro Ile His Leu Asp 
            100                 105                 110         


Tyr Ser Ser Pro Ser Pro Val Phe Gly Tyr Asn Ile Leu Val Ser Ala 
        115                 120                 125             


Trp Thr Ile Thr Tyr Glu Ile Ser Phe Tyr Ile Ile Leu Val Leu Ser 
    130                 135                 140                 


Leu Met Ile Asn His Arg Phe Arg Cys Glu Leu Thr Ile Leu Phe 
145                 150                 155                 


<210>  71
<211>  137
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 Acyltransferase

<400>  71

Met Lys Ser Ile Ile Ile Leu Asp Lys Tyr Phe Leu Tyr Ser Ile Leu 
1               5                   10                  15      


Leu Val Val Ile Ser Phe Val Phe Ile Lys His Pro Ile Phe Asp Gly 
            20                  25                  30          


His Gly Val Leu Lys Trp Gly Phe Leu Ser Phe Ile Ile Leu Leu Ile 
        35                  40                  45              


Leu Leu Ile Ile Glu Asn Thr Tyr Gly Ile Ala Lys Ser Asn Phe Leu 
    50                  55                  60                  


Phe Trp Leu Gly Glu Ile Ser Tyr Ser Leu Tyr Leu Thr His Ile Ile 
65                  70                  75                  80  


Ile Leu Glu Phe Ile Leu Lys His Ile Thr Pro Glu Ile Trp Asn Asn 
                85                  90                  95      


Pro Asn Leu Gly Met Ser Lys Ile Leu Phe Tyr Leu Ala Ile Ser Ile 
            100                 105                 110         


Ser Phe Ser Tyr Leu Val Tyr Leu Leu Val Glu Lys Pro Phe Ile Asn 
        115                 120                 125             


Leu Gly Lys Lys Leu Ile Thr Lys Leu 
    130                 135         


<210>  72
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  amino acid sequence rfb APP8 dTDP-glucose 4,6-dehydratase

<400>  72

Met Lys Ile Leu Ile Thr Gly Gly Ala Gly Phe Ile Gly Ser Ala Val 
1               5                   10                  15      


Ile Arg Tyr Ile Ile Gln His Thr Gln Asp Ser Val Val Asn Val Asp 
            20                  25                  30          


Lys Leu Thr Tyr Ala Gly Asn Leu Ala Ser Leu Glu Ser Val Ser Asn 
        35                  40                  45              


Ser Ser Arg Tyr His Phe Glu Gln Ala Asp Ile Cys Asp Ser Thr Arg 
    50                  55                  60                  


Ile Ser Gln Ile Phe Cys Lys Tyr Gln Pro Asp Val Val Met His Leu 
65                  70                  75                  80  


Ala Ala Glu Ser His Val Asp Arg Ser Ile Asp Gly Pro Ala Ala Phe 
                85                  90                  95      


Met Gln Thr Asn Ile Ile Gly Thr Tyr Thr Leu Leu Glu Ala Ser Arg 
            100                 105                 110         


Gln Tyr Trp Leu Ser Leu Pro Leu Glu Arg Lys Gln Thr Phe Arg Phe 
        115                 120                 125             


Gln His Ile Ser Thr Asp Glu Val Tyr Gly Asp Leu Asn Asp Ser Asn 
    130                 135                 140                 


Glu Leu Phe Ser Glu Asn Thr Ala Tyr Ser Pro Ser Ser Pro Tyr Ser 
145                 150                 155                 160 


Ala Ser Lys Ala Ala Ser Asp His Leu Val Arg Ala Trp Phe Arg Thr 
                165                 170                 175     


Tyr Gly Leu Pro Thr Leu Val Thr Asn Cys Ser Asn Asn Tyr Gly Pro 
            180                 185                 190         


Phe Gln Phe Pro Glu Lys Leu Ile Pro Leu Met Ile Leu Asn Ala Ile 
        195                 200                 205             


Ser Gly Lys Pro Leu Pro Ile Tyr Gly Asn Gly Leu Gln Ile Arg Asp 
    210                 215                 220                 


Trp Leu Phe Val Glu Asp His Ala Ile Ala Leu Tyr Gln Val Leu Cys 
225                 230                 235                 240 


Arg Gly Lys Val Gly Glu Thr Tyr Asn Ile Gly Gly His Asn Glu Lys 
                245                 250                 255     


Thr Asn Ile Glu Val Val Gln Ala Ile Cys Arg Leu Leu Asp Glu Leu 
            260                 265                 270         


Val Pro Asn Lys Pro Ser Gly Ile Glu Gln Tyr Glu Glu Leu Val Thr 
        275                 280                 285             


Tyr Val Ala Asp Arg Pro Gly His Asp Val Arg Tyr Ala Ile Asp Ala 
    290                 295                 300                 


Ser Lys Ile Glu Asn Gln Leu Gly Trp Thr Pro Lys Glu Thr Phe Glu 
305                 310                 315                 320 


Ser Gly Leu Arg Lys Thr Val Glu Trp Tyr Leu Asn Asn Gln Lys Trp 
                325                 330                 335     


Trp Gln Ser Val Leu Asp Gly Ser Tyr Cys Gly Glu Arg Leu Gly Leu 
            340                 345                 350         


Ser Leu Lys Ser Tyr 
        355         


