                         SEQUENCE LISTING

<110>  bluebird bio, Inc.
       Goss, Kendrick
       Parsons, Geoffrey
 
<120>  GENE THERAPY FOR MUCOPOLYSACCHARIDOSIS, TYPE II

<130>  BLBD-082/01WO

<150>  US 62/430,819
<151>  2016-12-06

<160>  17     

<170>  PatentIn version 3.5

<210>  1
<211>  7524
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthesized lentiviral vector encoding an iduronate 2-sulfatase 
       (I2S) polypeptide

<400>  1
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatcatat gccagcctat ggtgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca      600

gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat      660

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa      720

caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag      780

cagagctcgt ttagtgaacc gggtctctct ggttagacca gatctgagcc tgggagctct      840

ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgctcaaag      900

tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt      960

cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga aagtaaagcc     1020

agaggagatc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg     1080

gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg agagagtagg     1140

gtgcgagagc gtcggtatta agcgggggag aattagataa atgggaaaaa attcggttaa     1200

ggccaggggg aaagaaacaa tataaactaa aacatatagt tagggcaagc agggagctag     1260

aacgattcgc agttaatcct ggccttttag agacatcaga aggctgtaga caaatactgg     1320

gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacaa     1380

tagcagtcct ctattgtgtg catcaaagga tagatgtaaa agacaccaag gaagccttag     1440

ataagataga ggaagagcaa aacaaaagta agaaaaaggc acagcaagca gcagctgaca     1500

caggaaacaa cagccaggtc agccaaaatt accctatagt gcagaacctc caggggcaaa     1560

tggtacatca ggccatatca cctagaactt taaattaaga cagcagtaca aatggcagta     1620

ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata     1680

gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt     1740

caaaattttc gggtttatta cagggacagc agagatccag tttggaaagg accagcaaag     1800

ctcctctgga aaggtgaagg ggcagtagta atacaagata atagtgacat aaaagtagtg     1860

ccaagaagaa aagcaaagat catcagggat tatggaaaac agatggcagg tgatgattgt     1920

gtggcaagta gacaggatga ggattaacac atggaaaaga ttagtaaaac accatagctc     1980

tagagcgatc ccgatcttca gacctggagg aggagatatg agggacaatt ggagaagtga     2040

attatataaa tataaagtag taaaaattga accattagga gtagcaccca ccaaggcaaa     2100

gagaagagtg gtgcagagag aaaaaagagc agtgggaata ggagctttgt tccttgggtt     2160

cttgggagca gcaggaagca ctatgggcgc agcgtcaatg acgctgacgg tacaggccag     2220

acaattattg tctggtatag tgcagcagca gaacaatttg ctgagggcta ttgaggcgca     2280

acagcatctg ttgcaactca cagtctgggg catcaagcag ctccaggcaa gaatcctggc     2340

tgtggaaaga tacctaaagg atcaacagct cctggggatt tggggttgct ctggaaaact     2400

catttgcacc actgctgtgc cttggaatgc tagttggagt aataaatctc tggaacagat     2460

ttggaatcac acgacctgga tggagtggga cagagaaatt aacaattaca caagcttggt     2520

aggtttaaga atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc     2580

accattatcg tttcagaccc acctcccaac cccgagggga cccgacaggc ccgaaggaat     2640

agaagaagaa ggtggagaga gagacagaga cagatccatt cgattagtga acggatccat     2700

ctcgacggaa tgaaagaccc cacctgtagg tttggcaagc taggatcaag gttaggaaca     2760

gagagacagc agaatatggg ccaaacagga tatctgtggt aagcagttcc tgccccggct     2820

cagggccaag aacagttgga acagcagaat atgggccaaa caggatatct gtggtaagca     2880

gttcctgccc cggctcaggg ccaagaacag atggtcccca gatgcggtcc cgccctcagc     2940

agtttctaga gaaccatcag atgtttccag ggtgccccaa ggacctgaaa tgaccctgtg     3000

ccttatttga actaaccaat cagttcgctt ctcgcttctg ttcgcgcgct tctgctcccc     3060

gagctcaata aaagagccca caacccctca ctcggcgcga ttcacctgac gcgtctacgc     3120

caccatgcct ccaccgcgga cagggagggg tctgctctgg ctgggactgg tcctttcatc     3180

cgtctgcgtg gcgctgggat ccgagactca ggccaattcc actactgacg cgctgaacgt     3240

gctgctgatt atcgtggatg atcttcgccc ctcgcttgga tgctacgggg ataagctcgt     3300

ccggagcccg aacattgacc agttggcatc ccactccctc ctgtttcaaa acgccttcgc     3360

acaacaagcc gtgtgcgctc cgagcagagt gtcgttcctg accggccggc gccctgacac     3420

cacccggttg tacgacttca actcctattg gcgcgtccac gcgggaaact tttcaaccat     3480

cccgcagtac ttcaaggaaa acggttacgt gaccatgtcc gtgggaaaag tgttccaccc     3540

cggcatctcg tcgaatcata ccgatgacag cccttactcc tggtcgttcc ccccctatca     3600

cccgtcaagc gaaaaatacg agaacaccaa gacctgtaga ggtcccgacg gagaactgca     3660

cgctaacctc ctgtgccccg tggacgtgct ggacgtgcct gaagggaccc ttcccgacaa     3720

gcagtcaacc gagcaggcca tccagctgct ggaaaagatg aaaacttcgg ccagcccctt     3780

cttcctcgcc gtgggatacc ataagcctca tatccccttc cggtatccca aggagttcca     3840

gaagctgtat ccactcgaga acatcactct ggccccggat ccggaggtgc ccgacggact     3900

gccacctgtg gcctacaacc catggatgga cattcggcag cgggaggatg tgcaggccct     3960

caacatttcc gtcccgtacg ggccgatccc tgtggacttc cagcgcaaga tccgacagtc     4020

ctacttcgcc tccgtgtctt acctggatac tcaagtcggg cggctgctct ccgcgctgga     4080

cgatctccag cttgcaaata gcacgatcat cgccttcacc tccgatcacg gatgggccct     4140

gggagaacac ggcgaatggg cgaagtactc caacttcgac gtggccactc acgtgccgct     4200

gatcttttac gtgccgggca gaaccgcctc cctcccggaa gccggagaga agctgtttcc     4260

gtacctggac ccgttcgact ccgcgagcca gctcatggag cccgggcgcc agagcatgga     4320

cctggtcgaa ctcgtgtcgc tgtttcccac cctggctggc ctcgccggtt tgcaagtgcc     4380

cccgaggtgc cctgtgccga gcttccatgt ggaactgtgc agggagggaa agaacctgtt     4440

gaagcacttc cggttccgcg acctggagga agatccgtac ttgcctggca accctagaga     4500

actgatcgcc tactcccaat accctcggcc ttcggacatc cctcagtgga actccgacaa     4560

gccatccctg aaagacatca agattatggg atacagcatt cgcactatcg actaccgcta     4620

cactgtgtgg gtcggcttca accccgatga gttcctggcc aacttctccg acattcatgc     4680

tggcgaactg tacttcgtgg actcagaccc actccaagac cacaacatgt acaacgactc     4740

acagggaggc gatctgtttc aactcctgat gccctagtaa tgacaggtac ctttaagacc     4800

aatgacttac aaggcagctg tagatcttag ccacttttta aaagaaaagg ggggactgga     4860

agggctaatt cactcccaaa gaagacaaga tctgcttttt gcctgtactg ggtctctctg     4920

gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc     4980

tcaataaagc ttgccttgag tgcttcaatg tgtgtgttgg ttttttgtgt gtcgaaattc     5040

tagcgattct agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc     5100

gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta     5160

atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa     5220

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     5280

tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     5340

agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     5400

aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     5460

gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     5520

tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     5580

cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     5640

ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     5700

cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     5760

atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     5820

agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     5880

gtggtggcct aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa     5940

gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     6000

tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     6060

agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     6120

gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     6180

aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     6240

aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     6300

ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     6360

gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     6420

aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     6480

ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     6540

tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     6600

ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     6660

cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     6720

agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     6780

gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     6840

gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     6900

acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     6960

acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     7020

agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     7080

aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     7140

gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt     7200

tccccgaaaa gtgccacctg ggactagctt tttgcaaaag cctaggcctc caaaaaagcc     7260

tcctcactac ttctggaata gctcagaggc cgaggcggcc tcggcctctg cataaataaa     7320

aaaaattagt cagccatggg gcggagaatg ggcggaactg ggcggagtta ggggcgggat     7380

gggcggagtt aggggcggga ctatggttgc tgactaattg agatgagctt gcatgccgac     7440

attgattatt gactagtccc taagaaacca ttcttatcat gacattaacc tataaaaata     7500

ggcgtatcac gaggcccttt cgtc                                            7524


<210>  2
<211>  7658
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthesized lentiviral vector encoding an I2S polypeptide

<400>  2
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatcatat gccagcctat ggtgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca      600

gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat      660

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa      720

caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag      780

cagagctcgt ttagtgaacc gggtctctct ggttagacca gatctgagcc tgggagctct      840

ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgctcaaag      900

tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt      960

cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga aagtaaagcc     1020

agaggagatc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg     1080

gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg agagagtagg     1140

gtgcgagagc gtcggtatta agcgggggag aattagataa atgggaaaaa attcggttaa     1200

ggccaggggg aaagaaacaa tataaactaa aacatatagt tagggcaagc agggagctag     1260

aacgattcgc agttaatcct ggccttttag agacatcaga aggctgtaga caaatactgg     1320

gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacaa     1380

tagcagtcct ctattgtgtg catcaaagga tagatgtaaa agacaccaag gaagccttag     1440

ataagataga ggaagagcaa aacaaaagta agaaaaaggc acagcaagca gcagctgaca     1500

caggaaacaa cagccaggtc agccaaaatt accctatagt gcagaacctc caggggcaaa     1560

tggtacatca ggccatatca cctagaactt taaattaaga cagcagtaca aatggcagta     1620

ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata     1680

gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt     1740

caaaattttc gggtttatta cagggacagc agagatccag tttggaaagg accagcaaag     1800

ctcctctgga aaggtgaagg ggcagtagta atacaagata atagtgacat aaaagtagtg     1860

ccaagaagaa aagcaaagat catcagggat tatggaaaac agatggcagg tgatgattgt     1920

gtggcaagta gacaggatga ggattaacac atggaaaaga ttagtaaaac accatagctc     1980

tagagcgatc ccgatcttca gacctggagg aggagatatg agggacaatt ggagaagtga     2040

attatataaa tataaagtag taaaaattga accattagga gtagcaccca ccaaggcaaa     2100

gagaagagtg gtgcagagag aaaaaagagc agtgggaata ggagctttgt tccttgggtt     2160

cttgggagca gcaggaagca ctatgggcgc agcgtcaatg acgctgacgg tacaggccag     2220

acaattattg tctggtatag tgcagcagca gaacaatttg ctgagggcta ttgaggcgca     2280

acagcatctg ttgcaactca cagtctgggg catcaagcag ctccaggcaa gaatcctggc     2340

tgtggaaaga tacctaaagg atcaacagct cctggggatt tggggttgct ctggaaaact     2400

catttgcacc actgctgtgc cttggaatgc tagttggagt aataaatctc tggaacagat     2460

ttggaatcac acgacctgga tggagtggga cagagaaatt aacaattaca caagcttggt     2520

aggtttaaga atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc     2580

accattatcg tttcagaccc acctcccaac cccgagggga cccgacaggc ccgaaggaat     2640

agaagaagaa ggtggagaga gagacagaga cagatccatt cgattagtga acggatccaa     2700

ggatctgcga tcgctccggt gcccgtcagt gggcagagcg cacatcgccc acagtccccg     2760

agaagttggg gggaggggtc ggcaattgaa cgggtgccta gagaaggtgg cgcggggtaa     2820

actgggaaag tgatgtcgtg tactggctcc gcctttttcc cgagggtggg ggagaaccgt     2880

atataagtgc agtagtcgcc gtgaacgttc tttttcgcaa cgggtttgcc gccagaacac     2940

agctgaagct tcgaggggct cgcatctctc cttcacgcgc ccgccgccct acctgaggcc     3000

gccatccacg ccggttgagt cgcgttctgc cgcctcccgc ctgtggtgcc tcctgaactg     3060

cgtccgccgt ctaggtaagt ttaaagctca ggtcgagacc gggcctttgt ccggcgctcc     3120

cttggagcct acctagactc agccggctct ccacgctttg cctgaccctg cttgctcaac     3180

tctacgtctt tgtttcgttt tctgttctgc gccgttacag atccaagctg tgaccggcgc     3240

ctacgcgtct acgccaccat gcctccaccg cggacaggga ggggtctgct ctggctggga     3300

ctggtccttt catccgtctg cgtggcgctg ggatccgaga ctcaggccaa ttccactact     3360

gacgcgctga acgtgctgct gattatcgtg gatgatcttc gcccctcgct tggatgctac     3420

ggggataagc tcgtccggag cccgaacatt gaccagttgg catcccactc cctcctgttt     3480

caaaacgcct tcgcacaaca agccgtgtgc gctccgagca gagtgtcgtt cctgaccggc     3540

cggcgccctg acaccacccg gttgtacgac ttcaactcct attggcgcgt ccacgcggga     3600

aacttttcaa ccatcccgca gtacttcaag gaaaacggtt acgtgaccat gtccgtggga     3660

aaagtgttcc accccggcat ctcgtcgaat cataccgatg acagccctta ctcctggtcg     3720

ttccccccct atcacccgtc aagcgaaaaa tacgagaaca ccaagacctg tagaggtccc     3780

gacggagaac tgcacgctaa cctcctgtgc cccgtggacg tgctggacgt gcctgaaggg     3840

acccttcccg acaagcagtc aaccgagcag gccatccagc tgctggaaaa gatgaaaact     3900

tcggccagcc ccttcttcct cgccgtggga taccataagc ctcatatccc cttccggtat     3960

cccaaggagt tccagaagct gtatccactc gagaacatca ctctggcccc ggatccggag     4020

gtgcccgacg gactgccacc tgtggcctac aacccatgga tggacattcg gcagcgggag     4080

gatgtgcagg ccctcaacat ttccgtcccg tacgggccga tccctgtgga cttccagcgc     4140

aagatccgac agtcctactt cgcctccgtg tcttacctgg atactcaagt cgggcggctg     4200

ctctccgcgc tggacgatct ccagcttgca aatagcacga tcatcgcctt cacctccgat     4260

cacggatggg ccctgggaga acacggcgaa tgggcgaagt actccaactt cgacgtggcc     4320

actcacgtgc cgctgatctt ttacgtgccg ggcagaaccg cctccctccc ggaagccgga     4380

gagaagctgt ttccgtacct ggacccgttc gactccgcga gccagctcat ggagcccggg     4440

cgccagagca tggacctggt cgaactcgtg tcgctgtttc ccaccctggc tggcctcgcc     4500

ggtttgcaag tgcccccgag gtgccctgtg ccgagcttcc atgtggaact gtgcagggag     4560

ggaaagaacc tgttgaagca cttccggttc cgcgacctgg aggaagatcc gtacttgcct     4620

ggcaacccta gagaactgat cgcctactcc caataccctc ggccttcgga catccctcag     4680

tggaactccg acaagccatc cctgaaagac atcaagatta tgggatacag cattcgcact     4740

atcgactacc gctacactgt gtgggtcggc ttcaaccccg atgagttcct ggccaacttc     4800

tccgacattc atgctggcga actgtacttc gtggactcag acccactcca agaccacaac     4860

atgtacaacg actcacaggg aggcgatctg tttcaactcc tgatgcccta gtaatgacag     4920

gtacctttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt tttaaaagaa     4980

aaggggggac tggaagggct aattcactcc caaagaagac aagatctgct ttttgcctgt     5040

actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta actagggaac     5100

ccactgctta agcctcaata aagcttgcct tgagtgcttc aatgtgtgtg ttggtttttt     5160

gtgtgtcgaa attctagcga ttctagcttg gcgtaatcat ggtcatagct gtttcctgtg     5220

tgaaattgtt atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa     5280

gcctggggtg cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct     5340

ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga     5400

ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc     5460

gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa     5520

tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt     5580

aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa     5640

aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt     5700

ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg     5760

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc     5820

agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc     5880

gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta     5940

tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct     6000

acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc     6060

tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa     6120

caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa     6180

aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa     6240

aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt     6300

ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac     6360

agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc     6420

atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc     6480

cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata     6540

aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc     6600

cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc     6660

aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca     6720

ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa     6780

gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca     6840

ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt     6900

tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt     6960

tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg     7020

ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga     7080

tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc     7140

agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg     7200

acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag     7260

ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg     7320

gttccgcgca catttccccg aaaagtgcca cctgggacta gctttttgca aaagcctagg     7380

cctccaaaaa agcctcctca ctacttctgg aatagctcag aggccgaggc ggcctcggcc     7440

tctgcataaa taaaaaaaat tagtcagcca tggggcggag aatgggcgga actgggcgga     7500

gttaggggcg ggatgggcgg agttaggggc gggactatgg ttgctgacta attgagatga     7560

gcttgcatgc cgacattgat tattgactag tccctaagaa accattctta tcatgacatt     7620

aacctataaa aataggcgta tcacgaggcc ctttcgtc                             7658

<210>  3
<211>  3
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  3

Gly Gly Gly 
1           


<210>  4
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  4

Asp Gly Gly Gly Ser 
1               5   


<210>  5
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  5

Thr Gly Glu Lys Pro 
1               5   


<210>  6
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  6

Gly Gly Arg Arg 
1               


<210>  7
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  7

Gly Gly Gly Gly Ser 
1               5   


<210>  8
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  8

Glu Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Val Asp 
1               5                   10                  


<210>  9
<211>  18
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  9

Lys Glu Ser Gly Ser Val Ser Ser Glu Gln Leu Ala Gln Phe Arg Ser 
1               5                   10                  15      


Leu Asp 
        


<210>  10
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  10

Gly Gly Arg Arg Gly Gly Gly Ser 
1               5               


<210>  11
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  11

Leu Arg Gln Arg Asp Gly Glu Arg Pro 
1               5                   


<210>  12
<211>  12
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  12

Leu Arg Gln Lys Asp Gly Gly Gly Ser Glu Arg Pro 
1               5                   10          


<210>  13
<211>  16
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary linker sequence

<400>  13

Leu Arg Gln Lys Asp Gly Gly Gly Ser Gly Gly Gly Ser Glu Arg Pro 
1               5                   10                  15      


<210>  14
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cleavage sequence by TEV protease


<220>
<221>  misc_feature
<222>  (2)..(3)
<223>  Xaa is any amino acid

<220>
<221>  misc_feature
<222>  (5)..(5)
<223>  Xaa is any amino acid

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa = Gly or Ser

<400>  14

Glu Xaa Xaa Tyr Xaa Gln Xaa 
1               5           


<210>  15
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cleavage sequence by TEV protease

<400>  15

Glu Asn Leu Tyr Phe Gln Gly 
1               5           


<210>  16
<211>  7
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Cleavage sequence by TEV protease

<400>  16

Glu Asn Leu Tyr Phe Gln Ser 
1               5           


<210>  17
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Consensus Kozak sequence

<400>  17
gccrccatgg                                                              10


