SEQUENCE LISTING

<110>  Cellectis

<120>  DESIGN OF RARE-CUTTING ENDONUCLEASES FOR EFFICIENT AND SPECIFIC 
       TARGETING DNA SEQUENCES COMPRISING HIGHLY REPETITIVE MOTIVES

<130>  P81313675PCT00

<160>  16

<170>  patentIn version 3.5

<210>  1
<211>  13481
<212>  DNA
<213>  Homo sapiens

<220>
<223>  huntingtin (HTT) mRNA

<400>  1
gctgccggga cgggtccaag atggacggcc gctcaggttc tgcttttacc tgcggcccag     60
agccccattc attgccccgg tgctgagcgg cgccgcgagt cggcccgagg cctccgggga    120
ctgccgtgcc gggcgggaga ccgccatggc gaccctggaa aagctgatga aggccttcga    180
gtccctcaag tccttccagc agcagcagca gcagcagcag cagcagcagc agcagcagca    240
gcagcagcag cagcagcagc aacagccgcc accgccgccg ccgccgccgc cgcctcctca    300
gcttcctcag ccgccgccgc aggcacagcc gctgctgcct cagccgcagc cgcccccgcc    360
gccgcccccg ccgccacccg gcccggctgt ggctgaggag ccgctgcacc gaccaaagaa    420
agaactttca gctaccaaga aagaccgtgt gaatcattgt ctgacaatat gtgaaaacat    480
agtggcacag tctgtcagaa attctccaga atttcagaaa cttctgggca tcgctatgga    540
actttttctg ctgtgcagtg atgacgcaga gtcagatgtc aggatggtgg ctgacgaatg    600
cctcaacaaa gttatcaaag ctttgatgga ttctaatctt ccaaggttac agctcgagct    660
ctataaggaa attaaaaaga atggtgcccc tcggagtttg cgtgctgccc tgtggaggtt    720
tgctgagctg gctcacctgg ttcggcctca gaaatgcagg ccttacctgg tgaaccttct    780
gccgtgcctg actcgaacaa gcaagagacc cgaagaatca gtccaggaga ccttggctgc    840
agctgttccc aaaattatgg cttcttttgg caattttgca aatgacaatg aaattaaggt    900
tttgttaaag gccttcatag cgaacctgaa gtcaagctcc cccaccattc ggcggacagc    960
ggctggatca gcagtgagca tctgccagca ctcaagaagg acacaatatt tctatagttg   1020
gctactaaat gtgctcttag gcttactcgt tcctgtcgag gatgaacact ccactctgct   1080
gattcttggc gtgctgctca ccctgaggta tttggtgccc ttgctgcagc agcaggtcaa   1140
ggacacaagc ctgaaaggca gcttcggagt gacaaggaaa gaaatggaag tctctccttc   1200
tgcagagcag cttgtccagg tttatgaact gacgttacat catacacagc accaagacca   1260
caatgttgtg accggagccc tggagctgtt gcagcagctc ttcagaacgc ctccacccga   1320
gcttctgcaa accctgaccg cagtcggggg cattgggcag ctcaccgctg ctaaggagga   1380
gtctggtggc cgaagccgta gtgggagtat tgtggaactt atagctggag ggggttcctc   1440
atgcagccct gtcctttcaa gaaaacaaaa aggcaaagtg ctcttaggag aagaagaagc   1500
cttggaggat gactctgaat cgagatcgga tgtcagcagc tctgccttaa cagcctcagt   1560
gaaggatgag atcagtggag agctggctgc ttcttcaggg gtttccactc cagggtcagc   1620
aggtcatgac atcatcacag aacagccacg gtcacagcac acactgcagg cggactcagt   1680
ggatctggcc agctgtgact tgacaagctc tgccactgat ggggatgagg aggatatctt   1740
gagccacagc tccagccagg tcagcgccgt cccatctgac cctgccatgg acctgaatga   1800
tgggacccag gcctcgtcgc ccatcagcga cagctcccag accaccaccg aagggcctga   1860
ttcagctgtt accccttcag acagttctga aattgtgtta gacggtaccg acaaccagta   1920
tttgggcctg cagattggac agccccagga tgaagatgag gaagccacag gtattcttcc   1980
tgatgaagcc tcggaggcct tcaggaactc ttccatggcc cttcaacagg cacatttatt   2040
gaaaaacatg agtcactgca ggcagccttc tgacagcagt gttgataaat ttgtgttgag   2100
agatgaagct actgaaccgg gtgatcaaga aaacaagcct tgccgcatca aaggtgacat   2160
tggacagtcc actgatgatg actctgcacc tcttgtccat tgtgtccgcc ttttatctgc   2220
ttcgtttttg ctaacagggg gaaaaaatgt gctggttccg gacagggatg tgagggtcag   2280
cgtgaaggcc ctggccctca gctgtgtggg agcagctgtg gccctccacc cggaatcttt   2340
cttcagcaaa ctctataaag ttcctcttga caccacggaa taccctgagg aacagtatgt   2400
ctcagacatc ttgaactaca tcgatcatgg agacccacag gttcgaggag ccactgccat   2460
tctctgtggg accctcatct gctccatcct cagcaggtcc cgcttccacg tgggagattg   2520
gatgggcacc attagaaccc tcacaggaaa tacattttct ttggcggatt gcattccttt   2580
gctgcggaaa acactgaagg atgagtcttc tgttacttgc aagttagctt gtacagctgt   2640
gaggaactgt gtcatgagtc tctgcagcag cagctacagt gagttaggac tgcagctgat   2700
catcgatgtg ctgactctga ggaacagttc ctattggctg gtgaggacag agcttctgga   2760
aacccttgca gagattgact tcaggctggt gagctttttg gaggcaaaag cagaaaactt   2820
acacagaggg gctcatcatt atacagggct tttaaaactg caagaacgag tgctcaataa   2880
tgttgtcatc catttgcttg gagatgaaga ccccagggtg cgacatgttg ccgcagcatc   2940
actaattagg cttgtcccaa agctgtttta taaatgtgac caaggacaag ctgatccagt   3000
agtggccgtg gcaagagatc aaagcagtgt ttacctgaaa cttctcatgc atgagacgca   3060
gcctccatct catttctccg tcagcacaat aaccagaata tatagaggct ataacctact   3120
accaagcata acagacgtca ctatggaaaa taacctttca agagttattg cagcagtttc   3180
tcatgaacta atcacatcaa ccaccagagc actcacattt ggatgctgtg aagctttgtg   3240
tcttctttcc actgccttcc cagtttgcat ttggagttta ggttggcact gtggagtgcc   3300
tccactgagt gcctcagatg agtctaggaa gagctgtacc gttgggatgg ccacaatgat   3360
tctgaccctg ctctcgtcag cttggttccc attggatctc tcagcccatc aagatgcttt   3420
gattttggcc ggaaacttgc ttgcagccag tgctcccaaa tctctgagaa gttcatgggc   3480
ctctgaagaa gaagccaacc cagcagccac caagcaagag gaggtctggc cagccctggg   3540
ggaccgggcc ctggtgccca tggtggagca gctcttctct cacctgctga aggtgattaa   3600
catttgtgcc cacgtcctgg atgacgtggc tcctggaccc gcaataaagg cagccttgcc   3660
ttctctaaca aacccccctt ctctaagtcc catccgacga aaggggaagg agaaagaacc   3720
aggagaacaa gcatctgtac cgttgagtcc caagaaaggc agtgaggcca gtgcagcttc   3780
tagacaatct gatacctcag gtcctgttac aacaagtaaa tcctcatcac tggggagttt   3840
ctatcatctt ccttcatacc tcaaactgca tgatgtcctg aaagctacac acgctaacta   3900
caaggtcacg ctggatcttc agaacagcac ggaaaagttt ggagggtttc tccgctcagc   3960
cttggatgtt ctttctcaga tactagagct ggccacactg caggacattg ggaagtgtgt   4020
tgaagagatc ctaggatacc tgaaatcctg ctttagtcga gaaccaatga tggcaactgt   4080
ttgtgttcaa caattgttga agactctctt tggcacaaac ttggcctccc agtttgatgg   4140
cttatcttcc aaccccagca agtcacaagg ccgagcacag cgccttggct cctccagtgt   4200
gaggccaggc ttgtaccact actgcttcat ggccccgtac acccacttca cccaggccct   4260
cgctgacgcc agcctgagga acatggtgca ggcggagcag gagaacgaca cctcgggatg   4320
gtttgatgtc ctccagaaag tgtctaccca gttgaagaca aacctcacga gtgtcacaaa   4380
gaaccgtgca gataagaatg ctattcataa tcacattcgt ttgtttgaac ctcttgttat   4440
aaaagcttta aaacagtaca cgactacaac atgtgtgcag ttacagaagc aggttttaga   4500
tttgctggcg cagctggttc agttacgggt taattactgt cttctggatt cagatcaggt   4560
gtttattggc tttgtattga aacagtttga atacattgaa gtgggccagt tcagggaatc   4620
agaggcaatc attccaaaca tctttttctt cttggtatta ctatcttatg aacgctatca   4680
ttcaaaacag atcattggaa ttcctaaaat cattcagctc tgtgatggca tcatggccag   4740
tggaaggaag gctgtgacac atgccatacc ggctctgcag cccatagtcc acgacctctt   4800
tgtattaaga ggaacaaata aagctgatgc aggaaaagag cttgaaaccc aaaaagaggt   4860
ggtggtgtca atgttactga gactcatcca gtaccatcag gtgttggaga tgttcattct   4920
tgtcctgcag cagtgccaca aggagaatga agacaagtgg aagcgactgt ctcgacagat   4980
agctgacatc atcctcccaa tgttagccaa acagcagatg cacattgact ctcatgaagc   5040
ccttggagtg ttaaatacat tatttgagat tttggcccct tcctccctcc gtccggtaga   5100
catgctttta cggagtatgt tcgtcactcc aaacacaatg gcgtccgtga gcactgttca   5160
actgtggata tcgggaattc tggccatttt gagggttctg atttcccagt caactgaaga   5220
tattgttctt tctcgtattc aggagctctc cttctctccg tatttaatct cctgtacagt   5280
aattaatagg ttaagagatg gggacagtac ttcaacgcta gaagaacaca gtgaagggaa   5340
acaaataaag aatttgccag aagaaacatt ttcaaggttt ctattacaac tggttggtat   5400
tcttttagaa gacattgtta caaaacagct gaaggtggaa atgagtgagc agcaacatac   5460
tttctattgc caggaactag gcacactgct aatgtgtctg atccacatct tcaagtctgg   5520
aatgttccgg agaatcacag cagctgccac taggctgttc cgcagtgatg gctgtggcgg   5580
cagtttctac accctggaca gcttgaactt gcgggctcgt tccatgatca ccacccaccc   5640
ggccctggtg ctgctctggt gtcagatact gctgcttgtc aaccacaccg actaccgctg   5700
gtgggcagaa gtgcagcaga ccccgaaaag acacagtctg tccagcacaa agttacttag   5760
tccccagatg tctggagaag aggaggattc tgacttggca gccaaacttg gaatgtgcaa   5820
tagagaaata gtacgaagag gggctctcat tctcttctgt gattatgtct gtcagaacct   5880
ccatgactcc gagcacttaa cgtggctcat tgtaaatcac attcaagatc tgatcagcct   5940
ttcccacgag cctccagtac aggacttcat cagtgccgtt catcggaact ctgctgccag   6000
cggcctgttc atccaggcaa ttcagtctcg ttgtgaaaac ctttcaactc caaccatgct   6060
gaagaaaact cttcagtgct tggaggggat ccatctcagc cagtcgggag ctgtgctcac   6120
gctgtatgtg gacaggcttc tgtgcacccc tttccgtgtg ctggctcgca tggtcgacat   6180
ccttgcttgt cgccgggtag aaatgcttct ggctgcaaat ttacagagca gcatggccca   6240
gttgccaatg gaagaactca acagaatcca ggaatacctt cagagcagcg ggctcgctca   6300
gagacaccaa aggctctatt ccctgctgga caggtttcgt ctctccacca tgcaagactc   6360
acttagtccc tctcctccag tctcttccca cccgctggac ggggatgggc acgtgtcact   6420
ggaaacagtg agtccggaca aagactggta cgttcatctt gtcaaatccc agtgttggac   6480
caggtcagat tctgcactgc tggaaggtgc agagctggtg aatcggattc ctgctgaaga   6540
tatgaatgcc ttcatgatga actcggagtt caacctaagc ctgctagctc catgcttaag   6600
cctagggatg agtgaaattt ctggtggcca gaagagtgcc ctttttgaag cagcccgtga   6660
ggtgactctg gcccgtgtga gcggcaccgt gcagcagctc cctgctgtcc atcatgtctt   6720
ccagcccgag ctgcctgcag agccggcggc ctactggagc aagttgaatg atctgtttgg   6780
ggatgctgca ctgtatcagt ccctgcccac tctggcccgg gccctggcac agtacctggt   6840
ggtggtctcc aaactgccca gtcatttgca ccttcctcct gagaaagaga aggacattgt   6900
gaaattcgtg gtggcaaccc ttgaggccct gtcctggcat ttgatccatg agcagatccc   6960
gctgagtctg gatctccagg cagggctgga ctgctgctgc ctggccctgc agctgcctgg   7020
cctctggagc gtggtctcct ccacagagtt tgtgacccac gcctgctccc tcatctactg   7080
tgtgcacttc atcctggagg ccgttgcagt gcagcctgga gagcagcttc ttagtccaga   7140
aagaaggaca aataccccaa aagccatcag cgaggaggag gaggaagtag atccaaacac   7200
acagaatcct aagtatatca ctgcagcctg tgagatggtg gcagaaatgg tggagtctct   7260
gcagtcggtg ttggccttgg gtcataaaag gaatagcggc gtgccggcgt ttctcacgcc   7320
attgctaagg aacatcatca tcagcctggc ccgcctgccc cttgtcaaca gctacacacg   7380
tgtgccccca ctggtgtgga agcttggatg gtcacccaaa ccgggagggg attttggcac   7440
agcattccct gagatccccg tggagttcct ccaggaaaag gaagtcttta aggagttcat   7500
ctaccgcatc aacacactag gctggaccag tcgtactcag tttgaagaaa cttgggccac   7560
cctccttggt gtcctggtga cgcagcccct cgtgatggag caggaggaga gcccaccaga   7620
agaagacaca gagaggaccc agatcaacgt cctggccgtg caggccatca cctcactggt   7680
gctcagtgca atgactgtgc ctgtggccgg caacccagct gtaagctgct tggagcagca   7740
gccccggaac aagcctctga aagctctcga caccaggttt gggaggaagc tgagcattat   7800
cagagggatt gtggagcaag agattcaagc aatggtttca aagagagaga atattgccac   7860
ccatcattta tatcaggcat gggatcctgt cccttctctg tctccggcta ctacaggtgc   7920
cctcatcagc cacgagaagc tgctgctaca gatcaacccc gagcgggagc tggggagcat   7980
gagctacaaa ctcggccagg tgtccataca ctccgtgtgg ctggggaaca gcatcacacc   8040
cctgagggag gaggaatggg acgaggaaga ggaggaggag gccgacgccc ctgcaccttc   8100
gtcaccaccc acgtctccag tcaactccag gaaacaccgg gctggagttg acatccactc   8160
ctgttcgcag tttttgcttg agttgtacag ccgctggatc ctgccgtcca gctcagccag   8220
gaggaccccg gccatcctga tcagtgaggt ggtcagatcc cttctagtgg tctcagactt   8280
gttcaccgag cgcaaccagt ttgagctgat gtatgtgacg ctgacagaac tgcgaagggt   8340
gcacccttca gaagacgaga tcctcgctca gtacctggtg cctgccacct gcaaggcagc   8400
tgccgtcctt gggatggaca aggccgtggc ggagcctgtc agccgcctgc tggagagcac   8460
gctcaggagc agccacctgc ccagcagggt tggagccctg cacggcgtcc tctatgtgct   8520
ggagtgcgac ctgctggacg acactgccaa gcagctcatc ccggtcatca gcgactatct   8580
cctctccaac ctgaaaggga tcgcccactg cgtgaacatt cacagccagc agcacgtact   8640
ggtcatgtgt gccactgcgt tttacctcat tgagaactat cctctggacg tagggccgga   8700
attttcagca tcaataatac agatgtgtgg ggtgatgctg tctggaagtg aggagtccac   8760
cccctccatc atttaccact gtgccctcag aggcctggag cgcctcctgc tctctgagca   8820
gctctcccgc ctggatgcag aatcgctggt caagctgagt gtggacagag tgaacgtgca   8880
cagcccgcac cgggccatgg cggctctggg cctgatgctc acctgcatgt acacaggaaa   8940
ggagaaagtc agtccgggta gaacttcaga ccctaatcct gcagcccccg acagcgagtc   9000
agtgattgtt gctatggagc gggtatctgt tctttttgat aggatcagga aaggctttcc   9060
ttgtgaagcc agagtggtgg ccaggatcct gccccagttt ctagacgact tcttcccacc   9120
ccaggacatc atgaacaaag tcatcggaga gtttctgtcc aaccagcagc cataccccca   9180
gttcatggcc accgtggtgt ataaggtgtt tcagactctg cacagcaccg ggcagtcgtc   9240
catggtccgg gactgggtca tgctgtccct ctccaacttc acgcagaggg ccccggtcgc   9300
catggccacg tggagcctct cctgcttctt tgtcagcgcg tccaccagcc cgtgggtcgc   9360
ggcgatcctc ccacatgtca tcagcaggat gggcaagctg gagcaggtgg acgtgaacct   9420
tttctgcctg gtcgccacag acttctacag acaccagata gaggaggagc tcgaccgcag   9480
ggccttccag tctgtgcttg aggtggttgc agccccagga agcccatatc accggctgct   9540
gacttgttta cgaaatgtcc acaaggtcac cacctgctga gcgccatggt gggagagact   9600
gtgaggcggc agctggggcc ggagcctttg gaagtctgcg cccttgtgcc ctgcctccac   9660
cgagccagct tggtccctat gggcttccgc acatgccgcg ggcggccagg caacgtgcgt   9720
gtctctgcca tgtggcagaa gtgctctttg tggcagtggc caggcaggga gtgtctgcag   9780
tcctggtggg gctgagcctg aggccttcca gaaagcagga gcagctgtgc tgcaccccat   9840
gtgggtgacc aggtcctttc tcctgatagt cacctgctgg ttgttgccag gttgcagctg   9900
ctcttgcatc tgggccagaa gtcctccctc ctgcaggctg gctgttggcc cctctgctgt   9960
cctgcagtag aaggtgccgt gagcaggctt tgggaacact ggcctgggtc tccctggtgg  10020
ggtgtgcatg ccacgccccg tgtctggatg cacagatgcc atggcctgtg ctgggccagt  10080
ggctgggggt gctagacacc cggcaccatt ctcccttctc tcttttcttc tcaggattta  10140
aaatttaatt atatcagtaa agagattaat tttaacgtaa ctctttctat gcccgtgtaa  10200
agtatgtgaa tcgcaaggcc tgtgctgcat gcgacagcgt ccggggtggt ggacagggcc  10260
cccggccacg ctccctctcc tgtagccact ggcatagccc tcctgagcac ccgctgacat  10320
ttccgttgta catgttcctg tttatgcatt cacaaggtga ctgggatgta gagaggcgtt  10380
agtgggcagg tggccacagc aggactgagg acaggccccc attatcctag gggtgcgctc  10440
acctgcagcc cctcctcctc gggcacagac gactgtcgtt ctccacccac cagtcaggga  10500
cagcagcctc cctgtcactc agctgagaag gccagccctc cctggctgtg agcagcctcc  10560
actgtgtcca gagacatggg cctcccactc ctgttccttg ctagccctgg ggtggcgtct  10620
gcctaggagc tggctggcag gtgttgggac ctgctgctcc atggatgcat gccctaagag  10680
tgtcactgag ctgtgttttg tctgagcctc tctcggtcaa cagcaaagct tggtgtcttg  10740
gcactgttag tgacagagcc cagcatccct tctgcccccg ttccagctga catcttgcac  10800
ggtgacccct tttagtcagg agagtgcaga tctgtgctca tcggagactg ccccacggcc  10860
ctgtcagagc cgccactcct atccccaggc caggtccctg gaccagcctc ctgtttgcag  10920
gcccagagga gccaagtcat taaaatggaa gtggattctg gatggccggg ctgctgctga  10980
tgtaggagct ggatttggga gctctgcttg ccgactggct gtgagacgag gcaggggctc  11040
tgcttcctca gccctagagg cgagccaggc aaggttggcg actgtcatgt ggcttggttt  11100
ggtcatgccc gtcgatgttt tgggtattga atgtggtaag tggaggaaat gttggaactc  11160
tgtgcaggtg ctgccttgag acccccaagc ttccacctgt ccctctccta tgtggcagct  11220
ggggagcagc tgagatgtgg acttgtatgc tgcccacata cgtgaggggg agctgaaagg  11280
gagcccctcc tctgagcagc ctctgccagg cctgtatgag gcttttccca ccagctccca  11340
acagaggcct cccccagcca ggaccacctc gtcctcgtgg cggggcagca ggagcggtag  11400
aaaggggtcc gatgtttgag gaggccctta agggaagcta ctgaattata acacgtaaga  11460
aaatcaccat tccgtattgg ttgggggctc ctgtttctca tcctagcttt ttcctggaaa  11520
gcccgctaga aggtttggga acgaggggaa agttctcaga actgttggct gctccccacc  11580
cgcctcccgc ctcccccgca ggttatgtca gcagctctga gacagcagta tcacaggcca  11640
gatgttgttc ctggctagat gtttacattt gtaagaaata acactgtgaa tgtaaaacag  11700
agccattccc ttggaatgca tatcgctggg ctcaacatag agtttgtctt cctcttgttt  11760
acgacgtgat ctaaaccagt ccttagcaag gggctcagaa caccccgctc tggcagtagg  11820
tgtcccccac ccccaaagac ctgcctgtgt gctccggaga tgaatatgag ctcattagta  11880
aaaatgactt cacccacgca tatacataaa gtatccatgc atgtgcatat agacacatct  11940
ataattttac acacacacct ctcaagacgg agatgcatgg cctctaagag tgcccgtgtc  12000
ggttcttcct ggaagttgac tttccttaga cccgccaggt caagttagcc gcgtgacgga  12060
catccaggcg tgggacgtgg tcagggcagg gctcattcat tgcccactag gatcccactg  12120
gcgaagatgg tctccatatc agctctctgc agaagggagg aagactttat catgttccta  12180
aaaatctgtg gcaagcaccc atcgtattat ccaaattttg ttgcaaatgt gattaatttg  12240
gttgtcaagt tttgggggtg ggctgtgggg agattgcttt tgttttcctg ctggtaatat  12300
cgggaaagat tttaatgaaa ccagggtaga attgtttggc aatgcactga agcgtgtttc  12360
tttcccaaaa tgtgcctccc ttccgctgcg ggcccagctg agtctatgta ggtgatgttt  12420
ccagctgcca agtgctcttt gttactgtcc accctcattt ctgccagcgc atgtgtcctt  12480
tcaaggggaa aatgtgaagc tgaaccccct ccagacaccc agaatgtagc atctgagaag  12540
gccctgtgcc ctaaaggaca cccctcgccc ccatcttcat ggagggggtc atttcagagc  12600
cctcggagcc aatgaacagc tcctcctctt ggagctgaga tgagccccac gtggagctcg  12660
ggacggatag tagacagcaa taactcggtg tgtggccgcc tggcaggtgg aacttcctcc  12720
cgttgcgggg tggagtgagg ttagttctgt gtgtctggtg ggtggagtca ggcttctctt  12780
gctacctgtg agcatccttc ccagcagaca tcctcatcgg gctttgtccc tcccccgctt  12840
cctccctctg cggggaggac ccgggaccac agctgctggc cagggtagac ttggagctgt  12900
cctccagagg ggtcacgtgt aggagtgaga agaaggaaga tcttgagagc tgctgaggga  12960
ccttggagag ctcaggatgg ctcagacgag gacactcgct tgccgggcct gggcctcctg  13020
ggaaggaggg agctgctcag aatgccgcat gacaactgaa ggcaacctgg aaggttcagg  13080
ggccgctctt cccccatgtg cctgtcacgc tctggtgcag tcaaaggaac gccttcccct  13140
cagttgtttc taagagcaga gtctcccgct gcaatctggg tggtaactgc cagccttgga  13200
ggatcgtggc caacgtggac ctgcctacgg agggtgggct ctgacccaag tggggcctcc  13260
ttgtccaggt ctcactgctt tgcaccgtgg tcagagggac tgtcagctga gcttgagctc  13320
ccctggagcc agcagggctg tgatgggcga gtcccggagc cccacccaga cctgaatgct  13380
tctgagagca aagggaagga ctgacgagag atgtatattt aattttttaa ctgctgcaaa  13440
cattgtacat ccaaattaaa ggaaaaaaat ggaaaccatc a                      13481


<210>  2

<211>  259
<212>  DNA
<213>  Homo sapiens

<220>
<223>  fragment of huntingtin (HTT) mRNA (residues 1 to 259)

<400>  2
gctgccggga cgggtccaag atggacggcc gctcaggttc tgcttttacc tgcggcccag     60
agccccattc attgccccgg tgctgagcgg cgccgcgagt cggcccgagg cctccgggga    120
ctgccgtgcc gggcgggaga ccgccatggc gaccctggaa aagctgatga aggccttcga    180
gtccctcaag tccttccagc agcagcagca gcagcagcag cagcagcagc agcagcagca    240
gcagcagcag cagcagcag                                                 259


<210>  3

<211>  31
<212>  DNA
<213>  Homo sapiens

<220>
<223>  fragment of huntingtin (HTT) mRNA (residues 181 to 211)

<400>  3
gtccctcaag tccttccagc agcagcagca g                                    31


<210>  4

<211>  1266
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  pCLS9303

<400>  4
atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac     60
gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc    120
aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt    180
acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc    240
aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc    300
ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg    360
agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc    420
gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac    480
ttgaccggag acgcccgggg gatcaggtca cgtgcgtctc ggagcattgt tgcccagtta    540
tctcgccctg atccggcgtt ggccgcgttg accaacgacc acctcgtcgc cttggcctgc    600
ctcggcgggc gtcctgcgct ggatgcagtg aaaaagggat tgggggatcc tatcagccgt    660
tcccagctgg tgaagtccga gctggaggag aagaaatccg agttgaggca caagctgaag    720
tacgtgcccc acgagtacat cgagctgatc gagatcgccc ggaacagcac ccaggaccgt    780
atcctggaga tgaaggtgat ggagttcttc atgaaggtgt acggctacag gggcaagcac    840
ctgggcggct ccaggaagcc cgacggcgcc atctacaccg tgggctcccc catcgactac    900
ggcgtgatcg tggacaccaa ggcctactcc ggcggctaca acctgcccat cggccaggcc    960
gacgaaatgc agaggtacgt ggaggagaac cagaccagga acaagcacat caaccccaac   1020
gagtggtgga aggtgtaccc ctccagcgtg accgagttca agttcctgtt cgtgtccggc   1080
cacttcaagg gcaactacaa ggcccagctg accaggctga accacatcac caactgcaac   1140
ggcgccgtgc tgtccgtgga ggagctcctg atcggcggcg agatgatcaa ggccggcacc   1200
ctgaccctgg aggaggtgag gaggaagttc aacaacggcg agatcaactt cgcggccgac   1260
tgataa                                                              1266


<210>  5

<211>  1284
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  pCLS9312

<400>  5
atgggcgatc ctaaaaagaa acgtaaggtc atcgataagg agaccgccgc tgccaagttc     60
gagagacagc acatggacag catcgatatc gccgatctac gcacgctcgg ctacagccag    120
cagcaacagg agaagatcaa accgaaggtt cgttcgacag tggcgcagca ccacgaggca    180
ctggtcggcc acgggtttac acacgcgcac atcgttgcgt taagccaaca cccggcagcg    240
ttagggaccg tcgctgtcaa gtatcaggac atgatcgcag cgttgccaga ggcgacacac    300
gaagcgatcg ttggcgtcgg caaacagtgg tccggcgcac gcgctctgga ggccttgctc    360
acggtggcgg gagagttgag aggtccaccg ttacagttgg acacaggcca acttctcaag    420
attgcaaaac gtggcggcgt gaccgcagtg gaggcagtgc atgcatggcg caatgcactg    480
acgggtgccc cgctcaactt gaccggagac gcccggggga tcaggtcacg tgcgtctcgg    540
agcattgttg cccagttatc tcgccctgat ccggcgttgg ccgcgttgac caacgaccac    600
ctcgtcgcct tggcctgcct cggcgggcgt cctgcgctgg atgcagtgaa aaagggattg    660
ggggatccta tcagccgttc ccagctggtg aagtccgagc tggaggagaa gaaatccgag    720
ttgaggcaca agctgaagta cgtgccccac gagtacatcg agctgatcga gatcgcccgg    780
aacagcaccc aggaccgtat cctggagatg aaggtgatgg agttcttcat gaaggtgtac    840
ggctacaggg gcaagcacct gggcggctcc aggaagcccg acggcgccat ctacaccgtg    900
ggctccccca tcgactacgg cgtgatcgtg gacaccaagg cctactccgg cggctacaac    960
ctgcccatcg gccaggccga cgaaatgcag aggtacgtgg aggagaacca gaccaggaac   1020
aagcacatca accccaacga gtggtggaag gtgtacccct ccagcgtgac cgagttcaag   1080
ttcctgttcg tgtccggcca cttcaagggc aactacaagg cccagctgac caggctgaac   1140
cacatcacca actgcaacgg cgccgtgctg tccgtggagg agctcctgat cggcggcgag   1200
atgatcaagg ccggcaccct gaccctggag gaggtgagga ggaagttcaa caacggcgag   1260
atcaacttcg cggccgactg ataa                                          1284

<210>  6

<211>  530
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polypeptide

<220>
<223>  TiFLAN

<400>  6
Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
        35                  40                  45              


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    50                  55                  60                  


Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 
65                  70                  75                  80  


Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                85                  90                  95      


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
            100                 105                 110         


Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
        115                 120                 125             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
    130                 135                 140                 


Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
145                 150                 155                 160 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 
                165                 170                 175     


Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val 
            180                 185                 190         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 
        195                 200                 205             


Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 
    210                 215                 220                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
225                 230                 235                 240 


Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala 
                245                 250                 255     


Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            260                 265                 270         


Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
        275                 280                 285             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    290                 295                 300                 


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
305                 310                 315                 320 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                325                 330                 335     


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn 
            340                 345                 350         


Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val 
        355                 360                 365             


Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 
    370                 375                 380                 


Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
385                 390                 395                 400 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 
                405                 410                 415     


Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            420                 425                 430         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
        435                 440                 445             


Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val 
    450                 455                 460                 


Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 
465                 470                 475                 480 


Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 
                485                 490                 495     


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            500                 505                 510         


Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 
        515                 520                 525             


Leu Glu 
    530 


<210>  7

<211>  530
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polypeptide

<220>
<223>  TiFLAN2_T01.1

<400>  7
Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 
        35                  40                  45              


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    50                  55                  60                  


Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 
65                  70                  75                  80  


Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                85                  90                  95      


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
            100                 105                 110         


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 
        115                 120                 125             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
    130                 135                 140                 


Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
145                 150                 155                 160 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
                165                 170                 175     


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
            180                 185                 190         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 
        195                 200                 205             


Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 
    210                 215                 220                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
225                 230                 235                 240 


Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
                245                 250                 255     


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            260                 265                 270         


Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 
        275                 280                 285             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    290                 295                 300                 


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
305                 310                 315                 320 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                325                 330                 335     


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 
            340                 345                 350         


Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
        355                 360                 365             


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
    370                 375                 380                 


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 
385                 390                 395                 400 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
                405                 410                 415     


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            420                 425                 430         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
        435                 440                 445             


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
    450                 455                 460                 


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 
465                 470                 475                 480 


Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu 
                485                 490                 495     


Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            500                 505                 510         


Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 
        515                 520                 525             


Leu Glu 
    530 


<210>  8

<211>  936
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polypeptide

<220>
<223>  pCLS9984

<400>  8
Met Gly Asp Pro Lys Lys Lys Arg Lys Val Ile Asp Tyr Pro Tyr Asp 
1               5                   10                  15      


Val Pro Asp Tyr Ala Ile Asp Ile Ala Asp Leu Arg Thr Leu Gly Tyr 
            20                  25                  30          


Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val 
        35                  40                  45              


Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His 
    50                  55                  60                  


Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val 
65                  70                  75                  80  


Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala 
                85                  90                  95      


Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala 
            100                 105                 110         


Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp 
        115                 120                 125             


Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val 
    130                 135                 140                 


Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn 
145                 150                 155                 160 


Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
                165                 170                 175     


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            180                 185                 190         


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
        195                 200                 205             


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    210                 215                 220                 


Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 
225                 230                 235                 240 


Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                245                 250                 255     


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
            260                 265                 270         


Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
        275                 280                 285             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
    290                 295                 300                 


Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
305                 310                 315                 320 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 
                325                 330                 335     


Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val 
            340                 345                 350         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 
        355                 360                 365             


Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 
    370                 375                 380                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
385                 390                 395                 400 


Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala 
                405                 410                 415     


Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            420                 425                 430         


Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
        435                 440                 445             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    450                 455                 460                 


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
465                 470                 475                 480 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                485                 490                 495     


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn 
            500                 505                 510         


Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val 
        515                 520                 525             


Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 
    530                 535                 540                 


Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
545                 550                 555                 560 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 
                565                 570                 575     


Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            580                 585                 590         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
        595                 600                 605             


Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val 
    610                 615                 620                 


Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 
625                 630                 635                 640 


Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 
                645                 650                 655     


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            660                 665                 670         


Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 
        675                 680                 685             


Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro Asp Pro Ala Leu Ala 
    690                 695                 700                 


Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly Gly Arg 
705                 710                 715                 720 


Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Gly Asp Pro Ile Ser Arg 
                725                 730                 735     


Ser Gln Leu Val Lys Ser Glu Leu Glu Glu Lys Lys Ser Glu Leu Arg 
            740                 745                 750         


His Lys Leu Lys Tyr Val Pro His Glu Tyr Ile Glu Leu Ile Glu Ile 
        755                 760                 765             


Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu Glu Met Lys Val Met Glu 
    770                 775                 780                 


Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly Lys His Leu Gly Gly Ser 
785                 790                 795                 800 


Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val Gly Ser Pro Ile Asp Tyr 
                805                 810                 815     


Gly Val Ile Val Asp Thr Lys Ala Tyr Ser Gly Gly Tyr Asn Leu Pro 
            820                 825                 830         


Ile Gly Gln Ala Asp Glu Met Gln Arg Tyr Val Glu Glu Asn Gln Thr 
        835                 840                 845             


Arg Asn Lys His Ile Asn Pro Asn Glu Trp Trp Lys Val Tyr Pro Ser 
    850                 855                 860                 


Ser Val Thr Glu Phe Lys Phe Leu Phe Val Ser Gly His Phe Lys Gly 
865                 870                 875                 880 


Asn Tyr Lys Ala Gln Leu Thr Arg Leu Asn His Ile Thr Asn Cys Asn 
                885                 890                 895     


Gly Ala Val Leu Ser Val Glu Glu Leu Leu Ile Gly Gly Glu Met Ile 
            900                 905                 910         


Lys Ala Gly Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn 
        915                 920                 925             


Gly Glu Ile Asn Phe Ala Ala Asp 
    930                 935     



<210>  9

<211>  2814
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  pCLS9984

<400>  9
atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac     60
gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc    120
aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt    180
acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc    240
aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc    300
ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg    360
agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc    420
gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac    480
ttgacccccc agcaggtggt ggccatcgcc agcaatggcg gtggcaagca ggcgctggag    540
acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg    600
gtggccatcg ccagccacga tggcggcaag caggcgctgg agacggtcca gcggctgttg    660
ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat    720
aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc    780
cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg    840
ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccccag    900
caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg    960
ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc agcaggtggt ggccatcgcc   1020
agcaataatg gtggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc   1080
caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag   1140
caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc   1200
ccggagcagg tggtggccat cgccagcaat attggtggca agcaggcgct ggagacggtg   1260
caggcgctgt tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc   1320
atcgccagca ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg   1380
ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc   1440
ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc   1500
ttgaccccgg agcaggtggt ggccatcgcc agcaatattg gtggcaagca ggcgctggag   1560
acggtgcagg cgctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg   1620
gtggccatcg ccagcaataa tggtggcaag caggcgctgg agacggtcca gcggctgttg   1680
ccggtgctgt gccaggccca cggcttgacc ccggagcagg tggtggccat cgccagccac   1740
gatggcggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc   1800
cacggcttga ccccggagca ggtggtggcc atcgccagca atattggtgg caagcaggcg   1860
ctggagacgg tgcaggcgct gttgccggtg ctgtgccagg cccacggctt gaccccccag   1920
caggtggtgg ccatcgccag caataatggt ggcaagcagg cgctggagac ggtccagcgg   1980
ctgttgccgg tgctgtgcca ggcccacggc ttgacccctc agcaggtggt ggccatcgcc   2040
agcaatggcg gcggcaggcc ggcgctggag agcattgttg cccagttatc tcgccctgat   2100
ccggcgttgg ccgcgttgac caacgaccac ctcgtcgcct tggcctgcct cggcgggcgt   2160
cctgcgctgg atgcagtgaa aaagggattg ggggatccta tcagccgttc ccagctggtg   2220
aagtccgagc tggaggagaa gaaatccgag ttgaggcaca agctgaagta cgtgccccac   2280
gagtacatcg agctgatcga gatcgcccgg aacagcaccc aggaccgtat cctggagatg   2340
aaggtgatgg agttcttcat gaaggtgtac ggctacaggg gcaagcacct gggcggctcc   2400
aggaagcccg acggcgccat ctacaccgtg ggctccccca tcgactacgg cgtgatcgtg   2460
gacaccaagg cctactccgg cggctacaac ctgcccatcg gccaggccga cgaaatgcag   2520
aggtacgtgg aggagaacca gaccaggaac aagcacatca accccaacga gtggtggaag   2580
gtgtacccct ccagcgtgac cgagttcaag ttcctgttcg tgtccggcca cttcaagggc   2640
aactacaagg cccagctgac caggctgaac cacatcacca actgcaacgg cgccgtgctg   2700
tccgtggagg agctcctgat cggcggcgag atgatcaagg ccggcaccct gaccctggag   2760
gaggtgagga ggaagttcaa caacggcgag atcaacttcg cggccgactg ataa         2814


<210>  10

<211>  936
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polypeptide

<220>
<223>  pCLS16715

<400>  10
Met Gly Asp Pro Lys Lys Lys Arg Lys Val Ile Asp Tyr Pro Tyr Asp 
1               5                   10                  15      


Val Pro Asp Tyr Ala Ile Asp Ile Ala Asp Leu Arg Thr Leu Gly Tyr 
            20                  25                  30          


Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val 
        35                  40                  45              


Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His 
    50                  55                  60                  


Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val 
65                  70                  75                  80  


Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala 
                85                  90                  95      


Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala 
            100                 105                 110         


Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp 
        115                 120                 125             


Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val 
    130                 135                 140                 


Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn 
145                 150                 155                 160 


Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
                165                 170                 175     


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            180                 185                 190         


His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 
        195                 200                 205             


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    210                 215                 220                 


Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 
225                 230                 235                 240 


Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                245                 250                 255     


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
            260                 265                 270         


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 
        275                 280                 285             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
    290                 295                 300                 


Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
305                 310                 315                 320 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
                325                 330                 335     


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
            340                 345                 350         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 
        355                 360                 365             


Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 
    370                 375                 380                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
385                 390                 395                 400 


Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
                405                 410                 415     


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            420                 425                 430         


Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 
        435                 440                 445             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    450                 455                 460                 


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
465                 470                 475                 480 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                485                 490                 495     


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 
            500                 505                 510         


Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
        515                 520                 525             


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
    530                 535                 540                 


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 
545                 550                 555                 560 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
                565                 570                 575     


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            580                 585                 590         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
        595                 600                 605             


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
    610                 615                 620                 


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 
625                 630                 635                 640 


Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu 
                645                 650                 655     


Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            660                 665                 670         


Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 
        675                 680                 685             


Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro Asp Pro Ala Leu Ala 
    690                 695                 700                 


Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly Gly Arg 
705                 710                 715                 720 


Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Gly Asp Pro Ile Ser Arg 
                725                 730                 735     


Ser Gln Leu Val Lys Ser Glu Leu Glu Glu Lys Lys Ser Glu Leu Arg 
            740                 745                 750         


His Lys Leu Lys Tyr Val Pro His Glu Tyr Ile Glu Leu Ile Glu Ile 
        755                 760                 765             


Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu Glu Met Lys Val Met Glu 
    770                 775                 780                 


Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly Lys His Leu Gly Gly Ser 
785                 790                 795                 800 


Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val Gly Ser Pro Ile Asp Tyr 
                805                 810                 815     


Gly Val Ile Val Asp Thr Lys Ala Tyr Ser Gly Gly Tyr Asn Leu Pro 
            820                 825                 830         


Ile Gly Gln Ala Asp Glu Met Gln Arg Tyr Val Glu Glu Asn Gln Thr 
        835                 840                 845             


Arg Asn Lys His Ile Asn Pro Asn Glu Trp Trp Lys Val Tyr Pro Ser 
    850                 855                 860                 


Ser Val Thr Glu Phe Lys Phe Leu Phe Val Ser Gly His Phe Lys Gly 
865                 870                 875                 880 


Asn Tyr Lys Ala Gln Leu Thr Arg Leu Asn His Ile Thr Asn Cys Asn 
                885                 890                 895     


Gly Ala Val Leu Ser Val Glu Glu Leu Leu Ile Gly Gly Glu Met Ile 
            900                 905                 910         


Lys Ala Gly Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn 
        915                 920                 925             


Gly Glu Ile Asn Phe Ala Ala Asp 
    930                 935     


<210>  11

<211>  2814
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  pCLS16715

<400>  11
atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac     60
gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc    120
aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt    180
acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc    240
aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc    300
ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg    360
agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc    420
gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac    480
ttgacccccc agcaggtggt ggccatcgcc agcaataatg gtggcaagca ggcgctggag    540
acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg    600
gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg    660
ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat    720
aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc    780
cacggcttga ccccggagca ggtggtggcc atcgccagca atattggtgg caagcaggcg    840
ctggagacgg tgcaggcgct gttgccggtg ctgtgccagg cccacggctt gaccccccag    900
caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg    960
ctgttgccgg tgctgtgcca ggcccacggc ttgaccccgg agcaggtggt ggccatcgcc   1020
agccacgatg gcggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc   1080
caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag   1140
caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc   1200
ccggagcagg tggtggccat cgccagccac gatggcggca agcaggcgct ggagacggtc   1260
cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccggagca ggtggtggcc   1320
atcgccagcc acgatggcgg caagcaggcg ctggagacgg tccagcggct gttgccggtg   1380
ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc   1440
ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc   1500
ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag   1560
acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg   1620
gtggccatcg ccagcaatat tggtggcaag caggcgctgg agacggtgca ggcgctgttg   1680
ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat   1740
aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc   1800
cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg   1860
ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccggag   1920
caggtggtgg ccatcgccag caatattggt ggcaagcagg cgctggagac ggtgcaggcg   1980
ctgttgccgg tgctgtgcca ggcccacggc ttgacccctc agcaggtggt ggccatcgcc   2040
agcaatggcg gcggcaggcc ggcgctggag agcattgttg cccagttatc tcgccctgat   2100
ccggcgttgg ccgcgttgac caacgaccac ctcgtcgcct tggcctgcct cggcgggcgt   2160
cctgcgctgg atgcagtgaa aaagggattg ggggatccta tcagccgttc ccagctggtg   2220
aagtccgagc tggaggagaa gaaatccgag ttgaggcaca agctgaagta cgtgccccac   2280
gagtacatcg agctgatcga gatcgcccgg aacagcaccc aggaccgtat cctggagatg   2340
aaggtgatgg agttcttcat gaaggtgtac ggctacaggg gcaagcacct gggcggctcc   2400
aggaagcccg acggcgccat ctacaccgtg ggctccccca tcgactacgg cgtgatcgtg   2460
gacaccaagg cctactccgg cggctacaac ctgcccatcg gccaggccga cgaaatgcag   2520
aggtacgtgg aggagaacca gaccaggaac aagcacatca accccaacga gtggtggaag   2580
gtgtacccct ccagcgtgac cgagttcaag ttcctgttcg tgtccggcca cttcaagggc   2640
aactacaagg cccagctgac caggctgaac cacatcacca actgcaacgg cgccgtgctg   2700
tccgtggagg agctcctgat cggcggcgag atgatcaagg ccggcaccct gaccctggag   2760
gaggtgagga ggaagttcaa caacggcgag atcaacttcg cggccgactg ataa         2814


<210>  12

<211>  69
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  TIFLAN

<400>  12
tctcaagatt tcgctgcagc agcagcagca gcagcagcag cagcagcagc agcagcagca     60
gcagcagca                                                             69


<210>  13

<211>  46
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  TIFLAN2

<400>  13
tgtgatcccc ccagcagcag cagcagcagc agcagcagca gcagca                    46


<210>  14

<211>  530
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polypeptide

<220>
<223>  TiCAG

<400>  14
Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 
        35                  40                  45              


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    50                  55                  60                  


Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 
65                  70                  75                  80  


Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                85                  90                  95      


Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 
            100                 105                 110         


Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
        115                 120                 125             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 
    130                 135                 140                 


Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
145                 150                 155                 160 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 
                165                 170                 175     


Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val 
            180                 185                 190         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 
        195                 200                 205             


Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 
    210                 215                 220                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
225                 230                 235                 240 


Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
                245                 250                 255     


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            260                 265                 270         


Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
        275                 280                 285             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    290                 295                 300                 


His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly 
305                 310                 315                 320 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                325                 330                 335     


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 
            340                 345                 350         


Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
        355                 360                 365             


Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 
    370                 375                 380                 


Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
385                 390                 395                 400 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
                405                 410                 415     


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            420                 425                 430         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
        435                 440                 445             


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
    450                 455                 460                 


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 
465                 470                 475                 480 


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
                485                 490                 495     


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            500                 505                 510         


Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 
        515                 520                 525             


Leu Glu 
    530 


<210>  15

<211>  942
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polypeptide

<220>
<223>  pCLS9996

<400>  15
Met Gly Asp Pro Lys Lys Lys Arg Lys Val Ile Asp Lys Glu Thr Ala 
1               5                   10                  15      


Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser Ile Asp Ile Ala Asp 
            20                  25                  30          


Leu Arg Thr Leu Gly Tyr Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro 
        35                  40                  45              


Lys Val Arg Ser Thr Val Ala Gln His His Glu Ala Leu Val Gly His 
    50                  55                  60                  


Gly Phe Thr His Ala His Ile Val Ala Leu Ser Gln His Pro Ala Ala 
65                  70                  75                  80  


Leu Gly Thr Val Ala Val Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro 
                85                  90                  95      


Glu Ala Thr His Glu Ala Ile Val Gly Val Gly Lys Gln Trp Ser Gly 
            100                 105                 110         


Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly 
        115                 120                 125             


Pro Pro Leu Gln Leu Asp Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg 
    130                 135                 140                 


Gly Gly Val Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu 
145                 150                 155                 160 


Thr Gly Ala Pro Leu Asn Leu Thr Pro Gln Gln Val Val Ala Ile Ala 
                165                 170                 175     


Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
            180                 185                 190         


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 
        195                 200                 205             


Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
    210                 215                 220                 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 
225                 230                 235                 240 


Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val 
                245                 250                 255     


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 
            260                 265                 270         


Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 
        275                 280                 285             


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
    290                 295                 300                 


Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
305                 310                 315                 320 


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
                325                 330                 335     


Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
            340                 345                 350         


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
        355                 360                 365             


His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly 
    370                 375                 380                 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
385                 390                 395                 400 


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 
                405                 410                 415     


Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
            420                 425                 430         


Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 
        435                 440                 445             


Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
    450                 455                 460                 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 
465                 470                 475                 480 


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
                485                 490                 495     


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
            500                 505                 510         


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
        515                 520                 525             


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 
    530                 535                 540                 


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
545                 550                 555                 560 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
                565                 570                 575     


Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala 
            580                 585                 590         


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
        595                 600                 605             


Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 
    610                 615                 620                 


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
625                 630                 635                 640 


His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 
                645                 650                 655     


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
            660                 665                 670         


Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 
        675                 680                 685             


Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg 
    690                 695                 700                 


Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu 
705                 710                 715                 720 


Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu 
                725                 730                 735     


Gly Asp Pro Ile Ser Arg Ser Gln Leu Val Lys Ser Glu Leu Glu Glu 
            740                 745                 750         


Lys Lys Ser Glu Leu Arg His Lys Leu Lys Tyr Val Pro His Glu Tyr 
        755                 760                 765             


Ile Glu Leu Ile Glu Ile Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu 
    770                 775                 780                 


Glu Met Lys Val Met Glu Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly 
785                 790                 795                 800 


Lys His Leu Gly Gly Ser Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val 
                805                 810                 815     


Gly Ser Pro Ile Asp Tyr Gly Val Ile Val Asp Thr Lys Ala Tyr Ser 
            820                 825                 830         


Gly Gly Tyr Asn Leu Pro Ile Gly Gln Ala Asp Glu Met Gln Arg Tyr 
        835                 840                 845             


Val Glu Glu Asn Gln Thr Arg Asn Lys His Ile Asn Pro Asn Glu Trp 
    850                 855                 860                 


Trp Lys Val Tyr Pro Ser Ser Val Thr Glu Phe Lys Phe Leu Phe Val 
865                 870                 875                 880 


Ser Gly His Phe Lys Gly Asn Tyr Lys Ala Gln Leu Thr Arg Leu Asn 
                885                 890                 895     


His Ile Thr Asn Cys Asn Gly Ala Val Leu Ser Val Glu Glu Leu Leu 
            900                 905                 910         


Ile Gly Gly Glu Met Ile Lys Ala Gly Thr Leu Thr Leu Glu Glu Val 
        915                 920                 925             


Arg Arg Lys Phe Asn Asn Gly Glu Ile Asn Phe Ala Ala Asp 
    930                 935                 940         


<210>  16

<211>  2832
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Description of artificial sequence: Synthetic polynucleotide

<220>
<223>  pCLS9996

<400>  16
atgggcgatc ctaaaaagaa acgtaaggtc atcgataagg agaccgccgc tgccaagttc     60
gagagacagc acatggacag catcgatatc gccgatctac gcacgctcgg ctacagccag    120
cagcaacagg agaagatcaa accgaaggtt cgttcgacag tggcgcagca ccacgaggca    180
ctggtcggcc acgggtttac acacgcgcac atcgttgcgt taagccaaca cccggcagcg    240
ttagggaccg tcgctgtcaa gtatcaggac atgatcgcag cgttgccaga ggcgacacac    300
gaagcgatcg ttggcgtcgg caaacagtgg tccggcgcac gcgctctgga ggccttgctc    360
acggtggcgg gagagttgag aggtccaccg ttacagttgg acacaggcca acttctcaag    420
attgcaaaac gtggcggcgt gaccgcagtg gaggcagtgc atgcatggcg caatgcactg    480
acgggtgccc cgctcaactt gaccccccag caggtggtgg ccatcgccag caataatggt    540
ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc    600
ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag    660
acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg    720
gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg    780
ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat    840
aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc    900
cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg    960
ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccccag   1020
caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg   1080
ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc agcaggtggt ggccatcgcc   1140
agcaataatg gtggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc   1200
caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag   1260
caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc   1320
ccccagcagg tggtggccat cgccagcaat ggcggtggca agcaggcgct ggagacggtc   1380
cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc   1440
atcgccagca ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg   1500
ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc   1560
ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc   1620
ttgacccccc agcaggtggt ggccatcgcc agcaatggcg gtggcaagca ggcgctggag   1680
acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg   1740
gtggccatcg ccagcaataa tggtggcaag caggcgctgg agacggtcca gcggctgttg   1800
ccggtgctgt gccaggccca cggcttgacc ccggagcagg tggtggccat cgccagccac   1860
gatggcggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc   1920
cacggcttga ccccccagca ggtggtggcc atcgccagca atggcggtgg caagcaggcg   1980
ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gacccctcag   2040
caggtggtgg ccatcgccag caatggcggc ggcaggccgg cgctggagag cattgttgcc   2100
cagttatctc gccctgatcc ggcgttggcc gcgttgacca acgaccacct cgtcgccttg   2160
gcctgcctcg gcgggcgtcc tgcgctggat gcagtgaaaa agggattggg ggatcctatc   2220
agccgttccc agctggtgaa gtccgagctg gaggagaaga aatccgagtt gaggcacaag   2280
ctgaagtacg tgccccacga gtacatcgag ctgatcgaga tcgcccggaa cagcacccag   2340
gaccgtatcc tggagatgaa ggtgatggag ttcttcatga aggtgtacgg ctacaggggc   2400
aagcacctgg gcggctccag gaagcccgac ggcgccatct acaccgtggg ctcccccatc   2460
gactacggcg tgatcgtgga caccaaggcc tactccggcg gctacaacct gcccatcggc   2520
caggccgacg aaatgcagag gtacgtggag gagaaccaga ccaggaacaa gcacatcaac   2580
cccaacgagt ggtggaaggt gtacccctcc agcgtgaccg agttcaagtt cctgttcgtg   2640
tccggccact tcaagggcaa ctacaaggcc cagctgacca ggctgaacca catcaccaac   2700
tgcaacggcg ccgtgctgtc cgtggaggag ctcctgatcg gcggcgagat gatcaaggcc   2760
ggcaccctga ccctggagga ggtgaggagg aagttcaaca acggcgagat caacttcgcg   2820
gccgactgat aa                                                       2832


