                         SEQUENCE LISTING

<110>  BluCon Biotech GmbH
 
<120>  RESTRICTION/MODIFICATION SYSTEM AND USES THEROF

<130>  BLU-PA06-PCT/PCT

<160>  19    

<170>  PatentIn version 3.5

<210>  1
<211>  4380
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid pF25-5br (SEQ ID NO. 1)

<400>  1
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt      180

gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt      240

gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg      300

acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca      360

aggccgcatc tcgagggtct caaagatggt aacaggtgga ctttcaggcc ctgctataaa      420

gccaattgca gttagaatgg tttatgagtg tttcaagaaa gttagaatac ctattgttgg      480

catgggagga attatgaatt acaaagatgc cattgagttt tttattgctg gtgcaactgc      540

tattcagata ggtactgtaa attttattaa tccaaaagct gtttgcgaaa taaaagaggg      600

aattgaggct tatcttgaaa gaaagggttt taattccata aaagagcttg taggtaacat      660

aaatatctga ggtgaaaata aaaatgtcga agacagttgt atatcttatt cgccacgcag      720

aggcagaagg aaattttata agaaggtttc atggtattac agattctaat gtaacagaaa      780

agggtaaatt acaagctcaa aaacttgcag aaagattaaa aaatgtccat tttgatgtga      840

tttattcaag tcctctgaaa agagcccttt atactgcaag caaaatagca gagggaagag      900

atataaagat tatagtaaga gaagatttga tagagataaa cggcggagag tgggaagaca      960

ggtgctggga tgaacttcca ttgctttatc caacagagta tgaaatgtgg gagaaaatgc     1020

ctcacaagca ctgtatgcca aatggtgaaa gtatgtatga acttttcttg agagcaaaat     1080

ctgcttttga ggacataata aagtcaaata tgggaaaaag aatatgcatc gtaacccacg     1140

gaacattaat tagggctctt cttacgtata taaaaggata tgaatttgaa agactcaacg     1200

agattttgtg gcaggacaat actgctttaa atataatcga gtataaagaa ggaaaatacc     1260

accttgtggt tgagggcgac tggtctcacc ttggaaagga gctttccaca atagcatatc     1320

aagactggtg gcaacagttt ttaaaagaaa gaggaattga aaaacaagat ttaactatca     1380

ttgaaaggag agcagccctg gaagtagaaa aagtaagtag attaatttgc ttgcataaat     1440

gattaaaatt attaattcca aaaatgatat tgacagttaa attataaaag tgatagaata     1500

ttataaacat tagttggaaa tatggaggat aaataaaatt tgagaatatg tccaaaatgc     1560

ggtgaattaa atggtgaaaa taggacagaa tgttggaagt gcggatctat tttgggtcca     1620

gtagataaat acaaaaaaat ttgtctgaaa tgtgggcgta tataccctca aaaagcagag     1680

atatgcgatg aatgtggtgg aaagttggca gtttacgatg tagatacgaa ttacaataac     1740

acgaaaactg acagtagtgt aggttggcta tacatagttt caatattgtt tccaattgtt     1800

ggtattattt tgggttgtat ttatatagca agaagagaag ataatttggg aaaatctttg     1860

attataacaa gcatagttgt tatagttatt tcaatcttta tgagtttact ctttgtcagt     1920

tgttctccta acttttgatt tttgataaaa aatataaaaa aatgcagggt cttttaaaaa     1980

caactttgct gaacaagata accctgcatt tttttacatt tttaaaaatc agccacccaa     2040

ataagctttt ttgacctcag gatttgctgc aatctcttta gcctctccgg ataaaacaat     2100

tctacctgtt tcaattacat acgccctgtc tgcgattgaa agtgccatgt gggcattttg     2160

ttcaatcaaa agtatagttg tcccctgaga attaatttct tttattatct taaaaatctc     2220

tgttacaagt ataggtgcaa gtcccataga aggctcatca agcagaagta atttgggtct     2280

tgacattaga gaccttccga ttgccagcat ttgctgttct ccgccagaaa gtgtccctgc     2340

aagttgattt tttctttcat atagtcttgg aaatctttca aacacaagtt ctaagtcttt     2400

ttttattgcc tgtttatctt tcggtaccct gggcctcatg ggccttccgc tcactgcccg     2460

ctttccagtc gggaaacctg tcgtgccagc tgcattaaca tggtcatagc tgtttccttg     2520

cgtattgggc gctctccgct tcctcgctca ctgactcgct gcgctcggtc gttcgggtaa     2580

agcctggggt gcctaatgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc     2640

gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc     2700

aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag     2760

ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct     2820

cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta     2880

ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc     2940

cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc     3000

agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt     3060

gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct     3120

gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc     3180

tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca     3240

agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta     3300

agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa     3360

atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg     3420

cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg     3480

actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc     3540

aatgataccg cgagaaccac gctcaccggc tccagattta tcagcaataa accagccagc     3600

cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa     3660

ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc     3720

cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg     3780

ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc     3840

cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat     3900

ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg     3960

tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc     4020

ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg     4080

aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat     4140

gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg     4200

gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg     4260

ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct     4320

catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac     4380


<210>  2
<211>  5091
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid pTrc-EcMt1ORF02330 with gene DOCILOBI_02330 
       codon-optimized for expression in E. coli (SEQ ID NO. 2)

<400>  2
gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc       60

ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc      120

gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc      180

tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga      240

taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac tgctctttaa      300

caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac tttattatta      360

aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa ccatgatcta      420

taacaccatt ctgcatggtg attgcgtgac cattatgaaa gaacatattc cgagcgaaag      480

catcgatctg atttatgcag atccgcctta taatctgagc ggtcgtgatc tgattctgaa      540

aaataacaaa accggtggtc cgttctacaa gatgaatgaa gaatgggata gctgggacta      600

tgataaatac tgcgaattca cctataattg gctgctggca agctatagcg tgctgaaaaa      660

caatggtagc ctgtatatta gctgcaccta tcataatatt ggcgaggtga tttttctggc      720

caaaaagatt ggtttcaaac tgaacaatat tctgacctgg gttaaaacca atgccatgcc      780

gaatattacc aaacgcacct ttaaacacac caccgaattt gtttgctggt ttgttaaagg      840

tcctggctgg aaatttaact acaacgagat caaaatgctg aaccctcgca aaaccaaaga      900

tggtagcgtt aaacaaatgg acgatttctt cgactttttt gaaatgccgc tggttcaggg      960

taaagaacgt attaaactgg ataatggtcg tgcagcacat ccgaatcaga aaccggaaaa     1020

actgctggaa atcattatta ccgcaagcag tgatgaaggt gatattgttc tggatccgtt     1080

ttttggcacc ggcaccaccg gtgtggttgc agaacgtatg aatcgtaaat ggattggcat     1140

cgaaatcaac gaaacctata tcgagattgc caaaaagcgc attgaagagg aacgtcgtaa     1200

aaatgttcag agcaccttta tctaaaaacg gtctccagct tggctgtttt ggcggatgag     1260

agaagatttt cagcctgata cagattaaat cagaacgcag aagcggtctg ataaaacaga     1320

atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga     1380

aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg     1440

catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgttttat ctgttgtttg     1500

tcggtgaacg ctctcctgag taggacaaat ccgccgggag cggatttgaa cgttgcgaag     1560

caacggcccg gagggtggcg ggcaggacgc ccgccataaa ctgccaggca tcaaattaag     1620

cagaaggcca tcctgacgga tggccttttt gcgtttctac aaactctttt tgtttatttt     1680

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat     1740

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt     1800

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg     1860

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga     1920

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc     1980

tatgtggcgc ggtattatcc cgtgttgacg ccgggcaaga gcaactcggt cgccgcatac     2040

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg     2100

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca     2160

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg     2220

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg     2280

acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg     2340

gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag     2400

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg     2460

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct     2520

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac     2580

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact     2640

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     2700

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     2760

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     2820

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     2880

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc     2940

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     3000

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     3060

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     3120

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     3180

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     3240

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     3300

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     3360

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     3420

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     3480

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     3540

cagtgagcga ggaagcggaa gagcgcctga tgcggtattt tctccttacg catctgtgcg     3600

gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa     3660

gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc     3720

aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc     3780

tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc     3840

gaggcagcag atcaattcgc gcgcgaaggc gaagcggcat gcatttacgt tgacaccatc     3900

gaatggtgca aaacctttcg cggtatggca tgatagcgcc cggaagagag tcaattcagg     3960

gtggtgaatg tgaaaccagt aacgttatac gatgtcgcag agtatgccgg tgtctcttat     4020

cagaccgttt cccgcgtggt gaaccaggcc agccacgttt ctgcgaaaac gcgggaaaaa     4080

gtggaagcgg cgatggcgga gctgaattac attcccaacc gcgtggcaca acaactggcg     4140

ggcaaacagt cgttgctgat tggcgttgcc acctccagtc tggccctgca cgcgccgtcg     4200

caaattgtcg cggcgattaa atctcgcgcc gatcaactgg gtgccagcgt ggtggtgtcg     4260

atggtagaac gaagcggcgt cgaagcctgt aaagcggcgg tgcacaatct tctcgcgcaa     4320

cgcgtcagtg ggctgatcat taactatccg ctggatgacc aggatgccat tgctgtggaa     4380

gctgcctgca ctaatgttcc ggcgttattt cttgatgtct ctgaccagac acccatcaac     4440

agtattattt tctcccatga agacggtacg cgactgggcg tggagcatct ggtcgcattg     4500

ggtcaccagc aaatcgcgct gttagcgggc ccattaagtt ctgtctcggc gcgtctgcgt     4560

ctggctggct ggcataaata tctcactcgc aatcaaattc agccgatagc ggaacgggaa     4620

ggcgactgga gtgccatgtc cggttttcaa caaaccatgc aaatgctgaa tgagggcatc     4680

gttcccactg cgatgctggt tgccaacgat cagatggcgc tgggcgcaat gcgcgccatt     4740

accgagtccg ggctgcgcgt tggtgcggat atctcggtag tgggatacga cgataccgaa     4800

gacagctcat gttatatccc gccgtcaacc accatcaaac aggattttcg cctgctgggg     4860

caaaccagcg tggaccgctt gctgcaactc tctcagggcc aggcggtgaa gggcaatcag     4920

ctgttgcccg tctcactggt gaaaagaaaa accaccctgg cgcccaatac gcaaaccgcc     4980

tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa     5040

agcgggcagt gagcgcaacg caattaatgt gagttagcgc gaattgatct g              5091


<210>  3
<211>  5022
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence of plasmid pTrc-EcMt2ORF02331 with gene DOCILOBI_02331 
       codon-optimized for expression in E. coli (SEQ ID NO. 3)

<400>  3
gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc       60

ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc      120

gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc      180

tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga      240

taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac tgctctttaa      300

caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac tttattatta      360

aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa ccatgtgcaa      420

agtgcacctg tttaatgatg attgcctgaa cgtgctgaaa aagatcgaag ataatagcat      480

cgatctgatc tttgcagatc cgccttataa tctgagcagc gaaaatgcac tgaccacacg      540

tgcaggtaaa ccggttaaat gttataaagg cgagtgggat aaaatcgacg atatctttga      600

atttaacctg cgctggattg aacagtgtgt tcgtgttctg aaagaaaccg gcaccatttg      660

gattagcggc accctgcata atcatccgat tattggcacc attctgaaac agttaggtct      720

gtggattatc aacgatatca tttggttcaa accgaatgca acaccgctgc tgagccgtaa      780

tcgttttgtt ccgagcaccg aactgatttg ggttgcaagc aaaaacaaaa aatactactt      840

tgattatgaa atggcacgca aactgaatgg tggtaaacaa atgcgtaatc tgtgggaaat      900

tccggcacag cgtcataaaa caccgcatcc gaccgaaaaa ccggaagcac tgctggaacg      960

tattattctg attggtagca aagaaggtga cgttgttctg gatccgttta tgggtagcgg     1020

tacaaccggt gttgttgcaa aactgctgaa acgcaacttt attggcattg aaattgatcc     1080

ggtgtatttc gagattgcca aaaaacgcat cgaagaagaa aaactgattc agcagacctt     1140

tagcaacttc ctgtaaaaac ggtctccagc ttggctgttt tggcggatga gagaagattt     1200

tcagcctgat acagattaaa tcagaacgca gaagcggtct gataaaacag aatttgcctg     1260

gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta     1320

gcgccgatgg tagtgtgggg tctccccatg cgagagtagg gaactgccag gcatcaaata     1380

aaacgaaagg ctcagtcgaa agactgggcc tttcgtttta tctgttgttt gtcggtgaac     1440

gctctcctga gtaggacaaa tccgccggga gcggatttga acgttgcgaa gcaacggccc     1500

ggagggtggc gggcaggacg cccgccataa actgccaggc atcaaattaa gcagaaggcc     1560

atcctgacgg atggcctttt tgcgtttcta caaactcttt ttgtttattt ttctaaatac     1620

attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa     1680

aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat     1740

tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc     1800

agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga     1860

gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg     1920

cggtattatc ccgtgttgac gccgggcaag agcaactcgg tcgccgcata cactattctc     1980

agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag     2040

taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc     2100

tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg     2160

taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg     2220

acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac     2280

ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac     2340

cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg     2400

agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg     2460

tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg     2520

agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac     2580

tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg     2640

ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg     2700

tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc     2760

aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc     2820

tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt     2880

agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc     2940

taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact     3000

caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac     3060

agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag     3120

aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg     3180

gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg     3240

tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga     3300

gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt     3360

ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct     3420

ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg     3480

aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac     3540

accgcatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagtata     3600

cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc caacacccgc     3660

tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt     3720

ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgaggcagca     3780

gatcaattcg cgcgcgaagg cgaagcggca tgcatttacg ttgacaccat cgaatggtgc     3840

aaaacctttc gcggtatggc atgatagcgc ccggaagaga gtcaattcag ggtggtgaat     3900

gtgaaaccag taacgttata cgatgtcgca gagtatgccg gtgtctctta tcagaccgtt     3960

tcccgcgtgg tgaaccaggc cagccacgtt tctgcgaaaa cgcgggaaaa agtggaagcg     4020

gcgatggcgg agctgaatta cattcccaac cgcgtggcac aacaactggc gggcaaacag     4080

tcgttgctga ttggcgttgc cacctccagt ctggccctgc acgcgccgtc gcaaattgtc     4140

gcggcgatta aatctcgcgc cgatcaactg ggtgccagcg tggtggtgtc gatggtagaa     4200

cgaagcggcg tcgaagcctg taaagcggcg gtgcacaatc ttctcgcgca acgcgtcagt     4260

gggctgatca ttaactatcc gctggatgac caggatgcca ttgctgtgga agctgcctgc     4320

actaatgttc cggcgttatt tcttgatgtc tctgaccaga cacccatcaa cagtattatt     4380

ttctcccatg aagacggtac gcgactgggc gtggagcatc tggtcgcatt gggtcaccag     4440

caaatcgcgc tgttagcggg cccattaagt tctgtctcgg cgcgtctgcg tctggctggc     4500

tggcataaat atctcactcg caatcaaatt cagccgatag cggaacggga aggcgactgg     4560

agtgccatgt ccggttttca acaaaccatg caaatgctga atgagggcat cgttcccact     4620

gcgatgctgg ttgccaacga tcagatggcg ctgggcgcaa tgcgcgccat taccgagtcc     4680

gggctgcgcg ttggtgcgga tatctcggta gtgggatacg acgataccga agacagctca     4740

tgttatatcc cgccgtcaac caccatcaaa caggattttc gcctgctggg gcaaaccagc     4800

gtggaccgct tgctgcaact ctctcagggc caggcggtga agggcaatca gctgttgccc     4860

gtctcactgg tgaaaagaaa aaccaccctg gcgcccaata cgcaaaccgc ctctccccgc     4920

gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga aagcgggcag     4980

tgagcgcaac gcaattaatg tgagttagcg cgaattgatc tg                        5022


<210>  4
<211>  4955
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid pF25-6brBglII (SEQ ID NO. 4)

<400>  4
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt      180

gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt      240

gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg      300

acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca      360

aggccgcatc tcgagggtct caaagatggt aacaggtgga ctttcaggcc ctgctataaa      420

gccaattgca gttagaatgg tttatgagtg tttcaagaaa gttagaatac ctattgttgg      480

catgggagga attatgaatt acaaagatgc cattgagttt tttattgctg gtgcaactgc      540

tattcagata ggtactgtaa attttattaa tccaaaagct gtttgcgaaa taaaagaggg      600

aattgaggct tatcttgaaa gaaagggttt taattccata aaagagcttg taggtaacat      660

aaatatctga ggtgaaaata aaaatgtcga agacagttgt atatcttatt cgccacgcag      720

aggcagaagg aaattttata agaaggtttc atggtattac agattctaat gtaacagaaa      780

agggtaaatt acaagctcaa aaacttgcag aaagattaaa aaatgtccat tttgatgtga      840

tttattcaag tcctctgaaa agagcccttt atactgcaag caaaatagca gagggaagag      900

atataaagat tatagtaaga gaagatttga tagagataaa cggcggagag tgggaagaca      960

ggtgctggga tgaacttcca ttgctttatc caacagagta tgaaatgtgg gagaaaatgc     1020

ctcacaagca ctgtatgcca aatggtgaaa gtatgtatga acttttcttg agagcaaaat     1080

ctgcttttga ggacataata aagtcaaata tgggaaaaag aatatgcatc gtaacccacg     1140

gaacattaat tagggctctt cttacgtata taaaaggata tgaatttgaa agactcaacg     1200

agattttgtg gcaggacaat actgctttaa atataatcga gtataaagaa ggaaaatacc     1260

accttgtggt tgagggcgac tggtctcacc ttggaaagga gctttccaca atagcatatc     1320

aagactggtg gcaacagttt ttaaaagaaa gaggaattga aaaacaagat ttaactatca     1380

ttgaaaggag agaaagttat gaataaagag gcttacattc aaatgttcaa agacacagat     1440

gcacttttgg aaggacattt tcttttgtcc tctggaaaac acagtgcaaa gtaccttcaa     1500

tgtgcaaaag tgttgcagta cccaaacttg gcagaaatga tctgcaggga ccttgcacaa     1560

tactttaaag ataagcaaat tgacgttgtt ataggccctg cgttgggagc agtaacgctt     1620

tcgtacgaac ttgcaagaca gttaaattgc cgttccatct ttgcagaaag agaagatggg     1680

ataatgaaac ttagaagagg atttaagatt gaagagggag aaaaagtttt ggtagttgaa     1740

gacgtcataa caacaggcgg gtctgtgaaa gaaataattg aaattgtaaa agagtacaaa     1800

ggagaaattg tggcagttgc tggcattgta gatagaagtg gtggaaaggt agaacttggc     1860

tatcctttga aaactcttct tacacttgag attgaaacat atgagcctga agagtgtccg     1920

ctttgtaaag aaggtatacc tattgtaaaa cctggaagta gaaaaagtaa gtagatctaa     1980

tttgcttgca taaatgatta aaattattaa ttccaaaaat gatattgaca gttaaattat     2040

aaaagtgata gaatattata aacattagtt ggaaatatgg aggataaata aaatttgaga     2100

atatgtccaa aatgcggtga attaaatggt gaaaatagga cagaatgttg gaagtgcgga     2160

tctattttgg gtccagtaga taaatacaaa aaaatttgtc tgaaatgtgg gcgtatatac     2220

cctcaaaaag cagagatatg cgatgaatgt ggtggaaagt tggcagttta cgatgtagat     2280

acgaattaca ataacacgaa aactgacagt agtgtaggtt ggctatacat agtttcaata     2340

ttgtttccaa ttgttggtat tattttgggt tgtatttata tagcaagaag agaagataat     2400

ttgggaaaat ctttgattat aacaagcata gttgttatag ttatttcaat ctttatgagt     2460

ttactctttg tcagttgttc tcctaacttt tgatttttga taaaaaatat aaaaaaatgc     2520

agggtctttt aaaaacaact ttgctgaaca agataaccct gcattttttt acatttttaa     2580

aaatcagcca cccaaataag cttttttgac ctcaggattt gctgcaatct ctttagcctc     2640

tccggataaa acaattctac ctgtttcaat tacatacgcc ctgtctgcga ttgaaagtgc     2700

catgtgggca ttttgttcaa tcaaaagtat agttgtcccc tgagaattaa tttcttttat     2760

tatcttaaaa atctctgtta caagtatagg tgcaagtccc atagaaggct catcaagcag     2820

aagtaatttg ggtcttgaca ttagagacct tccgattgcc agcatttgct gttctccgcc     2880

agaaagtgtc cctgcaagtt gattttttct ttcatatagt cttggaaatc tttcaaacac     2940

aagttctaag tcttttttta ttgcctgttt atctttcggt accctgggcc tcatgggcct     3000

tccgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taacatggtc     3060

atagctgttt ccttgcgtat tgggcgctct ccgcttcctc gctcactgac tcgctgcgct     3120

cggtcgttcg ggtaaagcct ggggtgccta atgagcaaaa ggccagcaaa aggccaggaa     3180

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca     3240

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc     3300

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata     3360

cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta     3420

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca     3480

gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga     3540

cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg     3600

tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg     3660

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg     3720

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag     3780

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa     3840

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat     3900

ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc     3960

tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc     4020

atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc     4080

tggccccagt gctgcaatga taccgcgaga accacgctca ccggctccag atttatcagc     4140

aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc     4200

catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt     4260

gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc     4320

ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa     4380

aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt     4440

atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg     4500

cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc     4560

gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa     4620

agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt     4680

gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt     4740

caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag     4800

ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta     4860

tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat     4920

aggggttccg cgcacatttc cccgaaaagt gccac                                4955


<210>  5
<211>  813
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid pTrc-EcMt1ORF02330 codon-optimized for expression in E. 
       coli (SEQ ID NO. 5)

<400>  5
atgatctata acaccattct gcatggtgat tgcgtgacca ttatgaaaga acatattccg       60

agcgaaagca tcgatctgat ttatgcagat ccgccttata atctgagcgg tcgtgatctg      120

attctgaaaa ataacaaaac cggtggtccg ttctacaaga tgaatgaaga atgggatagc      180

tgggactatg ataaatactg cgaattcacc tataattggc tgctggcaag ctatagcgtg      240

ctgaaaaaca atggtagcct gtatattagc tgcacctatc ataatattgg cgaggtgatt      300

tttctggcca aaaagattgg tttcaaactg aacaatattc tgacctgggt taaaaccaat      360

gccatgccga atattaccaa acgcaccttt aaacacacca ccgaatttgt ttgctggttt      420

gttaaaggtc ctggctggaa atttaactac aacgagatca aaatgctgaa ccctcgcaaa      480

accaaagatg gtagcgttaa acaaatggac gatttcttcg acttttttga aatgccgctg      540

gttcagggta aagaacgtat taaactggat aatggtcgtg cagcacatcc gaatcagaaa      600

ccggaaaaac tgctggaaat cattattacc gcaagcagtg atgaaggtga tattgttctg      660

gatccgtttt ttggcaccgg caccaccggt gtggttgcag aacgtatgaa tcgtaaatgg      720

attggcatcg aaatcaacga aacctatatc gagattgcca aaaagcgcat tgaagaggaa      780

cgtcgtaaaa atgttcagag cacctttatc taa                                   813


<210>  6
<211>  270
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Translation product of SEQ ID NO.5 (SEQ ID NO. 6)

<400>  6

Met Ile Tyr Asn Thr Ile Leu His Gly Asp Cys Val Thr Ile Met Lys 
1               5                   10                  15      


Glu His Ile Pro Ser Glu Ser Ile Asp Leu Ile Tyr Ala Asp Pro Pro 
            20                  25                  30          


Tyr Asn Leu Ser Gly Arg Asp Leu Ile Leu Lys Asn Asn Lys Thr Gly 
        35                  40                  45              


Gly Pro Phe Tyr Lys Met Asn Glu Glu Trp Asp Ser Trp Asp Tyr Asp 
    50                  55                  60                  


Lys Tyr Cys Glu Phe Thr Tyr Asn Trp Leu Leu Ala Ser Tyr Ser Val 
65                  70                  75                  80  


Leu Lys Asn Asn Gly Ser Leu Tyr Ile Ser Cys Thr Tyr His Asn Ile 
                85                  90                  95      


Gly Glu Val Ile Phe Leu Ala Lys Lys Ile Gly Phe Lys Leu Asn Asn 
            100                 105                 110         


Ile Leu Thr Trp Val Lys Thr Asn Ala Met Pro Asn Ile Thr Lys Arg 
        115                 120                 125             


Thr Phe Lys His Thr Thr Glu Phe Val Cys Trp Phe Val Lys Gly Pro 
    130                 135                 140                 


Gly Trp Lys Phe Asn Tyr Asn Glu Ile Lys Met Leu Asn Pro Arg Lys 
145                 150                 155                 160 


Thr Lys Asp Gly Ser Val Lys Gln Met Asp Asp Phe Phe Asp Phe Phe 
                165                 170                 175     


Glu Met Pro Leu Val Gln Gly Lys Glu Arg Ile Lys Leu Asp Asn Gly 
            180                 185                 190         


Arg Ala Ala His Pro Asn Gln Lys Pro Glu Lys Leu Leu Glu Ile Ile 
        195                 200                 205             


Ile Thr Ala Ser Ser Asp Glu Gly Asp Ile Val Leu Asp Pro Phe Phe 
    210                 215                 220                 


Gly Thr Gly Thr Thr Gly Val Val Ala Glu Arg Met Asn Arg Lys Trp 
225                 230                 235                 240 


Ile Gly Ile Glu Ile Asn Glu Thr Tyr Ile Glu Ile Ala Lys Lys Arg 
                245                 250                 255     


Ile Glu Glu Glu Arg Arg Lys Asn Val Gln Ser Thr Phe Ile 
            260                 265                 270 


<210>  7
<211>  813
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Gene in Caldicellulosiruptor sp. strain DIB 104C (SEQ ID NO. 7)

<400>  7
ttgatttaca atacaatttt acatggtgat tgtgtaacga ttatgaaaga acatattcca       60

tcggaaagca tagatttaat ttatgctgac ccaccttaca atttgtcagg cagagatctt      120

atactaaaaa ataacaagac tggtggtcca ttttataaaa tgaatgaaga atgggacagc      180

tgggattatg acaaatactg tgaattcact tataattggc ttttagcctc atattctgtc      240

ttgaaaaata atggtagttt gtatatttct tgtacttacc ataacattgg agaagttata      300

tttttagcaa aaaagatagg ctttaaatta aacaatatat tgacatgggt caagacaaat      360

gctatgccaa atattactaa acgtacattt aaacacacaa cagaatttgt ttgttggttt      420

gttaaaggcc ctggatggaa atttaactat aatgaaatta aaatgcttaa tccaagaaaa      480

acaaaagatg gctctgttaa gcaaatggat gacttttttg atttctttga aatgcctctt      540

gttcaaggaa aagaaagaat taagttagac aatggcagag ccgcacatcc aaatcaaaaa      600

cctgaaaaat tgttggaaat aataattacc gcttcaagtg atgaaggaga tatagtatta      660

gatccttttt ttggaacagg aacaactggt gttgttgctg agcgtatgaa tagaaaatgg      720

attggaattg aaataaacga aacctacatt gaaattgcca aaaagagaat tgaagaggag      780

agaagaaaaa atgtgcaaag tacatttatt taa                                   813


<210>  8
<211>  270
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Translation product of SEQ ID NO.7 (SEQ ID NO.8)

<400>  8

Met Ile Tyr Asn Thr Ile Leu His Gly Asp Cys Val Thr Ile Met Lys 
1               5                   10                  15      


Glu His Ile Pro Ser Glu Ser Ile Asp Leu Ile Tyr Ala Asp Pro Pro 
            20                  25                  30          


Tyr Asn Leu Ser Gly Arg Asp Leu Ile Leu Lys Asn Asn Lys Thr Gly 
        35                  40                  45              


Gly Pro Phe Tyr Lys Met Asn Glu Glu Trp Asp Ser Trp Asp Tyr Asp 
    50                  55                  60                  


Lys Tyr Cys Glu Phe Thr Tyr Asn Trp Leu Leu Ala Ser Tyr Ser Val 
65                  70                  75                  80  


Leu Lys Asn Asn Gly Ser Leu Tyr Ile Ser Cys Thr Tyr His Asn Ile 
                85                  90                  95      


Gly Glu Val Ile Phe Leu Ala Lys Lys Ile Gly Phe Lys Leu Asn Asn 
            100                 105                 110         


Ile Leu Thr Trp Val Lys Thr Asn Ala Met Pro Asn Ile Thr Lys Arg 
        115                 120                 125             


Thr Phe Lys His Thr Thr Glu Phe Val Cys Trp Phe Val Lys Gly Pro 
    130                 135                 140                 


Gly Trp Lys Phe Asn Tyr Asn Glu Ile Lys Met Leu Asn Pro Arg Lys 
145                 150                 155                 160 


Thr Lys Asp Gly Ser Val Lys Gln Met Asp Asp Phe Phe Asp Phe Phe 
                165                 170                 175     


Glu Met Pro Leu Val Gln Gly Lys Glu Arg Ile Lys Leu Asp Asn Gly 
            180                 185                 190         


Arg Ala Ala His Pro Asn Gln Lys Pro Glu Lys Leu Leu Glu Ile Ile 
        195                 200                 205             


Ile Thr Ala Ser Ser Asp Glu Gly Asp Ile Val Leu Asp Pro Phe Phe 
    210                 215                 220                 


Gly Thr Gly Thr Thr Gly Val Val Ala Glu Arg Met Asn Arg Lys Trp 
225                 230                 235                 240 


Ile Gly Ile Glu Ile Asn Glu Thr Tyr Ile Glu Ile Ala Lys Lys Arg 
                245                 250                 255     


Ile Glu Glu Glu Arg Arg Lys Asn Val Gln Ser Thr Phe Ile 
            260                 265                 270 


<210>  9
<211>  744
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Gene in plasmid pTrc-EcMt2ORF02331 codon-optimized for expression
       in E. coli (SEQ ID NO. 9)

<400>  9
atgtgcaaag tgcacctgtt taatgatgat tgcctgaacg tgctgaaaaa gatcgaagat       60

aatagcatcg atctgatctt tgcagatccg ccttataatc tgagcagcga aaatgcactg      120

accacacgtg caggtaaacc ggttaaatgt tataaaggcg agtgggataa aatcgacgat      180

atctttgaat ttaacctgcg ctggattgaa cagtgtgttc gtgttctgaa agaaaccggc      240

accatttgga ttagcggcac cctgcataat catccgatta ttggcaccat tctgaaacag      300

ttaggtctgt ggattatcaa cgatatcatt tggttcaaac cgaatgcaac accgctgctg      360

agccgtaatc gttttgttcc gagcaccgaa ctgatttggg ttgcaagcaa aaacaaaaaa      420

tactactttg attatgaaat ggcacgcaaa ctgaatggtg gtaaacaaat gcgtaatctg      480

tgggaaattc cggcacagcg tcataaaaca ccgcatccga ccgaaaaacc ggaagcactg      540

ctggaacgta ttattctgat tggtagcaaa gaaggtgacg ttgttctgga tccgtttatg      600

ggtagcggta caaccggtgt tgttgcaaaa ctgctgaaac gcaactttat tggcattgaa      660

attgatccgg tgtatttcga gattgccaaa aaacgcatcg aagaagaaaa actgattcag      720

cagaccttta gcaacttcct gtaa                                             744


<210>  10
<211>  247
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Translation product of SEQ ID NO. 9 (SEQ ID NO. 10)

<400>  10

Met Cys Lys Val His Leu Phe Asn Asp Asp Cys Leu Asn Val Leu Lys 
1               5                   10                  15      


Lys Ile Glu Asp Asn Ser Ile Asp Leu Ile Phe Ala Asp Pro Pro Tyr 
            20                  25                  30          


Asn Leu Ser Ser Glu Asn Ala Leu Thr Thr Arg Ala Gly Lys Pro Val 
        35                  40                  45              


Lys Cys Tyr Lys Gly Glu Trp Asp Lys Ile Asp Asp Ile Phe Glu Phe 
    50                  55                  60                  


Asn Leu Arg Trp Ile Glu Gln Cys Val Arg Val Leu Lys Glu Thr Gly 
65                  70                  75                  80  


Thr Ile Trp Ile Ser Gly Thr Leu His Asn His Pro Ile Ile Gly Thr 
                85                  90                  95      


Ile Leu Lys Gln Leu Gly Leu Trp Ile Ile Asn Asp Ile Ile Trp Phe 
            100                 105                 110         


Lys Pro Asn Ala Thr Pro Leu Leu Ser Arg Asn Arg Phe Val Pro Ser 
        115                 120                 125             


Thr Glu Leu Ile Trp Val Ala Ser Lys Asn Lys Lys Tyr Tyr Phe Asp 
    130                 135                 140                 


Tyr Glu Met Ala Arg Lys Leu Asn Gly Gly Lys Gln Met Arg Asn Leu 
145                 150                 155                 160 


Trp Glu Ile Pro Ala Gln Arg His Lys Thr Pro His Pro Thr Glu Lys 
                165                 170                 175     


Pro Glu Ala Leu Leu Glu Arg Ile Ile Leu Ile Gly Ser Lys Glu Gly 
            180                 185                 190         


Asp Val Val Leu Asp Pro Phe Met Gly Ser Gly Thr Thr Gly Val Val 
        195                 200                 205             


Ala Lys Leu Leu Lys Arg Asn Phe Ile Gly Ile Glu Ile Asp Pro Val 
    210                 215                 220                 


Tyr Phe Glu Ile Ala Lys Lys Arg Ile Glu Glu Glu Lys Leu Ile Gln 
225                 230                 235                 240 


Gln Thr Phe Ser Asn Phe Leu 
                245         


<210>  11
<211>  744
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Gene from Caldicellulosiruptor sp. strain DIB 104C (SEQ ID NO. 
       11)

<400>  11
atgtgcaaag tacatttatt taatgatgat tgtttgaatg ttctaaaaaa gatagaagac       60

aatagtatag atctgatttt tgctgatcct ccttacaatt tgtcttcaga aaatgctcta      120

accacgagag ccggtaaacc agtaaaatgt tataaaggcg aatgggataa aatagatgat      180

atatttgagt ttaatctaag gtggatcgag caatgtgtca gagtacttaa agaaactgga      240

actatttgga tttctggaac attgcataat catcctataa ttggaactat tctgaagcag      300

ttgggtctct ggattatcaa tgacattata tggtttaaac ctaatgcaac tcctttactt      360

tcaagaaata gatttgttcc atctacagaa ttgatctggg ttgcaagtaa aaataaaaaa      420

tattattttg actatgagat ggcacgaaag cttaatggag gcaaacagat gagaaactta      480

tgggaaattc ctgctcaaag gcataagact cctcatccta ctgaaaaacc tgaagcattg      540

ttagaaagga ttattctaat aggcagtaaa gaaggggatg tggtcttaga tccttttatg      600

ggctctggaa caactggcgt tgtagctaaa ttgcttaaac gaaactttat tggaattgaa      660

attgatccag tatactttga gattgcaaaa aaacgtattg aggaagaaaa gcttattcag      720

caaacttttt caaattttct ttaa                                             744


<210>  12
<211>  247
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Translation product of SEQ ID NO. 11 (SEQ ID NO. 12)

<400>  12

Met Cys Lys Val His Leu Phe Asn Asp Asp Cys Leu Asn Val Leu Lys 
1               5                   10                  15      


Lys Ile Glu Asp Asn Ser Ile Asp Leu Ile Phe Ala Asp Pro Pro Tyr 
            20                  25                  30          


Asn Leu Ser Ser Glu Asn Ala Leu Thr Thr Arg Ala Gly Lys Pro Val 
        35                  40                  45              


Lys Cys Tyr Lys Gly Glu Trp Asp Lys Ile Asp Asp Ile Phe Glu Phe 
    50                  55                  60                  


Asn Leu Arg Trp Ile Glu Gln Cys Val Arg Val Leu Lys Glu Thr Gly 
65                  70                  75                  80  


Thr Ile Trp Ile Ser Gly Thr Leu His Asn His Pro Ile Ile Gly Thr 
                85                  90                  95      


Ile Leu Lys Gln Leu Gly Leu Trp Ile Ile Asn Asp Ile Ile Trp Phe 
            100                 105                 110         


Lys Pro Asn Ala Thr Pro Leu Leu Ser Arg Asn Arg Phe Val Pro Ser 
        115                 120                 125             


Thr Glu Leu Ile Trp Val Ala Ser Lys Asn Lys Lys Tyr Tyr Phe Asp 
    130                 135                 140                 


Tyr Glu Met Ala Arg Lys Leu Asn Gly Gly Lys Gln Met Arg Asn Leu 
145                 150                 155                 160 


Trp Glu Ile Pro Ala Gln Arg His Lys Thr Pro His Pro Thr Glu Lys 
                165                 170                 175     


Pro Glu Ala Leu Leu Glu Arg Ile Ile Leu Ile Gly Ser Lys Glu Gly 
            180                 185                 190         


Asp Val Val Leu Asp Pro Phe Met Gly Ser Gly Thr Thr Gly Val Val 
        195                 200                 205             


Ala Lys Leu Leu Lys Arg Asn Phe Ile Gly Ile Glu Ile Asp Pro Val 
    210                 215                 220                 


Tyr Phe Glu Ile Ala Lys Lys Arg Ile Glu Glu Glu Lys Leu Ile Gln 
225                 230                 235                 240 


Gln Thr Phe Ser Asn Phe Leu 
                245         


<210>  13
<211>  1785
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence of gene DOCILOBI_02329 encoding for restriction enzyme 
       of type II restriction modification system (gene from 
       Caldicellulosiruptor sp. strain DIB 104C) (SEQ ID NO. 13)

<400>  13
atgcgaaagc cttggtctat ttccacaact gttagaaacc ccgaaagatt aagaggattt       60

ttgcaagtgt tatctgagtt tgaaggaatg aattttgatg agaacgttca aattcaatat      120

cagatacgat taatacaata caaactctac agacctatga atttacctga agatataaaa      180

cgtgagtttg atgaccccgc aattaaaaaa atagattaca aaaaagctaa gaagattttt      240

gatttacaaa aatataaaga tccttctatg cgtggtcggc agtctgtaaa tcccttgaac      300

aaattaggat ttgcaattgc caaaaaaagc ttaggcaaga ttagaataac tgagttggga      360

agaaaatttt tagatgaaaa taacgatgtc tccgagatac ttttcaaaag tcttcttaaa      420

ttacagtacc caaatccttt tagtagcgat tttaaaagca gagatggctt tgatatcatt      480

ccttttatag taactttgaa gtttttctat ttgttagaaa aacaaacgca aatagaagaa      540

atttcaaaaa gagaattttg tttattcatt ccaaccctca ttaactataa agaaatagaa      600

aatcaaatta accagctact tgattataga agatcaaaaa ataagagaga ttttgaagta      660

catttcgtct ctaagttttt tggttcagct aaaaatattg agactaaaat caacaatttg      720

tttgattatg gagataacat cttacgattt tttcttctta caaaatgttt taccgctaaa      780

aaaagtgaat ttggacaagt tgccagtgta aaactttcac aagatagaaa aaaggaaatc      840

gaagaattac taaatatgtt tgaaggaaaa gcaataagct tttctaattt agatgactac      900

atcgaatata tgactgatat cacaaaacct gaattgccgt gggaaaaaaa taagaataaa      960

cttattgaga tggcagaaag tatccggaaa gatatttcac agcaaataga aagcagtaaa     1020

attgcaataa atgaatactc taaaatggtt ttaggaaaaa acttatttga gttatctgag     1080

gaagaattga aaaaacatat caatgaactg agggcgataa aatccaaaat taacgaagct     1140

aaaaaggctc atatcttaaa atataacttt gctaaattag acgaacatat taaaatacta     1200

aggaataaag aatattggaa ggaattagaa ccagccgatt tagaacaaat aatattcgag     1260

ctattgttaa ttattgatag tgcagaaaaa attgaatcaa atgctattaa agatgatgaa     1320

ggaaatttta taaattttgc accagctaaa aaacctgata ttgaattttt cttcaaggaa     1380

tttgctggaa tttgtgaagt aacactaaac aaaactcaat atcaatggat tcaagaggga     1440

tatcctgtat tagatcacgt ggcaaaattt atgaatcaat accctaatta tactaatttt     1500

gtaaatattt ttattgctcc aaaaatacat gataatacct attataattt tttcattgct     1560

ttaaaatacg gatttaagag taagaaaatc agaattatcc ctttaaattt tgaacagttt     1620

actatgttca caaaagtgct tcaaagctat tttgaaaagt tcaatggttt aaattcaaac     1680

ttaattttga ccctttgtaa tgatatattt tacacaatgg aaaatttaga tgaccactca     1740

atgatatgta gtttaattga caataagctt tatagtctat tttag                     1785


<210>  14
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primers MCUP180

<400>  14
agatcaaagg atcttcttga gatc                                              24


<210>  15
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  BLU32

<400>  15
aagaaatagc ggtctgacgc tcagtggaac g                                      31


<210>  16
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 054_pyrEup_f

<400>  16
cttgtccgaa cgtgaaagaa ggtggaatgg                                        30


<210>  17
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  055_pyrEdw_r

<400>  17
ttggcatttc tcacgtgcca gaaggaagac                                        30


<210>  18
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer BLU001

<400>  18
aggtggactt tcaggccctg ctataaagcc                                        30


<210>  19
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  BLU002

<400>  19
aacttgcagg gacactttct ggcggagaac                                        30


