                               SEQUENCE LISTING	

<110> QTEROS, INC.
 
<120> BIOCATALYSTS SYNTHESIZING DEREGULATED CELLULASES

<130> 37836-740.601

<140>
<141>

<150> 13/270,166
<151> 2011-10-10

<150> 61/442,120
<151> 2011-02-11

<150> 61/436,575
<151> 2011-01-26

<160> 44    

<170> PatentIn version 3.5

<210> 1
<211> 4904
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      plasmid pIMPCphy polynucleotide

<400> 1
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca       60

cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct      120

cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat      180

tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaaagcttt      240

ggctaacaca cacgccattc caaccaatag ttttctcggc ataaagccat gctctgacgc      300

ttaaatgcac taatgcctta aaaaaacatt aaagtctaac acactagact tatttacttc      360

gtaattaagt cgttaaaccg tgtgctctac gaccaaaagt ataaaacctt taagaacttt      420

cttttttctt gtaaaaaaag aaactagata aatctctcat atcttttatt caataatcgc      480

atcagattgc agtataaatt taacgatcac tcatcatgtt catatttatc agagctcctt      540

atattttatt tcgatttatt tgttatttat ttaacatttt tctattgacc tcatcttttc      600

tatgtgttat tcttttgtta attgtttaca aataatctac gatacataga aggaggaaaa      660

actagtatac tagtatgaac gagaaaaata taaaacacag tcaaaacttt attacttcaa      720

aacataatat agataaaata atgacaaata taagattaaa tgaacatgat aatatctttg      780

aaatcggctc aggaaaaggg cattttaccc ttgaattagt acagaggtgt aatttcgtaa      840

ctgccattga aatagaccat aaattatgca aaactacaga aaataaactt gttgatcacg      900

ataatttcca agttttaaac aaggatatat tgcagtttaa atttcctaaa aaccaatcct      960

ataaaatatt tggtaatata ccttataaca taagtacgga tataatacgc aaaattgttt     1020

ttgatagtat agctgatgag atttatttaa tcgtggaata cgggtttgct aaaagattat     1080

taaatacaaa acgctcattg gcattatttt taatggcaga agttgatatt tctatattaa     1140

gtatggttcc aagagaatat tttcatccta aacctaaagt gaatagctca cttatcagat     1200

taaatagaaa aaaatcaaga atatcacaca aagataaaca gaagtataat tatttcgtta     1260

tgaaatgggt taacaaagaa tacaagaaaa tatttacaaa aaatcaattt aacaattcct     1320

taaaacatgc aggaattgac gatttaaaca atattagctt tgaacaattc ttatctcttt     1380

tcaatagcta taaattattt aataagtaag ttaagggatg cataaactgc atcccttaac     1440

ttgtttttcg tgtacctatt ttttgtgaat cgatccggcc agcctcgcag agcaggattc     1500

ccgttgagca ccgccaggtg cgaataaggg acagtgaaga aggaacaccc gctcgcgggt     1560

gggcctactt cacctatcct gcccggatcg attatgtctt ttgcgcattc acttcttttc     1620

tatataaata tgagcgaagc gaataagcgt cggaaaagca gcaaaaagtt tcctttttgc     1680

tgttggagca tgggggttca gggggtgcag tatctgacgt caatgccgag cgaaagcgag     1740

ccgaagggta gcatttacgt tagataaccc cctgatatgc tccgacgctt tatatagaaa     1800

agaagattca actaggtaaa atcttaatat aggttgagat gataaggttt ataaggaatt     1860

tgtttgttct aatttttcac tcattttgtt ctaatttctt ttaacaaatg ttcttttttt     1920

tttagaacag ttatgatata gttagaatag tttaaaataa ggagtgagaa aaagatgaaa     1980

gaaagatatg gaacagtcta taaaggctct cagaggctca tagacgaaga aagtggagaa     2040

gtcatagagg tagacaagtt ataccgtaaa caaacgtctg gtaacttcgt aaaggcatat     2100

atagtgcaat taataagtat gttagatatg attggcggaa aaaaacttaa aatcgttaac     2160

tatatcctag ataatgtcca cttaagtaac aatacaatga tagctacaac aagagaaata     2220

gcaaaagcta caggaacaag tctacaaaca gtaataacaa cacttaaaat cttagaagaa     2280

ggaaatatta taaaaagaaa aactggagta ttaatgttaa accctgaact actaatgaga     2340

ggcgacgacc aaaaacaaaa atacctctta ctcgaatttg ggaactttga gcaagaggca     2400

aatgaaatag attgacctcc caataacacc acgtagttat tgggaggtca atctatgaaa     2460

tgcgattaag cttagcttgg ctgcaggtcg acggatcccc gggaattcac tggccgtcgt     2520

tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca     2580

tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca     2640

gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat tttctcctta cgcatctgtg     2700

cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt     2760

aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc     2820

ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc     2880

accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt     2940

taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg     3000

cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca     3060

ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt     3120

ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga     3180

aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga     3240

actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat     3300

gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca     3360

agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt     3420

cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac     3480

catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct     3540

aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga     3600

gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac     3660

aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat     3720

agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg     3780

ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc     3840

actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc     3900

aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg     3960

gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta     4020

atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg     4080

tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga     4140

tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt     4200

ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag     4260

agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa     4320

ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag     4380

tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca     4440

gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac     4500

cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa     4560

ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc     4620

agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg     4680

tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc     4740

ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc     4800

ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag     4860

ccgaacgccg agcgcagcga gtcagtgagc gaggaagcgg aaga                      4904


<210> 2
<211> 1434
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_2437; propionyl-CoA carboxylase

<400> 2
atgactagta catcacaact gtcagcaagg gacaggattg tcactttgct tgacgagaac       60

agttttgttg aggttggtgc tttggttact agaagaagta ctgatttcaa cctgcaacaa      120

aaagaagtac cttctgatgg tgttatcaca ggttatggaa ttattgacgg taatcctgtc      180

tatgtttata gccaggatgt agctgtaatg aatggttcca taggtgaaat gcacgcaaag      240

aaaatatcaa atttatatga tcttgcaatg aaggttggtg cacctgttat cggtttgatt      300

gattgtgccg gcttgagatt acaggaagca acggacgcat tagctggatt tggagatctt      360

tatctaaaac aatccttagc atccggggta atcccacaga tcacagcaat ttttggtaat      420

tgcggaggag gcagtgccat atccgcatct ctatgcgact ttgtcttcat ggaagagaag      480

aacgcaaaat tattcgtaaa tgctccaaat gcaattctag gaaacaattc tactaaatgt      540

gataccgcaa cagcttcttt tatggctgaa gcaggtgttg ttgattttgt cgaagcagat      600

gaagaatctg tactaaacag cattcgtaac ttagttgcaa tgttacctgc aaacaatgaa      660

gacgatgctt cttatgagga atgcaccgat gacttaaatc gtgctttatc ctcctttacc      720

tctgaactta gtgatacaac tcttgcactt aaagatttat ctgaccatgg tattttcatc      780

gaattgaaga aggcatatgc aaaggacatg gtgacaggtt ttatcaaatt aaatggttta      840

actgttggtg ctgtagcaaa tcgtactgcg ttactcgatg aagatggtaa agtaatcgaa      900

aaatatgatg attctttatc tacagcaggt tgttataaag cagagaagtt tgtaaagttc      960

tgcaatgctt ttcaaattcc ggtacttact ctaacgaatg ttgctggata cagtgcaaca     1020

atgaaggatg caggatccat tgctatagcg acagcgaaat taacctatgc cttcgcgaat     1080

gcaacagtgc caaaagtaaa tgtcatttta gaaaaggcat acggtagtgc atacataaca     1140

atgaattcga aacatattgg tgctgatatg gtattcgctt tggatggttc catgattggt     1200

actatggatg ctgaacttgc agcgcagatt atgtatgctg ataacaaaga agaacaagcg     1260

ttaaaggcta cagagtataa agtattacag caaagtgcag agtcagcggc aaaacgtggt     1320

tatgtggatg ctattatatc accagagagt gtaagacaac atgtaatcta tgcatttgaa     1380

atgttattca caaagagaga gagccgacca agcaaaaagc atggaacggt ttag           1434


<210> 3
<211> 477
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_2437; propionyl-CoA carboxylase

<400> 3
Met Thr Ser Thr Ser Gln Leu Ser Ala Arg Asp Arg Ile Val Thr Leu 
1               5                   10                  15      


Leu Asp Glu Asn Ser Phe Val Glu Val Gly Ala Leu Val Thr Arg Arg 
            20                  25                  30          


Ser Thr Asp Phe Asn Leu Gln Gln Lys Glu Val Pro Ser Asp Gly Val 
        35                  40                  45              


Ile Thr Gly Tyr Gly Ile Ile Asp Gly Asn Pro Val Tyr Val Tyr Ser 
    50                  55                  60                  


Gln Asp Val Ala Val Met Asn Gly Ser Ile Gly Glu Met His Ala Lys 
65                  70                  75                  80  


Lys Ile Ser Asn Leu Tyr Asp Leu Ala Met Lys Val Gly Ala Pro Val 
                85                  90                  95      


Ile Gly Leu Ile Asp Cys Ala Gly Leu Arg Leu Gln Glu Ala Thr Asp 
            100                 105                 110         


Ala Leu Ala Gly Phe Gly Asp Leu Tyr Leu Lys Gln Ser Leu Ala Ser 
        115                 120                 125             


Gly Val Ile Pro Gln Ile Thr Ala Ile Phe Gly Asn Cys Gly Gly Gly 
    130                 135                 140                 


Ser Ala Ile Ser Ala Ser Leu Cys Asp Phe Val Phe Met Glu Glu Lys 
145                 150                 155                 160 


Asn Ala Lys Leu Phe Val Asn Ala Pro Asn Ala Ile Leu Gly Asn Asn 
                165                 170                 175     


Ser Thr Lys Cys Asp Thr Ala Thr Ala Ser Phe Met Ala Glu Ala Gly 
            180                 185                 190         


Val Val Asp Phe Val Glu Ala Asp Glu Glu Ser Val Leu Asn Ser Ile 
        195                 200                 205             


Arg Asn Leu Val Ala Met Leu Pro Ala Asn Asn Glu Asp Asp Ala Ser 
    210                 215                 220                 


Tyr Glu Glu Cys Thr Asp Asp Leu Asn Arg Ala Leu Ser Ser Phe Thr 
225                 230                 235                 240 


Ser Glu Leu Ser Asp Thr Thr Leu Ala Leu Lys Asp Leu Ser Asp His 
                245                 250                 255     


Gly Ile Phe Ile Glu Leu Lys Lys Ala Tyr Ala Lys Asp Met Val Thr 
            260                 265                 270         


Gly Phe Ile Lys Leu Asn Gly Leu Thr Val Gly Ala Val Ala Asn Arg 
        275                 280                 285             


Thr Ala Leu Leu Asp Glu Asp Gly Lys Val Ile Glu Lys Tyr Asp Asp 
    290                 295                 300                 


Ser Leu Ser Thr Ala Gly Cys Tyr Lys Ala Glu Lys Phe Val Lys Phe 
305                 310                 315                 320 


Cys Asn Ala Phe Gln Ile Pro Val Leu Thr Leu Thr Asn Val Ala Gly 
                325                 330                 335     


Tyr Ser Ala Thr Met Lys Asp Ala Gly Ser Ile Ala Ile Ala Thr Ala 
            340                 345                 350         


Lys Leu Thr Tyr Ala Phe Ala Asn Ala Thr Val Pro Lys Val Asn Val 
        355                 360                 365             


Ile Leu Glu Lys Ala Tyr Gly Ser Ala Tyr Ile Thr Met Asn Ser Lys 
    370                 375                 380                 


His Ile Gly Ala Asp Met Val Phe Ala Leu Asp Gly Ser Met Ile Gly 
385                 390                 395                 400 


Thr Met Asp Ala Glu Leu Ala Ala Gln Ile Met Tyr Ala Asp Asn Lys 
                405                 410                 415     


Glu Glu Gln Ala Leu Lys Ala Thr Glu Tyr Lys Val Leu Gln Gln Ser 
            420                 425                 430         


Ala Glu Ser Ala Ala Lys Arg Gly Tyr Val Asp Ala Ile Ile Ser Pro 
        435                 440                 445             


Glu Ser Val Arg Gln His Val Ile Tyr Ala Phe Glu Met Leu Phe Thr 
    450                 455                 460                 


Lys Arg Glu Ser Arg Pro Ser Lys Lys His Gly Thr Val 
465                 470                 475         


<210> 4
<211> 1608
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_3282; Two component AraC family transcriptional 
      regulator

<400> 4
atgctaaaag taatcatagc agatgatgaa gataaaatat gccaactaat attcaaattg       60

attgattggg attcccttga tatgaaggtt gaggctattg cacataacgg tattgatgct      120

ttggagttag caaagttaaa taatccggat ataataatta cagatatcag gatgcccggt      180

tatgacggtt tggatttcat atcaagaacc agggagataa atccggacat ccaatttgtt      240

atcatcagcg gctatcaaca atttgaatat gcccacaaag caattaagta tggagtgatc      300

gactatctac ttaagccaat taaaaaaaat gaacttttag ctactttaac caagataaag      360

aatcaatatc tggaacgggt agatctgttg acgaaagaag aacagtcaga gttaagtata      420

agaaataata tttataaatt gagaaccggg ctgtttaact cggtgctatt taataatagg      480

acaaaaaaca taaccattga ttttataaat agtaattatg gatacagatt taaggaaggt      540

ttattccaaa tcataattgt aaaattagat ggcgtgggac ggtcttattc cactacaatt      600

acctttcttg aagagaagct actaaaaatt attcataaca atctaaagga atattgttat      660

gatatggaga attattttga aaataatata atctattgtt tcatgaacta caattttgat      720

aacaagaaga atatccggca ccaatgtaag agattaatgg atgaattact tttacaaaga      780

gaaatatttg aaaatttaga ggtaacaatt ggacttggca cagtggaaac tgagattcct      840

atgattggta attcatacaa ggcagcagtc tgggcatatc agcaaagatt agtacttggt      900

accaataaga taatagaggg taaaataatc acctcaaata actttgttga cagcgatctg      960

tttcatgatt ttaacaaaga tatgaaagca gctctagaaa ggctggataa agaatcagtt     1020

acctcggcaa tccattacat aagagaaggt ttaaggaatc gaccggaaac cagtggacat     1080

gaacttcttc aaatgaccaa agagatttgc aatctatttt tatttaccat gcgttacaat     1140

aaatttcccg tagatggggg agataatttc cttgagaatt ttagcatgaa tgctaatgac     1200

attggttctg cttataaatt atatcaatat cttacagacg ttattgttaa tagcatggat     1260

aagataatag aagataagag acagagtgat acaagaccca tcagggatgc aaagcaattt     1320

attcaaacga actatacgag acagataaca cttgacgaag ttagcggaaa ggttggtttt     1380

aatgctacat attttagttc cttatttaag aaagagactg gttacacatt tttggaatat     1440

ctttctgaag tgcgtataaa taaggcaaag gaacttttaa aggataccaa ttatagtgtt     1500

atggcaatct gtgagcatgt ggggtacagt gatattaaac attttacgaa aacgtttgta     1560

aaacacacga atttgaaacc aaatgagtat cgaaagttat attcatga                  1608


<210> 5
<211> 535
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_3282; Two component AraC family transcriptional 
      regulator

<400> 5
Met Leu Lys Val Ile Ile Ala Asp Asp Glu Asp Lys Ile Cys Gln Leu 
1               5                   10                  15      


Ile Phe Lys Leu Ile Asp Trp Asp Ser Leu Asp Met Lys Val Glu Ala 
            20                  25                  30          


Ile Ala His Asn Gly Ile Asp Ala Leu Glu Leu Ala Lys Leu Asn Asn 
        35                  40                  45              


Pro Asp Ile Ile Ile Thr Asp Ile Arg Met Pro Gly Tyr Asp Gly Leu 
    50                  55                  60                  


Asp Phe Ile Ser Arg Thr Arg Glu Ile Asn Pro Asp Ile Gln Phe Val 
65                  70                  75                  80  


Ile Ile Ser Gly Tyr Gln Gln Phe Glu Tyr Ala His Lys Ala Ile Lys 
                85                  90                  95      


Tyr Gly Val Ile Asp Tyr Leu Leu Lys Pro Ile Lys Lys Asn Glu Leu 
            100                 105                 110         


Leu Ala Thr Leu Thr Lys Ile Lys Asn Gln Tyr Leu Glu Arg Val Asp 
        115                 120                 125             


Leu Leu Thr Lys Glu Glu Gln Ser Glu Leu Ser Ile Arg Asn Asn Ile 
    130                 135                 140                 


Tyr Lys Leu Arg Thr Gly Leu Phe Asn Ser Val Leu Phe Asn Asn Arg 
145                 150                 155                 160 


Thr Lys Asn Ile Thr Ile Asp Phe Ile Asn Ser Asn Tyr Gly Tyr Arg 
                165                 170                 175     


Phe Lys Glu Gly Leu Phe Gln Ile Ile Ile Val Lys Leu Asp Gly Val 
            180                 185                 190         


Gly Arg Ser Tyr Ser Thr Thr Ile Thr Phe Leu Glu Glu Lys Leu Leu 
        195                 200                 205             


Lys Ile Ile His Asn Asn Leu Lys Glu Tyr Cys Tyr Asp Met Glu Asn 
    210                 215                 220                 


Tyr Phe Glu Asn Asn Ile Ile Tyr Cys Phe Met Asn Tyr Asn Phe Asp 
225                 230                 235                 240 


Asn Lys Lys Asn Ile Arg His Gln Cys Lys Arg Leu Met Asp Glu Leu 
                245                 250                 255     


Leu Leu Gln Arg Glu Ile Phe Glu Asn Leu Glu Val Thr Ile Gly Leu 
            260                 265                 270         


Gly Thr Val Glu Thr Glu Ile Pro Met Ile Gly Asn Ser Tyr Lys Ala 
        275                 280                 285             


Ala Val Trp Ala Tyr Gln Gln Arg Leu Val Leu Gly Thr Asn Lys Ile 
    290                 295                 300                 


Ile Glu Gly Lys Ile Ile Thr Ser Asn Asn Phe Val Asp Ser Asp Leu 
305                 310                 315                 320 


Phe His Asp Phe Asn Lys Asp Met Lys Ala Ala Leu Glu Arg Leu Asp 
                325                 330                 335     


Lys Glu Ser Val Thr Ser Ala Ile His Tyr Ile Arg Glu Gly Leu Arg 
            340                 345                 350         


Asn Arg Pro Glu Thr Ser Gly His Glu Leu Leu Gln Met Thr Lys Glu 
        355                 360                 365             


Ile Cys Asn Leu Phe Leu Phe Thr Met Arg Tyr Asn Lys Phe Pro Val 
    370                 375                 380                 


Asp Gly Gly Asp Asn Phe Leu Glu Asn Phe Ser Met Asn Ala Asn Asp 
385                 390                 395                 400 


Ile Gly Ser Ala Tyr Lys Leu Tyr Gln Tyr Leu Thr Asp Val Ile Val 
                405                 410                 415     


Asn Ser Met Asp Lys Ile Ile Glu Asp Lys Arg Gln Ser Asp Thr Arg 
            420                 425                 430         


Pro Ile Arg Asp Ala Lys Gln Phe Ile Gln Thr Asn Tyr Thr Arg Gln 
        435                 440                 445             


Ile Thr Leu Asp Glu Val Ser Gly Lys Val Gly Phe Asn Ala Thr Tyr 
    450                 455                 460                 


Phe Ser Ser Leu Phe Lys Lys Glu Thr Gly Tyr Thr Phe Leu Glu Tyr 
465                 470                 475                 480 


Leu Ser Glu Val Arg Ile Asn Lys Ala Lys Glu Leu Leu Lys Asp Thr 
                485                 490                 495     


Asn Tyr Ser Val Met Ala Ile Cys Glu His Val Gly Tyr Ser Asp Ile 
            500                 505                 510         


Lys His Phe Thr Lys Thr Phe Val Lys His Thr Asn Leu Lys Pro Asn 
        515                 520                 525             


Glu Tyr Arg Lys Leu Tyr Ser 
    530                 535 


<210> 6
<211> 939
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0329; ROK family glucokinase

<400> 6
atggacaagt tttgctttgg aattgacatt ggtggcacaa caattaaatg cggattattt       60

acagcgaatg gtgaattaaa agaaaaatgg gagattcctt ctagaacaga aaatggcgga      120

attcaagtac cacaggatgt agcggatacg attgatgcca aattaaaaga attatccatt      180

gagaaaaagg atgttcttgg agttggtatt ggtgtacccg gtccaattac cgaagatgga      240

actgtcttaa aatgtgctaa tctaggttgg gatattttta atgtaaatga aaaaatgagt      300

gcacttaccg gactaaaggt agcaactgca aatgatgcca atgttgctgc tctaggagaa      360

atgtggatgg gcggtggcaa aggttataag aacatcgtta tggttactct tggaaccggc      420

gtaggtggcg gagttatttt aaatggtaag attgttgcag gaagtaatgg cggtggtggt      480

gaaatcggcc atatgacaat gaatcttgat gagaaagaga catgcggttg cggtaaacat      540

ggccatttag agcaatatgc ttctgctaca ggcattgtac gtcttgctaa aaaacgttta      600

ttagatacaa gtgttactac ttcacttcgt gagcttgccg aggtaacagc gaaagatatt      660

tttgatcacg cgaaagcggg agatacggtc gcactagagc ttgttgaaga actaggtaga      720

tatcttgggt tggcattatc tcatgttgct gctgcggttg atccacaggt atttgtaatt      780

ggtggtggtg tatcaagagc tggctctatg ttacttgatg tgatttcaaa atattataat      840

cagaatatca tattcgcact atcaaataaa gagttccgtc ttgctgaact tggaaatgat      900

gcagggatct atggttgtgc aaaattagtg attggctaa                             939


<210> 7
<211> 312
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0329; ROK family glucokinase

<400> 7
Met Asp Lys Phe Cys Phe Gly Ile Asp Ile Gly Gly Thr Thr Ile Lys 
1               5                   10                  15      


Cys Gly Leu Phe Thr Ala Asn Gly Glu Leu Lys Glu Lys Trp Glu Ile 
            20                  25                  30          


Pro Ser Arg Thr Glu Asn Gly Gly Ile Gln Val Pro Gln Asp Val Ala 
        35                  40                  45              


Asp Thr Ile Asp Ala Lys Leu Lys Glu Leu Ser Ile Glu Lys Lys Asp 
    50                  55                  60                  


Val Leu Gly Val Gly Ile Gly Val Pro Gly Pro Ile Thr Glu Asp Gly 
65                  70                  75                  80  


Thr Val Leu Lys Cys Ala Asn Leu Gly Trp Asp Ile Phe Asn Val Asn 
                85                  90                  95      


Glu Lys Met Ser Ala Leu Thr Gly Leu Lys Val Ala Thr Ala Asn Asp 
            100                 105                 110         


Ala Asn Val Ala Ala Leu Gly Glu Met Trp Met Gly Gly Gly Lys Gly 
        115                 120                 125             


Tyr Lys Asn Ile Val Met Val Thr Leu Gly Thr Gly Val Gly Gly Gly 
    130                 135                 140                 


Val Ile Leu Asn Gly Lys Ile Val Ala Gly Ser Asn Gly Gly Gly Gly 
145                 150                 155                 160 


Glu Ile Gly His Met Thr Met Asn Leu Asp Glu Lys Glu Thr Cys Gly 
                165                 170                 175     


Cys Gly Lys His Gly His Leu Glu Gln Tyr Ala Ser Ala Thr Gly Ile 
            180                 185                 190         


Val Arg Leu Ala Lys Lys Arg Leu Leu Asp Thr Ser Val Thr Thr Ser 
        195                 200                 205             


Leu Arg Glu Leu Ala Glu Val Thr Ala Lys Asp Ile Phe Asp His Ala 
    210                 215                 220                 


Lys Ala Gly Asp Thr Val Ala Leu Glu Leu Val Glu Glu Leu Gly Arg 
225                 230                 235                 240 


Tyr Leu Gly Leu Ala Leu Ser His Val Ala Ala Ala Val Asp Pro Gln 
                245                 250                 255     


Val Phe Val Ile Gly Gly Gly Val Ser Arg Ala Gly Ser Met Leu Leu 
            260                 265                 270         


Asp Val Ile Ser Lys Tyr Tyr Asn Gln Asn Ile Ile Phe Ala Leu Ser 
        275                 280                 285             


Asn Lys Glu Phe Arg Leu Ala Glu Leu Gly Asn Asp Ala Gly Ile Tyr 
    290                 295                 300                 


Gly Cys Ala Lys Leu Val Ile Gly 
305                 310         


<210> 8
<211> 825
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_3487; Hypothetical polynucleotide

<400> 8
atgagcgtgg atgtaatgga aaagtcatta aacgaaaaaa tcatcagcgt aaaccaaatt       60

cgagagattt tggagatgta caacataggg aaaaaaccat tggcgaagtt attggggtgg      120

ggagagacca cagtaattcg ctatctggaa ggagatattc caacttccga atatacagaa      180

aagttacaag agattgcaga aagcccaatg tactattatc gtattcttat ggagaaccaa      240

agtaaaatta ccaatgttgc tttcaagaaa agcaaaagag cagtattgga ggctatgatg      300

cgttcgaagc ttcatgtagc aacacaatat attattaatc aggcaaatgc tgaaataaat      360

gcaaggcaga ttcaatatat attgttctac tctcaaatac tttctctagt ttttttggga      420

gaagagatat ttgaagaaga tgcacagctg acttacaatc agatgcctta tttaaacctc      480

tatgaggctc taaagaagca agggattcgt acaattgaaa tgaaccctga taagttaagt      540

agaaatgata aagacattat agattgtgtc cacaatgcct gtggttggta tgggaataaa      600

gcttttttgg ccatcgcttc agtggagaga aaggctacgg aggaattatt agaagggctt      660

aaaagcaaaa tcttaacgaa agactattta agaaatgtat acggtaggat gtttggtcaa      720

ttccaaataa aacgctatca agattttcca aagtatctac agcagaggct tattcaggca      780

attggaaaag acgtattttc tgtgaatata tataaagaga tataa                      825


<210> 9
<211> 274
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_3487; Hypothetical protein

<400> 9
Met Ser Val Asp Val Met Glu Lys Ser Leu Asn Glu Lys Ile Ile Ser 
1               5                   10                  15      


Val Asn Gln Ile Arg Glu Ile Leu Glu Met Tyr Asn Ile Gly Lys Lys 
            20                  25                  30          


Pro Leu Ala Lys Leu Leu Gly Trp Gly Glu Thr Thr Val Ile Arg Tyr 
        35                  40                  45              


Leu Glu Gly Asp Ile Pro Thr Ser Glu Tyr Thr Glu Lys Leu Gln Glu 
    50                  55                  60                  


Ile Ala Glu Ser Pro Met Tyr Tyr Tyr Arg Ile Leu Met Glu Asn Gln 
65                  70                  75                  80  


Ser Lys Ile Thr Asn Val Ala Phe Lys Lys Ser Lys Arg Ala Val Leu 
                85                  90                  95      


Glu Ala Met Met Arg Ser Lys Leu His Val Ala Thr Gln Tyr Ile Ile 
            100                 105                 110         


Asn Gln Ala Asn Ala Glu Ile Asn Ala Arg Gln Ile Gln Tyr Ile Leu 
        115                 120                 125             


Phe Tyr Ser Gln Ile Leu Ser Leu Val Phe Leu Gly Glu Glu Ile Phe 
    130                 135                 140                 


Glu Glu Asp Ala Gln Leu Thr Tyr Asn Gln Met Pro Tyr Leu Asn Leu 
145                 150                 155                 160 


Tyr Glu Ala Leu Lys Lys Gln Gly Ile Arg Thr Ile Glu Met Asn Pro 
                165                 170                 175     


Asp Lys Leu Ser Arg Asn Asp Lys Asp Ile Ile Asp Cys Val His Asn 
            180                 185                 190         


Ala Cys Gly Trp Tyr Gly Asn Lys Ala Phe Leu Ala Ile Ala Ser Val 
        195                 200                 205             


Glu Arg Lys Ala Thr Glu Glu Leu Leu Glu Gly Leu Lys Ser Lys Ile 
    210                 215                 220                 


Leu Thr Lys Asp Tyr Leu Arg Asn Val Tyr Gly Arg Met Phe Gly Gln 
225                 230                 235                 240 


Phe Gln Ile Lys Arg Tyr Gln Asp Phe Pro Lys Tyr Leu Gln Gln Arg 
                245                 250                 255     


Leu Ile Gln Ala Ile Gly Lys Asp Val Phe Ser Val Asn Ile Tyr Lys 
            260                 265                 270         


Glu Ile 
        


<210> 10
<211> 1413
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_1543; Dihydrolipoamide dehydrogenase

<400> 10
atggagaaaa tatatgattt gcttgttatt ggtgcaggcc ctggaggata tgtagcagcg       60

attaaggctg caaagctagg aatgaagaca gcagtgatag aaaacaggga agtcggcggt      120

acctgcttaa accgcggctg cgttcctgcg aaagccatgc tgcatgctgc aaaattatat      180

caggaggttc tgtccggaga acagttcgga atcctcgtgg aagaggtaag ctttgattac      240

ggcaaagtga tgtcctataa aaatgagact tctgaaagtc tcagacttgg agtagaacag      300

ctcctaaagg gaaataaagt ggaacgctta caagggatcg ggacgctttt gaaggatgga      360

agagttagaa ttaagacaaa ggaaggtgaa gaaattctcc aggcgaaaaa tatattgctg      420

gcaacaggat cgaagcctgt tcttccgccc attgaaggaa tccatcttcc cggaatcatg      480

accagcgatg agatgtttca acttgatcat gtaccggaaa gtctacttat tataggcggc      540

ggggtcatag gggtagaatt tgccacagta tactcatcat ttggttccaa agttacattg      600

ttggaggcgg aagaaagact tctacctggt ctagataagg aaatctcaca gaatataaag      660

ctgcttttga aaaagagagg cgttgatatt cataccagag catttgttca gaaaatagaa      720

aaggtggact gtgagtttat ctgtaccttt ttagagaagg gaaaggatca agaaaaagcc      780

gaggtccgaa agattccata cttgctttca gctaccggac gtatcccaaa tactcatggt      840

ctgctggagg agacaacatt actggaaatg gacagaggta ggattttggt aaatgaaaat      900

tttgaaacaa gcatgcctaa cgtgtttgct atcggtgatg ttattggagg aagccaacta      960

gcacatgttg caagttctca gggtatctgt gcagtggaac gaatgaatgg gaaagaaccg     1020

tcaattgatt tatccgtagt tccatcctgt gtttatacag atcctgaaat tgcatgtgtt     1080

ggaataacag aacaggaagc gaaagagaaa ggaatcgaga ccgttactgg aaagttttta     1140

acacatgcca acagtaaatc attgataaca aaggaagaga gaggctttgt taaagttgtt     1200

atagataagg aaacgaacgt attgctggga gcgcagatga tgtgcgccag agcgacagat     1260

atgatcggtg agatgggaac tgccatttcc aataaactta ctgcgatgca gcttttaaag     1320

gctatgcggg cacatcctac ctataatgag tccatagcgg aagcattgga ggattgtaac     1380

catggtgcga ttcatgcgtt accgcgcagg taa                                  1413


<210> 11
<211> 470
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_1543; Dihydrolipoamide dehydrogenase

<400> 11
Met Glu Lys Ile Tyr Asp Leu Leu Val Ile Gly Ala Gly Pro Gly Gly 
1               5                   10                  15      


Tyr Val Ala Ala Ile Lys Ala Ala Lys Leu Gly Met Lys Thr Ala Val 
            20                  25                  30          


Ile Glu Asn Arg Glu Val Gly Gly Thr Cys Leu Asn Arg Gly Cys Val 
        35                  40                  45              


Pro Ala Lys Ala Met Leu His Ala Ala Lys Leu Tyr Gln Glu Val Leu 
    50                  55                  60                  


Ser Gly Glu Gln Phe Gly Ile Leu Val Glu Glu Val Ser Phe Asp Tyr 
65                  70                  75                  80  


Gly Lys Val Met Ser Tyr Lys Asn Glu Thr Ser Glu Ser Leu Arg Leu 
                85                  90                  95      


Gly Val Glu Gln Leu Leu Lys Gly Asn Lys Val Glu Arg Leu Gln Gly 
            100                 105                 110         


Ile Gly Thr Leu Leu Lys Asp Gly Arg Val Arg Ile Lys Thr Lys Glu 
        115                 120                 125             


Gly Glu Glu Ile Leu Gln Ala Lys Asn Ile Leu Leu Ala Thr Gly Ser 
    130                 135                 140                 


Lys Pro Val Leu Pro Pro Ile Glu Gly Ile His Leu Pro Gly Ile Met 
145                 150                 155                 160 


Thr Ser Asp Glu Met Phe Gln Leu Asp His Val Pro Glu Ser Leu Leu 
                165                 170                 175     


Ile Ile Gly Gly Gly Val Ile Gly Val Glu Phe Ala Thr Val Tyr Ser 
            180                 185                 190         


Ser Phe Gly Ser Lys Val Thr Leu Leu Glu Ala Glu Glu Arg Leu Leu 
        195                 200                 205             


Pro Gly Leu Asp Lys Glu Ile Ser Gln Asn Ile Lys Leu Leu Leu Lys 
    210                 215                 220                 


Lys Arg Gly Val Asp Ile His Thr Arg Ala Phe Val Gln Lys Ile Glu 
225                 230                 235                 240 


Lys Val Asp Cys Glu Phe Ile Cys Thr Phe Leu Glu Lys Gly Lys Asp 
                245                 250                 255     


Gln Glu Lys Ala Glu Val Arg Lys Ile Pro Tyr Leu Leu Ser Ala Thr 
            260                 265                 270         


Gly Arg Ile Pro Asn Thr His Gly Leu Leu Glu Glu Thr Thr Leu Leu 
        275                 280                 285             


Glu Met Asp Arg Gly Arg Ile Leu Val Asn Glu Asn Phe Glu Thr Ser 
    290                 295                 300                 


Met Pro Asn Val Phe Ala Ile Gly Asp Val Ile Gly Gly Ser Gln Leu 
305                 310                 315                 320 


Ala His Val Ala Ser Ser Gln Gly Ile Cys Ala Val Glu Arg Met Asn 
                325                 330                 335     


Gly Lys Glu Pro Ser Ile Asp Leu Ser Val Val Pro Ser Cys Val Tyr 
            340                 345                 350         


Thr Asp Pro Glu Ile Ala Cys Val Gly Ile Thr Glu Gln Glu Ala Lys 
        355                 360                 365             


Glu Lys Gly Ile Glu Thr Val Thr Gly Lys Phe Leu Thr His Ala Asn 
    370                 375                 380                 


Ser Lys Ser Leu Ile Thr Lys Glu Glu Arg Gly Phe Val Lys Val Val 
385                 390                 395                 400 


Ile Asp Lys Glu Thr Asn Val Leu Leu Gly Ala Gln Met Met Cys Ala 
                405                 410                 415     


Arg Ala Thr Asp Met Ile Gly Glu Met Gly Thr Ala Ile Ser Asn Lys 
            420                 425                 430         


Leu Thr Ala Met Gln Leu Leu Lys Ala Met Arg Ala His Pro Thr Tyr 
        435                 440                 445             


Asn Glu Ser Ile Ala Glu Ala Leu Glu Asp Cys Asn His Gly Ala Ile 
    450                 455                 460                 


His Ala Leu Pro Arg Arg 
465                 470 


<210> 12
<211> 936
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_2570; binding-protein-dependent transport systems 
      inner membrane component

<400> 12
atgaaatctg atacggtaaa aattgtttca aagcgtaagt ctagaattaa ttcaatcaac       60

ctgcctactg agattggcat taatattata cttggtctct tttgtttgat ttgcgttgtt      120

ccgtttatat ttgttgtcat tatttctttt acttcagaag agagtatacg agagattgga      180

tactccttta ccccgaccaa atggtcactg gaagcatata aattcgcatt caattccagc      240

aaagcaattt ggcgtgccta ctttaacagc ttttttatta ctattctggg tacgatatta      300

agtgttttga tttgtgtact atattcttat ccattattcc gaaaggattt taagtatcga      360

aaattcttta ccttcttctg tttctttact atgttgtttg ggggaggctt agtaccaacc      420

tattatgtta gtaagaatat cttgggctta agcgataatt atgcggcgtt aattgtaccg      480

gcgttattta atccgtttaa tattattgtt atgcgtactt tctttcaaag ttcggtgccg      540

acggatatca ttgaagctgc agcaattgat ggcagcggag aatataacac cttgttcaaa      600

ataatcgttc cgattgcaaa gcccggaatt gctaccattg cattactgaa tgcactggca      660

tactggaatg aatggtatct agcaatgctg tatatccgaa ctgaaagtct ttatccactg      720

cagtatttgc tgatgagaac gcagaatcag attgactttt taaccaggaa tgccgccatg      780

ctgggttcgg agatttcaaa gcttgtacat gatttaccgc agcaaaattt aagaatggcg      840

cttgctgtac ttattgtagt accgattgcc tttgcctatc cattcttcca gcggtttatc      900

atatcgggtc ttacaatagg atcagtaaaa gggtaa                                936


<210> 13
<211> 311
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_2570; binding-protein-dependent transport systems 
      inner membrane component

<400> 13
Met Lys Ser Asp Thr Val Lys Ile Val Ser Lys Arg Lys Ser Arg Ile 
1               5                   10                  15      


Asn Ser Ile Asn Leu Pro Thr Glu Ile Gly Ile Asn Ile Ile Leu Gly 
            20                  25                  30          


Leu Phe Cys Leu Ile Cys Val Val Pro Phe Ile Phe Val Val Ile Ile 
        35                  40                  45              


Ser Phe Thr Ser Glu Glu Ser Ile Arg Glu Ile Gly Tyr Ser Phe Thr 
    50                  55                  60                  


Pro Thr Lys Trp Ser Leu Glu Ala Tyr Lys Phe Ala Phe Asn Ser Ser 
65                  70                  75                  80  


Lys Ala Ile Trp Arg Ala Tyr Phe Asn Ser Phe Phe Ile Thr Ile Leu 
                85                  90                  95      


Gly Thr Ile Leu Ser Val Leu Ile Cys Val Leu Tyr Ser Tyr Pro Leu 
            100                 105                 110         


Phe Arg Lys Asp Phe Lys Tyr Arg Lys Phe Phe Thr Phe Phe Cys Phe 
        115                 120                 125             


Phe Thr Met Leu Phe Gly Gly Gly Leu Val Pro Thr Tyr Tyr Val Ser 
    130                 135                 140                 


Lys Asn Ile Leu Gly Leu Ser Asp Asn Tyr Ala Ala Leu Ile Val Pro 
145                 150                 155                 160 


Ala Leu Phe Asn Pro Phe Asn Ile Ile Val Met Arg Thr Phe Phe Gln 
                165                 170                 175     


Ser Ser Val Pro Thr Asp Ile Ile Glu Ala Ala Ala Ile Asp Gly Ser 
            180                 185                 190         


Gly Glu Tyr Asn Thr Leu Phe Lys Ile Ile Val Pro Ile Ala Lys Pro 
        195                 200                 205             


Gly Ile Ala Thr Ile Ala Leu Leu Asn Ala Leu Ala Tyr Trp Asn Glu 
    210                 215                 220                 


Trp Tyr Leu Ala Met Leu Tyr Ile Arg Thr Glu Ser Leu Tyr Pro Leu 
225                 230                 235                 240 


Gln Tyr Leu Leu Met Arg Thr Gln Asn Gln Ile Asp Phe Leu Thr Arg 
                245                 250                 255     


Asn Ala Ala Met Leu Gly Ser Glu Ile Ser Lys Leu Val His Asp Leu 
            260                 265                 270         


Pro Gln Gln Asn Leu Arg Met Ala Leu Ala Val Leu Ile Val Val Pro 
        275                 280                 285             


Ile Ala Phe Ala Tyr Pro Phe Phe Gln Arg Phe Ile Ile Ser Gly Leu 
    290                 295                 300                 


Thr Ile Gly Ser Val Lys Gly 
305                 310     


<210> 14
<211> 1842
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_1682; ABC transporter related

<400> 14
atgagccata aagaagaata taaattaggg caatctaatc gtagaaaaca aatgggacca       60

ccaggaccag gtatgggcgg cgctggtgag aaagccaaag actttaaaac aacatgggta      120

aagttactta agtattgtaa aaaatattgg gcagttatgg taattgcgtt aagttgcgta      180

gtgatgggta ctgtattaac gttaattggt cctgacaaat tatcagaact tactgatttg      240

attacaaagg gaattgttac tggaatcgat ttagatggtg ttgctagaat tggttggaca      300

ttggtaatat tttatggact tagttttctg ctgtcattaa tacaaggtgt ggtgatggct      360

acgattacac aaaatgtatc gaaacagtta cgtggagata tttctgctaa gattaatcgt      420

ttgccgatgt ggttttataa taagacaaca acaggagatg tactttctcg tgtgacgaat      480

gacgtagata ccattgggca atctctaaat caaagtgtgg ggacattagt ttctgctatt      540

actttattag ttggatcttt gattatgatg ttaaagacaa atgtatttat gacaattaca      600

gcgattgttg caacttgtat tggatttgta ttgatgatgc ttatcatggg aaaatcacaa      660

aaatactttg tacgtcagca acagcatcta ggtgaaatca atggacatat tgaagaaatt      720

tatgatgggc ataccattgt aaaagcttat aatggtgaag ccaaagcacg agagaaattt      780

gtttatttaa ataatgaact tcgagaaagt aattttatgt ctcaatgttt atcagggtta      840

atgatgcctc tgatgtcgtt tattggtaac ttcggttatg tcgcagtttg tgtagttggt      900

gccgttcttg taatgaataa cacgatttcc tttggagtaa tcgtggcttt tatattatat      960

gtaagatatt ttacacaacc attaagtcag cttgcacaag ctgcccagtc tttgcaatcg     1020

gctgcagcag caggagaacg tgtgtttgaa ttccttgaag cggaagaaat ggaagatgag     1080

ttggcaaaga caagaaagct tgaaaatgtg caaggtaagg ttgattttga acatgtaaat     1140

tttggctatg aaggttctga taaagcaatc attaatgatt tctcggtttc caccaagcca     1200

ggtcaaaaag ttgcaattgt tggcccaact ggagcaggga aaactaccat tgtaaatctt     1260

ttaatgcgtt tccatgaaat acaaagtgga actattaaga ttgataatat tcctgttcat     1320

cagttacgac gtgaagatgt tcatgaacaa ttttgcatgg tacttcaaga tacgtggatt     1380

tttgaaggaa cggtccgtga aaatctagta tatagtacag aaaatgtttc tgagcaaacg     1440

ttagaagagg catgtaaagc agttggcttg catcatttta tacgtacgct tccacatgga     1500

tatgatacga tgttaaatga tcaattgagt ctatcggcag gacaaaaaca gcagcttaca     1560

atagcgagag ccatgattgc agaccgacca atgttaattc ttgatgaagc tacaagttct     1620

gtagatactc gtactgagtt aatcattcaa aatgctatgg atgagttaat gaaaggacgt     1680

acttcattta ttattgctca taggctatca actattaaaa atgcggattt gattcttgtt     1740

atgaaagatg gtgatatcat agagagcgga aaccataacg aattaatgaa acaaggcgga     1800

ttctatgcag acctatataa tagtcaattc gatgcggcgt aa                        1842


<210> 15
<211> 613
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_1682; ABC transporter related 

<400> 15
Met Ser His Lys Glu Glu Tyr Lys Leu Gly Gln Ser Asn Arg Arg Lys 
1               5                   10                  15      


Gln Met Gly Pro Pro Gly Pro Gly Met Gly Gly Ala Gly Glu Lys Ala 
            20                  25                  30          


Lys Asp Phe Lys Thr Thr Trp Val Lys Leu Leu Lys Tyr Cys Lys Lys 
        35                  40                  45              


Tyr Trp Ala Val Met Val Ile Ala Leu Ser Cys Val Val Met Gly Thr 
    50                  55                  60                  


Val Leu Thr Leu Ile Gly Pro Asp Lys Leu Ser Glu Leu Thr Asp Leu 
65                  70                  75                  80  


Ile Thr Lys Gly Ile Val Thr Gly Ile Asp Leu Asp Gly Val Ala Arg 
                85                  90                  95      


Ile Gly Trp Thr Leu Val Ile Phe Tyr Gly Leu Ser Phe Leu Leu Ser 
            100                 105                 110         


Leu Ile Gln Gly Val Val Met Ala Thr Ile Thr Gln Asn Val Ser Lys 
        115                 120                 125             


Gln Leu Arg Gly Asp Ile Ser Ala Lys Ile Asn Arg Leu Pro Met Trp 
    130                 135                 140                 


Phe Tyr Asn Lys Thr Thr Thr Gly Asp Val Leu Ser Arg Val Thr Asn 
145                 150                 155                 160 


Asp Val Asp Thr Ile Gly Gln Ser Leu Asn Gln Ser Val Gly Thr Leu 
                165                 170                 175     


Val Ser Ala Ile Thr Leu Leu Val Gly Ser Leu Ile Met Met Leu Lys 
            180                 185                 190         


Thr Asn Val Phe Met Thr Ile Thr Ala Ile Val Ala Thr Cys Ile Gly 
        195                 200                 205             


Phe Val Leu Met Met Leu Ile Met Gly Lys Ser Gln Lys Tyr Phe Val 
    210                 215                 220                 


Arg Gln Gln Gln His Leu Gly Glu Ile Asn Gly His Ile Glu Glu Ile 
225                 230                 235                 240 


Tyr Asp Gly His Thr Ile Val Lys Ala Tyr Asn Gly Glu Ala Lys Ala 
                245                 250                 255     


Arg Glu Lys Phe Val Tyr Leu Asn Asn Glu Leu Arg Glu Ser Asn Phe 
            260                 265                 270         


Met Ser Gln Cys Leu Ser Gly Leu Met Met Pro Leu Met Ser Phe Ile 
        275                 280                 285             


Gly Asn Phe Gly Tyr Val Ala Val Cys Val Val Gly Ala Val Leu Val 
    290                 295                 300                 


Met Asn Asn Thr Ile Ser Phe Gly Val Ile Val Ala Phe Ile Leu Tyr 
305                 310                 315                 320 


Val Arg Tyr Phe Thr Gln Pro Leu Ser Gln Leu Ala Gln Ala Ala Gln 
                325                 330                 335     


Ser Leu Gln Ser Ala Ala Ala Ala Gly Glu Arg Val Phe Glu Phe Leu 
            340                 345                 350         


Glu Ala Glu Glu Met Glu Asp Glu Leu Ala Lys Thr Arg Lys Leu Glu 
        355                 360                 365             


Asn Val Gln Gly Lys Val Asp Phe Glu His Val Asn Phe Gly Tyr Glu 
    370                 375                 380                 


Gly Ser Asp Lys Ala Ile Ile Asn Asp Phe Ser Val Ser Thr Lys Pro 
385                 390                 395                 400 


Gly Gln Lys Val Ala Ile Val Gly Pro Thr Gly Ala Gly Lys Thr Thr 
                405                 410                 415     


Ile Val Asn Leu Leu Met Arg Phe His Glu Ile Gln Ser Gly Thr Ile 
            420                 425                 430         


Lys Ile Asp Asn Ile Pro Val His Gln Leu Arg Arg Glu Asp Val His 
        435                 440                 445             


Glu Gln Phe Cys Met Val Leu Gln Asp Thr Trp Ile Phe Glu Gly Thr 
    450                 455                 460                 


Val Arg Glu Asn Leu Val Tyr Ser Thr Glu Asn Val Ser Glu Gln Thr 
465                 470                 475                 480 


Leu Glu Glu Ala Cys Lys Ala Val Gly Leu His His Phe Ile Arg Thr 
                485                 490                 495     


Leu Pro His Gly Tyr Asp Thr Met Leu Asn Asp Gln Leu Ser Leu Ser 
            500                 505                 510         


Ala Gly Gln Lys Gln Gln Leu Thr Ile Ala Arg Ala Met Ile Ala Asp 
        515                 520                 525             


Arg Pro Met Leu Ile Leu Asp Glu Ala Thr Ser Ser Val Asp Thr Arg 
    530                 535                 540                 


Thr Glu Leu Ile Ile Gln Asn Ala Met Asp Glu Leu Met Lys Gly Arg 
545                 550                 555                 560 


Thr Ser Phe Ile Ile Ala His Arg Leu Ser Thr Ile Lys Asn Ala Asp 
                565                 570                 575     


Leu Ile Leu Val Met Lys Asp Gly Asp Ile Ile Glu Ser Gly Asn His 
            580                 585                 590         


Asn Glu Leu Met Lys Gln Gly Gly Phe Tyr Ala Asp Leu Tyr Asn Ser 
        595                 600                 605             


Gln Phe Asp Ala Ala 
    610             


<210> 16
<211> 327
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0056; Hypothetical polynucleotide

<400> 16
atgataaaaa agcaaattgt ttatactata atttttttgg tagtactatt tacttcagga       60

tgtgaaaaaa aacaagatta tttaaatgga actgtttcag aagttttcga tacgtatatt      120

atagtaactc caaaagatga tgaattatta aaagagaaag cggaaaaaat tgtagtaact      180

aaaactatag ctctagcaac gggatttcca gatttagatg ttggggacga aattagagta      240

gtatattctg gaataaaaaa agaagatgat ttaatgattt tagatagtgt atttgctgta      300

tataaatcta atgaattaaa atactaa                                          327


<210> 17
<211> 108
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0056; Hypothetical protein

<400> 17
Met Ile Lys Lys Gln Ile Val Tyr Thr Ile Ile Phe Leu Val Val Leu 
1               5                   10                  15      


Phe Thr Ser Gly Cys Glu Lys Lys Gln Asp Tyr Leu Asn Gly Thr Val 
            20                  25                  30          


Ser Glu Val Phe Asp Thr Tyr Ile Ile Val Thr Pro Lys Asp Asp Glu 
        35                  40                  45              


Leu Leu Lys Glu Lys Ala Glu Lys Ile Val Val Thr Lys Thr Ile Ala 
    50                  55                  60                  


Leu Ala Thr Gly Phe Pro Asp Leu Asp Val Gly Asp Glu Ile Arg Val 
65                  70                  75                  80  


Val Tyr Ser Gly Ile Lys Lys Glu Asp Asp Leu Met Ile Leu Asp Ser 
                85                  90                  95      


Val Phe Ala Val Tyr Lys Ser Asn Glu Leu Lys Tyr 
            100                 105             


<210> 18
<211> 624
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_1910; TetR family transcriptional regulator

<400> 18
atgggaagag taacggaaaa taaacaagca aaacgaaatt caatctttca ttcttcgttt       60

gatttgtttc ggaccaaagg tttttttcag acatcgatct cggatatcgt gacaaaggct      120

ggagttgcca aaggtacctt ttatctatat tttaaggaca agtatgattt acgagacaag      180

cttattgcat ttaaagcagg aactttgttt catgcagctg aacttaactt aagtgaaaca      240

aatattaagg attttccgga acgactgata tttattgcag atcagattat tgatcggttg      300

gctatggaac ctgattttct aaaatttata tcaaaaaatt taagctgggg cgtatttaag      360

actgcattgt taaagggaca tgaaaccgga gaaacagatt tttataatgc gtatttgcat      420

atggtagagg aaagtcctta tgaatttaag aatcccgaaa ttatgttatt tatgattgtt      480

gaattagtag gttcaagctg ttatagttgt attcttgatg atgaaccatt aacgatggaa      540

gactataagc catatcttta cgaggtaatt cgaggaatca ttcagatgca gatagttaaa      600

aaaaaatcta gcaatgaggt ttaa                                             624


<210> 19
<211> 207
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_1910; TetR family transcriptional regulator

<400> 19
Met Gly Arg Val Thr Glu Asn Lys Gln Ala Lys Arg Asn Ser Ile Phe 
1               5                   10                  15      


His Ser Ser Phe Asp Leu Phe Arg Thr Lys Gly Phe Phe Gln Thr Ser 
            20                  25                  30          


Ile Ser Asp Ile Val Thr Lys Ala Gly Val Ala Lys Gly Thr Phe Tyr 
        35                  40                  45              


Leu Tyr Phe Lys Asp Lys Tyr Asp Leu Arg Asp Lys Leu Ile Ala Phe 
    50                  55                  60                  


Lys Ala Gly Thr Leu Phe His Ala Ala Glu Leu Asn Leu Ser Glu Thr 
65                  70                  75                  80  


Asn Ile Lys Asp Phe Pro Glu Arg Leu Ile Phe Ile Ala Asp Gln Ile 
                85                  90                  95      


Ile Asp Arg Leu Ala Met Glu Pro Asp Phe Leu Lys Phe Ile Ser Lys 
            100                 105                 110         


Asn Leu Ser Trp Gly Val Phe Lys Thr Ala Leu Leu Lys Gly His Glu 
        115                 120                 125             


Thr Gly Glu Thr Asp Phe Tyr Asn Ala Tyr Leu His Met Val Glu Glu 
    130                 135                 140                 


Ser Pro Tyr Glu Phe Lys Asn Pro Glu Ile Met Leu Phe Met Ile Val 
145                 150                 155                 160 


Glu Leu Val Gly Ser Ser Cys Tyr Ser Cys Ile Leu Asp Asp Glu Pro 
                165                 170                 175     


Leu Thr Met Glu Asp Tyr Lys Pro Tyr Leu Tyr Glu Val Ile Arg Gly 
            180                 185                 190         


Ile Ile Gln Met Gln Ile Val Lys Lys Lys Ser Ser Asn Glu Val 
        195                 200                 205         


<210> 20
<211> 897
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_1914; Hypothetical polynucleotide (AraC-like)

<400> 20
atgcttaatg tcgatagtat aataaagaat gattccgtat ttgaactctc gagtacgata       60

gattcaattt accgaactta tcaaagtgag aaagagcaac ttttggccga ttatgaagtg      120

aaaaagaaag aatgccaaca gttattaata gcgattgata atattagaac tgcttattta      180

accccttaca taaatcagct atatcaggaa ttagaagagt ttggaagtcc tagtaaaaag      240

cccaattata ctgtactttt aacagatcaa tcattaacca aggactggaa agatagtgat      300

attagacgca cctatctgat agctaaaaag cgtttatcaa aatacaaaaa tgattcaaaa      360

gatatagtta caaaatttct ttatgattgg acaagcaata agaaaaagtt agaggtagga      420

aatcaagatc taaagaaatt acgagaattg gtggcattac aaaaacaaga atacttagaa      480

gaaataggaa aagtatcgga gataatacca aagctcaagc tttatcttga tgtattaatc      540

gatttgaaaa ttaatataaa agagattatt ttaccagaaa cagaagggat tcatgcattt      600

ttggtagcaa aacacataac agaacagatt acgattaggc aatacccagg gaccgttaac      660

ttagattcta ttttaaagct agcaaataca tcatatgaaa aacattattt gtttataaaa      720

caagtatttc gtttttattc taatttggtg gaactgttgg aacactcaat cagtattgac      780

ttttgtacaa agaatagcat cacaccgaag gagttttctt ggatactaca aaagaaagaa      840

atcatggaag attctgtcaa tgaaatgaaa aggaatttat tttatataca aaattag         897


<210> 21
<211> 298
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_1914; Hypothetical protein (AraC-like)

<400> 21
Met Leu Asn Val Asp Ser Ile Ile Lys Asn Asp Ser Val Phe Glu Leu 
1               5                   10                  15      


Ser Ser Thr Ile Asp Ser Ile Tyr Arg Thr Tyr Gln Ser Glu Lys Glu 
            20                  25                  30          


Gln Leu Leu Ala Asp Tyr Glu Val Lys Lys Lys Glu Cys Gln Gln Leu 
        35                  40                  45              


Leu Ile Ala Ile Asp Asn Ile Arg Thr Ala Tyr Leu Thr Pro Tyr Ile 
    50                  55                  60                  


Asn Gln Leu Tyr Gln Glu Leu Glu Glu Phe Gly Ser Pro Ser Lys Lys 
65                  70                  75                  80  


Pro Asn Tyr Thr Val Leu Leu Thr Asp Gln Ser Leu Thr Lys Asp Trp 
                85                  90                  95      


Lys Asp Ser Asp Ile Arg Arg Thr Tyr Leu Ile Ala Lys Lys Arg Leu 
            100                 105                 110         


Ser Lys Tyr Lys Asn Asp Ser Lys Asp Ile Val Thr Lys Phe Leu Tyr 
        115                 120                 125             


Asp Trp Thr Ser Asn Lys Lys Lys Leu Glu Val Gly Asn Gln Asp Leu 
    130                 135                 140                 


Lys Lys Leu Arg Glu Leu Val Ala Leu Gln Lys Gln Glu Tyr Leu Glu 
145                 150                 155                 160 


Glu Ile Gly Lys Val Ser Glu Ile Ile Pro Lys Leu Lys Leu Tyr Leu 
                165                 170                 175     


Asp Val Leu Ile Asp Leu Lys Ile Asn Ile Lys Glu Ile Ile Leu Pro 
            180                 185                 190         


Glu Thr Glu Gly Ile His Ala Phe Leu Val Ala Lys His Ile Thr Glu 
        195                 200                 205             


Gln Ile Thr Ile Arg Gln Tyr Pro Gly Thr Val Asn Leu Asp Ser Ile 
    210                 215                 220                 


Leu Lys Leu Ala Asn Thr Ser Tyr Glu Lys His Tyr Leu Phe Ile Lys 
225                 230                 235                 240 


Gln Val Phe Arg Phe Tyr Ser Asn Leu Val Glu Leu Leu Glu His Ser 
                245                 250                 255     


Ile Ser Ile Asp Phe Cys Thr Lys Asn Ser Ile Thr Pro Lys Glu Phe 
            260                 265                 270         


Ser Trp Ile Leu Gln Lys Lys Glu Ile Met Glu Asp Ser Val Asn Glu 
        275                 280                 285             


Met Lys Arg Asn Leu Phe Tyr Ile Gln Asn 
    290                 295             


<210> 22
<211> 2496
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0430; glycosyl transferase 36

<400> 22
atgaaatttg gattttttga tgatttaaac aaagaatacg ttatcacaac tcctgaaacg       60

ccgtatcctt ggattaacta tcttggatcg caagcgttct tttccttgat atcaaatact      120

gctggaggtt acagtttcta taaagatgca aaactacgta ggattacaag atttcgttat      180

aataatgtac cattagatct cggtggtggt cgttattact acttgtacga taatggtgat      240

ttttggtctc cgggttttgc ccctgcaaaa aaggaactag aatattatga gtgtcgtcat      300

ggcatgggtt atactaaaat aacaggaaga agaaatggaa ttgaaacaaa gataaccttc      360

tttgtaccat tagactataa tggtgaagta cataaagtgt caattgttaa tacaagtaat      420

caagtgaaga atgtgaaatt gttttctttc ctcgagtggt gtctgtggaa tgcacaagaa      480

gattctacaa attttcaacg aaatttctcg acaggtgaag tagaagttaa agggtctgta      540

atctatcata agacagagta taaagaaaga cgggatcatt atgcattttt ctctgtaaat      600

gcaccgatta gcggattcga ttctgatcgc gagagttttc ttggtaccta tcatggtttt      660

gataacccgc aggtagttgc aaggggagaa tcaaataatt ctgtagcaga tggttggtct      720

ccaatcgcct ctcattgtgt agaaatttcc ttacagccta aggaaagcaa ggaactagta      780

tttattcttg gatatgctga aaataaatta gaagaaaaat gggagtatca agttggtgag      840

gtagataccg atgggttatt actatcttcc gtagcgccaa atggtacgaa tgttattaat      900

aagaagaaag cagaagagat gatagcaaaa tatgctacag taaaggctgt tgatcaagcg      960

ttactggagc ttaaagatta ttgggataaa ctgctttcga aatataatgt aaagtctcac     1020

gatgataaac taaatcgtat ggtgaatatc tggaaccaat atcaatgtat ggtaaccttt     1080

aatttatcaa gaagtgcttc ctactttgaa tctggtattg gaaggggaat gggatttcga     1140

gattccaatc aggatttact tggatttgtt catcagattc cagatcgtgc aagagagaga     1200

atcattgatc ttgcatccac gcaattacca gacggtggag catatcatca gtatcaacca     1260

ttaacgaaaa aaggaaatga tgagattggt ggtaatttca acgatgatcc attgtggctt     1320

attctagcag ttacagctta tataaaggag acaggagact atgatatctt agaagtgaat     1380

actccttatg ataataagtt agaacttgcg aaaccattgt cagatcattt aaaaagagca     1440

tttaaccacg taatagagaa tcttggtcca cacggactac ctttgattgg tcgtgcagat     1500

tggaacgatt gtttgaatct aaactgtttt tccatggatc caggagaatc ttttcaaaca     1560

gttacaagta aggatggtaa agtagcggaa tctgttatga ttgccggaat gtttacttat     1620

attggggaag aatatgctat tcttatggat aaggtaggaa ataacgaaga gtctattcgc     1680

gccatgaaag aagtagagaa tatgcgagat gtaattatga agcatggcta tgatggatct     1740

tggtttttac gtgcctacga tgattttgga agaaaaattg gtagtgatga gtgtgaagaa     1800

ggaaaaatat ttatcgaatc gcaaggcttc tgcgttatgg gaaaatgcgg tcttaacgat     1860

gggaaagcac aaaaagcttt agattctgta gaaaaacgtt tgggtacaaa attcggatta     1920

gttctaaata atccagcatt tacacgatat tatgtggaat atggcgagat ttctacctat     1980

ccagctggat ataaagaaaa tgcaggtatc ttctgtcata acaatgcctg gattatgtgt     2040

gcggaagcag tgataggccg tggggataaa gcctttgatt actatacaaa aattgcacca     2100

gcgtatacag aagaatactc tgaaattcat cgtttggagc catatgtcta cgctcagatg     2160

gttgcaggta aggatgccag acgttttggt gaagcaaaaa attcttggtt aaccggaact     2220

gcatcttgga attttgtagc gatatcacaa tatatacttg gaattaaacc agaatatgat     2280

ggtttaatga ttgaccctgc aatcccaact gattgggagg tatatcaaat aacacgtgaa     2340

ttccgtggtg ataggtatga tattacaata gtaaacccat atcatgtttc gaaaggtgtt     2400

aagaaactag ttgttgatgc gaaagaggtg gagggtaata tcattccggt atttggtgac     2460

ggattagaac ataaagtaaa agtcatctta ggttaa                               2496


<210> 23
<211> 831
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0430; glycosyl transferase 36

<400> 23
Met Lys Phe Gly Phe Phe Asp Asp Leu Asn Lys Glu Tyr Val Ile Thr 
1               5                   10                  15      


Thr Pro Glu Thr Pro Tyr Pro Trp Ile Asn Tyr Leu Gly Ser Gln Ala 
            20                  25                  30          


Phe Phe Ser Leu Ile Ser Asn Thr Ala Gly Gly Tyr Ser Phe Tyr Lys 
        35                  40                  45              


Asp Ala Lys Leu Arg Arg Ile Thr Arg Phe Arg Tyr Asn Asn Val Pro 
    50                  55                  60                  


Leu Asp Leu Gly Gly Gly Arg Tyr Tyr Tyr Leu Tyr Asp Asn Gly Asp 
65                  70                  75                  80  


Phe Trp Ser Pro Gly Phe Ala Pro Ala Lys Lys Glu Leu Glu Tyr Tyr 
                85                  90                  95      


Glu Cys Arg His Gly Met Gly Tyr Thr Lys Ile Thr Gly Arg Arg Asn 
            100                 105                 110         


Gly Ile Glu Thr Lys Ile Thr Phe Phe Val Pro Leu Asp Tyr Asn Gly 
        115                 120                 125             


Glu Val His Lys Val Ser Ile Val Asn Thr Ser Asn Gln Val Lys Asn 
    130                 135                 140                 


Val Lys Leu Phe Ser Phe Leu Glu Trp Cys Leu Trp Asn Ala Gln Glu 
145                 150                 155                 160 


Asp Ser Thr Asn Phe Gln Arg Asn Phe Ser Thr Gly Glu Val Glu Val 
                165                 170                 175     


Lys Gly Ser Val Ile Tyr His Lys Thr Glu Tyr Lys Glu Arg Arg Asp 
            180                 185                 190         


His Tyr Ala Phe Phe Ser Val Asn Ala Pro Ile Ser Gly Phe Asp Ser 
        195                 200                 205             


Asp Arg Glu Ser Phe Leu Gly Thr Tyr His Gly Phe Asp Asn Pro Gln 
    210                 215                 220                 


Val Val Ala Arg Gly Glu Ser Asn Asn Ser Val Ala Asp Gly Trp Ser 
225                 230                 235                 240 


Pro Ile Ala Ser His Cys Val Glu Ile Ser Leu Gln Pro Lys Glu Ser 
                245                 250                 255     


Lys Glu Leu Val Phe Ile Leu Gly Tyr Ala Glu Asn Lys Leu Glu Glu 
            260                 265                 270         


Lys Trp Glu Tyr Gln Val Gly Glu Val Asp Thr Asp Gly Leu Leu Leu 
        275                 280                 285             


Ser Ser Val Ala Pro Asn Gly Thr Asn Val Ile Asn Lys Lys Lys Ala 
    290                 295                 300                 


Glu Glu Met Ile Ala Lys Tyr Ala Thr Val Lys Ala Val Asp Gln Ala 
305                 310                 315                 320 


Leu Leu Glu Leu Lys Asp Tyr Trp Asp Lys Leu Leu Ser Lys Tyr Asn 
                325                 330                 335     


Val Lys Ser His Asp Asp Lys Leu Asn Arg Met Val Asn Ile Trp Asn 
            340                 345                 350         


Gln Tyr Gln Cys Met Val Thr Phe Asn Leu Ser Arg Ser Ala Ser Tyr 
        355                 360                 365             


Phe Glu Ser Gly Ile Gly Arg Gly Met Gly Phe Arg Asp Ser Asn Gln 
    370                 375                 380                 


Asp Leu Leu Gly Phe Val His Gln Ile Pro Asp Arg Ala Arg Glu Arg 
385                 390                 395                 400 


Ile Ile Asp Leu Ala Ser Thr Gln Leu Pro Asp Gly Gly Ala Tyr His 
                405                 410                 415     


Gln Tyr Gln Pro Leu Thr Lys Lys Gly Asn Asp Glu Ile Gly Gly Asn 
            420                 425                 430         


Phe Asn Asp Asp Pro Leu Trp Leu Ile Leu Ala Val Thr Ala Tyr Ile 
        435                 440                 445             


Lys Glu Thr Gly Asp Tyr Asp Ile Leu Glu Val Asn Thr Pro Tyr Asp 
    450                 455                 460                 


Asn Lys Leu Glu Leu Ala Lys Pro Leu Ser Asp His Leu Lys Arg Ala 
465                 470                 475                 480 


Phe Asn His Val Ile Glu Asn Leu Gly Pro His Gly Leu Pro Leu Ile 
                485                 490                 495     


Gly Arg Ala Asp Trp Asn Asp Cys Leu Asn Leu Asn Cys Phe Ser Met 
            500                 505                 510         


Asp Pro Gly Glu Ser Phe Gln Thr Val Thr Ser Lys Asp Gly Lys Val 
        515                 520                 525             


Ala Glu Ser Val Met Ile Ala Gly Met Phe Thr Tyr Ile Gly Glu Glu 
    530                 535                 540                 


Tyr Ala Ile Leu Met Asp Lys Val Gly Asn Asn Glu Glu Ser Ile Arg 
545                 550                 555                 560 


Ala Met Lys Glu Val Glu Asn Met Arg Asp Val Ile Met Lys His Gly 
                565                 570                 575     


Tyr Asp Gly Ser Trp Phe Leu Arg Ala Tyr Asp Asp Phe Gly Arg Lys 
            580                 585                 590         


Ile Gly Ser Asp Glu Cys Glu Glu Gly Lys Ile Phe Ile Glu Ser Gln 
        595                 600                 605             


Gly Phe Cys Val Met Gly Lys Cys Gly Leu Asn Asp Gly Lys Ala Gln 
    610                 615                 620                 


Lys Ala Leu Asp Ser Val Glu Lys Arg Leu Gly Thr Lys Phe Gly Leu 
625                 630                 635                 640 


Val Leu Asn Asn Pro Ala Phe Thr Arg Tyr Tyr Val Glu Tyr Gly Glu 
                645                 650                 655     


Ile Ser Thr Tyr Pro Ala Gly Tyr Lys Glu Asn Ala Gly Ile Phe Cys 
            660                 665                 670         


His Asn Asn Ala Trp Ile Met Cys Ala Glu Ala Val Ile Gly Arg Gly 
        675                 680                 685             


Asp Lys Ala Phe Asp Tyr Tyr Thr Lys Ile Ala Pro Ala Tyr Thr Glu 
    690                 695                 700                 


Glu Tyr Ser Glu Ile His Arg Leu Glu Pro Tyr Val Tyr Ala Gln Met 
705                 710                 715                 720 


Val Ala Gly Lys Asp Ala Arg Arg Phe Gly Glu Ala Lys Asn Ser Trp 
                725                 730                 735     


Leu Thr Gly Thr Ala Ser Trp Asn Phe Val Ala Ile Ser Gln Tyr Ile 
            740                 745                 750         


Leu Gly Ile Lys Pro Glu Tyr Asp Gly Leu Met Ile Asp Pro Ala Ile 
        755                 760                 765             


Pro Thr Asp Trp Glu Val Tyr Gln Ile Thr Arg Glu Phe Arg Gly Asp 
    770                 775                 780                 


Arg Tyr Asp Ile Thr Ile Val Asn Pro Tyr His Val Ser Lys Gly Val 
785                 790                 795                 800 


Lys Lys Leu Val Val Asp Ala Lys Glu Val Glu Gly Asn Ile Ile Pro 
                805                 810                 815     


Val Phe Gly Asp Gly Leu Glu His Lys Val Lys Val Ile Leu Gly 
            820                 825                 830     


<210> 24
<211> 849
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_2337; diaminopimelate epimerase

<400> 24
atggaatttg tctttgagaa atatcacgga aatggaaacg attatttagt atttgatacc       60

aaaaaatttg agtatgagct gcaaggttca caaattcgtg aaatatgtga tcgtcacttt      120

ggggtaggtt ctgacggtat tttaatgggg ccttatgaaa aggaagatgg tatttatgta      180

cgtatcttta atccagatgg tagtgaagca gagaagagtg gaaatggaat cagtatattt      240

gcacagttct taaaagacca taagtatgta acagaggatg taattacgat caatacctta      300

ggagggccaa tcgaaattca ttacatcaat gataaaaatg gtaccttaaa tgttgcgatg      360

ggtaaagttt cttatatgag tgatgaaatt cctgtaacag gtgaatctag agaagtgatt      420

gaagaagcaa tggaatttca aggacagaag gtaatcacta cctgtttaac cattggaaat      480

ccacattgtg taattataag agacgaatta gttaagaaag aagttaagag cttaggcaaa      540

attattgaat ctgatgaaca cttccctaat aaaatcaatg tgcagtttat gcaagtttta      600

aatcgggaag aaatcgtgat tgaaatctat gagcgtggag ccggatatac tttatcttct      660

ggtagtagca gttgtgccgc agctaatgtt gcctataaat tagggttaac agaccgtaaa      720

gtgaaagttc atatgcctgg tggaactgta gaggtcgaaa tcgaggaaga tggtatgact      780

tttatgacta gtcatgtaaa acgtatcgga aagattatta ccgcagatgt ttttatagaa      840

gaattataa                                                              849


<210> 25
<211> 282
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_2337; diaminopimelate epimerase

<400> 25
Met Glu Phe Val Phe Glu Lys Tyr His Gly Asn Gly Asn Asp Tyr Leu 
1               5                   10                  15      


Val Phe Asp Thr Lys Lys Phe Glu Tyr Glu Leu Gln Gly Ser Gln Ile 
            20                  25                  30          


Arg Glu Ile Cys Asp Arg His Phe Gly Val Gly Ser Asp Gly Ile Leu 
        35                  40                  45              


Met Gly Pro Tyr Glu Lys Glu Asp Gly Ile Tyr Val Arg Ile Phe Asn 
    50                  55                  60                  


Pro Asp Gly Ser Glu Ala Glu Lys Ser Gly Asn Gly Ile Ser Ile Phe 
65                  70                  75                  80  


Ala Gln Phe Leu Lys Asp His Lys Tyr Val Thr Glu Asp Val Ile Thr 
                85                  90                  95      


Ile Asn Thr Leu Gly Gly Pro Ile Glu Ile His Tyr Ile Asn Asp Lys 
            100                 105                 110         


Asn Gly Thr Leu Asn Val Ala Met Gly Lys Val Ser Tyr Met Ser Asp 
        115                 120                 125             


Glu Ile Pro Val Thr Gly Glu Ser Arg Glu Val Ile Glu Glu Ala Met 
    130                 135                 140                 


Glu Phe Gln Gly Gln Lys Val Ile Thr Thr Cys Leu Thr Ile Gly Asn 
145                 150                 155                 160 


Pro His Cys Val Ile Ile Arg Asp Glu Leu Val Lys Lys Glu Val Lys 
                165                 170                 175     


Ser Leu Gly Lys Ile Ile Glu Ser Asp Glu His Phe Pro Asn Lys Ile 
            180                 185                 190         


Asn Val Gln Phe Met Gln Val Leu Asn Arg Glu Glu Ile Val Ile Glu 
        195                 200                 205             


Ile Tyr Glu Arg Gly Ala Gly Tyr Thr Leu Ser Ser Gly Ser Ser Ser 
    210                 215                 220                 


Cys Ala Ala Ala Asn Val Ala Tyr Lys Leu Gly Leu Thr Asp Arg Lys 
225                 230                 235                 240 


Val Lys Val His Met Pro Gly Gly Thr Val Glu Val Glu Ile Glu Glu 
                245                 250                 255     


Asp Gly Met Thr Phe Met Thr Ser His Val Lys Arg Ile Gly Lys Ile 
            260                 265                 270         


Ile Thr Ala Asp Val Phe Ile Glu Glu Leu 
        275                 280         


<210> 26
<211> 1131
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_1048; oxidoreductase domain-containing polynucleotide

<400> 26
atgaatcaga acactaaagt aagaatagca ctgattggcg ttggtagtat ggggaaaaag       60

tatgctacta tgttagacat ggataagata tcaaatctta ctttaacagc cgtttgttgt      120

agaagtaaag aaaatcaaag ttgggtaaag gaaaacctaa gtgagagcgt taagttatat      180

ccaagtagtg aagaattatt tgcacatcat gaagaatacg atgcagtact aattgtaact      240

cctcacaaac aacatccagg actagcaata aaagcatttg aattaaagaa acatgtattt      300

tgtgataaac ccgctggagt ttctttacta gacgcgcaga gaatggaaca ggcacaaaag      360

gaatctggat gtaagtatgc tatgatgttt cataatcgaa cctatcctgt cttaaaaaag      420

gtaaagcagt tattggacga tgggtttgtt ggagaattaa aacgtataca acttgtgaac      480

accatttact atcgtacgga atattatcac caatccggtg attggagaag cagttggcat      540

ggggaaggtg gaggagcact catcaaccaa ggacaacata ttttagatta ttggcaatgg      600

ctattcggaa tgccatattc aatctatgct tctattccgt ttggaaaata caattccttt      660

gccgtagatg atgaggcgac tttactaatg gaatacccaa acaaagtgac agccaccttt      720

cttctttcta cgggtgaaat tccaaaagaa gaggtattaa ccgtagtagg tacgaaaggt      780

tgtattaaag taactggaaa tgaaattgag ctcacgtgtt atagtatgga ctctatgcaa      840

tatggaaaaa ctgcaaaaac taattcaaga gaagagatac tgcaaacaaa ggaattattt      900

atatgcaatg aaccaaagga gtcttatgag caaatgcttg taaactttgg ggaagcaatc      960

cttcgtggga aagctcttat tgcgcccgga gaagaaggta caaaggcatt agagttaaca     1020

aatgctgctt acctttctgc atgtttggga gagcgtgtaa ttttaccaat cgatagtact     1080

cagtatgaag aactattaaa gaaaatgatt gaaaatgaga aagttcaata g              1131


<210> 27
<211> 376
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_1048; oxidoreductase domain-containing protein

<400> 27
Met Asn Gln Asn Thr Lys Val Arg Ile Ala Leu Ile Gly Val Gly Ser 
1               5                   10                  15      


Met Gly Lys Lys Tyr Ala Thr Met Leu Asp Met Asp Lys Ile Ser Asn 
            20                  25                  30          


Leu Thr Leu Thr Ala Val Cys Cys Arg Ser Lys Glu Asn Gln Ser Trp 
        35                  40                  45              


Val Lys Glu Asn Leu Ser Glu Ser Val Lys Leu Tyr Pro Ser Ser Glu 
    50                  55                  60                  


Glu Leu Phe Ala His His Glu Glu Tyr Asp Ala Val Leu Ile Val Thr 
65                  70                  75                  80  


Pro His Lys Gln His Pro Gly Leu Ala Ile Lys Ala Phe Glu Leu Lys 
                85                  90                  95      


Lys His Val Phe Cys Asp Lys Pro Ala Gly Val Ser Leu Leu Asp Ala 
            100                 105                 110         


Gln Arg Met Glu Gln Ala Gln Lys Glu Ser Gly Cys Lys Tyr Ala Met 
        115                 120                 125             


Met Phe His Asn Arg Thr Tyr Pro Val Leu Lys Lys Val Lys Gln Leu 
    130                 135                 140                 


Leu Asp Asp Gly Phe Val Gly Glu Leu Lys Arg Ile Gln Leu Val Asn 
145                 150                 155                 160 


Thr Ile Tyr Tyr Arg Thr Glu Tyr Tyr His Gln Ser Gly Asp Trp Arg 
                165                 170                 175     


Ser Ser Trp His Gly Glu Gly Gly Gly Ala Leu Ile Asn Gln Gly Gln 
            180                 185                 190         


His Ile Leu Asp Tyr Trp Gln Trp Leu Phe Gly Met Pro Tyr Ser Ile 
        195                 200                 205             


Tyr Ala Ser Ile Pro Phe Gly Lys Tyr Asn Ser Phe Ala Val Asp Asp 
    210                 215                 220                 


Glu Ala Thr Leu Leu Met Glu Tyr Pro Asn Lys Val Thr Ala Thr Phe 
225                 230                 235                 240 


Leu Leu Ser Thr Gly Glu Ile Pro Lys Glu Glu Val Leu Thr Val Val 
                245                 250                 255     


Gly Thr Lys Gly Cys Ile Lys Val Thr Gly Asn Glu Ile Glu Leu Thr 
            260                 265                 270         


Cys Tyr Ser Met Asp Ser Met Gln Tyr Gly Lys Thr Ala Lys Thr Asn 
        275                 280                 285             


Ser Arg Glu Glu Ile Leu Gln Thr Lys Glu Leu Phe Ile Cys Asn Glu 
    290                 295                 300                 


Pro Lys Glu Ser Tyr Glu Gln Met Leu Val Asn Phe Gly Glu Ala Ile 
305                 310                 315                 320 


Leu Arg Gly Lys Ala Leu Ile Ala Pro Gly Glu Glu Gly Thr Lys Ala 
                325                 330                 335     


Leu Glu Leu Thr Asn Ala Ala Tyr Leu Ser Ala Cys Leu Gly Glu Arg 
            340                 345                 350         


Val Ile Leu Pro Ile Asp Ser Thr Gln Tyr Glu Glu Leu Leu Lys Lys 
        355                 360                 365             


Met Ile Glu Asn Glu Lys Val Gln 
    370                 375     


<210> 28
<211> 603
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_2965; Hypothetical polynucleotide

<400> 28
atgcaatggt tattagactt aatagcaaaa cacacaaaag agggagtact agatacagaa       60

gctcttacaa aagaactcaa tacagagttc ccaaagcatg cagtaccaaa gacagagttt      120

aatactgtca atgaacaact taagactgca aataatacaa ttactgattt aaagaagagc      180

aatggtgaca acgagacatt gcagacaacc attaagactc acgaaactac tatagcaaca      240

cttaaggctg aatctgaaaa ggttaagaag gagtatgcct taaaggataa gctgaaggat      300

ttaggggtta ctgatgctga ctatctgatt tacaagcatg gtggaattga taagtttaac      360

tatgataagg acggtaatct gataggcctt gaagattcca taaagccata caaagaatct      420

cttccacata tttttaagaa tgccaaatca ggaactgatt acaatcctgc aggtggagga      480

agttataccg gaaaaaatcc atttgcaaaa gattctttca acttgacgga gcaagggaaa      540

ctattaaaag aaaacccggc acaagctaag gagttagcaa gtgctgccgg aataacaatt      600

taa                                                                    603


<210> 29
<211> 200
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_2965; Hypothetical protein

<400> 29
Met Gln Trp Leu Leu Asp Leu Ile Ala Lys His Thr Lys Glu Gly Val 
1               5                   10                  15      


Leu Asp Thr Glu Ala Leu Thr Lys Glu Leu Asn Thr Glu Phe Pro Lys 
            20                  25                  30          


His Ala Val Pro Lys Thr Glu Phe Asn Thr Val Asn Glu Gln Leu Lys 
        35                  40                  45              


Thr Ala Asn Asn Thr Ile Thr Asp Leu Lys Lys Ser Asn Gly Asp Asn 
    50                  55                  60                  


Glu Thr Leu Gln Thr Thr Ile Lys Thr His Glu Thr Thr Ile Ala Thr 
65                  70                  75                  80  


Leu Lys Ala Glu Ser Glu Lys Val Lys Lys Glu Tyr Ala Leu Lys Asp 
                85                  90                  95      


Lys Leu Lys Asp Leu Gly Val Thr Asp Ala Asp Tyr Leu Ile Tyr Lys 
            100                 105                 110         


His Gly Gly Ile Asp Lys Phe Asn Tyr Asp Lys Asp Gly Asn Leu Ile 
        115                 120                 125             


Gly Leu Glu Asp Ser Ile Lys Pro Tyr Lys Glu Ser Leu Pro His Ile 
    130                 135                 140                 


Phe Lys Asn Ala Lys Ser Gly Thr Asp Tyr Asn Pro Ala Gly Gly Gly 
145                 150                 155                 160 


Ser Tyr Thr Gly Lys Asn Pro Phe Ala Lys Asp Ser Phe Asn Leu Thr 
                165                 170                 175     


Glu Gln Gly Lys Leu Leu Lys Glu Asn Pro Ala Gln Ala Lys Glu Leu 
            180                 185                 190         


Ala Ser Ala Ala Gly Ile Thr Ile 
        195                 200 


<210> 30
<211> 375
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0987; desulfoferrodoxin ferrous iron-binding region

<400> 30
atgatgaaat tttataaatg cgatgacgac agtctaatta ttccagttac gaatacttat       60

gcttctgttg aataccttaa gaacgcggaa gaaataagcc ctaatactat ggaggcaagt      120

acggagaaac acatccctat tgtcacctgt agcggtaata ccgtaaaagt aaacgttggt      180

agtgttgctc atccaatgac agaagaacat tctatcacga ccgtcatttt agaaactcgg      240

agtggtggac agtataaatt cttacaccat ggagatgaac caatcgttca ttttgatgta      300

tcctctggtg ataaggcaat ggctgcttat gcttactgca atttgcatgg tctatggaaa      360

gcagatattg attaa                                                       375


<210> 31
<211> 124
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0987; desulfoferrodoxin ferrous iron-binding region

<400> 31
Met Met Lys Phe Tyr Lys Cys Asp Asp Asp Ser Leu Ile Ile Pro Val 
1               5                   10                  15      


Thr Asn Thr Tyr Ala Ser Val Glu Tyr Leu Lys Asn Ala Glu Glu Ile 
            20                  25                  30          


Ser Pro Asn Thr Met Glu Ala Ser Thr Glu Lys His Ile Pro Ile Val 
        35                  40                  45              


Thr Cys Ser Gly Asn Thr Val Lys Val Asn Val Gly Ser Val Ala His 
    50                  55                  60                  


Pro Met Thr Glu Glu His Ser Ile Thr Thr Val Ile Leu Glu Thr Arg 
65                  70                  75                  80  


Ser Gly Gly Gln Tyr Lys Phe Leu His His Gly Asp Glu Pro Ile Val 
                85                  90                  95      


His Phe Asp Val Ser Ser Gly Asp Lys Ala Met Ala Ala Tyr Ala Tyr 
            100                 105                 110         


Cys Asn Leu His Gly Leu Trp Lys Ala Asp Ile Asp 
        115                 120                 


<210> 32
<211> 3792
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_1063; Hypothetical polynucleotide

<400> 32
atgaaacatc gagttactgc attgcttctt tgtctaacta tgttattagg aataataggt       60

atccatcctg taacagtaaa ggcaagtgat ccatctttcc aattaatgat aggaacaaaa      120

gatggagaga agaaaacaat tgattttgta aatcctgtat ttacagcaag tggaggggcc      180

atacaaatta agaattctat atcgcttttt tctacaggga acaaagcata tgttagtgag      240

ggtacgatta agactaatgt cgctgtagtt gtaggtagcg acatgaatgt tattcaagtc      300

attaatcaat ccagtaatgg tgcaaaacca tcatttacag aatctactga tgttatgcca      360

ccggagggag gatttgtttt actagcttac gatgactcct atgcgaatgc agggtttaag      420

tcctttttag caacaaagtt ccaagctggt gatagtgtaa agctatacaa tggagatgtt      480

gaaatatcac tagaggaagc actcaacagc taccctgtgg aaattccaga cggtggaaat      540

aataataatc aaacagattt tacatatacc ttagaaaact tagcagttga gactgcattc      600

gaccctaaca aaggaagtac tggtgaaaca aaagcaattg attttgtaaa tcctgacttt      660

tcaggaccag tagcagttaa aaactcaata tccatattta cctatggtaa tcgtccatat      720

gtcagcgctg gatctatact aaccaatgtt gcagttgtag ttgataaaaa tatgactgta      780

atccaggtga tcaatcaatc gatcaatgga ggaaaaccat cattttcaga atctacagat      840

gttgctgtac cagcgggtgg ttttattcta ctagcatgcg atgacagtta tgcaaacgca      900

ggatataaga gttttcttgc tactaagttt aaagctggcg atgctataaa attgaaagta      960

aatgaaaaga cagtaactgt ggcggacatt ttacagttaa ccggccaaaa tgggaatatt     1020

aagaagcatg cttccttatc gattgcacag gaaggtatgc aaaccattac agaaagttct     1080

ttcacaatct caggtaaagt taataactta gaagatagtg taacttattc cgtaagagta     1140

gatcagataa ataatgggaa ggtatatcca ggtgtagttg gggcagatgg atcttacact     1200

gttccagtat cagtagatcc aggagcgaat tactttgata taacattaat tgaaaatgga     1260

gtagattatt ctgaaagtac aaagagtgta atcctattcc agagagttag aacatcagag     1320

aataaaccta ttattttatg gattgaccag tttgcgagcg ttaaaaattt aaatactgta     1380

gaaaagattc aaaagatgat ggcaaatgcg aaaagagctg gaatcactgc attagcattc     1440

gacgtaaagg gagtcgaagg ttatgtctcc tataaaaagg caaccgtaag caatacacca     1500

tatatgacag aaaccaaaaa tccaaacaaa gcagttgcta tggatattga ttttcttgag     1560

gagatgctag cagaagcaca tgcaaatgga attaagttat atgctagttc taatttcttt     1620

acggaaggta acattgcaac aaatgattat gcttttgata ttagaaatac acatcctgat     1680

tgggcagaag tatttcagac tccagaagat aaaggagagt taaaaagcat tctaaattcc     1740

tccagaaact ccaccttact ttttgtaaat ccagcgaatg aagaagttag ggcacatgag     1800

ttggctatag taaaagatgt acttgaaaac tatgctgttg atggtatcat cttagaccgt     1860

gctagatatg ataatcagta tgcagatttt agtaatttga gcaaagagca atttatggca     1920

taccttcagg gaaaaggtaa aacattgcag aactggccag acgatgcgtt caagattaaa     1980

gcagatggct ctatggtaac cggacagcat tatcttgaat ggttatccta tcgtagtact     2040

gtgattgaaa gctttgtttc tgaagtaaga accctaatag atcaatataa aacatctcaa     2100

aatcgaaaca tagatttagc agcatatgtt ggatcatggt atgagtcata ttatcaaaat     2160

ggcgttaact gggcagattc ttcttttgaa tacaatgaac gccttggttt ccctatggaa     2220

gaactatatg cgaaagagtt tgaatattct aagactagtt atgttaaaca tattgacttt     2280

attatgacag gatgttatta cacaaccgaa gcattgatgc aaaaatatac aacattaaat     2340

aacatcttaa taaataacca agttccatta tatgcttcta ttgatttaac gaatttatca     2400

gaagcaccag accagagaat gatattccaa gcagcgtatc agcatagcga aggatcaatg     2460

atctttgatt tgtgttttgt tgattgggat aaaattcgat gcgcgatagc agatattgaa     2520

tataagaatt cagcagtgat tggggtttac gatcctaaga ctaagaatgt tttaacagtt     2580

gataacattg atactgccag agcagaagat aaattaacaa tttataccga tgcctatggt     2640

acaagtacag gtaccaatca atggggtgta gaagttgtag tagatgcgaa aggtaatgta     2700

atagaattaa aaaaccagaa acaagcggca gactggaact gggcaacacc agaaattaat     2760

gatagtacaa tacctgtagg tggatttgta ttatcaacag ttgatcgttc tggatctcgt     2820

acttatagac agcttctagc aaattcattc catgtcggtg ataaagttgc agcagccatc     2880

ttaactggat tcgttgatta tgaagaaaaa gtatatactt ctgcaacagc ggatattgaa     2940

gtgaaggtac aaacctttgg ccaagatcaa aatactgttg taaagattgg tgataaagaa     3000

gcaatcgcga aagaagataa taattattta gcgaagctta acttaagcaa tggtgtcaat     3060

ttgataccaa ttaccgttta tgtggatggc cttaatgttt tagagaaaac tatatcactt     3120

acagctacat taacacctgt aacaccaacg ataactcctc cagtaaccgg gccatctgaa     3180

aatggaaatg gaaatggtaa agataagact gatgataaga atgttgttat agatgacaaa     3240

atgcttgata atatttctat aaatgcttct aactttaagc tttatgtggg tggtactaaa     3300

gactatgagc gtcagcttaa ggttaattta ccaaactcta tcatgaagct tgagaaagaa     3360

aaaaaggcaa ttgttgaaat tacttatcaa tcttcaaatc ctaaggtagc taaagtaggt     3420

aacgatggaa atattacagc agtagctgtt ggtaaagcag taattactac caccgtcaca     3480

gtgaatgata agactactac gttcgaaaca gttgtaaacg ttttgaaagc ttccattaag     3540

atagtatacg atgaaagtac aataaaagtt ggtacaaaag tgactgcaca ttgtatggca     3600

agtggctatg atatttcaaa aatacagtgg ggaactacga agaaaaagat tgctgttgtt     3660

ggtaagaata ccggcaatca aaaggtaaat gtctccacac agtccgctgg aaaagatgtt     3720

gttattgtgt atgtcatgaa tggaaaagaa aaagtagtat tagcagagaa ggatattaca     3780

atagtgaaat aa                                                         3792


<210> 33
<211> 1263
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_1063; Hypothetical protein

<400> 33
Met Lys His Arg Val Thr Ala Leu Leu Leu Cys Leu Thr Met Leu Leu 
1               5                   10                  15      


Gly Ile Ile Gly Ile His Pro Val Thr Val Lys Ala Ser Asp Pro Ser 
            20                  25                  30          


Phe Gln Leu Met Ile Gly Thr Lys Asp Gly Glu Lys Lys Thr Ile Asp 
        35                  40                  45              


Phe Val Asn Pro Val Phe Thr Ala Ser Gly Gly Ala Ile Gln Ile Lys 
    50                  55                  60                  


Asn Ser Ile Ser Leu Phe Ser Thr Gly Asn Lys Ala Tyr Val Ser Glu 
65                  70                  75                  80  


Gly Thr Ile Lys Thr Asn Val Ala Val Val Val Gly Ser Asp Met Asn 
                85                  90                  95      


Val Ile Gln Val Ile Asn Gln Ser Ser Asn Gly Ala Lys Pro Ser Phe 
            100                 105                 110         


Thr Glu Ser Thr Asp Val Met Pro Pro Glu Gly Gly Phe Val Leu Leu 
        115                 120                 125             


Ala Tyr Asp Asp Ser Tyr Ala Asn Ala Gly Phe Lys Ser Phe Leu Ala 
    130                 135                 140                 


Thr Lys Phe Gln Ala Gly Asp Ser Val Lys Leu Tyr Asn Gly Asp Val 
145                 150                 155                 160 


Glu Ile Ser Leu Glu Glu Ala Leu Asn Ser Tyr Pro Val Glu Ile Pro 
                165                 170                 175     


Asp Gly Gly Asn Asn Asn Asn Gln Thr Asp Phe Thr Tyr Thr Leu Glu 
            180                 185                 190         


Asn Leu Ala Val Glu Thr Ala Phe Asp Pro Asn Lys Gly Ser Thr Gly 
        195                 200                 205             


Glu Thr Lys Ala Ile Asp Phe Val Asn Pro Asp Phe Ser Gly Pro Val 
    210                 215                 220                 


Ala Val Lys Asn Ser Ile Ser Ile Phe Thr Tyr Gly Asn Arg Pro Tyr 
225                 230                 235                 240 


Val Ser Ala Gly Ser Ile Leu Thr Asn Val Ala Val Val Val Asp Lys 
                245                 250                 255     


Asn Met Thr Val Ile Gln Val Ile Asn Gln Ser Ile Asn Gly Gly Lys 
            260                 265                 270         


Pro Ser Phe Ser Glu Ser Thr Asp Val Ala Val Pro Ala Gly Gly Phe 
        275                 280                 285             


Ile Leu Leu Ala Cys Asp Asp Ser Tyr Ala Asn Ala Gly Tyr Lys Ser 
    290                 295                 300                 


Phe Leu Ala Thr Lys Phe Lys Ala Gly Asp Ala Ile Lys Leu Lys Val 
305                 310                 315                 320 


Asn Glu Lys Thr Val Thr Val Ala Asp Ile Leu Gln Leu Thr Gly Gln 
                325                 330                 335     


Asn Gly Asn Ile Lys Lys His Ala Ser Leu Ser Ile Ala Gln Glu Gly 
            340                 345                 350         


Met Gln Thr Ile Thr Glu Ser Ser Phe Thr Ile Ser Gly Lys Val Asn 
        355                 360                 365             


Asn Leu Glu Asp Ser Val Thr Tyr Ser Val Arg Val Asp Gln Ile Asn 
    370                 375                 380                 


Asn Gly Lys Val Tyr Pro Gly Val Val Gly Ala Asp Gly Ser Tyr Thr 
385                 390                 395                 400 


Val Pro Val Ser Val Asp Pro Gly Ala Asn Tyr Phe Asp Ile Thr Leu 
                405                 410                 415     


Ile Glu Asn Gly Val Asp Tyr Ser Glu Ser Thr Lys Ser Val Ile Leu 
            420                 425                 430         


Phe Gln Arg Val Arg Thr Ser Glu Asn Lys Pro Ile Ile Leu Trp Ile 
        435                 440                 445             


Asp Gln Phe Ala Ser Val Lys Asn Leu Asn Thr Val Glu Lys Ile Gln 
    450                 455                 460                 


Lys Met Met Ala Asn Ala Lys Arg Ala Gly Ile Thr Ala Leu Ala Phe 
465                 470                 475                 480 


Asp Val Lys Gly Val Glu Gly Tyr Val Ser Tyr Lys Lys Ala Thr Val 
                485                 490                 495     


Ser Asn Thr Pro Tyr Met Thr Glu Thr Lys Asn Pro Asn Lys Ala Val 
            500                 505                 510         


Ala Met Asp Ile Asp Phe Leu Glu Glu Met Leu Ala Glu Ala His Ala 
        515                 520                 525             


Asn Gly Ile Lys Leu Tyr Ala Ser Ser Asn Phe Phe Thr Glu Gly Asn 
    530                 535                 540                 


Ile Ala Thr Asn Asp Tyr Ala Phe Asp Ile Arg Asn Thr His Pro Asp 
545                 550                 555                 560 


Trp Ala Glu Val Phe Gln Thr Pro Glu Asp Lys Gly Glu Leu Lys Ser 
                565                 570                 575     


Ile Leu Asn Ser Ser Arg Asn Ser Thr Leu Leu Phe Val Asn Pro Ala 
            580                 585                 590         


Asn Glu Glu Val Arg Ala His Glu Leu Ala Ile Val Lys Asp Val Leu 
        595                 600                 605             


Glu Asn Tyr Ala Val Asp Gly Ile Ile Leu Asp Arg Ala Arg Tyr Asp 
    610                 615                 620                 


Asn Gln Tyr Ala Asp Phe Ser Asn Leu Ser Lys Glu Gln Phe Met Ala 
625                 630                 635                 640 


Tyr Leu Gln Gly Lys Gly Lys Thr Leu Gln Asn Trp Pro Asp Asp Ala 
                645                 650                 655     


Phe Lys Ile Lys Ala Asp Gly Ser Met Val Thr Gly Gln His Tyr Leu 
            660                 665                 670         


Glu Trp Leu Ser Tyr Arg Ser Thr Val Ile Glu Ser Phe Val Ser Glu 
        675                 680                 685             


Val Arg Thr Leu Ile Asp Gln Tyr Lys Thr Ser Gln Asn Arg Asn Ile 
    690                 695                 700                 


Asp Leu Ala Ala Tyr Val Gly Ser Trp Tyr Glu Ser Tyr Tyr Gln Asn 
705                 710                 715                 720 


Gly Val Asn Trp Ala Asp Ser Ser Phe Glu Tyr Asn Glu Arg Leu Gly 
                725                 730                 735     


Phe Pro Met Glu Glu Leu Tyr Ala Lys Glu Phe Glu Tyr Ser Lys Thr 
            740                 745                 750         


Ser Tyr Val Lys His Ile Asp Phe Ile Met Thr Gly Cys Tyr Tyr Thr 
        755                 760                 765             


Thr Glu Ala Leu Met Gln Lys Tyr Thr Thr Leu Asn Asn Ile Leu Ile 
    770                 775                 780                 


Asn Asn Gln Val Pro Leu Tyr Ala Ser Ile Asp Leu Thr Asn Leu Ser 
785                 790                 795                 800 


Glu Ala Pro Asp Gln Arg Met Ile Phe Gln Ala Ala Tyr Gln His Ser 
                805                 810                 815     


Glu Gly Ser Met Ile Phe Asp Leu Cys Phe Val Asp Trp Asp Lys Ile 
            820                 825                 830         


Arg Cys Ala Ile Ala Asp Ile Glu Tyr Lys Asn Ser Ala Val Ile Gly 
        835                 840                 845             


Val Tyr Asp Pro Lys Thr Lys Asn Val Leu Thr Val Asp Asn Ile Asp 
    850                 855                 860                 


Thr Ala Arg Ala Glu Asp Lys Leu Thr Ile Tyr Thr Asp Ala Tyr Gly 
865                 870                 875                 880 


Thr Ser Thr Gly Thr Asn Gln Trp Gly Val Glu Val Val Val Asp Ala 
                885                 890                 895     


Lys Gly Asn Val Ile Glu Leu Lys Asn Gln Lys Gln Ala Ala Asp Trp 
            900                 905                 910         


Asn Trp Ala Thr Pro Glu Ile Asn Asp Ser Thr Ile Pro Val Gly Gly 
        915                 920                 925             


Phe Val Leu Ser Thr Val Asp Arg Ser Gly Ser Arg Thr Tyr Arg Gln 
    930                 935                 940                 


Leu Leu Ala Asn Ser Phe His Val Gly Asp Lys Val Ala Ala Ala Ile 
945                 950                 955                 960 


Leu Thr Gly Phe Val Asp Tyr Glu Glu Lys Val Tyr Thr Ser Ala Thr 
                965                 970                 975     


Ala Asp Ile Glu Val Lys Val Gln Thr Phe Gly Gln Asp Gln Asn Thr 
            980                 985                 990         


Val Val Lys Ile Gly Asp Lys Glu  Ala Ile Ala Lys Glu  Asp Asn Asn 
        995                 1000                 1005             


Tyr Leu  Ala Lys Leu Asn Leu  Ser Asn Gly Val Asn  Leu Ile Pro 
    1010                 1015                 1020             


Ile Thr  Val Tyr Val Asp Gly  Leu Asn Val Leu Glu  Lys Thr Ile 
    1025                 1030                 1035             


Ser Leu  Thr Ala Thr Leu Thr  Pro Val Thr Pro Thr  Ile Thr Pro 
    1040                 1045                 1050             


Pro Val  Thr Gly Pro Ser Glu  Asn Gly Asn Gly Asn  Gly Lys Asp 
    1055                 1060                 1065             


Lys Thr  Asp Asp Lys Asn Val  Val Ile Asp Asp Lys  Met Leu Asp 
    1070                 1075                 1080             


Asn Ile  Ser Ile Asn Ala Ser  Asn Phe Lys Leu Tyr  Val Gly Gly 
    1085                 1090                 1095             


Thr Lys  Asp Tyr Glu Arg Gln  Leu Lys Val Asn Leu  Pro Asn Ser 
    1100                 1105                 1110             


Ile Met  Lys Leu Glu Lys Glu  Lys Lys Ala Ile Val  Glu Ile Thr 
    1115                 1120                 1125             


Tyr Gln  Ser Ser Asn Pro Lys  Val Ala Lys Val Gly  Asn Asp Gly 
    1130                 1135                 1140             


Asn Ile  Thr Ala Val Ala Val  Gly Lys Ala Val Ile  Thr Thr Thr 
    1145                 1150                 1155             


Val Thr  Val Asn Asp Lys Thr  Thr Thr Phe Glu Thr  Val Val Asn 
    1160                 1165                 1170             


Val Leu  Lys Ala Ser Ile Lys  Ile Val Tyr Asp Glu  Ser Thr Ile 
    1175                 1180                 1185             


Lys Val  Gly Thr Lys Val Thr  Ala His Cys Met Ala  Ser Gly Tyr 
    1190                 1195                 1200             


Asp Ile  Ser Lys Ile Gln Trp  Gly Thr Thr Lys Lys  Lys Ile Ala 
    1205                 1210                 1215             


Val Val  Gly Lys Asn Thr Gly  Asn Gln Lys Val Asn  Val Ser Thr 
    1220                 1225                 1230             


Gln Ser  Ala Gly Lys Asp Val  Val Ile Val Tyr Val  Met Asn Gly 
    1235                 1240                 1245             


Lys Glu  Lys Val Val Leu Ala  Glu Lys Asp Ile Thr  Ile Val Lys 
    1250                 1255                 1260             


<210> 34
<211> 2301
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0928; AraC family transcriptional regulator

<400> 34
atgatatcta tatccatgaa agaaaaaatt cttttatcat tcaagaaaca taatttttac       60

tatcgcattc ttcttccatt ttctatcttc agcattacta tagtctgcct aatgtcagga      120

ataaattggt accgaataga aaacgaatat aataaaaaga ttattgaatc caaccaacag      180

tttctaaaga gggcttctca gctaagtgat caatatcttt atggtaattt cacttccatt      240

ttaaatagtt cttttttgga cggttttaag acttcaaaaa tggaccgttt tattacatat      300

ggagatagac taaaaccttc tgaatttcta aatttacacc agagtataac taatatctgt      360

gctcaaaact cctcagtaat tcaggtttct ttatatcagt atgcatcaga cctttatcta      420

gatagttttg aaggtcttgt ttataatgct tccaaaagac cattgaatag tgatcaatca      480

cttaagaatt acctatccac catttccggg catgagaaag ggattcttta ctacacttct      540

ggtcccaatt cttttcctgt aaagaaaata gttatgctac gttccgttcc cttatatacc      600

aattttacaa acggaagtgg atttattgca attactttgg atactaatag cctttgggaa      660

caattaaata tgtccactac ctccaaagat gattcttttt ttattttaga ctctgaaaaa      720

aagcttctct tggaaaaaag cccatcctct atttcctatg aatatctaaa aagtgtgtcg      780

gactccaccg aaatttctgc atttttaact tataacggta ttcgttaccg tttggatgaa      840

attgtttctg aggattccgg atggcattat atctcctgtg tcccaataaa tatactcaat      900

gcagaggtta gagcacagca tcagcttact cttatgatta cactaatttg catccttctt      960

tcactagtta tcgttcaaag aatttcaagt aaagcctatc aaccgattgt aaatcttcga     1020

aatcgtctgg caaaaaatta ttcatctatt gaaagcaaag atgatctttc tataattgaa     1080

ggaacctttt cttttttgga gaatcaggta gatgatatac agaaaatgct ccataaaaat     1140

agccaagtca tactctacaa actttttatg gatattctca ataaaaaaga actttccgat     1200

agccaacttt tacacaagtt agagcttagt ggaattcata ttacccaaag taattattgt     1260

ctgcttttaa tcgaatttga taaacatgta ttctacaaac tttctcttga acaacgggaa     1320

tatttgatta caaagtccga ttccctactg aaagatttcc ttagcgaacc tatcattcag     1380

accgcagagg cccagcctga taatcgaatt gctattttgc taaatctgaa tcctgataat     1440

tatttatcac ttacagaaca actagagctt cttccagacc atttatttga catttttcat     1500

ataaaaataa atctggcgtt ctccgctcct gttctctatc tttcagaaat atccaaggtg     1560

tattctcgaa tttccgaata catgaaatat ttttttcttt tcggctatgg gaatatattt     1620

acagatgaac tgatccacaa acttgataat acttcctatt ctttttcact gcaagattac     1680

cagcaaatcg aacgaatggt gagaaatggt actccggatg aattttcatt tcttatggaa     1740

agttatcaac aaatcataga atcaggggag tgttcctatc aggaagcgaa caatttttta     1800

attcagactt acagaattgc atttaatata gggaaagagt tggggttatt tgatgatcca     1860

cataaaaaag atcaaatatt aaatgatttt aatcatgcaa taaactttgc acactctatt     1920

gagtgtatct gtctggtcgt tcagatgtgc catgaatttc ttaatgaaga agttcttaat     1980

gcggattctt attttatcca acagatactt gagtacatta aaactcatca gaaagaagaa     2040

atttccctct ctttagttgc tcaagttttt catgtcagta ccggacattt aagccgcttg     2100

ttcaaaagtg tgaccaacca gaatttttca gcctatgtta taaacattaa attagaaact     2160

gcagctgaac ttctgaataa tgaacctgaa aaaagtattt caaacatagc tgctgaactt     2220

ggatattata ccccggctta ttttactaga ttatttaaag aaaagtttgg cgttacacct     2280

tctcagtttc gtaaaaagta a                                               2301


<210> 35
<211> 766
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0928; AraC family transcriptional regulator

<400> 35
Met Ile Ser Ile Ser Met Lys Glu Lys Ile Leu Leu Ser Phe Lys Lys 
1               5                   10                  15      


His Asn Phe Tyr Tyr Arg Ile Leu Leu Pro Phe Ser Ile Phe Ser Ile 
            20                  25                  30          


Thr Ile Val Cys Leu Met Ser Gly Ile Asn Trp Tyr Arg Ile Glu Asn 
        35                  40                  45              


Glu Tyr Asn Lys Lys Ile Ile Glu Ser Asn Gln Gln Phe Leu Lys Arg 
    50                  55                  60                  


Ala Ser Gln Leu Ser Asp Gln Tyr Leu Tyr Gly Asn Phe Thr Ser Ile 
65                  70                  75                  80  


Leu Asn Ser Ser Phe Leu Asp Gly Phe Lys Thr Ser Lys Met Asp Arg 
                85                  90                  95      


Phe Ile Thr Tyr Gly Asp Arg Leu Lys Pro Ser Glu Phe Leu Asn Leu 
            100                 105                 110         


His Gln Ser Ile Thr Asn Ile Cys Ala Gln Asn Ser Ser Val Ile Gln 
        115                 120                 125             


Val Ser Leu Tyr Gln Tyr Ala Ser Asp Leu Tyr Leu Asp Ser Phe Glu 
    130                 135                 140                 


Gly Leu Val Tyr Asn Ala Ser Lys Arg Pro Leu Asn Ser Asp Gln Ser 
145                 150                 155                 160 


Leu Lys Asn Tyr Leu Ser Thr Ile Ser Gly His Glu Lys Gly Ile Leu 
                165                 170                 175     


Tyr Tyr Thr Ser Gly Pro Asn Ser Phe Pro Val Lys Lys Ile Val Met 
            180                 185                 190         


Leu Arg Ser Val Pro Leu Tyr Thr Asn Phe Thr Asn Gly Ser Gly Phe 
        195                 200                 205             


Ile Ala Ile Thr Leu Asp Thr Asn Ser Leu Trp Glu Gln Leu Asn Met 
    210                 215                 220                 


Ser Thr Thr Ser Lys Asp Asp Ser Phe Phe Ile Leu Asp Ser Glu Lys 
225                 230                 235                 240 


Lys Leu Leu Leu Glu Lys Ser Pro Ser Ser Ile Ser Tyr Glu Tyr Leu 
                245                 250                 255     


Lys Ser Val Ser Asp Ser Thr Glu Ile Ser Ala Phe Leu Thr Tyr Asn 
            260                 265                 270         


Gly Ile Arg Tyr Arg Leu Asp Glu Ile Val Ser Glu Asp Ser Gly Trp 
        275                 280                 285             


His Tyr Ile Ser Cys Val Pro Ile Asn Ile Leu Asn Ala Glu Val Arg 
    290                 295                 300                 


Ala Gln His Gln Leu Thr Leu Met Ile Thr Leu Ile Cys Ile Leu Leu 
305                 310                 315                 320 


Ser Leu Val Ile Val Gln Arg Ile Ser Ser Lys Ala Tyr Gln Pro Ile 
                325                 330                 335     


Val Asn Leu Arg Asn Arg Leu Ala Lys Asn Tyr Ser Ser Ile Glu Ser 
            340                 345                 350         


Lys Asp Asp Leu Ser Ile Ile Glu Gly Thr Phe Ser Phe Leu Glu Asn 
        355                 360                 365             


Gln Val Asp Asp Ile Gln Lys Met Leu His Lys Asn Ser Gln Val Ile 
    370                 375                 380                 


Leu Tyr Lys Leu Phe Met Asp Ile Leu Asn Lys Lys Glu Leu Ser Asp 
385                 390                 395                 400 


Ser Gln Leu Leu His Lys Leu Glu Leu Ser Gly Ile His Ile Thr Gln 
                405                 410                 415     


Ser Asn Tyr Cys Leu Leu Leu Ile Glu Phe Asp Lys His Val Phe Tyr 
            420                 425                 430         


Lys Leu Ser Leu Glu Gln Arg Glu Tyr Leu Ile Thr Lys Ser Asp Ser 
        435                 440                 445             


Leu Leu Lys Asp Phe Leu Ser Glu Pro Ile Ile Gln Thr Ala Glu Ala 
    450                 455                 460                 


Gln Pro Asp Asn Arg Ile Ala Ile Leu Leu Asn Leu Asn Pro Asp Asn 
465                 470                 475                 480 


Tyr Leu Ser Leu Thr Glu Gln Leu Glu Leu Leu Pro Asp His Leu Phe 
                485                 490                 495     


Asp Ile Phe His Ile Lys Ile Asn Leu Ala Phe Ser Ala Pro Val Leu 
            500                 505                 510         


Tyr Leu Ser Glu Ile Ser Lys Val Tyr Ser Arg Ile Ser Glu Tyr Met 
        515                 520                 525             


Lys Tyr Phe Phe Leu Phe Gly Tyr Gly Asn Ile Phe Thr Asp Glu Leu 
    530                 535                 540                 


Ile His Lys Leu Asp Asn Thr Ser Tyr Ser Phe Ser Leu Gln Asp Tyr 
545                 550                 555                 560 


Gln Gln Ile Glu Arg Met Val Arg Asn Gly Thr Pro Asp Glu Phe Ser 
                565                 570                 575     


Phe Leu Met Glu Ser Tyr Gln Gln Ile Ile Glu Ser Gly Glu Cys Ser 
            580                 585                 590         


Tyr Gln Glu Ala Asn Asn Phe Leu Ile Gln Thr Tyr Arg Ile Ala Phe 
        595                 600                 605             


Asn Ile Gly Lys Glu Leu Gly Leu Phe Asp Asp Pro His Lys Lys Asp 
    610                 615                 620                 


Gln Ile Leu Asn Asp Phe Asn His Ala Ile Asn Phe Ala His Ser Ile 
625                 630                 635                 640 


Glu Cys Ile Cys Leu Val Val Gln Met Cys His Glu Phe Leu Asn Glu 
                645                 650                 655     


Glu Val Leu Asn Ala Asp Ser Tyr Phe Ile Gln Gln Ile Leu Glu Tyr 
            660                 665                 670         


Ile Lys Thr His Gln Lys Glu Glu Ile Ser Leu Ser Leu Val Ala Gln 
        675                 680                 685             


Val Phe His Val Ser Thr Gly His Leu Ser Arg Leu Phe Lys Ser Val 
    690                 695                 700                 


Thr Asn Gln Asn Phe Ser Ala Tyr Val Ile Asn Ile Lys Leu Glu Thr 
705                 710                 715                 720 


Ala Ala Glu Leu Leu Asn Asn Glu Pro Glu Lys Ser Ile Ser Asn Ile 
                725                 730                 735     


Ala Ala Glu Leu Gly Tyr Tyr Thr Pro Ala Tyr Phe Thr Arg Leu Phe 
            740                 745                 750         


Lys Glu Lys Phe Gly Val Thr Pro Ser Gln Phe Arg Lys Lys 
        755                 760                 765     


<210> 36
<211> 2601
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0788; Phage tape measure polynucleotide

<400> 36
atggcagata tatctgcatt ttttaattta agtcaaaaaa tttccgatac ggtcataaat       60

caaattacaa acgtgtattc caaatcaata agtgcttccg ttactcatac cgtaaataag      120

gttatagaga actctgtcgt caatgtcgat agaacaatta ataatataga aaaaatggat      180

cgctctatta atataaatat cggtagattc agaaagttta aagaagagac gcaagagcca      240

ataaaaacgg aaagctatgg tgaagctatt gaaaagatag gtatggttgt agaagcaatt      300

aataaaatgg gtgaagtcct tcaaagaatt gagacgcaag agtcaataaa aacggaaaaa      360

tttgatgaag ctattgaaaa gatagataag gtcgatgaat caattaataa aatgggtgaa      420

tcccttcaaa gaattgagac gcaagagtca ataaaaacgg aaaaatttga tgaagctatt      480

gaaaagatag ataaggtcga tgaatcaatt aataaaatgg atgaatccct tcaaagaatt      540

gagacgcaag agtcaataaa aacggaaaaa tttgatgaag ctattgaaaa gatagataag      600

gtcgatgaat caattaataa aatgggtgaa tcccttcaaa aagctgagat gcaagagcca      660

ataaaaacgg aaagctttga tgaagctact gaaaagataa gtaagaccga agaagcaatt      720

aataaaatag aggaagccct tcaaaaaacc gaggtgaaga gcgaggggac aggggcaaaa      780

ttaaaaaaga gttttagttc catatttagt tcagtaagag ataatttagg aaataatttt      840

gctgcggtgg gaaaagggat aggctctgtt ggaaatataa ttaatagtgt gacatccttt      900

ggtaccaaat atttagataa ggttgaaaat agtaagatat taaagacagc agatgcacta      960

gctcaaacca gaacaaaatt aacagcaatg acaggtagtc aagcagaggc tgatcaattt     1020

caacaaagaa tttttgattc cgcacaaaat tccagaactt cttatgagtc aacagccaat     1080

atggttcttg ggctaagtgc aaagggctcc ttttcaaata aggagcagat tgttaccttt     1140

actgaacttg ttaataaaaa cagtgtatta ggaggggcaa gcgctgaagg tacgaaaggc     1200

gtacaaacag cagttacaga agctatggtt tctggaacac ttagcggaga aggatttaat     1260

aatgtattag aaaatgctta tccaattata gaaaacatag cagcatacct taacaaacca     1320

atagaagcag ttcaaaaaat gggtgcacaa ggtgaaatca gtggtgaatt cttagcaaat     1380

gctatgtttg cttctgcaca aaaaacgaat gaagagttta gcaaaactcc tatgaccttt     1440

gaacaattga ttagttcaat aaaagataaa gctctgatgg tatttcaacc agtattacaa     1500

aagataagtg aattgacaca aaatcaagag tttatgaaca tgatacaaaa tattatgagt     1560

gggttaactt ttgtgggcga tttggcatta aggattgttg gcgtattaat aaatgctgca     1620

agtgcaattg ttgataattg gtcctggatt gctcctatga ttcttctaat tgcagttgct     1680

tttggaatat ggaagttatc tgttctacta agtagtttta gtattaaaga attaactgct     1740

tccttgctgg catgcccatt ggtatggatt attggtatta ttatggctat tatagcagtc     1800

atcaagatcg taatagatca cataaataag gttggagata agacatacac tgtagcaggc     1860

gttatttgcg gaattttagg tggagtggga gcctttgttt ggaacttatt tttgggatta     1920

ggagatttta ttcttagttt tgtaaatctc attgcaaatg catttatagg agttgcgaac     1980

ttttttgcta atgtatttaa gaacccgata tcttccatta tctatttatt tcaaggaatg     2040

gctgacggag tattaggtat tctggaaggt attgcaaatg cgattgattt tgtctttggt     2100

agtaattttg gtggaacagt tgctggttgg agaagtggac taaaagacat ggctgatgca     2160

gcggttcaaa aattagcacc agatgaaaaa tatgaacaaa aaattgatta tcttaattta     2220

tccatggaaa gctttggcct tacgagagca gaatattcag attggtggga taaggggaat     2280

gaatttggta ataaaatcaa tgatctcttt aaaggaagca cgggagatga caagagtttc     2340

gatgatactt gggatggaat tctaaaaaat acagataaaa tcgctcataa tacggaactt     2400

caaccagatg atttgtccta tttactcgaa cttgcagagc gtgatgcaat caaccgtttc     2460

acaacagcgg aagttaaaat tgatatgggt ggtgtttata atacggtatc aagcaaacag     2520

aatctggatg gaatcgtaga gtatctgacg gataagttac gagacgaact taataatact     2580

gcaagagctt gtaacgctta a                                               2601


<210> 37
<211> 866
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0788; Phage tape measure protein

<400> 37
Met Ala Asp Ile Ser Ala Phe Phe Asn Leu Ser Gln Lys Ile Ser Asp 
1               5                   10                  15      


Thr Val Ile Asn Gln Ile Thr Asn Val Tyr Ser Lys Ser Ile Ser Ala 
            20                  25                  30          


Ser Val Thr His Thr Val Asn Lys Val Ile Glu Asn Ser Val Val Asn 
        35                  40                  45              


Val Asp Arg Thr Ile Asn Asn Ile Glu Lys Met Asp Arg Ser Ile Asn 
    50                  55                  60                  


Ile Asn Ile Gly Arg Phe Arg Lys Phe Lys Glu Glu Thr Gln Glu Pro 
65                  70                  75                  80  


Ile Lys Thr Glu Ser Tyr Gly Glu Ala Ile Glu Lys Ile Gly Met Val 
                85                  90                  95      


Val Glu Ala Ile Asn Lys Met Gly Glu Val Leu Gln Arg Ile Glu Thr 
            100                 105                 110         


Gln Glu Ser Ile Lys Thr Glu Lys Phe Asp Glu Ala Ile Glu Lys Ile 
        115                 120                 125             


Asp Lys Val Asp Glu Ser Ile Asn Lys Met Gly Glu Ser Leu Gln Arg 
    130                 135                 140                 


Ile Glu Thr Gln Glu Ser Ile Lys Thr Glu Lys Phe Asp Glu Ala Ile 
145                 150                 155                 160 


Glu Lys Ile Asp Lys Val Asp Glu Ser Ile Asn Lys Met Asp Glu Ser 
                165                 170                 175     


Leu Gln Arg Ile Glu Thr Gln Glu Ser Ile Lys Thr Glu Lys Phe Asp 
            180                 185                 190         


Glu Ala Ile Glu Lys Ile Asp Lys Val Asp Glu Ser Ile Asn Lys Met 
        195                 200                 205             


Gly Glu Ser Leu Gln Lys Ala Glu Met Gln Glu Pro Ile Lys Thr Glu 
    210                 215                 220                 


Ser Phe Asp Glu Ala Thr Glu Lys Ile Ser Lys Thr Glu Glu Ala Ile 
225                 230                 235                 240 


Asn Lys Ile Glu Glu Ala Leu Gln Lys Thr Glu Val Lys Ser Glu Gly 
                245                 250                 255     


Thr Gly Ala Lys Leu Lys Lys Ser Phe Ser Ser Ile Phe Ser Ser Val 
            260                 265                 270         


Arg Asp Asn Leu Gly Asn Asn Phe Ala Ala Val Gly Lys Gly Ile Gly 
        275                 280                 285             


Ser Val Gly Asn Ile Ile Asn Ser Val Thr Ser Phe Gly Thr Lys Tyr 
    290                 295                 300                 


Leu Asp Lys Val Glu Asn Ser Lys Ile Leu Lys Thr Ala Asp Ala Leu 
305                 310                 315                 320 


Ala Gln Thr Arg Thr Lys Leu Thr Ala Met Thr Gly Ser Gln Ala Glu 
                325                 330                 335     


Ala Asp Gln Phe Gln Gln Arg Ile Phe Asp Ser Ala Gln Asn Ser Arg 
            340                 345                 350         


Thr Ser Tyr Glu Ser Thr Ala Asn Met Val Leu Gly Leu Ser Ala Lys 
        355                 360                 365             


Gly Ser Phe Ser Asn Lys Glu Gln Ile Val Thr Phe Thr Glu Leu Val 
    370                 375                 380                 


Asn Lys Asn Ser Val Leu Gly Gly Ala Ser Ala Glu Gly Thr Lys Gly 
385                 390                 395                 400 


Val Gln Thr Ala Val Thr Glu Ala Met Val Ser Gly Thr Leu Ser Gly 
                405                 410                 415     


Glu Gly Phe Asn Asn Val Leu Glu Asn Ala Tyr Pro Ile Ile Glu Asn 
            420                 425                 430         


Ile Ala Ala Tyr Leu Asn Lys Pro Ile Glu Ala Val Gln Lys Met Gly 
        435                 440                 445             


Ala Gln Gly Glu Ile Ser Gly Glu Phe Leu Ala Asn Ala Met Phe Ala 
    450                 455                 460                 


Ser Ala Gln Lys Thr Asn Glu Glu Phe Ser Lys Thr Pro Met Thr Phe 
465                 470                 475                 480 


Glu Gln Leu Ile Ser Ser Ile Lys Asp Lys Ala Leu Met Val Phe Gln 
                485                 490                 495     


Pro Val Leu Gln Lys Ile Ser Glu Leu Thr Gln Asn Gln Glu Phe Met 
            500                 505                 510         


Asn Met Ile Gln Asn Ile Met Ser Gly Leu Thr Phe Val Gly Asp Leu 
        515                 520                 525             


Ala Leu Arg Ile Val Gly Val Leu Ile Asn Ala Ala Ser Ala Ile Val 
    530                 535                 540                 


Asp Asn Trp Ser Trp Ile Ala Pro Met Ile Leu Leu Ile Ala Val Ala 
545                 550                 555                 560 


Phe Gly Ile Trp Lys Leu Ser Val Leu Leu Ser Ser Phe Ser Ile Lys 
                565                 570                 575     


Glu Leu Thr Ala Ser Leu Leu Ala Cys Pro Leu Val Trp Ile Ile Gly 
            580                 585                 590         


Ile Ile Met Ala Ile Ile Ala Val Ile Lys Ile Val Ile Asp His Ile 
        595                 600                 605             


Asn Lys Val Gly Asp Lys Thr Tyr Thr Val Ala Gly Val Ile Cys Gly 
    610                 615                 620                 


Ile Leu Gly Gly Val Gly Ala Phe Val Trp Asn Leu Phe Leu Gly Leu 
625                 630                 635                 640 


Gly Asp Phe Ile Leu Ser Phe Val Asn Leu Ile Ala Asn Ala Phe Ile 
                645                 650                 655     


Gly Val Ala Asn Phe Phe Ala Asn Val Phe Lys Asn Pro Ile Ser Ser 
            660                 665                 670         


Ile Ile Tyr Leu Phe Gln Gly Met Ala Asp Gly Val Leu Gly Ile Leu 
        675                 680                 685             


Glu Gly Ile Ala Asn Ala Ile Asp Phe Val Phe Gly Ser Asn Phe Gly 
    690                 695                 700                 


Gly Thr Val Ala Gly Trp Arg Ser Gly Leu Lys Asp Met Ala Asp Ala 
705                 710                 715                 720 


Ala Val Gln Lys Leu Ala Pro Asp Glu Lys Tyr Glu Gln Lys Ile Asp 
                725                 730                 735     


Tyr Leu Asn Leu Ser Met Glu Ser Phe Gly Leu Thr Arg Ala Glu Tyr 
            740                 745                 750         


Ser Asp Trp Trp Asp Lys Gly Asn Glu Phe Gly Asn Lys Ile Asn Asp 
        755                 760                 765             


Leu Phe Lys Gly Ser Thr Gly Asp Asp Lys Ser Phe Asp Asp Thr Trp 
    770                 775                 780                 


Asp Gly Ile Leu Lys Asn Thr Asp Lys Ile Ala His Asn Thr Glu Leu 
785                 790                 795                 800 


Gln Pro Asp Asp Leu Ser Tyr Leu Leu Glu Leu Ala Glu Arg Asp Ala 
                805                 810                 815     


Ile Asn Arg Phe Thr Thr Ala Glu Val Lys Ile Asp Met Gly Gly Val 
            820                 825                 830         


Tyr Asn Thr Val Ser Ser Lys Gln Asn Leu Asp Gly Ile Val Glu Tyr 
        835                 840                 845             


Leu Thr Asp Lys Leu Arg Asp Glu Leu Asn Asn Thr Ala Arg Ala Cys 
    850                 855                 860                 


Asn Ala 
865     


<210> 38
<211> 906
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_2125; D-isomer specific 2-hydroxyacid dehydrogenase 
      NAD-binding

<400> 38
atgtttaata aattagttgc aatcgaacca gttagcttaa ttccatctgc ggaacaaaaa       60

ctttatgatt atgcgaaaga agtaatctta tttgatgata tcccaaagga taataatgaa      120

atcattcacc gaatcggtga tgccgatgct gttttactta gctataccag cagtattgat      180

aaagaagttc taaatgcctg cccaaatatc cgatatatcg gtatgtgctg tagcttatat      240

tcaccggaaa gcgcgaatgt agacattctt accgccaatt ctaaaaatat cactgtctat      300

ggaatacgtg attatggcga ccagggagtt gtggaatatg taatcagcga acttgttcgg      360

tacttacatg ggttcggtga gaaacaatgg aaggaacttc cgatagagat tacagattta      420

aaagtaggaa tcgtaggcct tggtacttcc gggcaaatga tagccgtagc acttcaggct      480

cttggcgcag atttatatta ttatagccgc acccgtaaac cggaagagga ggcaagagat      540

ataaaatatc taccattgaa tgaattactt cagaccgtag atgtagtttg cacctgtctg      600

aataaaaatg taatcttatt ccatgaagaa caatttgaat gtcttggtaa tcataagatc      660

atgtttaata cttcaatagg tccatcccat gacattcctg cacttaccaa atggctttcc      720

catggtgata atgaattttt ctgtgatacc gctggcgcat taggagatac tactggtgag      780

ctattatcac atcctcatgt aaattgtatg aaagtatcta ccggaaggac aaagcaagcc      840

tttgatcgtc tgagtgaaaa ggtgttgaat aatattgaga cattcttaaa agaaaataat      900

atgtaa                                                                 906


<210> 39
<211> 301
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_2125; D-isomer specific 2-hydroxyacid dehydrogenase 
      NAD-binding

<400> 39
Met Phe Asn Lys Leu Val Ala Ile Glu Pro Val Ser Leu Ile Pro Ser 
1               5                   10                  15      


Ala Glu Gln Lys Leu Tyr Asp Tyr Ala Lys Glu Val Ile Leu Phe Asp 
            20                  25                  30          


Asp Ile Pro Lys Asp Asn Asn Glu Ile Ile His Arg Ile Gly Asp Ala 
        35                  40                  45              


Asp Ala Val Leu Leu Ser Tyr Thr Ser Ser Ile Asp Lys Glu Val Leu 
    50                  55                  60                  


Asn Ala Cys Pro Asn Ile Arg Tyr Ile Gly Met Cys Cys Ser Leu Tyr 
65                  70                  75                  80  


Ser Pro Glu Ser Ala Asn Val Asp Ile Leu Thr Ala Asn Ser Lys Asn 
                85                  90                  95      


Ile Thr Val Tyr Gly Ile Arg Asp Tyr Gly Asp Gln Gly Val Val Glu 
            100                 105                 110         


Tyr Val Ile Ser Glu Leu Val Arg Tyr Leu His Gly Phe Gly Glu Lys 
        115                 120                 125             


Gln Trp Lys Glu Leu Pro Ile Glu Ile Thr Asp Leu Lys Val Gly Ile 
    130                 135                 140                 


Val Gly Leu Gly Thr Ser Gly Gln Met Ile Ala Val Ala Leu Gln Ala 
145                 150                 155                 160 


Leu Gly Ala Asp Leu Tyr Tyr Tyr Ser Arg Thr Arg Lys Pro Glu Glu 
                165                 170                 175     


Glu Ala Arg Asp Ile Lys Tyr Leu Pro Leu Asn Glu Leu Leu Gln Thr 
            180                 185                 190         


Val Asp Val Val Cys Thr Cys Leu Asn Lys Asn Val Ile Leu Phe His 
        195                 200                 205             


Glu Glu Gln Phe Glu Cys Leu Gly Asn His Lys Ile Met Phe Asn Thr 
    210                 215                 220                 


Ser Ile Gly Pro Ser His Asp Ile Pro Ala Leu Thr Lys Trp Leu Ser 
225                 230                 235                 240 


His Gly Asp Asn Glu Phe Phe Cys Asp Thr Ala Gly Ala Leu Gly Asp 
                245                 250                 255     


Thr Thr Gly Glu Leu Leu Ser His Pro His Val Asn Cys Met Lys Val 
            260                 265                 270         


Ser Thr Gly Arg Thr Lys Gln Ala Phe Asp Arg Leu Ser Glu Lys Val 
        275                 280                 285             


Leu Asn Asn Ile Glu Thr Phe Leu Lys Glu Asn Asn Met 
    290                 295                 300     


<210> 40
<211> 1917
<212> DNA
<213> Clostridium phytofermentans

<220>
<223> Cphy_0935; HD superfamily phosphohydrolase-like 
      polynucleotide

<400> 40
ttgaaaaacg gtatttattt gaacgattct attcatggat taattccatt aacagaatat       60

gaaagaagaa tcgtttcaac aattggtttt aatcgtctcc atgacgtata tcaaaattct      120

acggtatatc taacatttcc aaccaatcga accaaacgat ttgaacactc cgttggtacg      180

atgaaacttt gttctgatat gttttttcaa tcgttactga atacaacaga ttctatgttg      240

agtgaattct ttgaaatctt tgatagagaa tatgcaacga ttttagatag attaagacag      300

cagatggatg tttgtgaaga aaaattaggt agcagactcc cgaaggcgat gccgcagatt      360

gaactagata agttgagaca ttcccttata ccgaacaata tccctgatca gtataaagtc      420

attcatctta tcctcataca atctatccgt gcagcagcat tgttacatga tattggacat      480

cctcccttta gtcatatagt cgaatttgcg ttgaaagatg tttacctgga atataaagac      540

aaagcagtta gtgaacagga aaatgcaaaa gaatttgtct cgattatgtc gaaatacttt      600

gagggtaata agaaattaca cgaacaaatg ggtgacgaga tcagcgaggg tattttaagc      660

aaaattatca tgccaatttc agaggatgat gagaaatacg atgaaaattt atttgaactt      720

ctgatattag aaagtgtaaa aagaattttt gcagaggatg gggcattcaa gtatcttcat      780

aggattattg attctagttt ggacggagac cgcttggatt atgtgacgag agatgtaatc      840

aattcgggaa aggattctgg aaagatagag tatagcagaa ttattaatga tatgcagttg      900

tttgtcgaga atggagagat ttttttctgt gttccaataa aagcagtaag ctcggtagag      960

gattttgtta agcgtagata taatagctat aaagatatta tttatcatca tagagtcatc     1020

caatctgatt atattctgga aggaatcgta aaagatttgg tgaagaaata tttgaatgaa     1080

acagtatcgg aaacacagcg tgacagcgaa gtactgatac catttgatat ctcaggactt     1140

tggtttccat tgggggataa aaagtcagca atccaagcaa atgcactttc tcaatggaat     1200

gactcctggt taacgaccgt actccggcaa atttattata cagaatatta tcataatgaa     1260

gaaatcgaag aaggctctgg ggattttgtg ttggaacaac gtctggcgga attgctgcgg     1320

aatagcaagc ggtatcattc cctgattaaa cggagtgaga attttaagat tattgacgac     1380

gctgtgaaac tagagataat aaaacaaaaa ggcaagattg aagagttcct agggaaagaa     1440

aatgagccat cttgtgggga taaatccacg gagctgattc atcaaatgtt agagaattct     1500

atgaaaaatt cttctggctt tattttgtcc tttatttgga gatatagtaa agaaatgaaa     1560

atcgaagcct ttgaacagat ggttagagaa atcgtggaag tcgaaacaaa tcatatcgtt     1620

actaatctta agacttatga tacggttact ttatttaaac ggatttcgat tggattagat     1680

tctccaattt atttctataa ccataaggaa aagatcagta ccttaaagga tatcagcgga     1740

attgctgaca ttttacagct tgattccgat tatctgccag tcttctatat ttatatttta     1800

gcaaaagaca aggatggggt tctaaaggag aaaagagaag aactgttagg ttgcatcggt     1860

aaacgaatcg gtgcccagat tatgagaaga ttgggaattt gggaggagat atcatga        1917


<210> 41
<211> 638
<212> PRT
<213> Clostridium phytofermentans

<220>
<223> Cphy_0935; HD superfamily phosphohydrolase-like protein 

<400> 41
Leu Lys Asn Gly Ile Tyr Leu Asn Asp Ser Ile His Gly Leu Ile Pro 
1               5                   10                  15      


Leu Thr Glu Tyr Glu Arg Arg Ile Val Ser Thr Ile Gly Phe Asn Arg 
            20                  25                  30          


Leu His Asp Val Tyr Gln Asn Ser Thr Val Tyr Leu Thr Phe Pro Thr 
        35                  40                  45              


Asn Arg Thr Lys Arg Phe Glu His Ser Val Gly Thr Met Lys Leu Cys 
    50                  55                  60                  


Ser Asp Met Phe Phe Gln Ser Leu Leu Asn Thr Thr Asp Ser Met Leu 
65                  70                  75                  80  


Ser Glu Phe Phe Glu Ile Phe Asp Arg Glu Tyr Ala Thr Ile Leu Asp 
                85                  90                  95      


Arg Leu Arg Gln Gln Met Asp Val Cys Glu Glu Lys Leu Gly Ser Arg 
            100                 105                 110         


Leu Pro Lys Ala Met Pro Gln Ile Glu Leu Asp Lys Leu Arg His Ser 
        115                 120                 125             


Leu Ile Pro Asn Asn Ile Pro Asp Gln Tyr Lys Val Ile His Leu Ile 
    130                 135                 140                 


Leu Ile Gln Ser Ile Arg Ala Ala Ala Leu Leu His Asp Ile Gly His 
145                 150                 155                 160 


Pro Pro Phe Ser His Ile Val Glu Phe Ala Leu Lys Asp Val Tyr Leu 
                165                 170                 175     


Glu Tyr Lys Asp Lys Ala Val Ser Glu Gln Glu Asn Ala Lys Glu Phe 
            180                 185                 190         


Val Ser Ile Met Ser Lys Tyr Phe Glu Gly Asn Lys Lys Leu His Glu 
        195                 200                 205             


Gln Met Gly Asp Glu Ile Ser Glu Gly Ile Leu Ser Lys Ile Ile Met 
    210                 215                 220                 


Pro Ile Ser Glu Asp Asp Glu Lys Tyr Asp Glu Asn Leu Phe Glu Leu 
225                 230                 235                 240 


Leu Ile Leu Glu Ser Val Lys Arg Ile Phe Ala Glu Asp Gly Ala Phe 
                245                 250                 255     


Lys Tyr Leu His Arg Ile Ile Asp Ser Ser Leu Asp Gly Asp Arg Leu 
            260                 265                 270         


Asp Tyr Val Thr Arg Asp Val Ile Asn Ser Gly Lys Asp Ser Gly Lys 
        275                 280                 285             


Ile Glu Tyr Ser Arg Ile Ile Asn Asp Met Gln Leu Phe Val Glu Asn 
    290                 295                 300                 


Gly Glu Ile Phe Phe Cys Val Pro Ile Lys Ala Val Ser Ser Val Glu 
305                 310                 315                 320 


Asp Phe Val Lys Arg Arg Tyr Asn Ser Tyr Lys Asp Ile Ile Tyr His 
                325                 330                 335     


His Arg Val Ile Gln Ser Asp Tyr Ile Leu Glu Gly Ile Val Lys Asp 
            340                 345                 350         


Leu Val Lys Lys Tyr Leu Asn Glu Thr Val Ser Glu Thr Gln Arg Asp 
        355                 360                 365             


Ser Glu Val Leu Ile Pro Phe Asp Ile Ser Gly Leu Trp Phe Pro Leu 
    370                 375                 380                 


Gly Asp Lys Lys Ser Ala Ile Gln Ala Asn Ala Leu Ser Gln Trp Asn 
385                 390                 395                 400 


Asp Ser Trp Leu Thr Thr Val Leu Arg Gln Ile Tyr Tyr Thr Glu Tyr 
                405                 410                 415     


Tyr His Asn Glu Glu Ile Glu Glu Gly Ser Gly Asp Phe Val Leu Glu 
            420                 425                 430         


Gln Arg Leu Ala Glu Leu Leu Arg Asn Ser Lys Arg Tyr His Ser Leu 
        435                 440                 445             


Ile Lys Arg Ser Glu Asn Phe Lys Ile Ile Asp Asp Ala Val Lys Leu 
    450                 455                 460                 


Glu Ile Ile Lys Gln Lys Gly Lys Ile Glu Glu Phe Leu Gly Lys Glu 
465                 470                 475                 480 


Asn Glu Pro Ser Cys Gly Asp Lys Ser Thr Glu Leu Ile His Gln Met 
                485                 490                 495     


Leu Glu Asn Ser Met Lys Asn Ser Ser Gly Phe Ile Leu Ser Phe Ile 
            500                 505                 510         


Trp Arg Tyr Ser Lys Glu Met Lys Ile Glu Ala Phe Glu Gln Met Val 
        515                 520                 525             


Arg Glu Ile Val Glu Val Glu Thr Asn His Ile Val Thr Asn Leu Lys 
    530                 535                 540                 


Thr Tyr Asp Thr Val Thr Leu Phe Lys Arg Ile Ser Ile Gly Leu Asp 
545                 550                 555                 560 


Ser Pro Ile Tyr Phe Tyr Asn His Lys Glu Lys Ile Ser Thr Leu Lys 
                565                 570                 575     


Asp Ile Ser Gly Ile Ala Asp Ile Leu Gln Leu Asp Ser Asp Tyr Leu 
            580                 585                 590         


Pro Val Phe Tyr Ile Tyr Ile Leu Ala Lys Asp Lys Asp Gly Val Leu 
        595                 600                 605             


Lys Glu Lys Arg Glu Glu Leu Leu Gly Cys Ile Gly Lys Arg Ile Gly 
    610                 615                 620                 


Ala Gln Ile Met Arg Arg Leu Gly Ile Trp Glu Glu Ile Ser 
625                 630                 635             


<210> 42
<211> 45
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 42
ccgcggagga gggttttgta tgagtaaaat cagaagaata gtttc                       45


<210> 43
<211> 51
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      primer

<400> 43
cccgggttag tggtggtggt ggtggtgttt tccataatat tgccctaatg a                51


<210> 44
<211> 7887
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 44
cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt       60

atcaggaaac agctatgacc gcggggattt tacacgtttc attaataatt tcttatattt      120

ctttatttgt ttgtaaaatt tacttaaatt tcgccagaaa acaaaagaaa gcctttacta      180

attaatagtt tagtgatact cttttatgta ggtatttttt aaaatacatt aaacctaggt      240

aattgaggaa agttacaatt accattatat aaggaggata ttcatatgaa aagaaaactg      300

aaacaaagat gtgctgtttt agtggcagtt gcaacgatga tagcttcgtt gcaatggggg      360

agagtgccag tacaagcagt aacagcagac ggtcttacct ctcaacagta tgttgaggca      420

atgggcgaag gctggaactt aggaaattcc tttgatggtt ttgattctga tacttcaaaa      480

ccagatcaag gcgagaccgc ttggggaaat cctaaggtta caaaagagct aatccatgca      540

gtcaaacaaa aaggctatag tagtatccgc ataccaatga ccctatatcg tagatatacg      600

gagagcaatg gtgtatgcac tatcgatagc gcatggatag cacgttacaa agaagtagta      660

gattatgcag ttgcagaagg tttatacgtt atgataaaca ttcaccatga ttcctggata      720

tggttatctt catgggatgg aaataagagt tctgtgcaat atgtaagatt tactcagatg      780

tgggatcaac ttgcgaaggc atttaaagat tatccgttac aagtatgttt tgaaacgata      840

aatgagccga actttcaaaa ctctggaaac gttactgcac agaataaatt agatatgctt      900

aaccaagcgg cttacaatat aattcgtgcc tctggtggat caaatgcaaa gagaatgatt      960

gttttaccat cactaaatac gaaccatgat aatagtgtac cattagctga tttcataact     1020

aaattgaatg attctaatat cattgcaacc gttcattatt atagtgaatg ggtatttagt     1080

gctaaccttg gtaagacaag ctttgatgaa gatttatggg gaaatggtga ttacactcct     1140

cgtgatgcgg taaataaggc gtttgatacc atttccaatg catttacagc aaaaaaaatc     1200

ggtgttgtta tcggagaatt tggtctttta ggttatgact ctgattttga aaataatcaa     1260

ccaggcgaag aattaaaata ttatgagtat atgaattatg tagctagaca aaagaaaatg     1320

tgccttatgt tttgggataa cggatctgga attaatcgta acgactctaa gtatagttgg     1380

aaaaaaccta tagttggaaa gatgttagaa gtatctatga caggacgttc ctcttatgca     1440

acaggccttg ataccattta cctaaacggc agctcattta atgatattaa tatcccgctt     1500

actctaaacg gtaacacctt tgttggagtt acaggattaa ccagtggtac cgattttacg     1560

tataaccaat ccaatgcaac actaacatta aaatcatcct acgtgaagaa ggtttatgat     1620

gcaatgggaa gtaattatgg tacggtagct gatttggtac ttaagttttc aagtggagct     1680

gattggcatg agtatttagt gaaatacaaa gcaccagtat ttcaaaatgc gaatggaact     1740

gtttccaatg gaattaatat tccagttcaa tttaacggaa gtaaactccg tcgttctaca     1800

gcttatatag gttctaatcg agttggcccg aatcaaagct ggtggatgta tttagagtat     1860

ggtgcaactt ttgtggcgaa ctatacgaac aatattttaa ccattaagcc tgatttcttt     1920

aaggatggtt ctgtttatga tggaaatata tcatttgaga tggagtttta tgatggacaa     1980

aagttaaaat ataatcttaa taaatcaaat ggtaacataa caggaactgc agcagcagta     2040

acccctacac caacaccaac ggcgacacca acaccaacag cgacgccaac accaaccgta     2100

acaccaaaac caacaataac cccaacagta acgccgacac caacagtaac gccaaaacca     2160

acaataacac cgacagtaac accaactcct actccaatcc caggaacagg tccagttaca     2220

ttaaaatacg aagtaacgaa tacttgggat aagcatacac aggcgaatat tacattaacc     2280

aatacctcta atacagcact aaagaatttt gttgtatcat ttacttataa agggtatata     2340

gaccaaatgt ggagtgcaga tttggttagt caaaattcgg gtaccattac agtgaaggga     2400

ccagcatggg ctacgaatct agatccaggg caaagtataa catttggttt tattgcttca     2460

catgatacac cgtctgttga tccaccatca aatgttactt tagttagttc aaattaaaat     2520

tgtattcaaa tctcgaggcc tgcagacatg caagcttggc actggccgtc gttttacaac     2580

gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt     2640

tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca     2700

gcctgaatgg cgaatggcgc tagcataaaa ataagaagcc tgcatttgca ggcttcttat     2760

ttttatggcg cgccgttctg aatccttagc taatggttca acaggtaact atgacgaaga     2820

tagcaccctg gataagtctg taatggattc taaggcattt aatgaagacg tgtatataaa     2880

atgtgctaat gaaaaagaaa atgcgttaaa agagcctaaa atgagttcaa atggttttga     2940

aattgattgg tagtttaatt taatatattt tttctattgg ctatctcgat acctatagaa     3000

tcttctgttc acttttgttt ttgaaatata aaaaggggct ttttagcccc ttttttttaa     3060

aactccggag gagtttcttc attcttgata ctatacgtaa ctattttcga tttgacttca     3120

ttgtcaatta agctagtaaa atcaatggtt aaaaaacaaa aaacttgcat ttttctacct     3180

agtaatttat aattttaagt gtcgagttta aaagtataat ttaccaggaa aggagcaagt     3240

tttttaataa ggaaaaattt ttccttttaa aattctattt cgttatatga ctaattataa     3300

tcaaaaaaat gaaaataaac aagaggtaaa aactgcttta gagaaatgta ctgataaaaa     3360

aagaaaaaat cctagattta cgtcatacat agcaccttta actactaaga aaaatattga     3420

aaggacttcc acttgtggag attatttgtt tatgttgagt gatgcagact tagaacattt     3480

taaattacat aaaggtaatt tttgcggtaa tagattttgt ccaatgtgta gttggcgact     3540

tgcttgtaag gatagtttag aaatatctat tcttatggag catttaagaa aagaagaaaa     3600

taaagagttt atatttttaa ctcttacaac tccaaatgta aaaagttatg atcttaatta     3660

ttctattaaa caatataata aatcttttaa aaaattaatg gagcgtaagg aagttaagga     3720

tataactaaa ggttatataa gaaaattaga agtaacttac caaaaggaaa aatacataac     3780

aaaggattta tggaaaataa aaaaagatta ttatcaaaaa aaaggacttg aaattggtga     3840

tttagaacct aattttgata cttataatcc tcattttcat gtagttattg cagttaataa     3900

aagttatttt acagataaaa attattatat aaatcgagaa agatggttgg aattatggaa     3960

gtttgctact aaggatgatt ctataactca agttgatgtt agaaaagcaa aaattaatga     4020

ttataaagag gtttacgaac ttgcgaaata ttcagctaaa gacactgatt atttaatatc     4080

gaggccagta tttgaaattt tttataaagc attaaaaggc aagcaggtat tagtttttag     4140

tggatttttt aaagatgcac acaaattgta caagcaagga aaacttgatg tttataaaaa     4200

gaaagatgaa attaaatatg tctatatagt ttattataat tggtgcaaaa aacaatatga     4260

aaaaactaga ataagggaac ttacggaaga tgaaaaagaa gaattaaatc aagatttaat     4320

agatgaaata gaaatagatt aaagtgtaac tatactttat atatatatga ttaaaaaaat     4380

aaaaaacaac agcctattag gttgttgttt tttattttct ttattaattt ttttaatttt     4440

tagtttttag ttctttttta aaataagttt cagcctcttt ttcaatattt tttaaagaag     4500

gagtatttgc atgaattgcc ttttttctaa cagacttagg aaatatttta acagtatctt     4560

cttgcgccgg tgattttgga acttcataac ttactaattt ataattatta ttttcttttt     4620

taattgtaac agttgcaaaa gaagctgaac ctgttccttc aactagttta tcatcttcaa     4680

tataatattc ttgacctata tagtataaat atatttttat tatattttta cttttttctg     4740

aatctattat tttataatca taaaaagttt taccaccaaa agaaggttgt actccttctg     4800

gtccaacata tttttttact atattatcta aataattttt gggaactggt gttgtaattt     4860

gattaatcga acaaccagtt atacttaaag gaattataac tataaaaata tataggatta     4920

tctttttaaa tttcattatt ggcctccttt ttattaaatt tatgttacca taaaaaggac     4980

ataacgggaa tatgtagaat atttttaatg tagacaaaat tttacataaa tataaagaaa     5040

ggaagtgttt gtttaaattt tatagcaaac tatcaaaaat tagggggata aaaatttatg     5100

aaaaaaaggt tttcgatgtt atttttatgt ttaactttaa tagtttgtgg tttatttaca     5160

aattcggccg gcccaatgaa taggtttaca cttactttag ttttatggaa atgaaagatc     5220

atatcatata taatctagaa taaaattaac taaaataatt attatctaga taaaaaattt     5280

agaagccaat gaaatctata aataaactaa attaagttta tttaattaac aactatggat     5340

ataaaatagg tactaatcaa aatagtgagg aggatatatt tgaatacata cgaacaaatt     5400

aataaagtga aaaaaatact tcggaaacat ttaaaaaata accttattgg tacttacatg     5460

tttggatcag gagttgagag tggactaaaa ccaaatagtg atcttgactt tttagtcgtc     5520

gtatctgaac cattgacaga tcaaagtaaa gaaatactta tacaaaaaat tagacctatt     5580

tcaaagaaaa taggagataa aagcaactta cgatatattg aattaacaat tattattcag     5640

caagaaatgg taccgtggaa tcatcctccc aaacaagaat ttatttatgg agaatggtta     5700

caagagcttt atgaacaagg atacattcct cagaaggaat taaattcaga tttaaccata     5760

atgctttacc aagcaaaacg aaaaaataaa agaatatacg gaaattatga cttagaggaa     5820

ttactacctg atattccatt ttctgatgtg agaagagcca ttatggattc gtcagaggaa     5880

ttaatagata attatcagga tgatgaaacc aactctatat taactttatg ccgtatgatt     5940

ttaactatgg acacgggtaa aatcatacca aaagatattg cgggaaatgc agtggctgaa     6000

tcttctccat tagaacatag ggagagaatt ttgttagcag ttcgtagtta tcttggagag     6060

aatattgaat ggactaatga aaatgtaaat ttaactataa actatttaaa taacagatta     6120

aaaaaattat aaaaaaattg aaaaaatggt ggaaacactt ttttcaattt ttttgtttta     6180

ttatttaata tttgggaaat attcattcta attggtaatc agattttaga agtttaaact     6240

cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc     6300

agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg     6360

ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct     6420

accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct     6480

tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct     6540

cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg     6600

gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc     6660

gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga     6720

gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg     6780

cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta     6840

tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg     6900

ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg     6960

ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat     7020

taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc     7080

agtgagcgag gaagcggaag agcgcccaat acgcagggcc ccctgcttcg gggtcattat     7140

agcgattttt tcggtatatc catccttttt cgcacgatat acaggatttt gccaaagggt     7200

tcgtgtagac tttccttggt gtatccaacg gcgtcagccg ggcaggatag gtgaagtagg     7260

cccacccgcg agcgggtgtt ccttcttcac tgtcccttat tcgcacctgg cggtgctcaa     7320

cgggaatcct gctctgcgag gctggccggc taccgccggc gtaacagatg agggcaagcg     7380

gatggctgat gaaaccaagc caaccaggaa gggcagccca cctatcaagg tgtactgcct     7440

tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg gccggcatga gcctgtcggc     7500

ctacctgctg gccgtcggcc agggctacaa aatcacgggc gtcgtggact atgagcacgt     7560

ccgcgagctg gcccgcatca atggcgacct gggccgcctg ggcggcctgc tgaaactctg     7620

gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc acgatcctcg ccctgctggc     7680

gaagatcgaa gagaagcagg acgagcttgg caaggtcatg atgggcgtgg tccgcccgag     7740

ggcagagcca tgactttttt agccgctaaa acggccgggg ggtgcgcgtg attgccaagc     7800

acgtccccat gcgctccatc aagaagagcg acttcgcgga gctggtgaag tacatcaccg     7860

acgagcaagg caagaccgat cgggccc                                         7887
