                                SEQUENCE LISTING

<110> Chen, Zhiwei
      Friedland, Gregory D.
      Chhabra, Swapnil R.
      Chivian, Dylan C.
      Simmons, Blake A.
      The Regents of the University of California
      Sandia Corporation

<120> Glycoside Hydrolases Having Multiple
  Hydrolase Activities

<130> 77429-839195

<140> WO Not yet assigned 
<141> Not yet assigned  

<150> US 61/481,642       
<151> 2011-05-02  

<160> 12

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 317
<212> PRT
<213> Thermotoga maritima

<220> 
<223> Thermotoga maritima strain MSB8 endoglucanase,
      Cel5A, Cel5ATma, locus TM_1751, locus B72216

<400> 1
Met Gly Val Asp Pro Phe Glu Arg Asn Lys Ile Leu Gly Arg Gly Ile
 1               5                  10                  15      
Asn Ile Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val
            20                  25                  30          
Val Ile Lys Asp Glu Phe Phe Asp Ile Ile Lys Glu Ala Gly Phe Ser
        35                  40                  45              
His Val Arg Ile Pro Ile Arg Trp Ser Thr His Ala Tyr Ala Phe Pro
    50                  55                  60                  
Pro Tyr Lys Ile Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val Ile
65                  70                  75                  80  
Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Val Ile Asn Ile His His
                85                  90                  95      
Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu
            100                 105                 110         
Ala Leu Trp Lys Gln Ile Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr
        115                 120                 125             
Leu Phe Phe Glu Ile Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu
    130                 135                 140                 
Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val Ile Arg Ser Ile
145                 150                 155                 160 
Asp Lys Lys His Thr Ile Ile Ile Gly Thr Ala Glu Trp Gly Gly Ile
                165                 170                 175     
Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn Ser Ile
            180                 185                 190         
Val Thr Ile His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gln Gly Ala
        195                 200                 205             
Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser
    210                 215                 220                 
Pro Asp Asp Gln Lys His Leu Ile Glu Glu Phe Asn Phe Ile Glu Glu
225                 230                 235                 240 
Trp Ser Lys Lys Asn Lys Arg Pro Ile Tyr Ile Gly Glu Phe Gly Ala
                245                 250                 255     
Tyr Arg Lys Ala Asp Leu Glu Ser Arg Ile Lys Trp Thr Ser Phe Val
            260                 265                 270         
Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe
        275                 280                 285             
Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys
    290                 295                 300                 
Asp Leu Leu Glu Ala Leu Ile Gly Gly Asp Ser Ile Glu
305                 310                 315         


<210> 2
<211> 954
<212> DNA
<213> Thermotoga maritima

<220> 
<223> Thermotoga maritima strain MSB8 endoglucanase,
      Cel5A, Cel5ATma, locus TM_1751, locus B72216 CDS

<400> 2
atgggggttg atccgtttga gcgtaataaa attctgggcc gcggtattaa tatcggcaac 60
gcactggagg ctccgaatga aggtgattgg ggcgtggtta ttaaggatga attcttcgat 120
attatcaaag aagcgggatt tagccatgtg cgtattccga ttcgttggtc gactcatgcc 180
tatgcatttc cgccatacaa aattatggat cgctttttca aacgtgtgga cgaagttatt 240
aacggtgccc tgaaacgcgg actggccgtt gttattaata tccaccacta tgaagagctg 300
atgaatgatc ctgaagaaca taaagaacgc tttctggcac tgtggaaaca gattgcggac 360
cgttataaag attatccgga aactctgttt ttcgaaattc tgaacgagcc gcatgggaac 420
ctgacgccgg aaaaatggaa tgaactgctg gaagaagctc tgaaagtaat ccgttcgatt 480
gacaagaaac ataccatcat tattggcacc gccgaatggg gtggtatcag tgcactggaa 540
aaactgtcag ttccgaagtg ggagaaaaac tccattgtga cgattcatta ttataacccg 600
tttgagttta cccaccaggg ggcagaatgg gtggaaggca gcgaaaaatg gctgggccgt 660
aaatggggta gtcctgatga tcaaaaacac ctgattgaag agtttaactt catcgaagag 720
tggtcaaaaa agaataaacg cccgatttat attggcgagt tcggtgccta tcgcaaagct 780
gatctggaat cgcgtattaa atggacaagt tttgttgtac gtgaaatgga aaagcgccgt 840
tggtcctggg cctattggga attctgtagc ggttttggtg tctacgatac gctgcgcaaa 900
acttggaaca aagatctgct ggaagccctg attggcggtg acagtatcga ataa       954

<210> 3
<211> 488
<212> PRT
<213> Epidinium ecaudatum

<220> 
<223> rumen anaerobic protozoan Epidinium ecaudatum cellulase
      family 5 protein (partial), Cel5AEec, cmc-06

<400> 3
Lys Thr Ala Ile Glu Thr Val Asn Asp Met Gly Leu Gly Trp Asn Leu
 1               5                  10                  15      
Gly Asn Thr Phe Asp Cys Phe Gly Thr Trp Lys Glu Ile Lys Thr Pro
            20                  25                  30          
Asp Asp Gln Ile Thr Met Trp Gly Asn Val Val Pro Thr Glu Ala Met
        35                  40                  45              
Val Thr Thr Ile Lys Lys Tyr Gly Phe Lys Thr Val Arg Phe Pro Val
    50                  55                  60                  
Thr Trp Met Asn Phe Met Asp Glu Ser Gly Lys Val Lys Ala Glu Trp
65                  70                  75                  80  
Met Ala Arg Val Lys Glu Val Val Asp Trp Ile Val Lys Ala Gly Leu
                85                  90                  95      
Tyr Cys Ile Leu Asn Val His His Asp Gly Val Ser Gly Asn Trp Leu
            100                 105                 110         
Ala Gln Gly Ala His Val Lys Ala Arg Tyr Val Thr Leu Trp Thr Gln
        115                 120                 125             
Ile Ala Thr Glu Phe Lys Asp Tyr Asp Asp His Leu Val Phe Glu Ser
    130                 135                 140                 
Met Asn Glu Val Glu Tyr Lys Asn Gly Asn Ser Phe Asp Tyr Asn Ser
145                 150                 155                 160 
Leu Leu Thr Leu Thr Gln Ala Phe Val Asp Thr Val Arg Gly Leu Gly
                165                 170                 175     
Gly Lys Asn Ser Asp Arg Leu Leu Leu Ile Ser Gly Met Asn Thr Asn
            180                 185                 190         
Leu Glu Asn Thr Cys Ser Ser Ser Tyr Lys Met Pro Thr Asp Lys Ala
        195                 200                 205             
Asn Lys Leu Ala Ile Ser Ile His Tyr Tyr Leu Pro Pro Gln Phe Thr
    210                 215                 220                 
Val Glu Ser Asp Lys Asn Pro Trp Thr Trp Thr Asp Asp Gln Gly Val
225                 230                 235                 240 
Val His Glu Ile Thr Pro Leu Gln Lys Trp Gly Asp Glu Gly Asn Tyr
                245                 250                 255     
Gln Glu Met Val Thr Asn Phe Glu Thr Met Lys Lys Ala Phe Val Asp
            260                 265                 270         
Lys Gly Ile Pro Val Ile Leu Gly Glu Val Gly Val Leu Thr Glu Glu
        275                 280                 285             
Lys Lys Asp Lys Ala Ser Ile Arg Glu Phe Leu Leu Ala Glu Tyr Ser
    290                 295                 300                 
Phe Thr Ala Gly Tyr Asn Gly Phe Met Ser Ile Leu Trp Asp Thr Ser
305                 310                 315                 320 
Lys Asn Thr Ala Gly Asp Met Asn Phe Tyr Asn Arg Glu Thr Asp Lys
                325                 330                 335     
Trp Tyr Asp Glu Gln Ile Arg Asp Asn Phe Ile Asn Ile Ala Ala Gly
            340                 345                 350         
Lys Phe Val Asp Pro Thr Lys Tyr Leu Val Asn Ser Asn Ser Glu Thr
        355                 360                 365             
Ser Thr Lys Val Asp Ser Asp Gly Asn Val Gln Ile Asn Ile Gly Ser
    370                 375                 380                 
Lys Lys Val Asn Lys Val Ile Phe Asn Ala Lys Ile Ser Gly Ala Val
385                 390                 395                 400 
Asn Ile Trp Asp Val Gly Phe Gly Val Ala Ser Ala Asp Lys Thr Gly
                405                 410                 415     
Lys Trp Phe Gly Asp Pro Val Gly Gly Ala Glu Gly Val Lys Gln Asn
            420                 425                 430         
Asp Gly Thr Tyr Thr Phe Thr Val Asp Val Ser Ala Lys Asp Phe Asn
        435                 440                 445             
Asp Tyr Val Gln Val Gln Arg Trp Trp Gly Asn Asp Asn Ile Thr Ile
    450                 455                 460                 
Asn Ser Val Thr Val Glu Phe Glu Gly Thr Ala Lys Arg Leu Asp Phe
465                 470                 475                 480 
Asn Ala Tyr Lys Ala Ala Leu Lys
                485             


<210> 4
<211> 1464
<212> DNA
<213> Epidinium ecaudatum

<220> 
<223> rumen anaerobic protozoan Epidinium ecaudatum cellulase
      family 5 protein (partial), Cel5AEec, cmc-06

<400> 4
aaaacggcga ttgaaaccgt gaacgatatg ggcctgggtt ggaacctggg caatacgttt 60
gactgcttcg gcacctggaa agaaattaaa acgccggatg accagatcac catgtggggc 120
aacgtggttc cgaccgaagc gatggtgacc acgatcaaaa aatacggttt caaaacggtg 180
cgtttcccgg ttacctggat gaattttatg gatgaatctg gcaaagtgaa agcggaatgg 240
atggcccgcg ttaaagaagt cgtggattgg attgtgaaag caggcctgta ctgtatcctg 300
aacgttcatc acgacggcgt cagcggtaat tggctggcac agggtgctca tgtgaaagca 360
cgttatgtta cgctgtggac ccaaattgct accgaattta aagattacga tgaccacctg 420
gtcttcgaat ccatgaacga agttgaatac aaaaacggta attcgttcga ttacaatagc 480
ctgctgaccc tgacgcaggc ctttgtcgat accgtgcgtg gcctgggcgg taaaaactcg 540
gaccgcctgc tgctgatcag cggtatgaac acgaatctgg aaaacacctg tagcagcagc 600
tataaaatgc cgacggataa agcaaacaaa ctggctatct ccatccatta ctacctgccg 660
ccgcagttta ccgttgaatc agataaaaat ccgtggacct ggacggatga ccaaggcgtt 720
gtccacgaaa ttaccccgct gcaaaaatgg ggcgatgagg gtaactacca agaaatggtg 780
acgaattttg aaaccatgaa aaaagcattc gtcgataaag gtattccggt gatcctgggc 840
gaagtgggtg ttctgacgga agagaaaaaa gacaaagcga gcattcgcga atttctgctg 900
gcagaatata gtttcaccgc tggctacaac ggttttatgt ccatcctgtg ggatacgtca 960
aaaaataccg cgggcgacat gaacttctat aatcgtgaaa ccgataaatg gtacgacgaa 1020
cagattcgcg ataacttcat taatatcgcg gcgggtaaat ttgtggaccc gaccaaatat 1080
ctggttaact ccaattcaga aacctctacg aaagttgata gtgacggcaa cgtccaaatc 1140
aacatcggtt cgaaaaaagt taacaaagtc atcttcaatg cgaaaatcag cggcgccgtg 1200
aatatctggg atgtcggttt cggtgtggcg agcgccgata aaaccggcaa atggtttggt 1260
gacccggtgg gcggtgcaga aggcgttaaa cagaacgatg gcacctatac gttcaccgtg 1320
gatgtgtctg ccaaagattt taatgactac gttcaggtcc aacgttggtg gggcaacgat 1380
aatattacga tcaacagtgt gaccgttgaa tttgaaggca ccgcgaaacg cctggatttt 1440
aatgcctata aagcagctct gaaa                                        1464

<210> 5
<211> 352
<212> PRT
<213> Prevotella bryantii

<220> 
<223> Prevotella bryantii strain B14
      beta-1,4-endoglucanase (partial), Cel5APbr

<400> 5
Ile Asn Gln Asn Ala Thr Tyr Met Glu Glu Ser Ala Gln Ser Ala Val
 1               5                  10                  15      
Asp Asn Phe Gly Leu Gly Phe Asn Leu Gly Asn Thr Leu Asp Ala Asn
            20                  25                  30          
Gly Cys Gly Thr Gly Lys Pro Val Ala Thr Tyr Glu Thr Phe Trp Gly
        35                  40                  45              
Gln Pro Glu Thr Thr Gln Asp Met Met Thr Phe Leu Met Gln Asn Gly
    50                  55                  60                  
Phe Asn Ala Val Arg Ile Pro Val Thr Trp Tyr Glu His Met Asp Ala
65                  70                  75                  80  
Glu Gly Asn Val Asp Glu Ala Trp Met Met Arg Val Lys Ala Ile Val
                85                  90                  95      
Glu Tyr Ala Met Asn Ala Gly Leu Tyr Ala Ile Val Asn Val His His
            100                 105                 110         
Asp Thr Ala Ala Gly Ser Gly Ala Trp Ile Lys Ala Asp Thr Asp Val
        115                 120                 125             
Tyr Ala Ala Thr Lys Glu Lys Phe Lys Lys Leu Trp Thr Gln Ile Ala
    130                 135                 140                 
Asn Ala Leu Ala Asp Tyr Asp Gln His Leu Leu Phe Glu Gly Tyr Asn
145                 150                 155                 160 
Glu Met Leu Asp Gly Asn Asn Ser Trp Asp Glu Pro Gln Lys Ala Ser
                165                 170                 175     
Gly Tyr Glu Ala Leu Asn Asn Tyr Ala Gln Asp Phe Val Asp Ala Val
            180                 185                 190         
Arg Ala Thr Gly Gly Asn Asn Ala Thr Arg Asn Leu Ile Val Asn Thr
        195                 200                 205             
Tyr Ala Ala Ala Lys Gly Glu Asn Val Leu Asn Asn Phe Met Leu Pro
    210                 215                 220                 
Thr Asp Ala Val Asn Asn His Leu Ile Val Gln Val His Ser Tyr Asp
225                 230                 235                 240 
Pro Trp Asn Phe Phe Asn Thr Lys Thr Thr Trp Asp Ser Glu Cys His
                245                 250                 255     
Asn Thr Leu Thr Glu Ile Phe Ser Ala Leu Ser Lys Lys Phe Thr Thr
            260                 265                 270         
Ile Pro Tyr Ile Ile Gly Glu Tyr Gly Thr His Gly Glu Ser Asp Ile
        275                 280                 285             
Ser Val Ser Lys Ser Ser Pro Ala Glu Lys Ile Lys Leu Ala Ala Asp
    290                 295                 300                 
Gln Ala Ala Asp Met Val Lys Leu Ala Lys Asp His His Ser Ala Thr
305                 310                 315                 320 
Phe Tyr Trp Met Ser Ile Phe Asp Gly Ser Asp Arg Ile Gln Pro Gln
                325                 330                 335     
Trp Ser Leu Pro Thr Val Val Glu Ala Met Gln Glu Ala Tyr Asn Asn
            340                 345                 350         


<210> 6
<211> 1056
<212> DNA
<213> Prevotella bryantii

<220> 
<223> Prevotella bryantii strain B14
      beta-1,4-endoglucanase (partial), Cel5APbr

<400> 6
attaaccaaa atgcaaccta catggaggag agcgcgcaat ccgcagtgga caatttcggc 60
ctgggcttca acctgggtaa cactctggac gcgaacggct gcggcaccgg caaaccggtg 120
gcgacctacg agactttctg gggtcaaccg gagactaccc aggacatgat gaccttcctg 180
atgcagaacg gtttcaacgc ggttcgtatt ccagtgacct ggtacgaaca catggacgcg 240
gaaggtaacg tggacgaagc gtggatgatg cgcgtgaagg cgattgttga gtacgcgatg 300
aacgcgggcc tgtacgcgat tgttaacgtt caccacgaca ccgcggctgg cagcggcgcg 360
tggatcaagg ctgacaccga cgtttacgcg gcgaccaaag aaaagtttaa aaagctgtgg 420
acccaaatcg ctaacgcgct ggcggactac gaccaacacc tgctgttcga gggctacaac 480
gaaatgctgg acggcaacaa cagctgggac gagccgcaaa aagcgagcgg ttacgaagcg 540
ctgaataact acgcgcaaga cttcgttgac gcggtgcgcg ctaccggtgg caacaacgcg 600
acccgcaatc tgatcgttaa cacctacgcc gcagcgaagg gtgagaacgt tctgaacaat 660
ttcatgctgc cgaccgacgc ggttaacaat cacctgatcg ttcaggtgca tagctacgac 720
ccgtggaact tctttaacac taagaccacc tgggactccg agtgccacaa cactctgacc 780
gagattttta gcgcgctgag caagaaattt accaccatcc cgtacatcat cggtgagtac 840
ggcacccacg gtgaaagcga catcagcgtt agcaaaagca gcccggctga aaagatcaaa 900
ctggcggcgg accaagcggc ggacatggtt aagctggcga aggaccacca cagcgcgacc 960
ttctattgga tgtccatctt tgacggtagc gaccgcatcc aaccgcaatg gagcctgccg 1020
accgttgtgg aagcgatgca agaggcgtac aacaac                           1056

<210> 7
<211> 482
<212> PRT
<213> Unknown

<220> 
<223> unidentified microorganism from rumen contents of
      a New Zealand dairy cow, bovine rumen microflora

<220> 
<223> cellulase, Cel5AUI

<400> 7
Ile Asn Val Leu Gln His His Pro Glu Val Glu Leu Ser Val Asp Lys
 1               5                  10                  15      
Thr Ser Val Ser Phe Asn Arg Ser Gly Gly Glu Glu Thr Phe Thr Val
            20                  25                  30          
Thr Ser Ser Thr Gln Pro His Val Ser Ala Asp Val Ser Trp Val Val
        35                  40                  45              
Val Glu Thr Gly Lys Ile Asp Lys Asp His His Thr Glu Val Arg Val
    50                  55                  60                  
Leu Ala Gly Ala Asn Arg Lys Glu Ala Ser Ala Gly Thr Leu Thr Val
65                  70                  75                  80  
Ser Cys Ser Asp Lys Lys Val Ser Val Ser Val Lys Gln Glu Ala Phe
                85                  90                  95      
Val Ala Pro Ser Val Ala Ser Thr Thr Ala Val Thr Pro Gln Met Val
            100                 105                 110         
Phe Asp Ala Met Gly Pro Gly Trp Asn Met Gly Asn His Met Asp Ala
        115                 120                 125             
Ile Ser Asn Gly Val Ser Gly Glu Thr Val Trp Gly Asn Pro Lys Cys
    130                 135                 140                 
Thr Gln Ala Thr Met Asp Gly Val Lys Ala Ala Gly Tyr Lys Ala Val
145                 150                 155                 160 
Arg Ile Cys Thr Thr Trp Glu Gly His Ile Gly Ala Ala Pro Ala Tyr
                165                 170                 175     
Ala Leu Glu Gln Lys Trp Leu Asp Arg Val Ala Glu Ile Val Gly Tyr
            180                 185                 190         
Ala Glu Lys Ala Gly Leu Val Ala Ile Val Asn Thr His His Asp Glu
        195                 200                 205             
Ser Tyr Trp Gln Asp Ile Ser Lys Cys Tyr Asn Asn Ala Ala Asn His
    210                 215                 220                 
Glu Lys Val Lys Asp Glu Val Phe Ser Val Trp Thr Gln Ile Ala Glu
225                 230                 235                 240 
Lys Phe Lys Asp Lys Gly Glu Trp Leu Val Phe Glu Ser Phe Asn Glu
                245                 250                 255     
Ile Gln Asp Gly Gly Trp Gly Trp Ser Asp Ala Phe Arg Lys Asn Pro
            260                 265                 270         
Asp Ala Gln Tyr Lys Val Leu Asn Glu Trp Asn Gln Thr Phe Val Asp
        275                 280                 285             
Ala Val Arg Ser Thr Gly Gly Gln Asn Ala Thr Arg Trp Leu Gly Ile
    290                 295                 300                 
Pro Gly Tyr Ala Cys Asn Pro Gly Phe Thr Ile Ala Gly Leu Val Leu
305                 310                 315                 320 
Pro Lys Asp Tyr Thr Thr Ala Asn Arg Leu Met Val Ala Val His Asp
                325                 330                 335     
Tyr Asp Pro Tyr Asp Tyr Thr Leu Lys Asp Pro Leu Ile Arg Gln Trp
            340                 345                 350         
Gly His Thr Ala Asp Ala Asp Lys Arg Pro Ser Gly Asp Asn Glu Lys
        355                 360                 365             
Ala Val Val Asp Val Phe Asn Asn Leu Lys Ala Ala Tyr Leu Asp Lys
    370                 375                 380                 
Gly Ile Pro Val Tyr Leu Gly Glu Met Gly Cys Ser Arg His Thr Ala
385                 390                 395                 400 
Ala Asp Phe Pro Tyr Gln Lys Tyr Tyr Met Glu Tyr Phe Cys Lys Ala
                405                 410                 415     
Ala Ala Asp Arg Leu Leu Pro Met Tyr Leu Trp Asp Asn Gly Ala Lys
            420                 425                 430         
Gly Val Gly Ser Glu Arg His Ala Tyr Ile Asp His Gly Thr Gly Gln
        435                 440                 445             
Phe Val Asp Glu Asp Ala Arg Thr Leu Val Gly Leu Met Val Lys Ala
    450                 455                 460                 
Val Thr Thr Lys Asp Ala Ser Tyr Thr Leu Glu Ser Val Tyr Asn Ser
465                 470                 475                 480 
Ala Pro
        

<210> 8
<211> 1446
<212> DNA
<213> Unknown

<220> 
<223> unidentified microorganism from rumen contents of
      a New Zealand dairy cow, bovine rumen microflora

<220> 
<223> cellulase, Cel5AUI

<400> 8
attaacgtgc tgcaacatca cccggaagtc gaactgagcg tggataaaac gagtgtgtcc 60
tttaatcgtt ctggcggtga agaaaccttc acggttacca gctctaccca accgcatgtt 120
tcagcggatg tctcgtgggt ggttgtcgaa acgggcaaaa tcgataaaga ccatcacacc 180
gaagtccgtg tgctggccgg cgcaaaccgc aaagaagcga gcgcgggcac cctgaccgtg 240
tcatgctcgg ataaaaaagt tagcgtctct gtgaaacagg aagcctttgt ggcaccgagt 300
gttgcctcca ccacggcagt tacgccgcaa atggtcttcg acgcaatggg cccgggttgg 360
aacatgggca atcacatgga tgcgattagc aacggcgtgt ctggtgaaac cgtttggggt 420
aatccgaaat gcacgcaggc tacgatggat ggcgtcaaag cggcgggtta taaagccgtg 480
cgtatttgta ccacgtggga aggccacatc ggtgcagctc cggcctatgc actggaacag 540
aaatggctgg atcgcgtcgc cgaaattgtg ggctacgctg aaaaagcggg tctggtggca 600
atcgttaaca cccatcacga tgaatcatat tggcaagaca tctcgaaatg ctacaacaat 660
gcggccaatc atgaaaaagt gaaagacgaa gtctttagtg tgtggaccca gattgccgaa 720
aaattcaaag ataaaggcga atggctggtt tttgaaagtt tcaacgaaat ccaagatggc 780
ggttggggtt ggtccgacgc ttttcgtaaa aatccggatg cgcagtataa agtgctgaac 840
gaatggaatc aaaccttcgt tgatgctgtc cgtagcacgg gcggtcagaa cgcgacccgc 900
tggctgggca ttccgggtta tgcctgtaat ccgggcttta ccatcgcagg tctggttctg 960
ccgaaagatt ataccacggc taaccgcctg atggttgcgg tccatgatta cgacccgtat 1020
gattacacgc tgaaagaccc gctgattcgt cagtggggcc acaccgctga tgcggacaaa 1080
cgtccgtcag gtgacaatga aaaagcggtg gttgatgtgt tcaacaatct gaaagcagct 1140
tatctggaca aaggtatccc ggtttacctg ggcgaaatgg gttgctcgcg tcacaccgcg 1200
gcggatttcc cgtaccagaa atactacatg gaatacttct gtaaagcagc tgcggaccgc 1260
ctgctgccga tgtatctgtg ggataacggc gccaaaggcg tgggttcaga acgtcatgca 1320
tacattgacc acggcacggg tcaatttgtt gatgaagacg cccgcaccct ggtcggtctg 1380
atggtgaaag ccgttaccac gaaagatgca tcttataccc tggaaagtgt ttacaattcc 1440
gcgccg                                                            1446

<210> 9
<211> 312
<212> PRT
<213> Dictyoglomus turgidum

<220> 
<223> Dictyoglomus turgidum strain DSM 6724 glycoside
      hydrolase family 5, Cel5BDtu, locus Dtur_0670

<400> 9
Met Asn Asn Leu Pro Ile Lys Arg Gly Ile Asn Phe Gly Asp Ala Leu
 1               5                  10                  15      
Glu Ala Pro Tyr Glu Gly Ala Trp Ser Gly Tyr Ile Ile Lys Asp Glu
            20                  25                  30          
Tyr Phe Lys Ile Val Lys Asp Ala Gly Phe Asp His Val Arg Ile Pro
        35                  40                  45              
Ile Lys Trp Ser Val Tyr Thr Gln Lys Glu Ala Pro Tyr Ser Ile Glu
    50                  55                  60                  
Lys Arg Ile Phe Asp Arg Val Asp His Leu Ile Glu Glu Gly Leu Lys
65                  70                  75                  80  
Asn Asn Leu His Val Ile Ile Asn Ile His His Tyr Glu Glu Ile Met
                85                  90                  95      
Glu Asp Pro Leu Gly Glu Lys Glu Arg Phe Leu Ala Ile Trp Arg Gln
            100                 105                 110         
Ile Ser Glu His Tyr Lys Asp Tyr Pro Asn Asn Leu Tyr Phe Glu Leu
        115                 120                 125             
Leu Asn Glu Pro Thr Gln Asn Leu Ser Ser Glu Leu Trp Asn Gln Phe
    130                 135                 140                 
Leu Lys Glu Ala Ile Glu Val Ile Arg Arg Thr Asn Pro Glu Arg Lys
145                 150                 155                 160 
Ile Ile Val Gly Pro Asp Asn Trp Asn Ser Leu Tyr Asn Leu Glu Lys
                165                 170                 175     
Leu Ile Ile Pro Glu Asn Asp Glu Asn Ile Ile Ile Thr Phe His Tyr
            180                 185                 190         
Tyr Asn Pro Phe Pro Phe Thr His Gln Gly Ala Gly Trp Val Lys Ile
        195                 200                 205             
Asp Leu Pro Val Gly Val Lys Trp Leu Gly Thr Glu Glu Glu Lys Arg
    210                 215                 220                 
Glu Ile Glu Arg Glu Leu Asp Met Ala Val Ser Trp Ala Glu Glu His
225                 230                 235                 240 
Gly Asn Ile Pro Leu Tyr Met Gly Glu Phe Gly Ala Tyr Ser Lys Ala
                245                 250                 255     
Asp Met Glu Ser Arg Val Arg Trp Thr Asp Phe Val Ala Arg Ser Ala
            260                 265                 270         
Glu Lys Arg Gly Ile Ala Trp Ser Tyr Trp Glu Phe Tyr Ser Gly Phe
        275                 280                 285             
Gly Val Phe Asp Pro Glu Lys Asn Glu Trp Arg Thr Pro Leu Leu Arg
    290                 295                 300                 
Ala Leu Ile Pro Glu Arg Asn Ile
305                 310         


<210> 10
<211> 936
<212> DNA
<213> Dictyoglomus turgidum

<220> 
<223> Dictyoglomus turgidum strain DSM 6724 glycoside
      hydrolase family 5, Cel5BDtu, locus Dtur_0670

<400> 10
atgaacaatc tgccgatcaa acgtggcatt aacttcggtg atgcgctgga agccccgtat 60
gaaggcgcgt ggagcggtta catcatcaaa gacgaatact tcaaaatcgt taaagatgcc 120
ggcttcgacc atgtccgcat cccgattaaa tggagcgtgt atacccagaa agaagcaccg 180
tactctatcg aaaaacgtat tttcgatcgc gtggaccatc tgattgaaga aggcctgaaa 240
aacaacctgc acgttatcat caacatccat cactacgaag aaatcatgga agatccgctg 300
ggtgaaaaag aacgttttct ggcgatctgg cgccaaatta gcgaacacta taaagactac 360
ccgaacaatc tgtacttcga actgctgaac gaaccgaccc agaatctgag cagcgaactg 420
tggaaccaat ttctgaaaga agccatcgaa gttattcgtc gcacgaatcc ggaacgtaaa 480
attatcgtcg gtccggataa ctggaacagc ctgtataacc tggaaaaact gattatcccg 540
gaaaacgacg aaaacatcat catcaccttc cattactaca atccgtttcc gttcacgcac 600
cagggtgcag gttgggtcaa aattgatctg ccggtgggcg ttaaatggct gggtacggaa 660
gaagaaaaac gtgaaatcga acgcgaactg gatatggccg tgagttgggc cgaagaacat 720
ggcaacattc cgctgtatat gggcgaattt ggtgcataca gtaaagctga tatggaatcc 780
cgtgtccgct ggaccgactt cgtggcacgt tccgctgaaa aacgcggtat tgcatggtca 840
tattgggaat tttactcggg ctttggtgtt ttcgatccgg agaaaaacga atggcgtacg 900
ccgctgctgc gcgctctgat cccggaacgc aatatt                           936

<210> 11
<211> 366
<212> PRT
<213> Clostridium thermocellum

<220> 
<223> endo-beta-1,4-glucanase (partial), Cel5CCth, celE

<400> 11
Ser Gly Thr Lys Leu Leu Asp Ala Ser Gly Asn Glu Leu Val Met Arg
 1               5                  10                  15      
Gly Met Arg Asp Ile Ser Ala Ile Asp Leu Val Lys Glu Ile Lys Ile
            20                  25                  30          
Gly Trp Asn Leu Gly Asn Thr Leu Asp Ala Pro Thr Glu Thr Ala Trp
        35                  40                  45              
Gly Asn Pro Arg Thr Thr Lys Ala Met Ile Glu Lys Val Arg Glu Met
    50                  55                  60                  
Gly Phe Asn Ala Val Arg Val Pro Val Thr Trp Asp Thr His Ile Gly
65                  70                  75                  80  
Pro Ala Pro Asp Tyr Lys Ile Asp Glu Ala Trp Leu Asn Arg Val Glu
                85                  90                  95      
Glu Val Val Asn Tyr Val Leu Asp Cys Gly Met Tyr Ala Ile Ile Asn
            100                 105                 110         
Leu His His Asp Asn Thr Trp Ile Ile Pro Thr Tyr Ala Asn Glu Gln
        115                 120                 125             
Arg Ser Lys Glu Lys Leu Val Lys Val Trp Glu Gln Ile Ala Thr Arg
    130                 135                 140                 
Phe Lys Asp Tyr Asp Asp His Leu Leu Phe Glu Thr Met Asn Glu Pro
145                 150                 155                 160 
Arg Glu Val Gly Ser Pro Met Glu Trp Met Gly Gly Thr Tyr Glu Asn
                165                 170                 175     
Arg Asp Val Ile Asn Arg Phe Asn Leu Ala Val Val Asn Thr Ile Arg
            180                 185                 190         
Ala Ser Gly Gly Asn Asn Asp Lys Arg Phe Ile Leu Val Pro Thr Asn
        195                 200                 205             
Ala Ala Thr Gly Leu Asp Val Ala Leu Asn Asp Leu Val Ile Pro Asn
    210                 215                 220                 
Asn Asp Ser Arg Val Ile Val Ser Ile His Ala Tyr Ser Pro Tyr Phe
225                 230                 235                 240 
Phe Ala Met Asp Val Asn Gly Thr Ser Tyr Trp Gly Ser Asp Tyr Asp
                245                 250                 255     
Lys Ala Ser Leu Thr Ser Glu Leu Asp Ala Ile Tyr Asn Arg Phe Val
            260                 265                 270         
Lys Asn Gly Arg Ala Val Ile Ile Gly Glu Phe Gly Thr Ile Asp Lys
        275                 280                 285             
Asn Asn Leu Ser Ser Arg Val Ala His Ala Glu His Tyr Ala Arg Glu
    290                 295                 300                 
Ala Val Ser Arg Gly Ile Ala Val Phe Trp Trp Asp Asn Gly Tyr Tyr
305                 310                 315                 320 
Asn Pro Gly Asp Ala Glu Thr Tyr Ala Leu Leu Asn Arg Lys Thr Leu
                325                 330                 335     
Ser Trp Tyr Tyr Pro Glu Ile Val Gln Ala Leu Met Arg Gly Ala Gly
            340                 345                 350         
Val Glu Pro Leu Val Ser Pro Thr Pro Thr Pro Thr Leu Met
        355                 360                 365     


<210> 12
<211> 1098
<212> DNA
<213> Clostridium thermocellum

<220> 
<223> endo-beta-1,4-glucanase (partial), Cel5CCth, celE

<400> 12
tccggcacca aactgctgga tgcgtcaggc aacgaactgg ttatgcgtgg tatgcgcgat 60
atttccgcca tcgacctggt caaagaaatt aaaatcggct ggaacctggg taataccctg 120
gatgcaccga ccgaaacggc atggggtaac ccgcgtacca cgaaagcaat gattgaaaaa 180
gtgcgtgaaa tgggcttcaa tgctgttcgc gtcccggtga cctgggatac gcatattggt 240
ccggcaccgg attataaaat cgacgaagcg tggctgaacc gcgtcgaaga agtggttaat 300
tatgtgctgg attgcggcat gtacgcaatt atcaacctgc atcacgacaa tacctggatt 360
atcccgacgt atgctaacga acagcgtagc aaagaaaaac tggttaaagt ctgggaacaa 420
attgcaaccc gctttaaaga ttacgatgac cacctgctgt tcgaaacgat gaatgaaccg 480
cgtgaagtgg gctcgccgat ggaatggatg ggcggcacct atgaaaaccg tgatgttatt 540
aaccgcttta atctggcagt cgtgaatacg atccgtgcga gcggcggcaa caatgacaaa 600
cgcttcattc tggtcccgac caacgcagcc acgggtctgg atgtcgcact gaatgacctg 660
gtgatcccga acaatgatag ccgcgtgatt gtttctatcc atgcgtatag tccgtacttt 720
ttcgcgatgg atgtgaacgg cacctcatat tggggttcgg attacgacaa agcgagcctg 780
acctccgaac tggatgccat ctacaaccgt ttcgttaaaa atggccgcgc ggtcattatc 840
ggcgaatttg gcaccatcga taaaaacaat ctgagcagcc gtgtggcgca tgctgaacac 900
tatgcgcgtg aagccgtgtc tcgcggtatt gccgtgtttt ggtgggataa cggctattac 960
aatccgggtg acgcagaaac ctacgctctg ctgaatcgca aaacgctgtc atggtattac 1020
ccggaaatcg tgcaagcgct gatgcgtggt gctggcgtgg aaccgctggt gtctccgacc 1080
ccgaccccga ccctgatg                                               1098


