                         SEQUENCE LISTING

<110>  Krichevsky, Alexander
 
<120>  PLANTS CAPABLE OF NITROGEN FIXATION

<130>  KRI0001-401-PC

<150>  61/991,103
<151>  2014-05-09

<150>  62/008,597
<151>  2014-06-06

<150>  62/091,046
<151>  2014-12-12

<160>  30    

<170>  PatentIn version 3.5

<210>  1
<211>  20
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(20)
<223>  Stl-L subunit, partial sequence; Xxa is an unspecified or unknown
       amino acid

<400>  1

Ala Leu Pro Gln Thr Glu Leu Arg Pro Met Gly Lys Pro Ile Leu Arg 
1               5                   10                  15      


Lys Xaa Asp Pro 
            20  


<210>  2
<211>  30
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(30)
<223>  St1-M subunit, partial sequence

<400>  2

Met Phe Pro Asn Ala Phe Lys Tyr Glu Ala Pro Ala Ser Val Asp Glu 
1               5                   10                  15      


Ala Val Arg Leu Leu Ala Glu Tyr Gly Tyr Asp Gly Lys Val 
            20                  25                  30  


<210>  3
<211>  18
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(18)
<223>  St1-S subunit, partial sequence

<400>  3

Met Lys Ile Arg Val Lys Val Asn Gly Thr Leu Tyr Glu Ala Asp Val 
1               5                   10                  15      


Glu Pro 
        


<210>  4
<211>  33
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(33)
<223>  St2-D subunit, partial sequence

<400>  4

Met Phe Glu Leu Pro Pro Leu Pro Tyr Pro Tyr Asp Ala Leu Glu Pro 
1               5                   10                  15      


Tyr Phe Asp Ala Lys Lys Met Glu Ile His Tyr Tyr Gly Gly His Gly 
            20                  25                  30          


Ala 
    


<210>  5
<211>  2493
<212>  DNA
<213>  Streptomyces thermoautotrophicus


<220>
<221>  gene
<222>  (1)..(2493)
<223>  Streptomyces thermoautotrophicus strain St1 putative 
       Mo-hydroxylase (sdnL) gene

<400>  5
gtggcactgc cgcagactga actgcgcccg atgggcaaac cgattctccg caaggaggat       60

ccccggctga tccgcgggaa gggccggttt gtggacgaca tcctgttgcc gaatatgctc      120

catctttgca tcttgcggag cccgtacgcc cacgcccgca ttcgccgcat cgatacgtcg      180

aaagcagaag ccgcgccggg cgtcaagctg gtgctcacgg gagaagatct ggccaagatg      240

aacctcgcct ggatgccgac cttggcgggg gacgtgcaga tggtgctggc gacgggcaag      300

gtcctgttcc agtaccagga ggtcgcggcg gtcgtcgcgg agacgcgcgc ccaggccgag      360

gacgcgattc agctgatcga ggtcgactac gagcccctgc cggtggtggt cgatccgttc      420

aaggcgctgg agccggacgc gcccatcctc cgggaggaca aggagaaaaa gtcgaaccac      480

atctggcact gggaggcggg cgaccgggaa gagaccgacg cgatcttccg cgaagcgccg      540

gtcgtcgtca agcaggatgt gcgttttcag cgcgtccatc cctcgccgct tgaaccgtgc      600

ggctgcgtgg ccgactacaa cccggcgacg gggaagctcg tggtctacgt cacgtcgcag      660

gcgccgcacg tccaccggac ggcgatcgct ttgacgacgg gcttccccga acacatgatt      720

caggtcattt cgcccgatgt gggcggcggg ttcgggaaca aggtgcccct ctaccccggc      780

tacgtggtgg cgatcgtcgc ttccttgaag ctgggagtcc ccgtgaagtg gatcgagacg      840

cggacggaaa acatcgccag cacccacttc gcccgcgatt accacatgac ggcggagatc      900

gcggcgacgg aagacggcaa gatgctggcg ctccgcgtga agacgatcgc cgaccacggc      960

gcgttcgacg cgaccgccaa cccgaccaaa taccccgccg gattgtacag catcgtgacg     1020

gggtcgtacg acttcaaggc ggcgttcgtc gaagtggacg gtgttcatac gaacaaaccg     1080

ccgggcggtg tggcctaccg ctgctcgttc cgggtcacgg aagcctccta tctgattgaa     1140

cgcgtcgtgg acgttttggc ccgtcggctc aagatggatc cggccgagtt gcgcctgcgc     1200

aatttcattc gcaaggagca gttcccgtac cgcagcccga cgggatgggt gtacgacagc     1260

ggggattacg aaaagacgtt caagctcgcg ctggagcgca tcggatatga agagctgcgc     1320

aaggagcaga aggagaagtg ggcccgggga gaattcatgg gcatcggcat ctccaccttc     1380

acggagatcg tcggcgcggg tccggcgcac tccttcgata ttctcggcat caagatgttc     1440

gacagcgcgg agatccgcgt ccatccgacg ggcaaggtga tcgcccggct cggcgtgcgc     1500

catcagggac aggggcatga gacgacgttc gcccagatca tcgccgagga gctggggctc     1560

agcgtcgacg acgtcgtggt cgaagaaggc gataccgaca cggcccccta cgggttgggc     1620

acgtacgcca gccgttccac gccgacggcc ggggcggcgg cggccctctg tgcgcgccgg     1680

atccgggaca aggcgcgtaa gatcgcggcc catttgctgg aggtcaacga agacgacgtc     1740

gtctgggacg gcgccgcctt ttcggtcaag ggacttccgg gccgttcggt gacgatgaaa     1800

gatgtggcct ttgccgccta cacgaacgtg cccgacggca tcgagccggg cttggaggcg     1860

tcgtactact acaatccgcc gaacctcacc ttcccctacg gggcctacat cgccgtggtc     1920

gacatcgaca agggaacggg cgccgtgaag gtgcggcggt tcttggccgt cgacgattgc     1980

ggcaacgtga tcaatccgat gatcgtcgaa ggtcaggtgc acggcggcct gacggaagga     2040

tttgcgatcg cgttcatgca ggacatcccg tatgacgccg acggcaactg cctggcgccg     2100

aactggatgg actacctggt tcccaccgct tgggacacgc cccagctgga gacggatcgg     2160

acggtcacgc cctcgcctca ccatccgctt ggcgccaaag gggtcggcga gtcgcccaac     2220

gtcggttcgc cggcggcgtt cgtcaatgcg gtgctggacg cgctgtcgcc gctcggcgta     2280

gaacacatcg acatgccgat ctatccgtgg aaggtgtgga agatcttgcg ggacacggca     2340

ttacggagtg attcgatggc cattcctgcg tcattccaga gcgcgaggag ggaaaagccc     2400

ggaggcggta tagcctccgg gcccatcaaa tggacaacct ctgggagaca gcgagggcgt     2460

tggatgaacg cgcggagcct tacgtctggg tga                                  2493


<210>  6
<211>  516
<212>  DNA
<213>  Streptomyces thermoautotrophicus


<220>
<221>  Streptomyces thermoautotrophicus strain St1 putative 
       2Fe2S-binding dinitrogenase (sdnS) gene
<222>  (1)..(516)

<220>
<221>  gene
<222>  (1)..(516)
<223>  Streptomyces thermoautotrophicus strain St1 putative 
       2Fe2S-binding dinitrogenase (sdnS) gene

<400>  6
atgaagatcc gggtcaaagt caacgggacg ctgtacgagg cggacgtgga accgcggacg       60

cttctggcgt actttctgcg cgaggaattg aagttgacgg gcacgcacat cggctgcgac      120

acgaccacct gcggagcttg cacggtgctt ttggacggga aggcggtcaa gtcgtgcacg      180

gtcctcgcgg tgcaggcgaa cggacgcgag gtcatgacgg tcgaagggct ggaaaaagac      240

ggccagctgc atcctctgca agtcgcgttc tgggaagaac acgcgcttca ttgcggatat      300

tgcacgcccg gtatgttgat ggcctcttac gcgctgttgc aagaaaatcc gatgcccacc      360

gaggaagaga ttcgttttgg attgtccggg aacgtctgcc gttgcaccgg ttacatgaac      420

atcgtcaagg ccgttcaatc cgcggcgcgc aggctttccg gcgcgtccgg cgaagccgtt      480

ggggaggtgg cgaccagtgg cactgccgca gactga                                516


<210>  7
<211>  885
<212>  DNA
<213>  Streptomyces thermoautotrophicus


<220>
<221>  gene
<222>  (1)..(885)
<223>  Streptomyces thermoautotrophicus strain St1 putative 
       dinitrogenase (sdnM) gene

<400>  7
gtgtttccca atgcgttcaa gtacgaggcg ccggcatcgg tcgacgaggc cgtccgtctg       60

ctggccgagt acggctacga cggaaaggtg ttggcgggcg ggcagagctt gctcccgatg      120

atgaagctgc gcgtcgcggc gccggccgtg ctcatcgaca tcaacggcat cgatgcgctc      180

caggggtggc gcgaggtcga cgggaaactg cgggtgggcg cgatgacgcg ccacgccgaa      240

ctggagcatg ccaaagagct ccgcgacacg tatccgctgt ttttccagac ggcccgatgg      300

atcgccgatc cgctcatccg caaccgcggg accatcggag gctcgctcgc gcacgccgat      360

cccggctccg actggggggc ggcgatgatc gcgcttcggg ccgaagtgga agcgcgaggc      420

ccccagggaa gccggctcat tcccatcgac gaattttttg tcgatacgtt tgcaaccgct      480

ttaaatgaag acgaactcgc cgtcgcggtg cacgtgccga cgccgaaggg gccggcggcc      540

tcccggtata tgaagctgga gcgccgggcg ggcgatttcg ccatcgccgc gctcgccgtc      600

cacgtcgccc tcggaaccga cggccgcgtg tccgaagccg gcatcggcat ttgcgcgtgc      660

ggtccgatcc ccctccgggc agccaaagcg gaggcggcgc tcatcggccg gccgctgacg      720

gaagaggtca tcgtcgaggc gtcgaggctg gttccggaag atgccgagcc cgccgacgat      780

ctgcgaggaa gcgcggaata taagcgcgac gtgttgcgcg tgtttgccgc gcgcgccctc      840

cgcgacatcg ccaaagagct gcaaggaaag gtggggatcc aatga                      885


<210>  8
<211>  624
<212>  DNA
<213>  Streptomyces thermoautotrophicus


<220>
<221>  gene
<222>  (1)..(624)
<223>  Streptomyces thermoautotrophicus strain UBT1 superoxide 
       oxidoreductase (sdnO) gene

<400>  8
atgttcgaac tgccgccgct tccgtacccc tacgacgcgc tggagccgta ttttgacgcc       60

aagacgatgg aaattcacta caacgggcac cacggcgctt acgtcaagaa cctgaacgcc      120

gccctcgaaa aatatcccgc atggcaaaat aagccgattg aagagctgct tcagtccctc      180

gaccaactgc cggaagacat ccggacggcg gtccggaaca acggcggggg ccactacaac      240

cacagcttct ggtggccgat gctgaagaaa aacgaagggg gccagccggt cggcaagttt      300

gccgaagcga tcaaccggga cttcggcagc tttgaggcct ttaaggacgc cttttccaag      360

gcggcggcgg gacggttcgg aagcggctgg gcgtgggtcg tcgtcgaacc ggatgggaag      420

ctcaccgtca cgacgacgcc gaaccaggac aacccggtca tggaagggaa gacggtcgtc      480

ttcggcctcg acgtctggga gcacgcctac tacctgaagt atcagaaccg gcggccggag      540

tacatccagg cgttctggaa cgtcgtcaac tgggacgtcg tcaacgagcg gtacgaagaa      600

gcgctgaaaa agttcgggcg gtaa                                             624


<210>  9
<211>  24206
<212>  DNA
<213>  Klebsiella pneumoniae


<220>
<221>  misc_feature
<222>  (1)..(24206)
<223>  Klebsiella pneumoniae DNA for nif gene cluster

<400>  9
ggtaacccgc tacggcttga gattatccgc atccttgccg acggcagcga gcagagctgt       60

aacgccctgc gtcacgaaga tgtggcgaag tcgaccatga cccaccactg gcgcgtcctg      120

cgcgacagcg gtgtgatctg gcagcgccca caggggcggg agaacttgat ttcgctgcgc      180

cgggaagatt tagacgcgcg ctttcccggc ctgctggata cgctgcttaa ggtcatgcag      240

caggagaact aaaggcccgc tactcctcgc cggccagccg ccgatactgg gcaaagcggg      300

cccgcgcgtc ctcctcggtt cggctaaaga gcgcatccgc cagatgcggc gtcgttttgt      360

gcagcgaggc gtagcgcact tcgccaagca aaaagtcgcg gaagctctcc tccggctctt      420

cggaatcgag cataaacggc gtcttacctt ccgcttcccg ctgcggatga tagcgccaca      480

ggtgccagta tcccgcctca accgcccgtt tcgcctcgcg ctggctgcag cgcataccgg      540

ctttcagccc gtggttaatg caggcggcgt aggcaatcac cagcgacggt cccggccagg      600

cttcggcctc ggcgatcgcc cgtagggtct gatctttatc agcgcccatc gcgacctggg      660

ccacgtacac attgccgtag ctcatcgcca tcatgccgag atcttttttc cgcgtgcgtt      720

tgccctgcgc ggcaaacttc gcgatggccg ccaccggggt cgatttagac gactggccgc      780

cggtattgga gtaaacctcg gtgtcaaaca ccagaatatt gacgtcttcc ccgctcgcca      840

gcacgtgatc gagaccgccg aagccgatat cgtaggccca gccgtcgccg ccgaaaatcc      900

actgcgaacg acgaacaaaa tagtcgcggt tctgccacag ctgctccaac agcggcacgc      960

cctctttttc cgccgccagc cgttcgctga gccggtccgc gcgctcgcgg gtgccctcgc     1020

cttcatcctg cttcgccagc cactggcgca ttgcgtcgct aagttcgtcg ctgaccggta     1080

gcgccagcgc ggcggtcata tcatcggcga tttgttgacg caccgcctgg ccgccgagca     1140

tcatgccgag gccaaactcc gcattatcct caaacagcga gttcgcccat gccgggccat     1200

ggccgcggtg gttggtggta tagggaatcg acggcgcgct ggctccccag atagaagagc     1260

agccggtggc gttagcgatc agcatccggt cgccaaacag ctgggttatc aggcgggcat     1320

aaggcgtttc accgcatccc gcgcaggcgc cggaaaactc cagcagcggg gtttcaaact     1380

ggctgccttt gaccgtcgtc ttacgaaacg gattgctctt cggcgtcagc gccagcgcat     1440

agtcccagac cggcgccatc tgacgctggc tatcgagaga ctgcattttt aacgccttgc     1500

cgcgcgcggg acagatatcc acgcagttgc cgcagccgga acaatccagc ggcgagatag     1560

ccagatggta gtgatactcc ttcgctccct gcgcgggttt gctcagcagc ccaaccggcg     1620

cggcgtcatg ctcttcgccg ttgagcagcg ccgggcggat cgccgcatgc gggcagataa     1680

aggcgcactg gttacactgc gtgcagccct ccggctgcca gaccggcact tccagcgcga     1740

tcccgcgttt ctcccacgcg gcggtgcccg aaggaaaggt cccgtcctcc ataccgacga     1800

acgcgctcac cggcagctgg tcgccgcact ggcggttcat cggctgcaga atatcgcgga     1860

tgaaatccgg catcatggct gatgcttgcg ccgcgggttc atccagcgtc gcccagtgcg     1920

ccggaatcgt cacctgatgc agcgaggcca tgcccagctc gatcgcccgc tggttcatct     1980

caatcaccgc cgcccctttg ctgccgtagc ttttttcaac cgcctgcttg aggtaatccg     2040

ccgcggtctg cgggtcgata atcgccgcca gcttaaagaa cgccgcctgc atcagcatat     2100

taaagcgccc gcccagcccg agctcgcggg cgatatccac ggcgttcagg gtataaaaat     2160

ggatattttc ccgcgccaga tagcgtttaa agccgaccgg cagatgctgc tccagctccg     2220

catcggacca gctgcagttg agtaaaaagg tcccgcccgg ctttaatccg tccagcagat     2280

cgtagcgctc aacgtaggac tgctgcgaac aggagataaa atcggcccga tggatcaggt     2340

agggcgaatt gatcggccgg tcgccgaagc gtaaatgtga aacggtaatg ccgccggatt     2400

ttttcgagtc ataagaaaag taggcctgcg cgtagagcgg cgttttatcg ccgataattt     2460

tgatcgcgct tttattggcc ccgacggtgc cgtccgagcc catgccccaa aatttacagg     2520

cggtgatgcc gtcatgcgag accgccagcg tctgctgggc cggcggtaac gaagtaaagg     2580

ttacatcatc gacaatcccg agggtaaacc cgtccatcgg cagcggttta ttgaggttat     2640

caaagacggc cgcgatatcg ttgggcagaa catccttccc gccaagcgca tagcggccgc     2700

cgacgattag cggcgcatcg tcgtggtggt agaaggcgtt tttcacatcc aggcacagcg     2760

gttcagcctg agcgccgggc tctttggtac ggtcaaggac ggcaatccgc tgcacggttt     2820

tcggcagctg ggcgaagaag tgggccagcg aaaaagggcg aaacagatgc acgctgagca     2880

gcccgacctt ctctcccgcc gcgttcagcg tatccaccac ttcctgaacg gtatcgcaga     2940

ccgatcccat tgcgataatc acccgttcgg catccgccgc gccggtatag ttaaacagat     3000

gatactcccg gccggtgagc gcgctgattt gcgtcatata gctttcgaca atgtcgggca     3060

gcgcctgata aaaacggttg cccgcctccc gctcctggaa gtagatatcc gggttctgcg     3120

ccgttccgcg gatgaccgga tgatccggat gcagcgcgtt acggcggaag ctgtcgagcg     3180

cgggccggtc cagcagcgtc gccagctgct catattccaa cacctcgatt ttttgaattt     3240

cgtgcgaggt gcgaaaaccg tcgaagaagt taacaaacgg gatgcgtccc ttaatcgccg     3300

ccagatgcgc caccgccgac aaatccatca cctgctgcac gttgttctcc gccagcatcg     3360

cgcagccggt ctggcggacc gccatcacat cctggtgatc gccaaaaata ttcagcgaat     3420

tggtcgccag cgcccgggcg ctgacgtgaa agacgcccgg cagcagttca ccggcgattt     3480

tgtacatgtt ggggatcatc agcagcagcc cctgggaggc cgtataggtg gtggtgagcg     3540

ccccggcctg cagcgcgccg tggaccgcgc ctgccgcgcc ggcctccgac tgcatctcca     3600

ttaagcgcac cggctggcca aaaaggttct ttttcccctg cgccgcccac tcgtcgacgt     3660

tttccgccat cggcgtggag ggggttatgg ggtaaatcgc cgcgacctcg gtaaaggcat     3720

aagagatcca ggccgccgcg gcgttgccat ccattgtttt catttttccg gacattgttc     3780

aatcctcgaa ggtgagaggc atcttcgccg cctcaaataa gcggcaaacc cagttgttgc     3840

ctcaagcaca gcctgtgcca gctcgcggat gacagaagag ttagcgcgaa ttcaacgcgt     3900

tatgaagaga gtcgccgcgc agcgcgccaa gagattgcgt ggaataagac acagggggcg     3960

acaagctgtt gaacaggcga caaagcgccc atggccccgg caggcgcaat tgttctgttt     4020

cccacatttg gtcgccttat tgtgccgttt tgttttacgt cctgcgcggc gacaaataac     4080

taacttcata aaaatcataa gaatacataa acaggcacgg ctggtatgtt ccctgcactt     4140

ctctgctggc aaacactcaa caacaggaga agtcaccatg accatgcgtc aatgcgctat     4200

ttacggtaaa ggcggtatcg gtaaatccac caccacgcag aacctcgtcg ccgcgctggc     4260

ggagatgggt aagaaagtga tgatcgtcgg ctgcgatccg aaggcggact ccacccgtct     4320

gattctgcac gccaaagcac agaacaccat tatggagatg gccgcggaag tcggctcggt     4380

cgaggacctc gaactcgaag acgtgctgca aattggctac ggcgatgtgc gctgcgcgga     4440

atccggcggc ccggagccag gcgtcggctg cgcgggacgc ggcgtgatca cggcgatcaa     4500

ctttcttgaa gaagaaggcg cctacgagga cgatctcgat ttcgtgttct atgacgtgct     4560

cggcgacgtg gtctgcggcg gcttcgccat gccgatccgc gaaaacaaag cccaggagat     4620

ctacatcgtc tgctccggcg aaatgatggc gatgtacgcg gccaacaata tctccaaagg     4680

gatcgttaaa tacgccaaat ccggcaaggt gcgcctcggc ggcctgatct gtaactcacg     4740

tcagaccgac cgtgaagacg aactgattat tgccctggcg gaaaagctcg gtacccagat     4800

gatccacttt gtgccccgcg acaacatcgt gcagcgcgcg gagatccgcc gcatgacggt     4860

tatcgagtac gaccccgcct gtaaacaggc caacgaatac cgcaccctgg cgcagaagat     4920

cgtcaacaac accatgaaag tggtgccgac gccctgcacc atggatgagc tggaatcgct     4980

gctgatggag ttcggcatca tggaagagga agacaccagc atcattggca aaaccgccgc     5040

cgaagaaaac gcggcctgag cacaggacaa ttatgatgac caacgcaacg ggcgaacgta     5100

atctggcgct gatccaggaa gtcctggagg tgttcccgga aaccgcgcga aaagagcgca     5160

gaaagcacat gatggtcagc gatccgaaaa tgaagagcgt cggcaagtgc attatctcta     5220

accgcaaatc acaacccggc gtaatgaccg tacgcggctg cgcctacgcc ggttccaaag     5280

gggtggtatt tgggccgatt aaggatatgg cccatatttc gcacggaccg gctggctgcg     5340

gccagtattc ccgcgccgaa cgacgcaact actacaccgg agtcagcggc gtcgatagct     5400

tcggcacgct gaacttcacc tctgattttc aggagcgcga catcgtcttc ggcggcgata     5460

aaaagctcag caagctgatt gaagagatgg agttgctgtt cccgctcacc aaagggatca     5520

ccattcagtc ggaatgcccg gtggggctga tcggtgatga tatcagcgcg gtggccaacg     5580

ccagcagcaa ggcgctggat aaaccggtga tcccggtacg ctgcgaaggc tttcgcggcg     5640

tgtcgcagtc tctggggcac catatcgcca acgacgtggt gcgcgactgg atcctgaaca     5700

atcgcgaagg acagccgttt gaaaccaccc cttacgatgt ggcgatcatc ggcgactaca     5760

acatcggcgg cgacgcctgg gcctcgcgca ttctgctgga agagatgggg ctacgggtag     5820

tcgcgcagtg gtccggcgac ggcacgctgg tggagatgga gaatacccca ttcgtcaagc     5880

tgaacctggt tcactgctac cgttcgatga actatatcgc ccgccatatg gaggagaaac     5940

atcagattcc gtggatggag tacaacttct tcgggccgac caaaatcgcc gaatcgctgc     6000

gcaaaatcgc cgaccagttc gacgatacca ttcgcgcgaa cgccgaagcg gtgatcgccc     6060

ggtatgaggg gcagatggcg gcgattatcg ccaaatatcg cccgcgcctg gaggggcgta     6120

aggtgctgct ctatatcgga ggcctgcggc cgcgccacgt tattggcgcc tatgaggatc     6180

tcgggatgga gatcatcgcc gccggctacg agtttgccca taacgatgat tacgaccgca     6240

ccctgccgga tctgaaagag ggcacgctgc tgttcgatga cgccagcagc tacgagctgg     6300

aagcgttcgt caaggcgctg aagcccgacc ttatcggctc cggcatcaag gaaaaatata     6360

tcttccagaa aatgggcgtg ccgttccgcc agatgcactc gtgggactat tccggcccgt     6420

accacggcta cgatggtttc gccattttcg cccgcgatat ggatatgacc ctgaacaacc     6480

cggcgtggaa cgaactgacc gctccgtggc tgaagtctgc gtgattgccc actcactgtc     6540

ccgtctgttc accgatttgt ggcgcgggag gagaacacca tgagccaaac gattgataaa     6600

attaatagct gttatccgct attcgaacag gatgaatacc aggagctgtt ccgcaataag     6660

cggcagctgg aagaggcgca cgatgcgcag cgcgtgcagg aggtctttgc ctggaccacc     6720

accgccgagt atgaagcgct gaatttccga cgcgaggcgc tgaccgttga cccggcgaaa     6780

gcctgccagc cgcttggcgc ggtgctttgc tcgctgggat ttgccaacac cctgccgtat     6840

gtgcacggct ctcaggggtg cgtggcctac tttcgcacct attttaaccg ccatttcaaa     6900

gagccgatcg cctgcgtctc cgactcgatg accgaagacg cggcggtctt cggcggcaac     6960

aacaatatga acctgggcct gcagaacgcc agcgcgctgt acaaaccgga gatcattgcg     7020

gtgtccacca cctgcatggc ggaagttatc ggcgatgacc tgcaggcgtt tatcgccaac     7080

gctaaaaaag atggcttcgt cgacagcagc atcgccgtgc cccacgccca tacgccaagc     7140

tttatcggca gccacgtcac cggctgggat aacatgtttg aaggcttcgc caaaaccttc     7200

actgcggact accaggggca gccgggcaaa ttgccgaagc tcaatctggt gaccggcttt     7260

gaaacctatc tcggcaactt ccgcgtatta aagcggatga tggaacagat ggcggtgccg     7320

tgcagcctgc tctccgatcc gtcggaagtt ctcgacacgc ccgccgacgg tcactatcgg     7380

atgtattccg gcggcaccac gcagcaggag atgaaagagg cccctgacgc catcgatacg     7440

ctgctcctgc agccgtggca gctgctgaag agcaaaaaag tggtgcagga gatgtggaac     7500

cagcccgcca ccgaggtcgc cattccgctg gggctggccg ccaccgatga actgctgatg     7560

accgtcagcc agcttagcgg caagccgatt gccgacgccc tcacccttga gcgcggccgg     7620

ctggttgaca tgatgctcga ctcccacacc tggctgcacg gcaagaagtt tggcctgtac     7680

ggcgatccgg acttcgtgat gggcctcacc cgcttcctgc tggagctggg ctgcgagcca     7740

acggtgatcc tgagccataa cgccaacaaa cgctggcaaa aagcgatgaa caaaatgctc     7800

gatgcctcgc cgtacgggcg cgatagcgaa gtgtttatca actgcgattt gtggcacttc     7860

cgttcgctga tgttcacccg tcagccggac tttatgatcg gcaactccta cggcaagttt     7920

atccagcgcg ataccctggc gaagggtaaa gcctttgaag tgccgcttat ccgcctcggc     7980

tttccgctgt tcgaccgcca ccatctgcac cgccagacaa cctggggtta tgaaggggcg     8040

atgaacattg tgacgacgct ggtgaacgcc gtgctggaga aactggatag cgataccagc     8100

cagctgggca aaaccgatta cagcttcgat ctcgtccgtt aaccatcagg tgccccgcgt     8160

catgcggcgg caggagggag tatgcccatc gtgattttcc gtgagcgcgg cgcggacctg     8220

tacgcctata tcgcgaaaca ggatctggaa gcgcgagtga tccagattga gcataacgac     8280

gctgaacgct ggggcggcgc gatttcgctg gaggggggac gccgctacta cgtgcatccg     8340

cagccggggc gtcccgtctt tccgataagc ctgcgcgcga cgcgcaatac cttgatataa     8400

ggagctagtg atgtccgaca acgataccct attctggcgt atgctggcgc tgtttcagtc     8460

tctgccggac ctacagccgg cgcaaatcgt cgactggctg gcgcaggaga gcggcgagac     8520

gctgacgcca gagcgtctgg cgaccctgac ccagccgcag ctggccgcca gctttccctc     8580

cgcgacggcg gtgatgtccc ccgctcgctg gtcgcgggtg atggcgagcc tgcagggcgc     8640

gctgcccgcc catttacgca tcgttcgccc tgcccagcgc acgccgcagc tgctggcggc     8700

attttgctcc caggatgggc tggtgattaa cggccatttc ggccagggac gactgttttt     8760

tatctacgcg ttcgatgaac aaggcggctg gttgtacgat ctgcgccgct atccctccgc     8820

cccccaccag caggaggcca acgaagtgcg cgcccggctt attgaggact gtcagctgct     8880

gttttgccag gagataggcg ggcccgccgc cgcgcggccg atccgccatc gcatccaccc     8940

gatgaaagcg cagcccggga cgacgattca ggcacagtgc gaggcgatca atacgctgct     9000

ggccggccgt ttgccgccgt ggctggcgaa gcggcttaac agggataacc ctctggaaga     9060

acgcgttttt taatccctgt tttgtgcttg ttgcccgctg accccgcggg ctttttttcg     9120

cgtatggacg ctcttcccca cgttacgctc aggggaatat tccgttcacg gttgttccgg     9180

gcttcttgat gcgcctaacc ccctcgctgc cagcctttca tcaacaaata gccatcccag     9240

cgcgataggt cataaagcat cacatgccgc catcccttgt ccgattgttg gctttgtcgc     9300

aaagccaaca acctcttttc tttaaaaatc aaggctccgc tctggagcgc gaattgcatc     9360

ttccccctca tcccccaccg tcaacgaggt cactatgaag ggaaatgaaa ttctggcgct     9420

gctggatgaa ccggcctgtg aacacaacca taaacaaaaa tccggctgca gcgcgcccaa     9480

acccggcgcc accgccgcgg gctgcgcgtt cgacggcgcg cagataaccc tgctgcccat     9540

cgccgacgtg gcgcatctgg tccacggccc catcggctgc gccggaagct catgggataa     9600

ccgcggcagc gccagctccg gccccaccct taatcggctc gggttcacca ccgatctcaa     9660

cgaacaggac gtgattatgg gccgcggcga acgccgactg tttcacgccg tgcgccatat     9720

cgtcacccgc tatcatccgg cggcggtctt tatctacaac acctgcgtac cggccatgga     9780

gggcgatgac ctggaagcgg tatgccaggc cgcgcagacc gccaccggcg taccggttat     9840

cgctattgac gccgccggtt tctacggcag taaaaatctc ggtaaccggc cggcgggcga     9900

cgtcatggtc aaacgggtca tcggccagcg cgagcccgcc ccctggccgg agagcacgct     9960

ctttgccccg gagcagcgtc acgatattgg cctgattggc gaattcaata ttgccggcga    10020

gttctggcat attcagccgc tgctcgacga actggggatc cgcgtgctcg gcagcctctc    10080

cggtgatggc cgcttcgccg agatccagac catgcaccgg gcgcaggcca atatgctggt    10140

ctgctcgcgg gcgttaatta acgtcgccag agccctggag cagcgctacg gcacgccgtg    10200

gttcgaaggc agcttttacg ggatccgcgc cacctctgac gccctgcgcc agctggcggc    10260

gctgctgggc gacgacgacc ttcgccagcg caccgaagcg ctgattgcgc gggaggaaca    10320

ggcggcggaa ctggcgctac agccgtggcg cgaacagctg cgcggccgca aagcgctgct    10380

ctataccggc ggggtgaaat cctggtcggt ggtatcggcg ctgcaggatt tgggcatgac    10440

cgtggtggca accggcacgc gtaaatccac cgaagaggat aaacagcgga tccgcgagct    10500

gatgggcgaa gaggcggtaa tgctggaaga gggcaacgcc cgcacgctgc tggatgtggt    10560

ctatcgctat caggccgacc tgatgattgc cggcggacgc aatatgtaca ccgcctataa    10620

agccaggctg ccgtttctcg atatcaatca ggagcgcgaa cacgccttcg ctggctatca    10680

ggggatcgtc accctcgccc gccagctgtg tcagaccatc aacagcccca tctggccgca    10740

aacccattct cgcgccccgt ggcgctaagg agctcaccat ggcagacatt ttccgcaccg    10800

ataagccgct ggcggtcagc cccatcaaaa ccggccagcc gctcggcgca atcctcgcca    10860

gcctcgggat cgaacacagc atccctctgg tccacggcgc gcaggggtgc agcgccttcg    10920

ccaaagtctt ttttattcaa catttccacg acccggttcc cctgcagtcg acggcgatgg    10980

accccacgtc gacgattatg ggcgcggacg gcaatatttt taccgccctg gataccctct    11040

gccagcgcaa caatccgcag gctatcgtac tgctcagcac cgggctgtcg gaggcccagg    11100

gcagcgatat ttcccgcgtg gttcgccagt ttcgcgaaga gtatccccgg cataaggggg    11160

tggcgatatt gacggttaac acgccggatt tttatggctc catggagaac ggcttcagcg    11220

cggtgttaga gagcgtcatt gagcagtggg tgccgccggc gccgcgcccg gctcagcgca    11280

atcgccgggt caatctgctg gtcagccatc tctgttcgcc gggcgatatc gagtggctgc    11340

gccgatgcgt cgaagccttt ggtctgcagc cgataatcct gccggacctg gcgcaatcga    11400

tggacggcca cctggcgcag ggcgatttct cgccgctgac ccagggcggg acgccgctgc    11460

gccagataga gcagatgggg caaagcctgt gcagcttcgc cattggcgtc tcccttcatc    11520

gcgcctcatc gctgctggcc ccgcgctgcc gcggcgaggt tatcgccctg ccgcacctga    11580

tgaccctcga acgctgcgac gcctttattc atcaactggc gaaaatttcc ggacgcgccg    11640

ttcccgagtg gctggaacgc cagcgcggcc agctacagga tgcgatgatc gactgccata    11700

tgtggctcca gggccagcgc atggcgatag cggcggaagg cgatttgctg gcggcgtggt    11760

gtgatttcgc caacagccag gggatgcagc ccggcccgct ggtggcccct accggtcatc    11820

ccagcctgcg ccagctgccg gtggaacggg tggtgccggg ggatctggag gatctgcaaa    11880

ccctgctgtg cgcgcatccc gccgacctgc tggtggcgaa ctcgcacgcc cgcgacctgg    11940

cggagcagtt tgcgctgccg ctggtgcgcg cgggttttcc gctctttgac aagctcggcg    12000

aattccgccg ggtgcgacag gggtatagcg ggatgcgcga tacgctgttt gagctggcaa    12060

acctgatacg cgagcgtcac caccacctcg cccactaccg atcgccgctg cgccagaacc    12120

ccgaatcgtc actctccaca ggaggcgctt atgccgccga ttaaccgtca gtttgatatg    12180

gtccactccg atgagtggtc tatgaaggtc gccttcgcca gctccgacta tcgtcacgtc    12240

gatcagcact tcggcgctac cccgcggctg gtggtgtacg gcgtcaaggc ggatcgggtc    12300

actctcatcc gggtggttga tttctcggtc gagaacggcc accagacgga gaagatcgcc    12360

aggcggatcc acgccctgga ggattgcgtc acgctgttct gcgtggcgat tggcgacgcg    12420

gtttttcgcc agctgttgca ggtgggcgtg cgtgccgaac gcgttcccgc cgacaccacc    12480

atcgtcggct tactgcagga gattcagctc tactggtacg acaaagggca gcgcaaaaat    12540

cagcgccagc gcgacccgga gcgctttacc cgtctgctgc aggagcagga gtggcatggg    12600

gatccggacc cgcgccgcta gccgtgtcgt ttctgtgaca aagcccacaa aacatcgcga    12660

cactgtagga cgaaccttgt caggactaat acacaaccat ttgaaaaata ttaattttat    12720

tctctggtat cgcaattgct agttcgttat cgccaccgcg cttccgcggt gaaccgcgcc    12780

ccggcgtttt ccgtcaacat ccctggagct gacagcatgt ggaattactc cgagaaagtg    12840

aaagaccatt tttttaaccc ccgcaatgcg cgcgtggtgg acaacgccaa cgcggtaggc    12900

gacgtcggtt cgttaagctg cggcgacgcc ctgcgcctga tgctgcgcgt cgacccgcaa    12960

agcgaaatca ttgaggaggc gggcttccag accttcggct gcggcagcgc catcgcctcc    13020

tcctccgcgc tgacggagct gattatcggc cataccctcg ccgaagccgg gcagataacc    13080

aatcagcaga ttgccgatta tctcgacgga ctgccgccgg agaaaatgca ctgctcggtg    13140

atgggccagg aggccctgcg cgcggccatc gccaactttc gcggcgaaag ccttgaagag    13200

gagcacgacg agggcaagct gatctgcaaa tgcttcggcg tcgatgaagg gcatattcgc    13260

cgcgcggtac agaacaacgg gctgaccacc cttgccgagg tgatcaacta caccaaagcg    13320

ggcggcggct gcacctcttg ccacgaaaaa atcgagctgg ccctggcgga gatcctcgcc    13380

cagcagccgc agacgacgcc agccgtggcc agcggcaaag atccgcactg gcagagcgtc    13440

gtcgatacca tcgcagaact gcggccgcat attcaggccg acggcggcga tatggcgcta    13500

ctcagcgtca ccaaccacca ggtgaccgtc agcctctccg gcagctgtag cggctgcatg    13560

atgaccgata tgaccctggc ctggctgcag caaaaactga tggaacgtac cggctgttat    13620

atggaagtgg tggcggcctg agccggcgtt aactgaccca agggggacaa gatgaaacag    13680

gtttatctcg ataacaacgc caccacccgt ctggacccga tggtcctgga agcgatgatg    13740

ccctttttga ccgattttta cggcaacccc tcgtcgatac acgattttgg cattccggcc    13800

caggcggctc tggaacgcgc gcatcagcag gctgcggcgc tgctgggcgc ggagtatccc    13860

agcgagatca tctttacctc ctgcgccacc gaagccaccg ccaccgccat cgcctcggcg    13920

atcgccctgc tgcctgagcg tcgcgaaatc atcaccagcg tggtcgaaca tccggcgacg    13980

ctggcggcct gcgagcacat ggagcgcgag ggctaccgga ttcatcgcat cgcggtagat    14040

ggcgaggggg cgctggacat ggcgcagttc cgcgcggcgc tcagcccgcg cgtcgcgttg    14100

gtcagcgtga tgtgggcgaa taacgaaacc ggggtgcttt tcccgatcgg cgaaatggcg    14160

gagctggccc atgaacaagg ggcgctgttt cactgcgatg cggtgcaggt ggtcgggaaa    14220

ataccgatcg ccgtgggcca gacccgcatc gatatgctct cctgctcggc gcataagttc    14280

cacgggccaa aaggcgtagg ctgtctttat ctgcggcggg gaacgcgctt tcgcccgctg    14340

ctgcgcggcg gtcaccagga gtacggtcgg cgagccggga cagaaaatat ctgcggaatc    14400

gtcggcatgg gcgcggcctg cgagctggcg aatattcatc tgccgggaat gacgcatatc    14460

ggccaattgc gcaacaggct ggagcatcgc ctgctggcca gcgtgccgtc ggtcatggtg    14520

atgggcggcg gccagccggc ggtgcccggc acggtgaatc tggcctttga gtttattgaa    14580

ggtgaagcca ttctgctgct gttaaaccag gccgggatcg ccgcctccag cggcagcgcc    14640

tgcacctcag gctcgctgga accctcccac gtgatgcggg cgatgaatat cccctacacc    14700

gccgcccacg gcaccatccg cttttctctc tcgcgctaca cccgggagaa agagatcgat    14760

tacgtcgtcg ccacgctgcc gccgattatc gaccggctgc gcgcgctgtc gccctactgg    14820

cagaacggca agccgcgccc ggcggacgcc gtattcacgc cggtttacgg ctaaggcgga    14880

ggtggctgat ggaacgcgtg ctgattaacg ataccaccct gcgcgacggc gagcagagcc    14940

ccggcgtcgc ctttcgcacc agcgaaaagg tcgccattgc cgaggcgctt tacgccgcag    15000

gaataacggc gatggaggtc ggcaccccgg cgatgggcga cgaggagatc gcgcggatcc    15060

agctggtgcg tcgccagctg cccgacgcga ccctgatgac ctggtgtcgg atgaacgcgc    15120

tggagatccg ccagagcgcc gatctgggca tcgactgggt ggatatctcg attccggctt    15180

cggataagct gcggcagtac aaactgcgcg agccgctggc ggtgctgctg gagcggctgg    15240

cgatgtttat ccatcttgcg cataccctcg gcctgaaggt atgcatcggc tgcgaggacg    15300

cctcgcgggc cagcggccag accctgcgcg ctatcgccga ggtcgcgcag caatgcgccg    15360

ccgcccgcct gcgctatgcc gatacggtcg gcctgctcga cccttttacc accgcggcgc    15420

aaatctcggc cctgcgcgac gtctggtccg gcgaaatcga aatgcatgcc cataacgatc    15480

tgggtatggc gaccgccaat acgctggcgg cggtaagcgc cggggccacc agcgtgaata    15540

cgacggtcct cggtctcggc gagcgggcgg gcaacgcggc gctggaaacc gtcgcgctgg    15600

gccttgaacg ctgcctgggc gtggagaccg gcgtgcattt ttcggcgctg cccgcgtcct    15660

gtcagagggt cgcggaagcc gcgcagcgcg ccatcgaccc gcagcagccg ctggtcggcg    15720

agctggtgtt tacccatgag tcaggtgtcc acgtggcggc gctgctgcgg cacagcgaga    15780

gctaccagtc catcgcccct tccctgatgg gccgcagcta ccggctggtg ctgggcaaac    15840

actccgggcg tcaggcggtc aacggcgttt ttgaccagat gggctatcac ctcaacgccg    15900

cgcagattaa ccagctgctg cccgccatcc gccgcttcgc cgagaactgg aagcgcagcc    15960

cgaaagatta cgagctggtg gctatctacg acgagctgtg cggtgaatcc gctctgcggg    16020

cgagggggta atgatggagt ggttttatca aattcccggc gtggacgaac ttcgctccgc    16080

cgaatctttt tttcagtttt tcgccgtccc ctatcagccc gagctgcttg gccgctgcag    16140

cctgccggtg ctggcaacgt ttcatcgcaa actccgcgcg gaggtgccgc tgcaaaaccg    16200

gctcgaggat aacgaccgcg cgccctggct gctggcgcga agactgctcg cggagagcta    16260

tcagcaacag tttcaggaga gcggaacatg agaccgaaat tcacctttag cgaagaggtc    16320

cgcgtcgtac gcgcgattcg taacgacggc accgtggcgg gcttcgcgcc cggcgcgctg    16380

ctggtcaggc gcggcagcac cggctttgtg cgcgactggg gcgttttttt gcaagatcag    16440

attatctacc agatccactt tccggaaacc gatcggatca tcggctgccg cgagcaggag    16500

ctgatcccca tcacccagcc gtggctggcc ggaaatttgc aatacaggga tagcgtgacc    16560

tgccagatgg cgctcgcggt caacggcgat gtggtcgtga gcgccggcca gcggggacgc    16620

gttgaggcta ccgatcgggg agagctcggc gacagctaca ccgtcgactt tagcggccgc    16680

tggttcaggg tcccggtgca ggccatcgcc cttatagagg aaagagaaga atgaacccgt    16740

ggcaacgttt tgcccggcag cggctggcgc gcagccgctg gaatcgcgat ccggcggccc    16800

tggatccggc cgacacgccg gcttttgaac aggcctggca acgccagtgc catatggagc    16860

agacgatcgt cgcgcgggtc cctgaaggcg atattccggc ggcgttgctg gagaatatcg    16920

ctgcctccct tgccatctgg ctcgacgagg gggattttgc gccgcccgag cgcgctgcca    16980

tcgtgcgcca tcacgcccgg ctggaactcg ccttcgccga tatcgcccgc caggcgccgc    17040

agccggatct ctccacggta caggcatggt atctgcgcca ccagacgcag tttatgcgcc    17100

cggaacagcg tctgacccgc catttactgc tgacggtcga taacgaccgc gaagccgtgc    17160

accagcggat cctcggcctg tatcggcaaa tcaacgcctc gcgggacgct ttcgcgccgc    17220

tggcccagcg ccattcccac tgcccgagcg cgctggaaga gggtcgttta ggctggatta    17280

gccgtggcct gctctatccg cagctcgaga ccgcgctgtt ttcactggcg gaaaacgcgc    17340

taagccttcc catcgccagc gaactgggct ggcatctttt atggtgcgaa gcgattcgcc    17400

ccgccgcgcc catggagccg cagcaggcgc tggagagcgc gcgcgattat ctttggcagc    17460

agagccagca gcgccatcag cgccagtggc tggaacagat gatttcccgt cagccgggac    17520

tgtgcgggta gcctcggcgg ctacccgtta acgcctacag cacggtgcgt ttaatctcct    17580

caagccagct cgccagacgc gcttcggtct ggtcgaactg gttatcctga tccagcacca    17640

gcccaacaaa gcggtcgcct tccagcgccg aggacgcgct gaattcataa ccctcatttg    17700

gccagctgcc aatcatctgc gcgccgcgcg cgctcagggc gtcgaacagc gggcgcatcc    17760

cgctgacgaa gttgtccgga tagcctctct gatcgccgag gccgaacagc gccacggttt    17820

tccctttcag gctggcgtcg tcgaggccgc tgataaattc gctccatgac tcgctttcgc    17880

atccggcctc cagccccggc agctggccgt cgccgagcgt cggcgtgccc agcagcagca    17940

ccggataggc cataaagtcg tccagcgtcg tgcggttaat gttgaccggg gcatccgcca    18000

gctcgcccag ttgcttatgg atcattttcg cgattttgcg ggttttaccg gtatcggtgc    18060

caaagaaaat accaatgttc gccatgttgc gctcctgtcg gaaaaggggg ttgaaaatac    18120

gcgttctcgc aggggtattg cgaaggctgt gccaggttgc tttgcactac cgcggcccat    18180

ccctgcccca aaacgatcgc ttcagccctc tcccgccgcg cgcggcgggg ctggcggggc    18240

gcttaaaatg caaaaagcgc ctgcttttcc cctaccggat caatgtttct gcacatcacg    18300

ccgataaggg cgcacggttt gcatggttat caccgttcgg aaaacaccgc ggcgtccctg    18360

tcacggtgtc ggacaaattg tcataactgc gacacaggag tttgcgatga ccctgaatat    18420

gatgctcgat aacgccgtac ccgaggcgat tgccggtgcg ctgactcaac aacatccggg    18480

gctgtttttt acaatggtcg aacaggcatc ggtagcgatt tccctcaccg atgcccgggc    18540

gaatattacc tacgccaacc cggcgttttg ccgccagact ggatactcgc tggcgcaatt    18600

gctcaatcaa aacccgcgcc tgctggccag cagccagacg ccgcgcgaga tctaccagga    18660

gatgtggcaa accctgctcc agcgccagcc gtggcgcggt cagctaatta atcaggcccg    18720

cgacggcggc ctgtatctgg tagatatcga tatcacgccg gtgctgaatc cgcagggcga    18780

gctggagcat tatctggcga tgcagcggga tatcagcgtc agctataccc tggaacagcg    18840

gctgcgcaat catatgacgc taatggaagc ggtgctcaat aacatccccg ccgccgtggt    18900

cgtggtcgat gagcaggatc gggtggtgat ggataatctc gcctacaaaa cgttctgcgc    18960

ggactgcggc gggaaagagc tgctggtcga gctccaggtt tccccgcgca aaatggggcc    19020

cggcgcggag caaatcctgc cggtggtggt tcgcggcgcg gtccgctggc tgtcggtaac    19080

ctgctgggcg ctgcccggcg tgagtgaaga agccagccgc tacttcgtcg acagcgcccc    19140

ggcgcgcacg ctgatggtga tcgccgactg tacccagcag cgccagcagc aggagcaggg    19200

ccggctcgac cgtctgaaac agcaaatgac cgccggtaag ctgctggccg cgattcgcga    19260

gtcgctggac gcggcgctga ttcagcttaa ttgcccaatc aatatgctgg cggcggcccg    19320

ccggctgaac ggcgaaggca gcggcaacgt ggcgctggac gcggcgtggc gcgaaggtga    19380

agaggccatg gcgcgcctgc agcgctgccg cccttctctt gagctggaaa gcaatgccgt    19440

ctggccgctt cagccctttt ttgacgacct gtacgccctc taccgcaccc gctttgacga    19500

tcgcgcgcgg ctgcaggtgg acatggcatc gccgcatctg gtcggcttcg gccagcgtac    19560

ccagctgctg gcctgcttga gtttatggct cgaccggacg ctggccctcg ccgccgagct    19620

gccctccgta ccgctggaga tcgagcttta cgccgaagag gacgagggct ggctctcttt    19680

gtatctcaac gacaatgtcc cgctgctgca ggtgcgctac gcccactccc ccgatgccct    19740

aaactctccc ggcaaaggga tggagctgcg gctgatccaa acgctggtcg cctaccaccg    19800

cggcgcgatt gaactggctt cgcgaccgca gggaggcacc agcctggttc tgcgtttccc    19860

gctctttaat accctgaccg gaggtgagca atgatccata aatccgattc ggacaccacc    19920

gtcagacgtt tcgatctctc ccagcagttt accgccatgc agcggataag cgtggtcctg    19980

agtcgcgcca ccgaagcgag caaaaccctg caggaggttc tgagcgtgct acataacgat    20040

gcctttatgc agcacgggat gatttgcctg tacgacagcc agcaggagat cctgagcatc    20100

gaagcgctgc agcaaacgga agatcagacg ctgcccggca gtacgcaaat tcgctaccgg    20160

ccgggggaag gattagtcgg taccgtgctg gcgcagggcc agtcgctggt gctgccgcgc    20220

gtcgccgacg accagcgttt tctcgatcgt ctgagcctgt acgactatga cctgccgttt    20280

atcgccgttc cgctgatggg cccccactcc cggcccatcg gcgtactggc ggcgcacgcg    20340

atggcgcgtc aggaagagcg gctgcccgcc tgcacgcgct ttctcgaaac cgtcgccaat    20400

ctgatcgccc agacgattcg cctgatgatc ctgccaacct ccgccgcgca ggcgccgcag    20460

cagagcccca gaatagagcg cccgcgcgcc tgtacccctt cgcgcggttt cggcctggaa    20520

aatatggtcg gtaaaagccc ggcgatgcgg cagattatgg atattattcg tcaggtttcc    20580

cgctgggata ccacggtgct ggtacgcggc gagagcggca ccgggaaaga gctcatcgcc    20640

aacgccatcc accataattc tccgcgcgcc gccgcggcgt tcgtcaaatt taactgcgcg    20700

gcgctgccgg acaacctgct ggagagcgag ctgtttggtc atgagaaagg cgcgtttacc    20760

ggcgcggtgc gccagcggaa aggccgcttt gagctggcgg acggcggcac cttattcctc    20820

gatgagatcg gcgaaagcag cgcctcgttt caggctaagc tactgcgtat tctgcaagag    20880

ggggagatgg agcgcgtcgg cggcgacgaa accctgcggg tcaacgtgcg cattatcgcg    20940

gcgaccaacc gccatctgga agaggaggtg cggctgggtc atttccgcga ggatctatac    21000

taccgcctga acgtaatgcc tatcgcgctg ccgccgctgc gcgagcgcca ggaggatatc    21060

gccgagctgg cgcactttct ggtgcgaaaa atcgcccaca gccaggggcg aacgctgcgc    21120

atcagcgatg gggcgattcg cctgctgatg gagtacagct ggccgggaaa cgtgcgcgaa    21180

ctggaaaact gtctcgaacg ttcggcggtg ctgtcggaaa gcggcctgat agaccgggac    21240

gtgattctgt tcaaccatcg cgataacccg ccgaaagcgc tcgccagcag cggcccggcg    21300

gaggacggct ggctcgataa cagcctcgac gagcgccagc ggctgatcgc cgccctggaa    21360

aaagcgggct gggtgcaggc caaagcggcg cggctgctcg gcatgacccc gcgccaggtg    21420

gcgtatcgca ttcagattat ggatatcacc atgccgcgac tgtgaagcct tatgtgagat    21480

tcaggacatt gtcgccagcg cggcggaatt gcgacaattc agggacgcgg gttgccggtt    21540

aaaaagtcta cttttcatgc ggttgcgaaa ttaacctctg gtacagcatt tgcagcagga    21600

aggtatcgcc caaccacgaa ggtacgacca tgacttcctg ctcctctttt tctggcggca    21660

aagcctgccg cccggcggat gacagcgcat tgacgccgct tgtggccgat aaagctgccg    21720

cgcacccctg ctactctcgc catgggcatc accgtttcgc gcggatgcat ctgcccgtcg    21780

cgcccgcctg caatttgcag tgcaactact gtaatcgcaa attcgattgc agcaacgagt    21840

cccgccccgg ggtatcgtca acgctgctga cgcctgaaca ggcggtcgtg aaagtgcgtc    21900

aggtcgcgca ggcgatcccg cagctttcgg tggtgggcat cgccgggccc ggcgatccgc    21960

tcgccaatat cgcccgcacc tttcgcaccc tggagctgat ccgcgaacag ctgccggacc    22020

tgaaattatg cctgtcgacc aacggactgg tgctgcctga cgcggtggac cgcctgctgg    22080

atgtcggcgt tgaccacgtc acggtcacca ttaacaccct cgacgcggag attgccgcgc    22140

aaatctacgc ctggctatgg ctggacggcg aacgctacag cgggcgcgaa gcgggagaga    22200

tcctgattgc ccgtcagctt gagggcgtac gcaggctgac cgccaaaggc gtgctggtga    22260

aaataaattc ggtgctgatc cccggtatca acgatagcgg catggccggc gtgagccgcg    22320

cgctgcgggc cagcggcgcg tttatccata atattatgcc gctgatcgcc aggccggagc    22380

acggcacggt gtttggcctc aacggccagc cggagccgga cgccgagacg ctcgccgcca    22440

cccgcagccg gtgcggcgaa gtgatgccgc agatgaccca ctgccaccag tgtcgcgccg    22500

acgccattgg gatgctcggc gaagaccgca gccagcagtt tacccagctt ccggcgccag    22560

agagtctccc ggcctggctg ccgatcctcc accagcgcgc gcagctgcac gccagcattg    22620

cgacccgcgg cgaatctgaa gccgatgacg cctgcctggt cgccgtggcg tcaagccgcg    22680

gggacgtcat tgattgtcac tttggtcacg ccgaccggtt ctacatttac agcctctcgg    22740

ccgccggtat ggtgctggtc aacgagcgct ttacgcccaa atattgtcag gggcgcgatg    22800

actgcgagcc gcaggataac gcagcccggt ttgcggcgat cctcgaactg ctggcggacg    22860

ttaaagccgt attctgcgtg cgtatcggcc atacgccgtg gcaacagctg gaacaggaag    22920

gcattgaacc ctgcgttgac ggcgcgtggc ggccggtctc cgaagtgctg cccgcgtggt    22980

ggcaacagcg tcgggggagc tggcctgccg cgttgccgca taagggggtc gcctgatgcc    23040

gccgctcgac tggttgcggc gcttatggct gctgtaccac gcggggaaag gcagctttcc    23100

gctgcgcatg gggcttagcc cgcgcgattg gcaggcgctg cggcggcgcc tgggcgaggt    23160

ggaaacgccg ctcgacggcg agacgctcac ccgtcgccgc ctgatggcgg agctcaacgc    23220

cacccgcgaa gaggagcgcc agcagctggg cgcctggctg gcgggctgga tgcagcagga    23280

tgccgggccg atggcgcaga ttatcgccga ggtttcgctg gcgtttaacc atctctggca    23340

ggatcttggt ctggcatcgc gcgccgaatt gcgcctgctg atgagcgact gctttccaca    23400

gctggtggtg atgaacgaac acaatatgcg ctggaaaaag ttcttttatc gtcagcgctg    23460

tttgctgcaa cagggggaag ttatctgccg ttcgccaagc tgcgacgagt gctgggaacg    23520

cagcgcctgt tttgagtagc cgtttcccga agggggcgct gcaaacaaaa agccggaggt    23580

ttccctccgg cttttcacat catcaaatgt gattatgcga cgtcttcgta ctgcggcacc    23640

gggttgcgga agcttttggt cacgcaggcc tccgtagacc agaccaatac cgccccagat    23700

caggccgaga accatggagc tctcttcgag gttaatccac agtgcgccga cggtcagcgc    23760

gccgcagacc ggcagaatca gatagttgaa gtggtctttc agcgttttgt tgcgcttttc    23820

acggatccag aactgggaga tcaccgacag gttaacgaag gtgaacgcca ccagcgcgcc    23880

gaggttaatc ggcgccgtcg ccgtgacgag gtcgagttta atcgccagca gcgcgatcgc    23940

gcaaccagca gcacgttcca tgccggagta cgccgtttcg gatgcacgta gccgaagaaa    24000

cgcgtcggga acacgccgtc gcggcccatc acgtacatca gacgggaaac gcccgcgtgc    24060

gcggccgtgc cggatgccag tacggtaacg ctggagaaaa tcagcacgcc ccactggaag    24120

gttttgcccg ccacgtacag catgatttca ggctgcgagg cgtccggatc tttgaagcgc    24180

gagatgtccg ggaagtacag ctgcag                                         24206


<210>  10
<211>  4983
<212>  DNA
<213>  Azotobacter vinelandii


<220>
<221>  misc_feature
<222>  (1)..(4983)
<223>  Azotobacter vinelandii nifHDK gene cluster; n is a, c, g, or t

<400>  10
cccgggccca gatagggaac gatgtcgccc gagccgagct gggcgaggat ttcctttaat       60

aagctgtcgg tcactgaact ctcctgctga gggaagggca agaatcgaca ccttattgca      120

ataagtgtgc caagatttcg ttgtttaact aattgaattt aaaagaaatc attggtgatt      180

tcggaatggc ttgtcgtatc cgtgggccag gatggggcgt ggcttcacga caattgtcag      240

ttttgtcaca gggggccgga ccaggatggt ggacgctcga tggggatgtc gggccattgt      300

tcggttgtag caattacaca catgtcggag tagggggatt gtgaggggga ttgttgtgta      360

tcaccccctg cagctcccgt cgatggataa ttaatcattt aaaatcaatg gtttatttat      420

gtgttgcggg tgctggcaca gacgctgcat tacctttggt gcgcggagtt gttcgggctt      480

acggccgaac gttcaagtgg aaatgcaacc tgaggaaatt aactatggct atgcgtcaat      540

gcgccatcta cggcaaaggt ggtatcggta agtccaccac tactcagaac ctggtggcag      600

ccctggctga gatgggcaag aaggtcatga tcgttggttg tgacccgaaa gctgactcca      660

cccgcctgat cctgcactcc aaggcccaga acaccatcat ggaaatggct gccgaagccg      720

gtaccgtgga agatctggag ctggaagacg tgctgaaggc tggctacggc ggcgtcaagt      780

gcgttgagtc cggtggtccg gagccgggcg ttggctgcgc cggccgtggt gttatcacag      840

caatcaactt cctggaagag gaaggcgcct acgaagacga tctggacttc gtattctacg      900

acgtcctggg cgacgtggtg tgtggcggct tcgccatgcc gatccgcgag aacaagcccc      960

aagaaatcta catcgtctgc tccggtgaga tgatggccat gtacgccgcc aacaacatct     1020

ccaagggcat cgtgaagtat gccaactccg gcagcgtgcg tctgggcggc ctgatctgca     1080

acagccgtaa caccgaccgc gaagacgagc tgatcatcgc tctggccaac aagctgggca     1140

cccagatgat ccacttcgtg ccgcgtgaca acgtcgtgca gcgcgccgaa atccgccgca     1200

tgaccgtgat cgaatacgat ccgaaagcca agcaagccga cgaataccgc gctctggccc     1260

gcaaggtcgt cgacaacaaa ctgctggtca tcccgaaccc gatcaccatg gacgagctcg     1320

aagagctgct gatggaattc ggtatcatgg aagtcgaaga cgaatccatc gtcggcaaaa     1380

ccgccgaaga agtctgatag ccgctccggt ttcagaagga ctttacaggg cagattggct     1440

ctgtcggggt ggcgcccccc gcattgggcg ggcgcccacc cgttacccgc attatgaacg     1500

ctaaggcaag aggagtcata cccatgaccc gtatgtcgcg cgaagaggtt gaatccctca     1560

tccaggaagt tctggaagtt tatcccgaga aggctcgcaa ggatcgtaac aagcacctgg     1620

ccgtcaacga cccggcggtt acccagtcca agaagtgcat catctccaac aagaagtccc     1680

agcccggtct gatgaccatc cgcggctgcg cctacgccgg ttccaaaggc gtggtctggg     1740

gccccatcaa ggacatgatc cacatctccc acggtccggt aggctgcggc cagtattcgc     1800

gcgccggccg tcgtaactac tacatcggta ccaccggtgt gaacgccttc gtcaccatga     1860

acttcacctc ggacttccag gagaaggaca tcgtgttcgg tggcgacaag aagctcgcca     1920

aactgatcga cgaagtggaa accctgttcc cgctgaacaa gggtatctcc gtccagtccg     1980

agtgcccgat cggcctgatc ggcgacgaca tcgaatccgt gtccaaggtc aagggcgccg     2040

agctcagcaa gaccatcgta ccggtccgtt gcgaaggctt ccgcggcgtt tgccagtccc     2100

tgggccacca catcgccaac gacgcagtcc gcgactgggt cctgggcaag cgtgacgccg     2160

acaccacctt cgccagcact ccttacgatg tggccatcat cggcgactac aacatcggcg     2220

gcgacgcctg gtcttcccgc atcctgctgg aagaaatggg cctgcgttgc gtagcccagt     2280

ggtccggcga cggctacatc tcccaaatcg agctgacccc gaaggtcaag ctgaacctgg     2340

ttcactgcta ccgctcgatg aactacatct cccgtcacat ggaagagaag tacggtatcc     2400

catggatgga gtacaacttc ttcggcccga ccaagaccat cgagtcgctg cgtgccatcg     2460

ccgccaagtt cgacgagagc atccagaaga agtgcgaaga ggtcatcgcc aagtacaagc     2520

ccgagtggga agcggtggtc gccaagtacc gtccgcgcct ggaaggcaag cgcgtcatgc     2580

tctacatcgg tggcctgcgt ccgcgccacg tgatcggcgc ctacgaagac ctgggcatgg     2640

aagtggtggg taccggctac gagttcgccc acaacgacga ctatgaccgg accatgaaag     2700

aaatgggtga ctccaccctg ctgtacgatg acgtgaccgg catggaattc gaagaattcg     2760

tcaagcgcat caagcccgac ctgatcggct ccggtatcaa ggagaagttc atcttccaga     2820

agatgggcat ccccttccgt caaatgcact cctgggatta ttccggcccc taccacggct     2880

tcgatggctt cgccatcttc gcccgtgaca tggacatgac cctgaacaat ccgtgctgga     2940

agaaactgca ggctccctgg gaagcttccg aaggcgccga gaaagtcgcc gccagcgcct     3000

gatagcagag caatcgtacg caacgtccgc tgcgggcggt ttccgccggc cgacattccg     3060

ctaacgccgt tcacagatga gtgaggcgta ggagagagtc atgagccagc aagtcgataa     3120

aatcaaagcc agctacccgc tgttcctcga tcaggactac aaggacatgc ttgccaagaa     3180

gcgcgacggc ttcgaggaaa agtatccgca ggacaagatc gacgaagtat tccagtggac     3240

caccaccaag gaataccagg agctgaactt ccagcgcgaa gccctgaccg tcaacccggc     3300

caaggcttgc cagccgctgg gcgccgttct ctgcgccctc ggtttcgaga agaccatgcc     3360

ctacgtgcac ggttcccagg gttgcgtcgc ctacttccgc tcctacttga accgtcattt     3420

ccgcgagccg gtttcctgcg tttccgactc catgaccgaa gacgcggcag tgttcggcgg     3480

ccagcagaac atgaaggacg gtctgcagaa ctgtaaggct acctacaagc ccgacatgat     3540

cgcagtgtcc accacctgca tggccgaggt catcggtgac gacctcaacg ccttcatcaa     3600

caactcgaag aaggaaggtt tcattcctga cgagttcccg gtgccgttcg cccatacccc     3660

gagcttcgtg ggcagccacg tgaccggctg ggacaacatg ttcgaaggca ttgctcgcta     3720

cttcaccctg aagtccatgg acgacaaggt ggttggcagc aacaagaaga tcaacatcgt     3780

ccccggcttc gagacctacc tgggcaactt ccgcgtgatc aagcgcatgc tttcggaaat     3840

gggcgtgggc tacagcctgc tctccgatcc ggaagaagtg ctggacaccc cggctgacgg     3900

ccagttccgc atgtacgcgg gcggcaccac tcaggaagag atgaaggacg ctccgaacgc     3960

cctcaacacc gtcctgctgc agccgtggca cctngagaag accaagaagt tcgtcgaggg     4020

tacctggaag cacgaagtac cgaagctgaa catcccgatg ggcctggact ggaccgacga     4080

gttcctgatg aaagtcagcg aaatcagcgg ccagccgatt ccggcgagcc tgaccaagga     4140

gcgtggccgt ctggtcgaca tgatgaccga ctcccacacc tggctgcacg gcaagcgttt     4200

cgccctgtgg ggtgatccgg acttcgtgat gggcctggtc aagttcctgc tggaactggg     4260

ttgcgagccg gtacacattc tctgccacaa cggcaacaag cgttggaaga aggcggtcga     4320

cgccatcctc gccgcttcgc cctacggcaa gaatgctacc gtctacatcg gcaaggacct     4380

gtggcacctg cgttcgctgg tcttcaccga caagccggac ttcatgatcg gcaacagcta     4440

cggtaagttc atccagcgcg acaccctgca caagggcaag gagttcgagg ttccgctgat     4500

ccgtatcggc ttcccgatct tcgaccgtca tcacctgcat cgctccacca ccctgggtta     4560

cgagggcgcc atgcagatcc tgaccaccct ggtgaactcg atcctggaac gtctggacga     4620

ggaaacccgc ggtatgcagg ccaccgacta caaccacgac ctggtacgct aagtcgtcgg     4680

ttcaagtggt atcggccgga gcggcgcaag ctgctctccc ttggcggcgg ccgcaggtgg     4740

tcgggccttt tgcccgcgat ctgcggcaac cgccaaaccc gtctaaggag caagcccatg     4800

cccagcgtca tgattcgccg caacgacgaa ggccaactga ccttctatat cgccaagaaa     4860

gaccaggaag agatcgtggt gtccctggag catgacagcc ccgaactctg gggtggcgaa     4920

gtcaccctcg gcgacggttc gacctatttc atcgagccga taccgcaacc caagctgccg     4980

atc                                                                   4983


<210>  11
<211>  150
<212>  DNA
<213>  Nicotiana tabacum


<220>
<221>  promoter
<222>  (1)..(150)
<223>  Chloroplast Prrn promoter

<400>  11
gctctagttg gatttgctcc cccgccgtcg ttcaatgaga atggataaga ggctcgtggg       60

attgacgtga gggggcaggg atggctatat ttctgggagc gaactccggg cgaatttgaa      120

gcgcttggat acagttgtag ggagggatcc                                       150


<210>  12
<211>  210
<212>  DNA
<213>  Cauliflower Mosaic Virus


<220>
<221>  promoter
<222>  (1)..(210)
<223>  Cauliflower Mosaic Virus 35S promoter

<400>  12
tccactgacg taagggatga cgcacaatcc cactatcctt cgcaagaccc ttcctctata       60

taaggaagtt catttcattt ggagaggaca cgctgaaatc accagtctct ctctacaaat      120

ctatctctct ctattttctc cataataatg tgtgagtagt tcccagataa gggaattagg      180

gttcttatag ggtttcgctc acgtgttgag                                       210


<210>  13
<211>  395
<212>  DNA
<213>  Nicotiana tabacum


<220>
<221>  terminator
<222>  (1)..(395)
<223>  Chloroplast psbA terminator

<400>  13
gatcctggcc tagtctatag gaggttttga aaagaaagga gcaataatca ttttcttgtt       60

ctatcaagag ggtgctattg ctcctttctt tttttctttt tatttattta ctagtatttt      120

acttacatag acttttttgt ttacattata gaaaaagaag gagaggttat tttcttgcat      180

ttattcatga ttgagtattc tattttgatt ttgtatttgt ttaaaattgt agaaatagaa      240

cttgtttctc ttcttgctaa tgttactata tctttttgat tttttttttc caaaaaaaaa      300

atcaaatttt gacttcttct tatctcttat ctttgaatat ctcttatctt tgaaataata      360

atatcattga aataagaaag aagagctata ttcga                                 395


<210>  14
<211>  210
<212>  DNA
<213>  Cauliflower Mosaic Virus


<220>
<221>  terminator
<222>  (1)..(210)
<223>  Cauliflower Mosaic Virus 35S terminator

<400>  14
gtccgcaaaa atcaccagtc tctctctaca aatctatctc tctctatttt tctccagaat       60

aatgtgtgag tagttcccag ataagggaat tagggttctt atagggtttc gctcatgtgt      120

tgagcatata agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa      180

tttctaattc ctaaaaccaa aatccagtga                                       210


<210>  15
<211>  792
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Drug resistance gene


<220>
<221>  gene
<222>  (1)..(792)
<223>  Spectinomycin resistance gene aadA

<400>  15
atgggggaag cggtgatcgc cgaagtatcg actcaactat cagaggtagt tggcgtcatc       60

gagcgccatc tcgaaccgac gttgctggcc gtacatttgt acggctccgc agtggatggc      120

ggcctgaagc cacacagtga tattgatttg ctggttacgg tgaccgtaag gcttgatgaa      180

acaacgcggc gagctttgat caacgacctt ttggaaactt cggcttcccc tggagagagc      240

gagattctcc gcgctgtaga agtcaccatt gttgtgcacg acgacatcat tccgtggcgt      300

tatccagcta agcgcgaact gcaatttgga gaatggcagc gcaatgacat tcttgcaggt      360

atcttcgagc cagccacgat cgacattgat ctggctatct tgctgacaaa agcaagagaa      420

catagcgttg ccttggtagg tccagcggcg gaggaactct ttgatccggt tcttgaacag      480

gatctatttg aggcgctaaa tgaaacctta acgctatgga actcgccgcc cgactgggct      540

ggcgatgagc gaaatgtagt gcttacgttg tcccgcattt ggtacagcgc agtaaccggc      600

aaaatcgcgc cgaaggatgt cgctgccgac tgggcaatgg agcgcctgcc ggcccagtat      660

cagcccgtca tacttgaagc tagacaggct tatcttggac aagaagaaga tcgcttggcc      720

tcgcgcgcag atcagttgga agaatttgtc cactacgtga aaggcgagat caccaaggta      780

gtcggcaaat aa                                                          792


<210>  16
<211>  2686
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid sequence


<220>
<221>  misc_feature
<222>  (1)..(2686)
<223>  pUC19

<400>  16
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt acccggggat      420

cctctagagt cgacctgcag gcatgcaagc ttggcgtaat catggtcata gctgtttcct      480

gtgtgaaatt gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt      540

aaagcctggg gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc      600

gctttccagt cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg      660

agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg      720

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca      780

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac      840

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac      900

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg      960

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac     1020

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat     1080

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag     1140

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac     1200

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt     1260

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt     1320

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc     1380

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga     1440

aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac     1500

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc     1560

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct     1620

gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca     1680

tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct     1740

ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca     1800

ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc     1860

atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg     1920

cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct     1980

tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa     2040

aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta     2100

tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc     2160

ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg     2220

agttgctctt gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa     2280

gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg     2340

agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc     2400

accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg     2460

gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat     2520

cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata     2580

ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc     2640

atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtc                    2686


<210>  17
<211>  224
<212>  DNA
<213>  Nicotiana tabacum


<220>
<221>  promoter
<222>  (1)..(224)
<223>  Chloroplast psbA promoter

<400>  17
gggcaaccca ctagcatatc gaaattctaa ttttctgtag agaagtccgt atttttccaa       60

tcaacttcat taaaaatttg aatagatcta catacacctt ggttgacacg agtatataag      120

tcatgttata ctgttgaata acaagccttc cattttctat tttgatttgt agaaaactag      180

tgtgcttggg agtccctgat gattaaataa accaagattt tacc                       224


<210>  18
<211>  1627
<212>  DNA
<213>  Nicotiana tabacum


<220>
<221>  misc_feature
<222>  (1)..(1627)
<223>  Nicotiana tabacum TrnI chloroplast genome locus

<400>  18
cttcgggaac gcggacacag gtggtgcatg gctgtcgtca gctcgtgccg taaggtgttg       60

ggttaagtcc cgcaacgagc gcaaccctcg tgtttagttg ccatcgttga gtttggaacc      120

ctgaacagac tgccggtgat aagccggagg aaggtgagga tgacgtcaag tcatcatgcc      180

ccttatgccc tgggcgacac acgtgctaca atggccggga caaagggtcg cgatcccgcg      240

agggtgagct aaccccaaaa acccgtcctc agttcggatt gcaggctgca actcgcctgc      300

atgaagccgg aatcgctagt aatcgccggt cagccatacg gcggtgaatt cgttcccggg      360

ccttgtacac accgcccgtc acactatggg agctggccat gcccgaagtc gttaccttaa      420

ccgcaaggag ggggatgccg aaggcagggc tagtgactgg agtgaagtcg taacaaggta      480

gccgtactgg aaggtgcggc tggatcacct ccttttcagg gagagctaat gcttgttggg      540

tattttggtt tgacactgct tcacaccccc aaaaaaaaga agggagctac gtctgagtta      600

aacttggaga tggaagtctt ctttcctttc tcgacggtga agtaagacca agctcatgag      660

cttattatcc taggtcggaa caagttgata ggaccccctt ttttacgtcc ccatgttccc      720

cccgtgtggc gacatggggg cgaaaaaagg aaagagaggg atggggtttc tctcgctttt      780

ggcatagcgg gcccccagtg ggaggctcgc acgacgggct attagctcag tggtagagcg      840

cgcccctgat aattgcgtcg ttgtgcctgg gctgtgaggg ctctcagcca catggatagt      900

tcaatgtgct catcggcgcc tgaccctgag atgtggatca tccaaggcac attagcatgg      960

cgtactcctc ctgttcgaac cggggtttga aaccaaactc ctcctcagga ggatagatgg     1020

ggcgattcgg gtgagatcca atgtagatcc aactttcgat tcactcgtgg gatccgggcg     1080

gtccgggggg gaccaccacg gctcctctct tctcgagaat ccatacatcc cttatcagtg     1140

tatggacagc tatctctcga gcacaggttt agcaatggga aaataaaatg gagcacctaa     1200

caacgcatct tcacagacca agaactacga gatcgcccct ttcattctgg ggtgacggag     1260

ggatcgtacc attcgagccg tttttttctt gactcgaaat gggagcaggt ttgaaaaagg     1320

atcttagagt gtctagggtt gggccaggag ggtctcttaa cgccttcttt tttcttctca     1380

tcggagttat ttcacaaaga cttgccaggg taaggaagaa ggggggaaca agcacacttg     1440

gagagcgcag tacaacggag agttgtatgc tgcgttcggg aaggatgaat cgctcccgaa     1500

aaggaatcta ttgattctct cccaattggt tggaccgtag gtgcgatgat ttacttcacg     1560

ggcgaggtct ctggttcaag tccaggatgg cccagctgcg ccagggaaaa gaatagaaga     1620

agcatct                                                               1627


<210>  19
<211>  1625
<212>  DNA
<213>  Nicotiana tabacum


<220>
<221>  misc_feature
<222>  (1)..(1625)
<223>  Nicotiana tabacum TrnA chloroplast genome locus

<400>  19
actacttcat gcatgctcca cttggctcgg ggggatatag ctcagttggt agagctccgc       60

tcttgcaatt gggtcgttgc gattacgggt tggatgtcta attgtccagg cggtaatgat      120

agtatcttgt acctgaaccg gtggctcact ttttctaagt aatggggaag aggaccgaaa      180

cgtgccactg aaagactcta ctgagacaaa gatgggctgt caagaacgta gaggaggtag      240

gatgggcagt tggtcagatc tagtatggat cgtacatgga cggtagttgg agtcggcggc      300

tctcccaggg ttccctcatc tgagatctct ggggaagagg atcaagttgg cccttgcgaa      360

cagcttgatg cactatctcc cttcaaccct ttgagcgaaa tgcggcaaaa gaaaaggaag      420

gaaaatccat ggaccgaccc catcatctcc accccgtagg aactacgaga tcaccccaag      480

gacgccttcg gcatccaggg gtcacggacc gaccatagaa ccctgttcaa taagtggaac      540

gcattagctg tccgctctca ggttgggcag tcagggtcgg agaagggcaa tgactcattc      600

ttagttagaa tgggattcca actcagcacc ttttgagtga gattttgaga agagttgctc      660

tttggagagc acagtacgat gaaagttgta agctgtgttc gggggggagt tattgtctat      720

cgttggcctc tatggtagaa tcagtcgggg gacctgagag gcggtggttt accctgcggc      780

ggatgtcagc ggttcgagtc cgcttatctc caactcgtga acttagccga tacaaagctt      840

tatgatagca cccaattttt ccgattcggc ggttcgatct atgatttatc attcatggac      900

gttgataaga tccatccatt tagcagcacc ttaggatggc atagccttaa aagtgaaggg      960

cgaggttcaa acgaggaaag gcttacggtg gatacctagg cacccagaga cgaggaaggg     1020

cgtagtaatc gacgaaatgc ttcggggagt tgaaaataag catagatccg gagattcccg     1080

aatagggcaa cctttcgaac tgctgctgaa tccatgggca ggcaagagac aacctggcga     1140

actgaaacat cttagtagcc agaggaaaag aaagcaaaag cgattcccgt agtagcggcg     1200

agcgaaatgg gagcagccta aaccgtgaaa acggggttgt gggagagcaa tacaagcgtc     1260

gtgctgctag gcgaagcagc ccgaatgctg caccctagat ggcgaaagtc cagtagccga     1320

aagcatcact agcttatgct ctgacccgag tagcatgggg cacgtggaat cccgtgtgaa     1380

tcagcaagga ccaccttgca aggctaaata ctcctgggtg accgatagcg aagtagtacc     1440

gtgagggaag ggtgaaaaga acccccatcg gggagtgaaa tagaacatga aaccgtaagc     1500

tcccaagcag tgggaggagc cagggctctg accgcgtgcc tgttgaagaa tgagccggcg     1560

actcataggc agtggcttgg ttaagggaac ccaccggagc cgtagcgaaa gcgagtcttc     1620

atagg                                                                 1625


<210>  20
<211>  6743
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid sequence


<220>
<221>  misc_feature
<222>  (1)..(6743)
<223>  Chloroplast transformation vector pCTV

<400>  20
gtttaaaccg gtcttcggga acgcggacac aggtggtgca tggctgtcgt cagctcgtgc       60

cgtaaggtgt tgggttaagt cccgcaacga gcgcaaccct cgtgtttagt tgccatcgtt      120

gagtttggaa ccctgaacag actgccggtg ataagccgga ggaaggtgag gatgacgtca      180

agtcatcatg ccccttatgc cctgggcgac acacgtgcta caatggccgg gacaaagggt      240

cgcgatcccg cgagggtgag ctaaccccaa aaacccgtcc tcagttcgga ttgcaggctg      300

caactcgcct gcatgaagcc ggaatcgcta gtaatcgccg gtcagccata cggcggtgaa      360

ttcgttcccg ggccttgtac acaccgcccg tcacactatg ggagctggcc atgcccgaag      420

tcgttacctt aaccgcaagg agggggatgc cgaaggcagg gctagtgact ggagtgaagt      480

cgtaacaagg tagccgtact ggaaggtgcg gctggatcac ctccttttca gggagagcta      540

atgcttgttg ggtattttgg tttgacactg cttcacaccc ccaaaaaaaa gaagggagct      600

acgtctgagt taaacttgga gatggaagtc ttctttcctt tctcgacggt gaagtaagac      660

caagctcatg agcttattat cctaggtcgg aacaagttga taggaccccc ttttttacgt      720

ccccatgttc cccccgtgtg gcgacatggg ggcgaaaaaa ggaaagagag ggatggggtt      780

tctctcgctt ttggcatagc gggcccccag tgggaggctc gcacgacggg ctattagctc      840

agtggtagag cgcgcccctg ataattgcgt cgttgtgcct gggctgtgag ggctctcagc      900

cacatggata gttcaatgtg ctcatcggcg cctgaccctg agatgtggat catccaaggc      960

acattagcat ggcgtactcc tcctgttcga accggggttt gaaaccaaac tcctcctcag     1020

gaggatagat ggggcgattc gggtgagatc caatgtagat ccaactttcg attcactcgt     1080

gggatccggg cggtccgggg gggaccacca cggctcctct cttctcgaga atccatacat     1140

cccttatcag tgtatggaca gctatctctc gagcacaggt ttagcaatgg gaaaataaaa     1200

tggagcacct aacaacgcat cttcacagac caagaactac gagatcgccc ctttcattct     1260

ggggtgacgg agggatcgta ccattcgagc cgtttttttc ttgactcgaa atgggagcag     1320

gtttgaaaaa ggatcttaga gtgtctaggg ttgggccagg agggtctctt aacgccttct     1380

tttttcttct catcggagtt atttcacaaa gacttgccag ggtaaggaag aaggggggaa     1440

caagcacact tggagagcgc agtacaacgg agagttgtat gctgcgttcg ggaaggatga     1500

atcgctcccg aaaaggaatc tattgattct ctcccaattg gttggaccgt aggtgcgatg     1560

atttacttca cgggcgaggt ctctggttca agtccaggat ggcccagctg cgccagggaa     1620

aagaatagaa gaagcatctg gcgcgccgcg aaattaatac gactcactat agggagacca     1680

cgccgtcgtt caatgagaat ggataagagg ctcgtgggat tgacgtgagg gggcagggat     1740

ggctatattt ctgggagcga actccgggcg aatttgaagc gcttggatac gcatgcagga     1800

ggtatttatg ggggaagcgg tgatcgccga agtatcgact caactatcag aggtagttgg     1860

cgtcatcgag cgccatctcg aaccgacgtt gctggccgta catttgtacg gctccgcagt     1920

ggatggcggc ctgaagccac acagtgatat tgatttgctg gttacggtga ccgtaaggct     1980

tgatgaaaca acgcggcgag ctttgatcaa cgaccttttg gaaacttcgg cttcccctgg     2040

agagagcgag attctccgcg ctgtagaagt caccattgtt gtgcacgacg acatcattcc     2100

gtggcgttat ccagctaagc gcgaactgca atttggagaa tggcagcgca atgacattct     2160

tgcaggtatc ttcgagccag ccacgatcga cattgatctg gctatcttgc tgacaaaagc     2220

aagagaacat agcgttgcct tggtaggtcc agcggcggag gaactctttg atccggttcc     2280

tgaacaggat ctatttgagg cgctaaatga aaccttaacg ctatggaact cgccgcccga     2340

ctgggctggc gatgagcgaa atgtagtgct tacgttgtcc cgcatttggt acagcgcagt     2400

aaccggcaaa atcgcgccga aggatgtcgc tgccgactgg gcaatggagc gcctgccggc     2460

ccagtatcag cccgtcatac ttgaagctag acaggcttat cttggacaag aagaagatcg     2520

cttggcctcg cgcgcagatc agttggaaga atttgtccac tacgtgaaag gcgagatcac     2580

caaggtagtc ggcaaataat aggatcgttt atttacaacg gaatggtata caaagtcaac     2640

agatctcaac tcgagacctc aatgaattca ttggaccgcg gatcaaggta ccatagatat     2700

cattagctag cactaactag tagtagtcga catcaagagc tcattccaca tatgactgga     2760

ggatccacaa ggcctatcaa ggcgccatta attaaaggcc ggccaattta aatacaagct     2820

tgatcctggc ctagtctata ggaggttttg aaaagaaagg agcaataatc attttcttgt     2880

tctatcaaga gggtgctatt gctcctttct ttttttcttt ttatttattt actagtattt     2940

tacttacata gacttttttg tttacattat agaaaaagaa ggagaggtta ttttcttgca     3000

tttattcatg attgagtatt ctattttgat tttgtatttg tttaaaattg tagaaataga     3060

acttgtttct cttcttgcta atgttactat atctttttga tttttttttt ccaaaaaaaa     3120

aatcaaattt tgacttcttc ttatctctta tctttgaata tctcttatct ttgaaataat     3180

aatatcattg aaataagaaa gaagagctat attcgacctg cagactactt catgcatgct     3240

ccacttggct cggggggata tagctcagtt ggtagagctc cgctcttgca attgggtcgt     3300

tgcgattacg ggttggatgt ctaattgtcc aggcggtaat gatagtatct tgtacctgaa     3360

ccggtggctc actttttcta agtaatgggg aagaggaccg aaacgtgcca ctgaaagact     3420

ctactgagac aaagatgggc tgtcaagaac gtagaggagg taggatgggc agttggtcag     3480

atctagtatg gatcgtacat ggacggtagt tggagtcggc ggctctccca gggttccctc     3540

atctgagatc tctggggaag aggatcaagt tggcccttgc gaacagcttg atgcactatc     3600

tcccttcaac cctttgagcg aaatgcggca aaagaaaagg aaggaaaatc catggaccga     3660

ccccatcatc tccaccccgt aggaactacg agatcacccc aaggacgcct tcggcatcca     3720

ggggtcacgg accgaccata gaaccctgtt caataagtgg aacgcattag ctgtccgctc     3780

tcaggttggg cagtcagggt cggagaaggg caatgactca ttcttagtta gaatgggatt     3840

ccaactcagc accttttgag tgagattttg agaagagttg ctctttggag agcacagtac     3900

gatgaaagtt gtaagctgtg ttcggggggg agttattgtc tatcgttggc ctctatggta     3960

gaatcagtcg ggggacctga gaggcggtgg tttaccctgc ggcggatgtc agcggttcga     4020

gtccgcttat ctccaactcg tgaacttagc cgatacaaag ctttatgata gcacccaatt     4080

tttccgattc ggcggttcga tctatgattt atcattcatg gacgttgata agatccatcc     4140

atttagcagc accttaggat ggcatagcct taaaagtgaa gggcgaggtt caaacgagga     4200

aaggcttacg gtggatacct aggcacccag agacgaggaa gggcgtagta atcgacgaaa     4260

tgcttcgggg agttgaaaat aagcatagat ccggagattc ccgaataggg caacctttcg     4320

aactgctgct gaatccatgg gcaggcaaga gacaacctgg cgaactgaaa catcttagta     4380

gccagaggaa aagaaagcaa aagcgattcc cgtagtagcg gcgagcgaaa tgggagcagc     4440

ctaaaccgtg aaaacggggt tgtgggagag caatacaagc gtcgtgctgc taggcgaagc     4500

agcccgaatg ctgcacccta gatggcgaaa gtccagtagc cgaaagcatc actagcttat     4560

gctctgaccc gagtagcatg gggcacgtgg aatcccgtgt gaatcagcaa ggaccacctt     4620

gcaaggctaa atactcctgg gtgaccgata gcgaagtagt accgtgaggg aagggtgaaa     4680

agaaccccca tcggggagtg aaatagaaca tgaaaccgta agctcccaag cagtgggagg     4740

agccagggct ctgaccgcgt gcctgttgaa gaatgagccg gcgactcata ggcagtggct     4800

tggttaaggg aacccaccgg agccgtagcg aaagcgagtc ttcatagggc ggccgcccgg     4860

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc     4920

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc     4980

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga     5040

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc     5100

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat     5160

agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg     5220

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc     5280

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga     5340

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact     5400

agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt     5460

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag     5520

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg     5580

tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa     5640

aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata     5700

tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg     5760

atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata     5820

cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg     5880

gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct     5940

gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt     6000

tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc     6060

tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga     6120

tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt     6180

aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc     6240

atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa     6300

tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca     6360

catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca     6420

aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct     6480

tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc     6540

gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa     6600

tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt     6660

tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc     6720

taagaaacca ttattatcat gac                                             6743


<210>  21
<211>  830
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(830)
<223>  sdnL protein

<400>  21

Met Ala Leu Pro Gln Thr Glu Leu Arg Pro Met Gly Lys Pro Ile Leu 
1               5                   10                  15      


Arg Lys Glu Asp Pro Arg Leu Ile Arg Gly Lys Gly Arg Phe Val Asp 
            20                  25                  30          


Asp Ile Leu Leu Pro Asn Met Leu His Leu Cys Ile Leu Arg Ser Pro 
        35                  40                  45              


Tyr Ala His Ala Arg Ile Arg Arg Ile Asp Thr Ser Lys Ala Glu Ala 
    50                  55                  60                  


Ala Pro Gly Val Lys Leu Val Leu Thr Gly Glu Asp Leu Ala Lys Met 
65                  70                  75                  80  


Asn Leu Ala Trp Met Pro Thr Leu Ala Gly Asp Val Gln Met Val Leu 
                85                  90                  95      


Ala Thr Gly Lys Val Leu Phe Gln Tyr Gln Glu Val Ala Ala Val Val 
            100                 105                 110         


Ala Glu Thr Arg Ala Gln Ala Glu Asp Ala Ile Gln Leu Ile Glu Val 
        115                 120                 125             


Asp Tyr Glu Pro Leu Pro Val Val Val Asp Pro Phe Lys Ala Leu Glu 
    130                 135                 140                 


Pro Asp Ala Pro Ile Leu Arg Glu Asp Lys Glu Lys Lys Ser Asn His 
145                 150                 155                 160 


Ile Trp His Trp Glu Ala Gly Asp Arg Glu Glu Thr Asp Ala Ile Phe 
                165                 170                 175     


Arg Glu Ala Pro Val Val Val Lys Gln Asp Val Arg Phe Gln Arg Val 
            180                 185                 190         


His Pro Ser Pro Leu Glu Pro Cys Gly Cys Val Ala Asp Tyr Asn Pro 
        195                 200                 205             


Ala Thr Gly Lys Leu Val Val Tyr Val Thr Ser Gln Ala Pro His Val 
    210                 215                 220                 


His Arg Thr Ala Ile Ala Leu Thr Thr Gly Phe Pro Glu His Met Ile 
225                 230                 235                 240 


Gln Val Ile Ser Pro Asp Val Gly Gly Gly Phe Gly Asn Lys Val Pro 
                245                 250                 255     


Leu Tyr Pro Gly Tyr Val Val Ala Ile Val Ala Ser Leu Lys Leu Gly 
            260                 265                 270         


Val Pro Val Lys Trp Ile Glu Thr Arg Thr Glu Asn Ile Ala Ser Thr 
        275                 280                 285             


His Phe Ala Arg Asp Tyr His Met Thr Ala Glu Ile Ala Ala Thr Glu 
    290                 295                 300                 


Asp Gly Lys Met Leu Ala Leu Arg Val Lys Thr Ile Ala Asp His Gly 
305                 310                 315                 320 


Ala Phe Asp Ala Thr Ala Asn Pro Thr Lys Tyr Pro Ala Gly Leu Tyr 
                325                 330                 335     


Ser Ile Val Thr Gly Ser Tyr Asp Phe Lys Ala Ala Phe Val Glu Val 
            340                 345                 350         


Asp Gly Val His Thr Asn Lys Pro Pro Gly Gly Val Ala Tyr Arg Cys 
        355                 360                 365             


Ser Phe Arg Val Thr Glu Ala Ser Tyr Leu Ile Glu Arg Val Val Asp 
    370                 375                 380                 


Val Leu Ala Arg Arg Leu Lys Met Asp Pro Ala Glu Leu Arg Leu Arg 
385                 390                 395                 400 


Asn Phe Ile Arg Lys Glu Gln Phe Pro Tyr Arg Ser Pro Thr Gly Trp 
                405                 410                 415     


Val Tyr Asp Ser Gly Asp Tyr Glu Lys Thr Phe Lys Leu Ala Leu Glu 
            420                 425                 430         


Arg Ile Gly Tyr Glu Glu Leu Arg Lys Glu Gln Lys Glu Lys Trp Ala 
        435                 440                 445             


Arg Gly Glu Phe Met Gly Ile Gly Ile Ser Thr Phe Thr Glu Ile Val 
    450                 455                 460                 


Gly Ala Gly Pro Ala His Ser Phe Asp Ile Leu Gly Ile Lys Met Phe 
465                 470                 475                 480 


Asp Ser Ala Glu Ile Arg Val His Pro Thr Gly Lys Val Ile Ala Arg 
                485                 490                 495     


Leu Gly Val Arg His Gln Gly Gln Gly His Glu Thr Thr Phe Ala Gln 
            500                 505                 510         


Ile Ile Ala Glu Glu Leu Gly Leu Ser Val Asp Asp Val Val Val Glu 
        515                 520                 525             


Glu Gly Asp Thr Asp Thr Ala Pro Tyr Gly Leu Gly Thr Tyr Ala Ser 
    530                 535                 540                 


Arg Ser Thr Pro Thr Ala Gly Ala Ala Ala Ala Leu Cys Ala Arg Arg 
545                 550                 555                 560 


Ile Arg Asp Lys Ala Arg Lys Ile Ala Ala His Leu Leu Glu Val Asn 
                565                 570                 575     


Glu Asp Asp Val Val Trp Asp Gly Ala Ala Phe Ser Val Lys Gly Leu 
            580                 585                 590         


Pro Gly Arg Ser Val Thr Met Lys Asp Val Ala Phe Ala Ala Tyr Thr 
        595                 600                 605             


Asn Val Pro Asp Gly Ile Glu Pro Gly Leu Glu Ala Ser Tyr Tyr Tyr 
    610                 615                 620                 


Asn Pro Pro Asn Leu Thr Phe Pro Tyr Gly Ala Tyr Ile Ala Val Val 
625                 630                 635                 640 


Asp Ile Asp Lys Gly Thr Gly Ala Val Lys Val Arg Arg Phe Leu Ala 
                645                 650                 655     


Val Asp Asp Cys Gly Asn Val Ile Asn Pro Met Ile Val Glu Gly Gln 
            660                 665                 670         


Val His Gly Gly Leu Thr Glu Gly Phe Ala Ile Ala Phe Met Gln Asp 
        675                 680                 685             


Ile Pro Tyr Asp Ala Asp Gly Asn Cys Leu Ala Pro Asn Trp Met Asp 
    690                 695                 700                 


Tyr Leu Val Pro Thr Ala Trp Asp Thr Pro Gln Leu Glu Thr Asp Arg 
705                 710                 715                 720 


Thr Val Thr Pro Ser Pro His His Pro Leu Gly Ala Lys Gly Val Gly 
                725                 730                 735     


Glu Ser Pro Asn Val Gly Ser Pro Ala Ala Phe Val Asn Ala Val Leu 
            740                 745                 750         


Asp Ala Leu Ser Pro Leu Gly Val Glu His Ile Asp Met Pro Ile Tyr 
        755                 760                 765             


Pro Trp Lys Val Trp Lys Ile Leu Arg Asp Thr Ala Leu Arg Ser Asp 
    770                 775                 780                 


Ser Met Ala Ile Pro Ala Ser Phe Gln Ser Ala Arg Arg Glu Lys Pro 
785                 790                 795                 800 


Gly Gly Gly Ile Ala Ser Gly Pro Ile Lys Trp Thr Thr Ser Gly Arg 
                805                 810                 815     


Gln Arg Gly Arg Trp Met Asn Ala Arg Ser Leu Thr Ser Gly 
            820                 825                 830 


<210>  22
<211>  171
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(171)
<223>  sdnS protein

<400>  22

Met Lys Ile Arg Val Lys Val Asn Gly Thr Leu Tyr Glu Ala Asp Val 
1               5                   10                  15      


Glu Pro Arg Thr Leu Leu Ala Tyr Phe Leu Arg Glu Glu Leu Lys Leu 
            20                  25                  30          


Thr Gly Thr His Ile Gly Cys Asp Thr Thr Thr Cys Gly Ala Cys Thr 
        35                  40                  45              


Val Leu Leu Asp Gly Lys Ala Val Lys Ser Cys Thr Val Leu Ala Val 
    50                  55                  60                  


Gln Ala Asn Gly Arg Glu Val Met Thr Val Glu Gly Leu Glu Lys Asp 
65                  70                  75                  80  


Gly Gln Leu His Pro Leu Gln Val Ala Phe Trp Glu Glu His Ala Leu 
                85                  90                  95      


His Cys Gly Tyr Cys Thr Pro Gly Met Leu Met Ala Ser Tyr Ala Leu 
            100                 105                 110         


Leu Gln Glu Asn Pro Met Pro Thr Glu Glu Glu Ile Arg Phe Gly Leu 
        115                 120                 125             


Ser Gly Asn Val Cys Arg Cys Thr Gly Tyr Met Asn Ile Val Lys Ala 
    130                 135                 140                 


Val Gln Ser Ala Ala Arg Arg Leu Ser Gly Ala Ser Gly Glu Ala Val 
145                 150                 155                 160 


Gly Glu Val Ala Thr Ser Gly Thr Ala Ala Asp 
                165                 170     


<210>  23
<211>  294
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(294)
<223>  sdnM protein

<400>  23

Met Phe Pro Asn Ala Phe Lys Tyr Glu Ala Pro Ala Ser Val Asp Glu 
1               5                   10                  15      


Ala Val Arg Leu Leu Ala Glu Tyr Gly Tyr Asp Gly Lys Val Leu Ala 
            20                  25                  30          


Gly Gly Gln Ser Leu Leu Pro Met Met Lys Leu Arg Val Ala Ala Pro 
        35                  40                  45              


Ala Val Leu Ile Asp Ile Asn Gly Ile Asp Ala Leu Gln Gly Trp Arg 
    50                  55                  60                  


Glu Val Asp Gly Lys Leu Arg Val Gly Ala Met Thr Arg His Ala Glu 
65                  70                  75                  80  


Leu Glu His Ala Lys Glu Leu Arg Asp Thr Tyr Pro Leu Phe Phe Gln 
                85                  90                  95      


Thr Ala Arg Trp Ile Ala Asp Pro Leu Ile Arg Asn Arg Gly Thr Ile 
            100                 105                 110         


Gly Gly Ser Leu Ala His Ala Asp Pro Gly Ser Asp Trp Gly Ala Ala 
        115                 120                 125             


Met Ile Ala Leu Arg Ala Glu Val Glu Ala Arg Gly Pro Gln Gly Ser 
    130                 135                 140                 


Arg Leu Ile Pro Ile Asp Glu Phe Phe Val Asp Thr Phe Ala Thr Ala 
145                 150                 155                 160 


Leu Asn Glu Asp Glu Leu Ala Val Ala Val His Val Pro Thr Pro Lys 
                165                 170                 175     


Gly Pro Ala Ala Ser Arg Tyr Met Lys Leu Glu Arg Arg Ala Gly Asp 
            180                 185                 190         


Phe Ala Ile Ala Ala Leu Ala Val His Val Ala Leu Gly Thr Asp Gly 
        195                 200                 205             


Arg Val Ser Glu Ala Gly Ile Gly Ile Cys Ala Cys Gly Pro Ile Pro 
    210                 215                 220                 


Leu Arg Ala Ala Lys Ala Glu Ala Ala Leu Ile Gly Arg Pro Leu Thr 
225                 230                 235                 240 


Glu Glu Val Ile Val Glu Ala Ser Arg Leu Val Pro Glu Asp Ala Glu 
                245                 250                 255     


Pro Ala Asp Asp Leu Arg Gly Ser Ala Glu Tyr Lys Arg Asp Val Leu 
            260                 265                 270         


Arg Val Phe Ala Ala Arg Ala Leu Arg Asp Ile Ala Lys Glu Leu Gln 
        275                 280                 285             


Gly Lys Val Gly Ile Gln 
    290                 


<210>  24
<211>  207
<212>  PRT
<213>  Streptomyces thermoautotrophicus


<220>
<221>  MISC_FEATURE
<222>  (1)..(207)
<223>  sdnO protein

<400>  24

Met Phe Glu Leu Pro Pro Leu Pro Tyr Pro Tyr Asp Ala Leu Glu Pro 
1               5                   10                  15      


Tyr Phe Asp Ala Lys Thr Met Glu Ile His Tyr Asn Gly His His Gly 
            20                  25                  30          


Ala Tyr Val Lys Asn Leu Asn Ala Ala Leu Glu Lys Tyr Pro Ala Trp 
        35                  40                  45              


Gln Asn Lys Pro Ile Glu Glu Leu Leu Gln Ser Leu Asp Gln Leu Pro 
    50                  55                  60                  


Glu Asp Ile Arg Thr Ala Val Arg Asn Asn Gly Gly Gly His Tyr Asn 
65                  70                  75                  80  


His Ser Phe Trp Trp Pro Met Leu Lys Lys Asn Glu Gly Gly Gln Pro 
                85                  90                  95      


Val Gly Lys Phe Ala Glu Ala Ile Asn Arg Asp Phe Gly Ser Phe Glu 
            100                 105                 110         


Ala Phe Lys Asp Ala Phe Ser Lys Ala Ala Ala Gly Arg Phe Gly Ser 
        115                 120                 125             


Gly Trp Ala Trp Val Val Val Glu Pro Asp Gly Lys Leu Thr Val Thr 
    130                 135                 140                 


Thr Thr Pro Asn Gln Asp Asn Pro Val Met Glu Gly Lys Thr Val Val 
145                 150                 155                 160 


Phe Gly Leu Asp Val Trp Glu His Ala Tyr Tyr Leu Lys Tyr Gln Asn 
                165                 170                 175     


Arg Arg Pro Glu Tyr Ile Gln Ala Phe Trp Asn Val Val Asn Trp Asp 
            180                 185                 190         


Val Val Asn Glu Arg Tyr Glu Glu Ala Leu Lys Lys Phe Gly Arg 
        195                 200                 205         


<210>  25
<211>  2566
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificial sequence


<220>
<221>  misc_feature
<222>  (1)..(2566)
<223>  DNA segment containing sdnL gene optimized for expression in 
       chloroplasts

<400>  25
ggtaccagga ggtatttatg gctttgcctc aaactgaact acgacctatg gggaaaccca       60

tattaaggaa agaggaccca cgattaatcc gaggtaaggg tcgttttgtt gatgatatat      120

tattaccaaa tatgttacac ttatgtattt taaggtcccc ctatgctcac gctaggatac      180

gacgtatcga tacctcaaaa gcagaggcag ctcctggcgt taaattagtt cttactggtg      240

aagatttagc taaaatgaat cttgcctgga tgcccacttt ggctggcgat gtccaaatgg      300

tcttagccac aggtaaggta ctttttcaat accaagaagt tgcagcagta gttgctgaaa      360

ctagagcgca ggcagaggat gctattcaat taatagaagt agattatgaa cctttgcctg      420

tggtagtaga tccctttaaa gctcttgaac cagacgctcc aatcttacgt gaagataaag      480

aaaaaaaatc aaatcatatc tggcattggg aggccggtga tagagaagaa acagatgcta      540

tatttcgaga ggcccctgtg gttgtaaaac aagatgtacg atttcaaaga gttcatccct      600

ccccacttga accttgtgga tgtgtcgctg attacaatcc agctactgga aaacttgtag      660

tatatgttac gtcacaagcg ccacatgtac atagaacagc aattgcattg accacaggat      720

ttccagaaca catgatacag gttattagtc cggatgtagg gggtggattc ggaaataaag      780

ttcctcttta tcctggttat gttgtggcta ttgtagcatc tttaaaatta ggtgttcctg      840

ttaaatggat tgagaccaga acggaaaata ttgcttctac acattttgcc agagactatc      900

acatgaccgc tgaaattgcc gctacggaag atggtaaaat gttagccctt cgtgttaaaa      960

caattgctga tcatggtgcc tttgacgcta cagctaatcc taccaaatat cctgctggac     1020

tttactctat agttacagga agttacgact ttaaggcagc ctttgttgaa gtagatggtg     1080

tacacactaa caaacctccg ggaggcgtag cctaccgatg ctcctttaga gttacagaag     1140

cgagttattt gatagaacga gtggttgatg tcttggctag acgattaaaa atggaccccg     1200

ctgaattaag actaaggaac ttcattcgta aggagcaatt tccttataga agtcccactg     1260

gctgggtata cgattcaggt gattatgaaa aaacgttcaa attagctctt gagagaatag     1320

ggtatgaaga actacgtaaa gagcaaaaag aaaaatgggc tagaggagaa tttatgggta     1380

tcggcatcag tacttttaca gaaattgtgg gagcaggacc agcccattca ttcgatatat     1440

tagggataaa aatgttcgat tcagcagaaa tcagagtgca tcctaccgga aaggttattg     1500

ctcgtttagg tgttagacat cagggccaag gtcatgagac aacttttgca caaattattg     1560

cagaagaact tggcctttca gttgatgatg ttgtagtaga ggagggtgat acggatacag     1620

cgccttatgg acttggaacc tatgcctctc gaagtacacc aactgccggg gcagctgcgg     1680

ctttgtgtgc tcgaagaatt agagataaag caagaaaaat cgcagctcat cttcttgagg     1740

taaacgaaga cgatgtagta tgggatggcg cagctttttc tgtgaaaggt ttaccaggac     1800

gttctgtcac tatgaaggat gtagcatttg ctgcctatac caatgtgcca gatggcatcg     1860

aaccgggtct agaggctagt tattattata atccgccaaa cttaactttt ccttatggtg     1920

cctacatagc agtcgttgac attgataaag gaactggagc ggttaaagta cgaagatttt     1980

tagctgtaga tgattgcgga aatgtaataa atccgatgat agtagaagga caagtccatg     2040

ggggtttaac agaaggtttt gcaatagcgt ttatgcaaga tataccttat gatgcagatg     2100

ggaactgtct agctcctaat tggatggatt accttgtacc aacggcatgg gatactccgc     2160

aattagagac agatagaact gtgaccccta gtcctcatca tcctttggga gcaaaaggag     2220

ttggagagtc tcccaatgtc ggatctcccg ccgcattcgt aaatgctgtt ctagatgccc     2280

tatctccact aggtgtagaa catattgata tgcctattta tccttggaaa gtctggaaaa     2340

tattacgaga caccgccctt cgttctgatt ctatggctat tccagcttct ttccaaagtg     2400

cacgacgaga gaaacctggc ggaggtattg catctggacc cattaagtgg actacatctg     2460

gacgtcaacg agggagatgg atgaatgctc gttctttaac ttctggctaa taggatcgtt     2520

tatttacaac ggaatggtat acaaagtcaa cagatctcaa gctagc                    2566


<210>  26
<211>  2220
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificial sequence


<220>
<221>  misc_feature
<222>  (1)..(2220)
<223>  DNA segment containing sdnS - sdnM - sdnO genes optimized for 
       expression in chloroplasts

<400>  26
gctagcagga ggtatttatg aaaattagag taaaggttaa cggaacctta tatgaagctg       60

atgttgaacc gcgtacctta ttggcttatt tcttacgtga agaacttaaa ttaacgggca      120

ctcatattgg atgtgatacg acaacttgcg gggcttgtac tgtactactt gatggaaaag      180

cggttaaatc ttgcactgta ctagccgtac aagctaacgg cagagaggtt atgacagtgg      240

aaggacttga aaaggatggt caacttcatc ctttacaggt tgctttttgg gaggaacatg      300

ccctacattg tggatactgt acacccggta tgttgatggc tagttatgct ttgttacagg      360

aaaatccgat gccgaccgag gaagagatta gattcggact ttcagggaat gtttgtcgat      420

gtactggcta tatgaatata gtcaaagctg tacaatcagc agcaagacgt cttagtggag      480

cttctggtga agctgttgga gaggtagcaa cttctggcac tgctgctgac taataggatc      540

gtttatttac aacggaatgg tatacaaagt caacagatct caaaggaggt atttatgttt      600

cccaatgcct tcaaatatga ggctccagct tcagtagatg aagcagtacg tctattagcc      660

gagtatggat atgatggtaa ggttttagct ggcggtcaat ccttgctacc tatgatgaaa      720

ctacgagtcg ctgctcctgc cgtacttatt gatataaatg gtattgatgc gttacaagga      780

tggcgtgaag ttgatgggaa attacgtgtc ggagccatga cacgtcatgc ggaattagaa      840

catgcaaaag agcttaggga tacttatcct ttgttcttcc aaactgcgcg ttggattgct      900

gatccgttaa tccgaaatag aggaacaatt ggaggaagtc tagctcatgc tgatccaggg      960

tctgactggg gggcagcaat gattgcttta cgagctgagg tggaagcccg tggtcctcaa     1020

gggtctcgtt taattcccat tgacgaattt tttgttgata cttttgccac cgctttaaat     1080

gaggatgaat tggccgttgc cgtacatgta ccgacaccta aagggcctgc tgcatcacga     1140

tacatgaaac tagaacgtcg agcaggtgat tttgctatag ccgctttggc agtacatgtc     1200

gcattaggta cagatggtcg tgtctctgaa gctggtattg ggatatgtgc ttgtggtccc     1260

attccgctaa gagccgccaa agctgaagcg gctttgatcg gacgtccctt aactgaagaa     1320

gtaatagtag aagcgtctag attggttcca gaagatgctg aacctgccga tgacttacga     1380

ggttctgccg aatataaacg agatgtactt agggtattcg ccgcccgagc tttaagagat     1440

atagcaaaag aacttcaggg caaggttgga atacaataat aggatcgttt atttacaacg     1500

gaatggtata caaagtcaac agatctcaaa ggaggtattt atgtttgaat taccaccttt     1560

accatatccg tacgacgctt tggaaccgta tttcgatgca aagactatgg aaattcatta     1620

taatggtcat cacggtgcat acgtcaagaa tctaaatgct gctttagaaa agtatcctgc     1680

ctggcaaaat aagcccattg aagaattatt gcaatcttta gatcagttac cggaagatat     1740

tcgtactgct gttcgaaata acggaggcgg acattataac catagttttt ggtggcctat     1800

gttgaaaaag aatgaggggg gtcaacctgt aggaaaattt gccgaagcta taaatcgtga     1860

ttttggtagt tttgaagcgt ttaaggatgc tttttccaaa gccgcagctg ggcgttttgg     1920

atctggctgg gcttgggttg tagttgagcc ggatggaaaa ttaacggtca ccacaactcc     1980

caatcaagat aatcctgtta tggaagggaa gactgtagtg tttggtttgg atgtttggga     2040

acatgcttat tatttaaaat atcaaaatag acgtccggaa tacatacagg ctttttggaa     2100

tgtcgtaaat tgggatgtag taaatgaacg atatgaagaa gctctaaaaa aattcggccg     2160

ttaataggat cgtttattta caacggaatg gtatacaaag tcaacagatc tcaacatatg     2220


<210>  27
<211>  11455
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid sequence


<220>
<221>  misc_feature
<222>  (1)..(11455)
<223>  pCTV-StNitrogenase vector

<400>  27
gtttaaaccg gtcttcggga acgcggacac aggtggtgca tggctgtcgt cagctcgtgc       60

cgtaaggtgt tgggttaagt cccgcaacga gcgcaaccct cgtgtttagt tgccatcgtt      120

gagtttggaa ccctgaacag actgccggtg ataagccgga ggaaggtgag gatgacgtca      180

agtcatcatg ccccttatgc cctgggcgac acacgtgcta caatggccgg gacaaagggt      240

cgcgatcccg cgagggtgag ctaaccccaa aaacccgtcc tcagttcgga ttgcaggctg      300

caactcgcct gcatgaagcc ggaatcgcta gtaatcgccg gtcagccata cggcggtgaa      360

ttcgttcccg ggccttgtac acaccgcccg tcacactatg ggagctggcc atgcccgaag      420

tcgttacctt aaccgcaagg agggggatgc cgaaggcagg gctagtgact ggagtgaagt      480

cgtaacaagg tagccgtact ggaaggtgcg gctggatcac ctccttttca gggagagcta      540

atgcttgttg ggtattttgg tttgacactg cttcacaccc ccaaaaaaaa gaagggagct      600

acgtctgagt taaacttgga gatggaagtc ttctttcctt tctcgacggt gaagtaagac      660

caagctcatg agcttattat cctaggtcgg aacaagttga taggaccccc ttttttacgt      720

ccccatgttc cccccgtgtg gcgacatggg ggcgaaaaaa ggaaagagag ggatggggtt      780

tctctcgctt ttggcatagc gggcccccag tgggaggctc gcacgacggg ctattagctc      840

agtggtagag cgcgcccctg ataattgcgt cgttgtgcct gggctgtgag ggctctcagc      900

cacatggata gttcaatgtg ctcatcggcg cctgaccctg agatgtggat catccaaggc      960

acattagcat ggcgtactcc tcctgttcga accggggttt gaaaccaaac tcctcctcag     1020

gaggatagat ggggcgattc gggtgagatc caatgtagat ccaactttcg attcactcgt     1080

gggatccggg cggtccgggg gggaccacca cggctcctct cttctcgaga atccatacat     1140

cccttatcag tgtatggaca gctatctctc gagcacaggt ttagcaatgg gaaaataaaa     1200

tggagcacct aacaacgcat cttcacagac caagaactac gagatcgccc ctttcattct     1260

ggggtgacgg agggatcgta ccattcgagc cgtttttttc ttgactcgaa atgggagcag     1320

gtttgaaaaa ggatcttaga gtgtctaggg ttgggccagg agggtctctt aacgccttct     1380

tttttcttct catcggagtt atttcacaaa gacttgccag ggtaaggaag aaggggggaa     1440

caagcacact tggagagcgc agtacaacgg agagttgtat gctgcgttcg ggaaggatga     1500

atcgctcccg aaaaggaatc tattgattct ctcccaattg gttggaccgt aggtgcgatg     1560

atttacttca cgggcgaggt ctctggttca agtccaggat ggcccagctg cgccagggaa     1620

aagaatagaa gaagcatctg gcgcgccgcg aaattaatac gactcactat agggagacca     1680

cgccgtcgtt caatgagaat ggataagagg ctcgtgggat tgacgtgagg gggcagggat     1740

ggctatattt ctgggagcga actccgggcg aatttgaagc gcttggatac gcatgcagga     1800

ggtatttatg ggggaagcgg tgatcgccga agtatcgact caactatcag aggtagttgg     1860

cgtcatcgag cgccatctcg aaccgacgtt gctggccgta catttgtacg gctccgcagt     1920

ggatggcggc ctgaagccac acagtgatat tgatttgctg gttacggtga ccgtaaggct     1980

tgatgaaaca acgcggcgag ctttgatcaa cgaccttttg gaaacttcgg cttcccctgg     2040

agagagcgag attctccgcg ctgtagaagt caccattgtt gtgcacgacg acatcattcc     2100

gtggcgttat ccagctaagc gcgaactgca atttggagaa tggcagcgca atgacattct     2160

tgcaggtatc ttcgagccag ccacgatcga cattgatctg gctatcttgc tgacaaaagc     2220

aagagaacat agcgttgcct tggtaggtcc agcggcggag gaactctttg atccggttcc     2280

tgaacaggat ctatttgagg cgctaaatga aaccttaacg ctatggaact cgccgcccga     2340

ctgggctggc gatgagcgaa atgtagtgct tacgttgtcc cgcatttggt acagcgcagt     2400

aaccggcaaa atcgcgccga aggatgtcgc tgccgactgg gcaatggagc gcctgccggc     2460

ccagtatcag cccgtcatac ttgaagctag acaggcttat cttggacaag aagaagatcg     2520

cttggcctcg cgcgcagatc agttggaaga atttgtccac tacgtgaaag gcgagatcac     2580

caaggtagtc ggcaaataat aggatcgttt atttacaacg gaatggtata caaagtcaac     2640

agatctcaac tcgagacctc aatgaattca ttggaccgcg gatcaaggta ccaggaggta     2700

tttatggctt tgcctcaaac tgaactacga cctatgggga aacccatatt aaggaaagag     2760

gacccacgat taatccgagg taagggtcgt tttgttgatg atatattatt accaaatatg     2820

ttacacttat gtattttaag gtccccctat gctcacgcta ggatacgacg tatcgatacc     2880

tcaaaagcag aggcagctcc tggcgttaaa ttagttctta ctggtgaaga tttagctaaa     2940

atgaatcttg cctggatgcc cactttggct ggcgatgtcc aaatggtctt agccacaggt     3000

aaggtacttt ttcaatacca agaagttgca gcagtagttg ctgaaactag agcgcaggca     3060

gaggatgcta ttcaattaat agaagtagat tatgaacctt tgcctgtggt agtagatccc     3120

tttaaagctc ttgaaccaga cgctccaatc ttacgtgaag ataaagaaaa aaaatcaaat     3180

catatctggc attgggaggc cggtgataga gaagaaacag atgctatatt tcgagaggcc     3240

cctgtggttg taaaacaaga tgtacgattt caaagagttc atccctcccc acttgaacct     3300

tgtggatgtg tcgctgatta caatccagct actggaaaac ttgtagtata tgttacgtca     3360

caagcgccac atgtacatag aacagcaatt gcattgacca caggatttcc agaacacatg     3420

atacaggtta ttagtccgga tgtagggggt ggattcggaa ataaagttcc tctttatcct     3480

ggttatgttg tggctattgt agcatcttta aaattaggtg ttcctgttaa atggattgag     3540

accagaacgg aaaatattgc ttctacacat tttgccagag actatcacat gaccgctgaa     3600

attgccgcta cggaagatgg taaaatgtta gcccttcgtg ttaaaacaat tgctgatcat     3660

ggtgcctttg acgctacagc taatcctacc aaatatcctg ctggacttta ctctatagtt     3720

acaggaagtt acgactttaa ggcagccttt gttgaagtag atggtgtaca cactaacaaa     3780

cctccgggag gcgtagccta ccgatgctcc tttagagtta cagaagcgag ttatttgata     3840

gaacgagtgg ttgatgtctt ggctagacga ttaaaaatgg accccgctga attaagacta     3900

aggaacttca ttcgtaagga gcaatttcct tatagaagtc ccactggctg ggtatacgat     3960

tcaggtgatt atgaaaaaac gttcaaatta gctcttgaga gaatagggta tgaagaacta     4020

cgtaaagagc aaaaagaaaa atgggctaga ggagaattta tgggtatcgg catcagtact     4080

tttacagaaa ttgtgggagc aggaccagcc cattcattcg atatattagg gataaaaatg     4140

ttcgattcag cagaaatcag agtgcatcct accggaaagg ttattgctcg tttaggtgtt     4200

agacatcagg gccaaggtca tgagacaact tttgcacaaa ttattgcaga agaacttggc     4260

ctttcagttg atgatgttgt agtagaggag ggtgatacgg atacagcgcc ttatggactt     4320

ggaacctatg cctctcgaag tacaccaact gccggggcag ctgcggcttt gtgtgctcga     4380

agaattagag ataaagcaag aaaaatcgca gctcatcttc ttgaggtaaa cgaagacgat     4440

gtagtatggg atggcgcagc tttttctgtg aaaggtttac caggacgttc tgtcactatg     4500

aaggatgtag catttgctgc ctataccaat gtgccagatg gcatcgaacc gggtctagag     4560

gctagttatt attataatcc gccaaactta acttttcctt atggtgccta catagcagtc     4620

gttgacattg ataaaggaac tggagcggtt aaagtacgaa gatttttagc tgtagatgat     4680

tgcggaaatg taataaatcc gatgatagta gaaggacaag tccatggggg tttaacagaa     4740

ggttttgcaa tagcgtttat gcaagatata ccttatgatg cagatgggaa ctgtctagct     4800

cctaattgga tggattacct tgtaccaacg gcatgggata ctccgcaatt agagacagat     4860

agaactgtga cccctagtcc tcatcatcct ttgggagcaa aaggagttgg agagtctccc     4920

aatgtcggat ctcccgccgc attcgtaaat gctgttctag atgccctatc tccactaggt     4980

gtagaacata ttgatatgcc tatttatcct tggaaagtct ggaaaatatt acgagacacc     5040

gcccttcgtt ctgattctat ggctattcca gcttctttcc aaagtgcacg acgagagaaa     5100

cctggcggag gtattgcatc tggacccatt aagtggacta catctggacg tcaacgaggg     5160

agatggatga atgctcgttc tttaacttct ggctaatagg atcgtttatt tacaacggaa     5220

tggtatacaa agtcaacaga tctcaagcta gcaggaggta tttatgaaaa ttagagtaaa     5280

ggttaacgga accttatatg aagctgatgt tgaaccgcgt accttattgg cttatttctt     5340

acgtgaagaa cttaaattaa cgggcactca tattggatgt gatacgacaa cttgcggggc     5400

ttgtactgta ctacttgatg gaaaagcggt taaatcttgc actgtactag ccgtacaagc     5460

taacggcaga gaggttatga cagtggaagg acttgaaaag gatggtcaac ttcatccttt     5520

acaggttgct ttttgggagg aacatgccct acattgtgga tactgtacac ccggtatgtt     5580

gatggctagt tatgctttgt tacaggaaaa tccgatgccg accgaggaag agattagatt     5640

cggactttca gggaatgttt gtcgatgtac tggctatatg aatatagtca aagctgtaca     5700

atcagcagca agacgtctta gtggagcttc tggtgaagct gttggagagg tagcaacttc     5760

tggcactgct gctgactaat aggatcgttt atttacaacg gaatggtata caaagtcaac     5820

agatctcaaa ggaggtattt atgtttccca atgccttcaa atatgaggct ccagcttcag     5880

tagatgaagc agtacgtcta ttagccgagt atggatatga tggtaaggtt ttagctggcg     5940

gtcaatcctt gctacctatg atgaaactac gagtcgctgc tcctgccgta cttattgata     6000

taaatggtat tgatgcgtta caaggatggc gtgaagttga tgggaaatta cgtgtcggag     6060

ccatgacacg tcatgcggaa ttagaacatg caaaagagct tagggatact tatcctttgt     6120

tcttccaaac tgcgcgttgg attgctgatc cgttaatccg aaatagagga acaattggag     6180

gaagtctagc tcatgctgat ccagggtctg actggggggc agcaatgatt gctttacgag     6240

ctgaggtgga agcccgtggt cctcaagggt ctcgtttaat tcccattgac gaattttttg     6300

ttgatacttt tgccaccgct ttaaatgagg atgaattggc cgttgccgta catgtaccga     6360

cacctaaagg gcctgctgca tcacgataca tgaaactaga acgtcgagca ggtgattttg     6420

ctatagccgc tttggcagta catgtcgcat taggtacaga tggtcgtgtc tctgaagctg     6480

gtattgggat atgtgcttgt ggtcccattc cgctaagagc cgccaaagct gaagcggctt     6540

tgatcggacg tcccttaact gaagaagtaa tagtagaagc gtctagattg gttccagaag     6600

atgctgaacc tgccgatgac ttacgaggtt ctgccgaata taaacgagat gtacttaggg     6660

tattcgccgc ccgagcttta agagatatag caaaagaact tcagggcaag gttggaatac     6720

aataatagga tcgtttattt acaacggaat ggtatacaaa gtcaacagat ctcaaaggag     6780

gtatttatgt ttgaattacc acctttacca tatccgtacg acgctttgga accgtatttc     6840

gatgcaaaga ctatggaaat tcattataat ggtcatcacg gtgcatacgt caagaatcta     6900

aatgctgctt tagaaaagta tcctgcctgg caaaataagc ccattgaaga attattgcaa     6960

tctttagatc agttaccgga agatattcgt actgctgttc gaaataacgg aggcggacat     7020

tataaccata gtttttggtg gcctatgttg aaaaagaatg aggggggtca acctgtagga     7080

aaatttgccg aagctataaa tcgtgatttt ggtagttttg aagcgtttaa ggatgctttt     7140

tccaaagccg cagctgggcg ttttggatct ggctgggctt gggttgtagt tgagccggat     7200

ggaaaattaa cggtcaccac aactcccaat caagataatc ctgttatgga agggaagact     7260

gtagtgtttg gtttggatgt ttgggaacat gcttattatt taaaatatca aaatagacgt     7320

ccggaataca tacaggcttt ttggaatgtc gtaaattggg atgtagtaaa tgaacgatat     7380

gaagaagctc taaaaaaatt cggccgttaa taggatcgtt tatttacaac ggaatggtat     7440

acaaagtcaa cagatctcaa catatgactg gaggatccac aaggcctatc aaggcgccat     7500

taattaaagg ccggccaatt taaatacaag cttgatcctg gcctagtcta taggaggttt     7560

tgaaaagaaa ggagcaataa tcattttctt gttctatcaa gagggtgcta ttgctccttt     7620

ctttttttct ttttatttat ttactagtat tttacttaca tagacttttt tgtttacatt     7680

atagaaaaag aaggagaggt tattttcttg catttattca tgattgagta ttctattttg     7740

attttgtatt tgtttaaaat tgtagaaata gaacttgttt ctcttcttgc taatgttact     7800

atatcttttt gatttttttt ttccaaaaaa aaaatcaaat tttgacttct tcttatctct     7860

tatctttgaa tatctcttat ctttgaaata ataatatcat tgaaataaga aagaagagct     7920

atattcgacc tgcagactac ttcatgcatg ctccacttgg ctcgggggga tatagctcag     7980

ttggtagagc tccgctcttg caattgggtc gttgcgatta cgggttggat gtctaattgt     8040

ccaggcggta atgatagtat cttgtacctg aaccggtggc tcactttttc taagtaatgg     8100

ggaagaggac cgaaacgtgc cactgaaaga ctctactgag acaaagatgg gctgtcaaga     8160

acgtagagga ggtaggatgg gcagttggtc agatctagta tggatcgtac atggacggta     8220

gttggagtcg gcggctctcc cagggttccc tcatctgaga tctctgggga agaggatcaa     8280

gttggccctt gcgaacagct tgatgcacta tctcccttca accctttgag cgaaatgcgg     8340

caaaagaaaa ggaaggaaaa tccatggacc gaccccatca tctccacccc gtaggaacta     8400

cgagatcacc ccaaggacgc cttcggcatc caggggtcac ggaccgacca tagaaccctg     8460

ttcaataagt ggaacgcatt agctgtccgc tctcaggttg ggcagtcagg gtcggagaag     8520

ggcaatgact cattcttagt tagaatggga ttccaactca gcaccttttg agtgagattt     8580

tgagaagagt tgctctttgg agagcacagt acgatgaaag ttgtaagctg tgttcggggg     8640

ggagttattg tctatcgttg gcctctatgg tagaatcagt cgggggacct gagaggcggt     8700

ggtttaccct gcggcggatg tcagcggttc gagtccgctt atctccaact cgtgaactta     8760

gccgatacaa agctttatga tagcacccaa tttttccgat tcggcggttc gatctatgat     8820

ttatcattca tggacgttga taagatccat ccatttagca gcaccttagg atggcatagc     8880

cttaaaagtg aagggcgagg ttcaaacgag gaaaggctta cggtggatac ctaggcaccc     8940

agagacgagg aagggcgtag taatcgacga aatgcttcgg ggagttgaaa ataagcatag     9000

atccggagat tcccgaatag ggcaaccttt cgaactgctg ctgaatccat gggcaggcaa     9060

gagacaacct ggcgaactga aacatcttag tagccagagg aaaagaaagc aaaagcgatt     9120

cccgtagtag cggcgagcga aatgggagca gcctaaaccg tgaaaacggg gttgtgggag     9180

agcaatacaa gcgtcgtgct gctaggcgaa gcagcccgaa tgctgcaccc tagatggcga     9240

aagtccagta gccgaaagca tcactagctt atgctctgac ccgagtagca tggggcacgt     9300

ggaatcccgt gtgaatcagc aaggaccacc ttgcaaggct aaatactcct gggtgaccga     9360

tagcgaagta gtaccgtgag ggaagggtga aaagaacccc catcggggag tgaaatagaa     9420

catgaaaccg taagctccca agcagtggga ggagccaggg ctctgaccgc gtgcctgttg     9480

aagaatgagc cggcgactca taggcagtgg cttggttaag ggaacccacc ggagccgtag     9540

cgaaagcgag tcttcatagg gcggccgccc gggtaatacg gttatccaca gaatcagggg     9600

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg     9660

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac     9720

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg     9780

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct     9840

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg     9900

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct     9960

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac    10020

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt    10080

tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt atctgcgctc    10140

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca    10200

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat    10260

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac    10320

gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt    10380

aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc    10440

aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg    10500

cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg    10560

ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc    10620

cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta    10680

ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg    10740

ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct    10800

ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta    10860

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg    10920

ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga    10980

ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt    11040

gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca    11100

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt    11160

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt    11220

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga    11280

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt    11340

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc    11400

gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgac         11455


<210>  28
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oligo


<220>
<221>  misc_feature
<222>  (1)..(22)
<223>  oligo

<400>  28
caaaatagaa tactcaatca tg                                                22


<210>  29
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oligo


<220>
<221>  misc_feature
<222>  (1)..(20)
<223>  oligo

<400>  29
agatatagca aaagaacttc                                                   20


<210>  30
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificial Sequence


<220>
<221>  misc_feature
<222>  (1)..(20)
<223>  oligo

<400>  30
cgtggtgatt gatgaaactg                                                   20


