SEQUENCE LISTING
<110> PHYCAL, INC.
<120> CHLOROPHYTE GENES FOR THE OPTIMIZATION OF PRODUCTIVITY
<140> PCT/US2012/060202
<141> 15 OCT 2012
<151> US PROVISIONAL APPLICATION NO. 61/547,416

<210> SEQ ID NO: 1
<211> 463 
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

<400> 
Met	Thr 	Gln 	Ser 	Arg 	Val 	Glu 	Gln 	Asn	Leu  	Gln 	Arg	Val	Asn	Glu
1				5					10					15
Leu	Lys	Ala	Glu	Ala	Thr	Thr	Leu	Glu	Arg	Met	Arg	Lys	Ala	Ser
				20					25					30
Asp	Leu	Asp	Ile	Lys	Glu	Arg	Glu	Arg	Ile	Ala	Ile	Ser	Thr	Val
				35					40					45
Ala	Ala	Lys	Gly	Pro	Ala	Ser	Ser	Ser	Ser	Ser	Ala	Ala	Ala	Val
				50					55					60
Ser	Ala	Pro	Ala	Thr	Ser	Ala	Thr	Leu	Thr	Val	Glu	Arg	Pro	Ala
				65					70					75
Ala	Thr	Thr	Val	Thr	Gln	Glu	Val	Pro	Ser	Thr	Ser	Tyr	Gly	Thr
				80					85					90
Pro	Val	Asp	Arg	Ala	Pro	Arg	Arg	Ser	Lys	Ala	Ala	Ile	Arg	Arg
				95					100					105
Ser	Arg	Gly	Leu	Glu	Ser	Ser	Met	Glu	Ile	Glu	Glu	Gly	Leu	Arg
				110					115					120
Asn	Phe	Trp	Tyr	Pro	Ala	Glu	Phe	Ser	Ala	Arg	Leu	Pro	Lys	Asp
				125					130					135
Thr	Leu	Val	Pro	Phe	Glu	Leu	Phe	Gly	Glu	Pro	Trp	Val	Met	Phe
				140					145					150
Arg	Asp	Glu	Lys	Gly	Gln	Pro	Ser	Tyr	Ile	Arg	Asp	Glu	Cys	Ala
				155					160					165
His	Arg	Gly	Cys	Pro	Leu	Ser	Leu	Gly	Lys	Val	Val	Glu	Gly	Gln
				170					175					180
Val	Met	Cys	Pro	Tyr	His	Gly	Trp	Glu	Phe	Asn	Gly	Asp	Gly	Ala
				185					190					195
Cys	Thr	Lys	Met	Pro	Ser	Thr	Pro	Phe	Cys	Arg	Asn	Val	Gly	Val
				200					205					210
Ala	Ala	Leu	Pro	Cys	Ala	Glu	Lys	Asp	Gly	Phe	Ile	Trp	Val	Trp
				215					220					225
Pro	Gly	Asp	Gly	Leu	Pro	Ala	Glu	Thr	Leu	Pro	Asp	Phe	Ala	Gln
				230					235					240
Pro	Pro	Glu	Gly	Phe	Leu	Ile	His	Ala	Glu	Ile	Met	Val	Asp	Val
				245					250					255
Pro	Val	Glu	His	Gly	Leu	Leu	Ile	Glu	Asn	Leu	Leu	Asp	Leu	Ala
				260					265					270
Val	Pro	Asp	Phe	Val	Lys	Phe	His	Ala	Asn	Lys	Ala	Leu	Ser	Gly
				290					295					300
Phe	Trp	Asp	Pro	Tyr	Pro	Ile	Asp	Met	Ala	Phe	Gln	Pro	Pro	Cys
				305					310					315
Met	Thr	Leu	Ser	Thr	Ile	Gly	Leu	Ala	Gln	Pro	Gly	Lys	Ile	Met
				320					325					330
Arg	Gly	Val	Thr	Ala	Ser	Gln	Cys	Lys	Asn	His	Leu	His	Gln	Leu
				335					340					345
His	Val	Cys	Met	Pro	Ser	Lys	Lys	Gly	His	Thr	Arg	Leu	Leu	Tyr
				350					355					360
Arg	Met	Ser	Leu	Asp	Phe	Leu	Pro	Trp	Met	Arg	His	Val	Pro	Phe
				365					370					375
Ile	Asp	Arg	Ile	Trp	Lys	Gln	Val	Ala	Ala	Gln	Val	Leu	Gly	Glu
				380					385					390
Asp	Leu	Val	Leu	Val	Leu	Gly	Gln	Gln	Asp	Arg	Met	Leu	Arg	Gly
				395					400					405
Gly	Ser	Asn	Trp	Ser	Asn	Pro	Ala	Pro	Tyr	Asp	Lys	Leu	Ala	Val
				410					415					420
Arg	Tyr	Arg	Arg	Trp	Arg	Asn	Gly	Val	Asn	Ala	Glu	Val	Ala	Arg
				425					430					435
Val	Arg	Ala	Gly	Glu	Pro	Pro	Ser	Asn	Pro	Val	Ala	Met	Ser	Ala
				440					445					450
Gly	Glu	Met	Phe	Ser	Val	Asp	Glu	Asp	Asp	Met	Asp	Asn		
				455					460					

210> SEQ ID NO: 2
<211>  1600
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400> 
GCC CGG CTG TGT GAT GAA CGA GCG GCT GAT GCA CGG CGA CGA CGC GCG 48
CGC GGC CGC CAT CCG CCA GCG CCT GGA CTA CCT GCG CCG CAG GCG GCT 96
GGC CTG GGA GAT GGT TTA CGA CGT GGT GAT CAG GGA CGA CGC GCT GTG 144
CAC GCT GAG CGT GAT CGA GGA GGC CAA CAG CCG CGT GCA GCG CAT CCT 192
CAG CGA AGA GCA CCG CGA GCG GGC GGG GGT GGC CTC ACT GCG TCG CCA 240
GAT GGT GGA GCT GCA GGG GGA GGT TGC GAG CGC ACG GGC GCG CCT AGA 288
CGC CAC CCA GGA GGC TCT GGC CAG GAA CAT GGC CGC CAT GGA GCG CCT 336
GAG GGT GGA GGC CGA GGC TCT GCA GGG CCT GGT AGC TGG CGG CAA CCT 384
GGT TGA GGC GGG CGA GCC AAG CCA AGC CAC ACC CCT GCT CCA GCA TGG 432
ATT GGG TGA AGT GCA GGG CAG CGG AGC GGG AAC ACC TGG ACG GCC AGA 480
GCC ACC CAT CGC CCT GCC TTT TAC GAG CTC CGC CGC CAT TGG TTT GAC 528
GAG TGA GGA GCG GGC ACT CGC CCG ACC TTC CTC GCA TGC AGC ACC ACC 576
CAT GGG TGG CCA TGC GGC AGC CCC GCC TCC GCA GGT CAA GCG GCG GCG 624
TGG CCT CAA GTC CAG CCT GTT CCT GGA GCC AGG GCT GAA GGA GCA CTG 672
GTA TGG CGT CGC GTT TTG CAG CCA GCT CCA GCC GGG CTC TAC GCT TTC 720
GTT TGA ACT TTT TGG AGA GAT GTG GGA GCT GCG CCG CCA GGA TGG TGG 768
GGA CGT TGC GTG CGT GCC TGC TGG CAA CCC TGC AAG CCC ATC GCG CAG 816
TCG GGG CGT GCC GTG CAC TGA GGT GGA CGG CTT CGT GTG GGT CTG GCC 864
CGG GTC GCA GGC GCC CAC CCC GCC GCC CAC CGC CTC GGC CGT GCC GCT 912
GGG CTT CCG CTG CCA TGC TGA GAT CGA GGT GGA GGT GCC GGT GGA GCA 960
CGG CCT GCT GCT GGA GAA CCT GCT GGA CCT GGC GCA TGC CCC CTT CAC 1008
CCA TAC CAG CAC CTT TGC CAA GGG GTG GCC GAT CCC AGA CGT GGT CAA 1056
GTT CCA GGC CAC CAA GCT GCT AAC GGG CAA CTG GCA GCC CTA CCC CAT 1104
CAC CAT GTC CTT CGC CCC GCC AAA CAT GGT GGT GTC GCT CAT CGG CTT 1152
ATC GCG CCC AGG CGT GGT GGA AAG GGG TCT GAG CGC GGA GTC GTG CAA 1200
GCG CCA CCT GCA CCA GCT CCA CGT CTG CCT GCC CAG CCG CCC CGG CCA 1248
CAC TCG CCT GCT GTA CCG CAT GAG CAT GGA CTT TGC TGG GTG GTT GCG 1296
CTT CGT ACC CGG CAT CGC TGC CTT GCG GCG CAG CAT CGC CGG GCA GGT 1344
GCT GGG CGA GGA CCT GGT GCT CGT CCG CGG CCA GCA GGA CCG CAT GCT 1392
GCG AGG GGC TGA CAC CTG GCA GAC GCC GGT GTC GTA CGA CAA GCT GGC 1440
GGT GCG GTA CCG GCG CTG GCG CAA CCG GTT GGA GGG CGG GGA GCT GGA 1488
CAG GGA GGT GGT GGC GCA GGC CGC GGC GGC GCT GGC CAT GTC GGC TGG 1536
GGA GAT GTT TGC GCT GTC GGA GGA GAG GTG CGG GGA CGA CAA CGG CAC 1584
CTG CGT CGC GGA GTG A                                           1600 

<210> SEQ ID NO: 3
<211> 389
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Ala	Met	Met	Met	Arg	Gln	Lys	Val	Ala	Gly	Ala	Ile	Ala
1				5					10					15
Gly	Glu	Arg	Arg	Ser	Ala	Val	Ala	Pro	Lys	Met	Gly	Arg	Ala	Ala
				20					25					30
Thr	Ala	Pro	Val	Val	Val	Ala	Ser	Ala	Asn	Ala	Ser	Ala	Phe	Lys
				35					40					45
Gly	Ala	Ala	Val	Thr	Ala	Arg	Val	Lys	Arg	Ser	Thr	Arg	Ala	Ala
				50					55					60
Arg	Val	Gln	Ser	Arg	Arg	Thr	Ala	Val	Leu	Thr	Gln	Ala	Lys	Ile
				65					70					75
Gly	Asp	Ser	Leu	Ala	Glu	Phe	Leu	Val	Glu	Ala	Thr	Pro	Asp	Pro
				80					85					90
Lys	Leu	Arg	Gln	Leu	Met	Met	Ser	Met	Ala	Glu	Ala	Thr	Arg	Thr
				95					100					105
Ile	Ala	His	Lys	Val	Arg	Thr	Ala	Ser	Cys	Ala	Gly	Thr	Ala	Cys
				110					115					120
Val	Asn	Ser	Phe	Gly	Asp	Glu	Gln	Leu	Ala	Val	Asp	Met	Val	Ala
				125					130					135
Asp	Lys	Leu	Leu	Phe	Glu	Ala	Leu	Lys	Tyr	Ser	His	Val	Cys	Lys
				140					145					150
Leu	Ala	Cys	Ser	Glu	Glu	Val	Pro	Glu	Pro	Val	Asp	Met	Gly	Gly
				155					160					165
Glu	Gly	Phe	Cys	Val	Ala	Phe	Asp	Pro	Leu	Asp	Gly	Ser	Ser	Ile
				170					175					180
Val	Asp	Thr	Asn	Phe	Ala	Val	Gly	Thr	Ile	Phe	Gly	Val	Trp	Pro
				185					190					195
Gly	Asp	Lys	Leu	Thr	Asn	Ile	Thr	Gly	Arg	Glu	Gln	Val	Ala	Ala
				200					205					210
Gly	Met	Gly	Ile	Tyr	Gly	Pro	Arg	Thr	Val	Phe	Cys	Ile	Ala	Leu
				215					220					225
Lys	Asp	Ala	Pro	Gly	Cys	His	Glu	Phe	Leu	Leu	Met	Asp	Asp	Gly
				230					235					240
Lys	Trp	Met	His	Val	Lys	Glu	Thr	Thr	His	Ile	Gly	Glu	Gly	Lys
				245					250					255
Met	Phe	Ala	Pro	Gly	Asn	Leu	Arg	Ala	Thr	Phe	Asp	Asn	Pro	Ala
				260					265					270
Tyr	Glu	Arg	Leu	Ile	Asn	Phe	Tyr	Leu	Gly	Glu	Lys	Tyr	Thr	Leu
				275					280					285
Arg	Tyr	Thr	Gly	Gly	Met	Val	Pro	Asp	Val	Phe	Gln	Ile	Ile	Val
				290					295					300
Lys	Glu	Lys	Gly	Val	Phe	Thr	Asn	Val	Thr	Ser	Pro	Thr	Thr	Lys
				305					310					315
Ala	Lys	Leu	Arg	Ile	Leu	Phe	Glu	Val	Ala	Pro	Leu	Ala	Leu	Leu
				320					325					330
Ile	Glu	Lys	Ala	Gly	Gly	Ala	Ser	Ser	Cys	Asp	Gly	Lys	Ala	Val
				335					340					345
Ser	Ala	Leu	Asp	Ile	Pro	Ile	Leu	Val	Cys	Asp	Gln	Arg	Thr	Gln
				350					355					360
Ile	Cys	Tyr	Gly	Ser	Ile	Gly	Glu	Val	Arg	Arg	Phe	Glu	Glu	Tyr
				365					370					375
Met	Tyr	Gly	Thr	Ser	Pro	Arg	Phe	Ser	Glu	Lys	Val	Ala	Ala	
				380					385					

<210> SEQ ID NO: 4
<211> 1313 
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400>
GCG GGG GGG GGA CCC TTA CGA GCC CGA CCC TGT ATC AAT TTC CTT CTT 48
TTT CAA TCA AAA AGT GCG ACG CAG GTA GTG TGT AGG CTC GAT GGC GTC 96
CAT GTC CAT GAT ACG GCA ACC TTG CCG TGG AGT CGA GAG GTC CCT CCA 144
GGC CAT GCC TGC CCC CAT CAG GGT GGC TGG CCG CTG CGC AGG AAG GGG 192
CGC CAG GAA GAC GGG CGT CAG CCT CTT CCC ACG CCG TCA GGC TCC CAT 240
GAC CAT CAC ATC GGC GCT GGG TGA CTC CTT GGA GGA GTA CCT GCA GAC 288
CGC GAC GTC TGA CCC CAA CCT ACG CCG GCT GAT GAC GGC TAT GTC GGA 336
GGC CAT CCG CAC CAT CGC CTA CAA GGT GCG CAC CGC GTC GTG CAG CAG 384
CAC CGC CTG CAT CAA CAC CTT TGG TGA CGA GCA GCT GGC GGT GGA CCT 432
GCT GGC GGA CAA GCT GCT GTT CGA GGC GCT GCG CTA CTC CCA CGC CTG 480
CAA GTA CGC CTG CTC GGA GGA GAA CCC CGA GCC CCT GGA TGT GGG TGG 528
TGA GGG CTT CTC CGT GGC CTT TGA TCC CCT GGA TGG CTC CTC CAT CGT 576
GGA CAC CAA CTT CTC CGT GGG CAC CAT CTT TGG GGT GTG GCC CGG GGA 624
CAA GCT GAA GGA CAT CAC CGG CCG GCA GCA GGC GGC GGC GGG CAT GGG 672
CAT CTA CGG CCC CCG CAC TGT CTA CTG CAT CGC CCT GGC TGA GGC GCC 720
CGG CTG CCA GGA GTT CCT GCT GCA GGA CGA CGG CAA GTG GCT GCA CGT 768
CAA GGA GAC GGA ACG CAT TGA GGA GGG CAA GAT CTT CTC CCC CGG CAA 816
CCT GAG GGC CAC CTT TGA CAA CGC AGA GTA CCA GAA GCT GGT GCA GCA 864
CTA CGT GCA GGA GCA GTA CAC CCT GCG GTA CAC GGG TGG CAT GGT GCC 912
TGA CGT CTT CCA GAT CAT CGT CAA GGA GCA GGG CGT GTT CAC CAA CGT 960
CAT CTC CCC CAC CAC CAA GGC CAA GCT GCG CAT CCT GTT CGA GGT CGC 1008
CCC GCT GGC GCT GCT GGT GGA GAA GGC GGG CGG GCA CTC GTC GGC GGA 1056
CGG CAA GTG TAT CTC CGG CCT GGA TGT CGA GAT CAA GCA GTA CGA CCA 1104
GCG CAC CCA GAT CTG CTT TGG CTC GAT CAA GGA GGT GGC CAG GTT CGA 1152
GAA GAT GCT GTA CGG CAA GTC GGA GCG CTT TGC CAA GGA GGA GCA GCT 1200
CGT GGC ATG AGC CTG CGC TGG GGG TTG TAC CTG GAG GCG CTG CGG CAC 1248
AGT CGT GGC AGG TTG GAC GGC CGA GAA GAT AGC TTT TTC TTG TTT GTA 1296
ATG TCT GGA ACG GTC AA          				1313

<210> SEQ ID NO: 5
<211> 446
<212> Amino Acid Sequence
<213> Coccomyxa subellipsoidea C-169

Met	Cys	Leu	Gln	Ala	Gly	Leu	Ser	Leu	Ala	Ser	Ile	Gly	Leu	Ala
1				5					10					15
Ala	Ala	Phe	Ala	Phe	Ser	Thr	Trp	Glu	His	Ala	Leu	Pro	Ala	His
				20					25					30
Ala	Val	Thr	Pro	Glu	Gln	Leu	Leu	Phe	Leu	Glu	Ala	Trp	Arg	Ala
				35					40					45
Val	Asp	Arg	Ala	Tyr	Val	Asp	Lys	Ser	Phe	Asn	Gly	Gln	Ser	Trp
				50					55					60
Phe	Arg	Leu	Arg	Glu	Arg	Tyr	Met	Lys	Glu	Glu	Ala	Met	Asn	Ser
				65					70					75
Thr	Lys	Glu	Thr	Tyr	Ala	Ala	Ile	Arg	Lys	Ala	Leu	Ala	Thr	Leu
				80					85					90
Asp	Asp	Pro	Phe	Thr	Arg	Phe	Leu	Glu	Pro	Thr	Gln	Tyr	Ala	Ala
				95					100					105
Leu	Arg	Arg	Gly	Thr	Ala	Gly	Ser	Val	Thr	Gly	Val	Gly	Leu	Glu
				110					115					120
Val	Gly	Phe	Asp	Thr	Lys	Thr	Ser	Gly	Ser	Gly	Asn	Ser	Leu	Val
				125					130					135
Val	Ile	Thr	Pro	Ser	Ala	Gly	Gly	Pro	Ala	Glu	Arg	Ala	Gly	Ile
				140					145					150
Glu	Pro	Arg	Asp	Gly	Val	Val	Ala	Ile	Asn	Asp	Arg	Gln	Thr	Gln
				155					160					165
Gly	Leu	Ser	Leu	Tyr	Glu	Ala	Gly	Asp	Leu	Leu	Gln	Gly	Thr	Glu
				170					175					180
Gly	Ser	Glu	Val	Thr	Leu	Thr	Val	Arg	Lys	His	Gly	Gln	Asp	Thr
				185					190					195
Thr	Lys	Gln	Leu	Thr	Leu	Val	Arg	Glu	Lys	Ile	Asn	Phe	Asn	Pro
				200					205					210
Val	Ser	Ser	Gln	Leu	Cys	Ser	Gly	Ala	Ser	Ser	Ser	Thr	Ile	Ser
				215					220					225
Asp	Gly	Ala	Gly	Glu	Ala	Ala	Ala	Ser	Ser	Ser	Gly	Ser	Gly	Lys
				230					235					240
Val	Gly	Tyr	Ile	Arg	Val	Ala	Thr	Phe	Ser	Lys	Gln	Thr	Ala	Glu
				245					250					255
Asn	Ala	Arg	Asn	Ala	Ile	Gln	Lys	Leu	Lys	Ser	Glu	Gly	Ala	Asp
				260					265					270
Arg	Phe	Val	Leu	Asp	Val	Arg	Asn	Asn	Gly	Gly	Gly	Leu	Phe	Pro
				275					280					285
Ala	Gly	Val	Asp	Val	Ala	Arg	Met	Trp	Leu	Asp	Ser	Gly	Glu	Ile
				290					295					300
Val	Leu	Ile	Ala	Asp	Ser	Gln	Gly	Val	Arg	Asp	Ser	Tyr	Glu	Ala
				305					310					315
Asp	Gly	Gly	Ala	Leu	Asp	Ala	Thr	Ser	Pro	Leu	Ser	Val	Leu	Val
				320					325					330
Asn	Arg	Gly	Thr	Ala	Ser	Ala	Ser	Glu	Val	Leu	Ala	Gly	Ala	Leu
				335					340					345
Lys	Asp	Asn	Gly	Arg	Ala	Arg	Ile	Val	Gly	Glu	Arg	Thr	Phe	Gly
				350					355					360
Lys	Gly	Leu	Ile	Gln	Thr	Ile	Val	Glu	Leu	Ser	Asp	Gly	Ser	Gly
				365					370					375
Val	Ala	Val	Thr	Val	Ala	Arg	Tyr	Gln	Thr	Pro	Ala	Gly	Thr	Asp
				380					385					390
Ile	Asn	Lys	Val	Gly	Ile	Gln	Pro	Asp	Val	Thr	Leu	Gly	Pro	Asp
				395					400					405
Thr	Met	Pro	Pro	Ala	Asp	Gly	Pro	Gly	Phe	Cys	Lys	Phe	Val	Ala
				410					415					420
Ser	Ala	Asp	Ala	Pro	Gln	Leu	Phe	Gly	Pro	Val	Arg	Ser	Lys	Ala
				425					430					435
Ser	Val	Ala	Ala	Asp	Leu	Val	Val	Ser	Thr	Arg				
				440					445					

<210> SEQ ID NO: 6
<211> 606
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CTC TCA TGT GAC ACA GAG GTT TCA ATG CCT GGC AGT CCA GGC ATG CAC 48
CCT TGT GTC GTG CCG GGA CCC GCA TCT CCC ATT CCG CGA TGT ACA TAC 96
GCC CAT GCA CAG GCC CTG TGG TGG GTA CGA AGT CGA TTT CAA AAA CAT 144
GTA GCA TGC GCA CGT AAG CCT GGC ATT CTA CTT GAT CAT TGC CAA ACA 192
ATA TCA AAG CGT GAT AAG TCG TCG GGA CAG ATC CTG CTC AGT AGC GTT 240
CAC CGA GCA TTG CAG CGA GGC ATG CTG GGC ATC GCA GGC CTG GCC AGC 288
GTC ATC CTG CTG AGC GGT GCC AGC CCT GCA CAT GCC GTG AAC AAG AAC 336
CAG CTC CTC TAC CTG GAG GCA TGG AAG GCC GTG GAT CGT GCC TAC GTT 384
GAC AAG ACA TTC AAT GGC CAG AAC TGG TAC AAG ATT AGG GAG TCG CTC 432
CTG CAG TCA GAG TCT TTT GGT GAC AGG GAG CAG GCG TAT GCC GCC ATC 480
CGG AAG CTG CTG GGC ACC CTG GGG GAC CCC TTC ACG CGC CAC CTC GGC 528
CCT GAC CAG TAT GAA GCA CTC AAA CGG GCC TCC AGC GGC CAG CTG TCC 576
GGC ATC GGG GTG GAG GTG GGG CTC CGC AGC 			606

<210> SEQ ID NO: 7
<211> 162
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Gln	Ala	Leu	Thr	Ala	Ser	Val	Cys	Ala	Ala	Ala	Gly	Ser	Ala
1				5					10					15
Ala	Arg	Ala	Gly	Ser	Arg	Arg	Arg	Ala	Phe	Ala	Gly	Arg	Gln	Leu
				20					25					30
Arg	Ala	Ala	Val	Ser	Asn	Gly	Ser	Arg	Cys	Arg	Ala	Phe	Phe	Lys
				35					40					45
Phe	Gly	Asn	Ser	Lys	Asp	Lys	Glu	Ala	Asn	Gly	Ala	Pro	Gln	Asp
				50					55					60
Ala	Glu	Tyr	Phe	Gln	Gly	Gln	Lys	Arg	Gly	Asp	Tyr	Gln	Ala	Ser
				65					70					75
Asp	Val	Gln	Asp	Tyr	Phe	Met	Tyr	Met	Gly	Met	Leu	Ala	Ser	Glu
				80					85					90
Gly	Asn	Tyr	Glu	Arg	Cys	Glu	Ala	Met	Leu	Ala	Thr	Gly	Thr	Ala
				95					100					105
Pro	Val	Asp	Leu	Leu	Leu	Leu	Met	Ala	Cys	Ser	Glu	Asn	Asp	Asp
				110					115					120
Gly	Lys	Val	Glu	Glu	Leu	Leu	Glu	Ser	Gly	Ala	Asp	Pro	Ser	Ile
				125					130					135
Lys	Asp	Leu	Asp	Gly	Arg	Thr	Pro	Met	Glu	Leu	Thr	Thr	Lys	Glu
				140					145					150
Glu	Val	Arg	Glu	Leu	Leu	Ser	Lys	Tyr	Val	Thr	Ala			
				155					160					

<210> SEQ ID NO: 8
<211> 525
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG TCG CTG GTC CAC GCG GCT GCT CGG GGC ATG CCC ATC GGC TCA TGC 48
ACT CTC ATT ACA TGC ACA GCA CAG CGG TCG ATT GGT TGC CAC CAA AGG 96
AAC CTA CGC TTT CCC CCC GCG GTC CAG ACA TAC GTA GGG TCT ATG AGC 144
AAG GCT CAA ACT GTA ACG GTA TCC CGC AGG CTA TCG TGC AAC GCA TTC 192
TTC GGG TTT GGG AAG CAG AAT GAC ACA TTG CCG ACA CTG CAA CGG GGG 240
GAC TAT ACC AGG AGT GAG GTG GAA GAC TAC TAC AAC TAC ATG GGG ATG 288
CTG GCC GCA GAG GGC AAC TAT GAC AAA CTG GAA GAG CTG TTC GAC AGT 336
GGC GTG GAA CCT GTG GAT CTC CTG GTT CTC CTT GCC AGC ACC GAG AAC 384
GAC CTG CCC AAG CTG GAG GAG CTG GTG GCG GCG GGG GCA GAC CTG AGC 432
GTC AAG GAT CCA GAG GGA AGG GGC TGC ATG GAC CTG TGC ACG AAA CCT 480
GCG ATC AAG GAA TAC TTG AAA TCT GCT TCC AAC TCC ATG GCA TAG     525

<210> SEQ ID NO: 9
<211> 718
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Gln	Thr	Met	Leu	Lys	Gln	Arg	Cys	Gln	Pro	Ala	Val	Gly	Lys
1				5					10					15
Gln	Ala	Lys	Ala	Val	Pro	Ala	Val	Ala	Pro	Lys	Val	Gly	Arg	Ala
				20					25					30
Arg	Asn	Val	Val	Val	Ala	Gln	Ala	Ala	Pro	Ala	Ala	Ala	Lys	Ala
				35					40					45
Ala	Ala	Pro	Ser	Ile	Ser	Arg	Asp	Glu	Val	Glu	Lys	Cys	Ile	Asn
				50					55					60
Ala	Ile	Arg	Phe	Leu	Ala	Ile	Asp	Ala	Ile	Asn	Lys	Ser	Lys	Ser
				65					70					75
Gly	His	Pro	Gly	Met	Pro	Met	Gly	Cys	Ala	Pro	Met	Gly	Tyr	Val
				80					85					90
Leu	Trp	Asn	Glu	Val	Met	Lys	Tyr	Asn	Pro	Lys	Asn	Pro	Asp	Phe
				95					100					105
Phe	Asn	Arg	Asp	Arg	Phe	Val	Leu	Ser	Ala	Gly	His	Gly	Ser	Met
				110					115					120
Phe	Gln	Tyr	Ser	Met	Met	His	Leu	Thr	Gly	Tyr	Asp	Ser	Val	Pro
				125					130					135
Leu	Asp	Gln	Ile	Lys	Gln	Phe	Arg	Gln	Trp	Asn	Ser	Leu	Thr	Pro
				140					145					150
Gly	His	Pro	Glu	Asn	Phe	Val	Thr	Pro	Gly	Val	Glu	Val	Thr	Thr
				155					160					165
Gly	Pro	Leu	Gly	Gln	Gly	Ile	Cys	Asn	Ala	Val	Gly	Leu	Ala	Val
				170					175					180
Ala	Glu	Ala	His	Leu	Ala	Ala	Arg	Phe	Asn	Lys	Pro	Asp	Val	Lys
				185					190					195
Pro	Ile	Val	Asp	His	Tyr	Thr	Tyr	Cys	Ile	Leu	Gly	Asp	Gly	Cys
				200					205					210
Met	Met	Glu	Gly	Ile	Ser	Asn	Glu	Ala	Cys	Ser	Leu	Ala	Gly	His
				215					220					225
Trp	Gly	Leu	Gly	Lys	Leu	Ile	Ala	Leu	Tyr	Asp	Asp	Asn	Lys	Ile
				230					235					240
Ser	Ile	Asp	Gly	His	Thr	Asp	Ile	Ser	Phe	Thr	Glu	Asp	Val	Ala
				245					250					255
Lys	Arg	Tyr	Glu	Ala	Leu	Gly	Trp	His	Val	Ile	His	Val	Ile	Asn
				260					265					270
Gly	Asn	Thr	Asp	Val	Asp	Gly	Leu	Arg	Ala	Ala	Ile	Ala	Gln	Ala
				275					280					285
Lys	Ala	Val	Lys	Asp	Lys	Pro	Thr	Leu	Ile	Lys	Val	Ser	Thr	Leu
				290					295					300
Ile	Gly	Tyr	Gly	Ser	Pro	Asn	Lys	Ala	Asp	Ser	His	Asp	Val	His
				305					310					315
Gly	Ala	Pro	Leu	Gly	Pro	Asp	Glu	Thr	Ala	Ala	Thr	Arg	Lys	Asn
				320					325					330
Leu	Asn	Trp	Pro	Tyr	Gly	Glu	Phe	Glu	Val	Pro	Gln	Asp	Val	Tyr
				335					340					345
Asp	Val	Phe	Arg	Gly	Ala	Ile	Lys	Arg	Gly	Ala	Glu	Glu	Glu	Ala
				350					355					360
Asn	Trp	His	Lys	Ala	Cys	Ala	Glu	Tyr	Lys	Ala	Lys	Tyr	Pro	Lys
				365					370					375
Glu	Trp	Ala	Glu	Phe	Glu	Ala	Leu	Thr	Ser	Cys	Lys	Leu	Pro	Glu
				380					385					390
Asn	Trp	Glu	Ala	Ala	Leu	Pro	His	Phe	Lys	Pro	Glu	Asp	Lys	Gly
				395					400					405
Leu	Ala	Thr	Arg	Gln	His	Ser	Gln	Thr	Met	Ile	Asn	Ala	Leu	Ala
				410					415					420
Pro	Ala	Leu	Pro	Gly	Leu	Ile	Gly	Gly	Ser	Ala	Asp	Leu	Ala	Pro
				425					430					435
Ser	Asn	Leu	Thr	Leu	Met	Lys	Ile	Ser	Gly	Asp	Phe	Gln	Lys	Gly
				440					445					450
Ser	Tyr	Ala	Glu	Arg	Asn	Leu	Arg	Phe	Gly	Val	Arg	Glu	His	Ala
				455					460					465
Met	Gly	Ala	Ile	Cys	Asn	Gly	Ile	Ala	Leu	His	Lys	Ser	Gly	Leu
				470					475					480
Ile	Pro	Tyr	Cys	Ala	Thr	Phe	Tyr	Ile	Phe	Thr	Asp	Tyr	Met	Arg
				485					490					495
Asn	Ala	Met	Arg	Met	Ser	Ala	Leu	Ser	Glu	Ala	Gly	Val	Val	Tyr
				500					505					510
Val	Met	Thr	His	Asp	Ser	Ile	Gly	Leu	Gly	Glu	Asp	Gly	Pro	Thr
				515					520					525
His	Gln	Pro	Ile	Glu	His	Leu	Ala	Ser	Phe	Arg	Ala	Met	Pro	Asp
				530					535					540
Met	Leu	Met	Ile	Arg	Pro	Ala	Gly	Gly	Asn	Glu	Thr	Ala	Gly	Ala
				545					550					555
Tyr	Lys	Val	Ala	Ile	Ala	Asn	Arg	Lys	Arg	Pro	Thr	Thr	Ile	Ala
				560					565					570
Leu	Ser	Arg	Gln	Asn	Met	Pro	Asn	Ile	Pro	Asn	Cys	Ser	Val	Glu
				575					580					585
Gly	Val	Ala	Lys	Gly	Ala	Tyr	Thr	Ile	His	Asp	Thr	Lys	Ala	Gly
				590					595					600
Val	Lys	Pro	Asp	Val	Ile	Leu	Met	Gly	Thr	Gly	Ser	Glu	Leu	Glu
				605					610					615
Leu	Ala	Thr	Ala	Ala	Ala	Gly	Ile	Leu	Glu	Lys	Glu	Gly	Lys	Asn
				620					625					630
Val	Arg	Val	Val	Ser	Phe	Pro	Cys	Trp	Glu	Leu	Phe	Glu	Glu	Gln
				635					640					645
Ser	Ala	Glu	Tyr	Lys	Glu	Ser	Val	Leu	Pro	Ser	Asp	Val	Thr	Ala
				650					655					660
Arg	Val	Ser	Val	Glu	Ala	Ala	Thr	Ser	Phe	Gly	Trp	Ala	Lys	Tyr
				665					670					675
Ile	Gly	Leu	Lys	Gly	Lys	His	Val	Gly	Ile	Asp	Thr	Phe	Gly	Ala
				680					685					690
Ser	Ala	Pro	Ala	Pro	Thr	Leu	Tyr	Glu	Lys	Phe	Gly	Ile	Thr	Val
				695					700					705
Asn	His	Val	Val	Glu	Ala	Ala	Lys	Ala	Thr	Leu	Gln	His		
				710					715					

<210> SEQ ID NO: 10
<211> 1784
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400>
AGT TTG TGG CAT AAG CTT CCT TTG AAC GCG CAA GTA TAG CTT GTA TCC 48
ACA GCG TTG CTT CAT TAG GGA AGA ACA GAG CTA CAT CCG CCA TGG TTG 96
CTG CAC AGT GTC TGC TGA GCG TGG CCC CTA CCC GGC CCG GGC TCT GCA 144
GCG TGC GTC AGG GAG TGG CCG CGC CGT TCT CGG CTC CTC GCT TCA CTC 192
GTC GCA CAC TGC GGA CAT GCA GGC CCC TCA CTC GCG CTG CCC TTG CGG 240
TGG AGA TCC CCC CGG CGA CCG GTG ACC TGA AGC CCA AGG ACA AGA ACG 288
CGG AGC TGG CCA TCA ATG CCA TCC GCT TCC TCT CCA TCG ACG GCG TCA 336
ACG CGG CCA AGT CTG GCC ACC CGG GGC TGC CCA TGG GCT GCG CCC CCA 384
TGA CCT ACG TCA TCT GGA AGG ACT TCA TGA ATG TCA ACC CCA AGG ACC 432
CCA AGT GGC CCA ACA GGG ATC GCT TCG TGC TGA GCG CGG GCC ACG GCT 480
CCA TGC TCC AGT ACG CCA TCC TGC ACT TGA TGG GCT TCA ACC TGC CGA 528
TCT CCG AGC TGA AGC AGT TCC GCC AGT GGG GCA GCA AGA CCC CCG GCC 576
ACC CCG AGA ACT TTG AGA CCG AGG GCG TGG AGG TGA CCA CAG GCC CCC 624
TGG GCC AGG GCA TCG CCA ACG CGG TGG GGC TGG CTG TGG CTG AGA CGC 672
ACC TGG CCG CGC GCT TCA ACA AGC CCG ACG CCA AGC TGG TGG ACC ACT 720
ACA CCT ACT GCA TCA TGG GCG ACG GCT GCA ACA TGG AGG GCA TCT CCA 768
ACG AGG CCG CCT CCC TGG CGG GGC ACT GGA AGC TGG GCA AGC TCA TCG 816
CCT TCT ACG ACG ACA ACC GCA TCT CCA TCG ACG GCC ACA CCT CCA TCT 864
CCT TCA CCG AGG ATG TGG TGG CGC GGT ACG AGG CTC TGG GCT GGC ACA 912
CCA TCC ATG TCA AGG ATG GAA ACC ACG ACC TGG CGG GCC TGC GCG ACG 960
CCA TCA ACG AGG CCA AGT CCG TGA CGG ATA AGC CCA CCC TGA TCA AGG 1008
TGT CCA CCA TCA TCG GAT ACG GCT CGC CCA ACA AGG CCG ACT CGC ACG 1056
ACG TGC ACG GCT CCG CCC TGG GCG CGG CGG AGG CCC AGG CCA CGC GCG 1104
ACA ACT TGG GCT GGC CCT ACG GCG AGT TCG AGG TGC CCC AGG AGG CGT 1152
ACG ACG AGT TCG GCA AGG CAG CCA AAC GCG GCG AGG AGG CGG AGG CGG 1200
CGT GGC AGG CCA CCC GCC GCG AGT ACG CGG AGA AGT ACC CCG AGG AGT 1248
ACG CGG AGT ACG AGG CCA TCA CCA GCG GGC ACC TGC CCG CGG GCT GGG 1296
CGG ACG TGC TGC CCT CCT TCA CCA GCG CCG ACA AGG GCC TGG CCA CGC 1344
GCC TGC ACA GCC AGA CCA TGC TCA ACG CGC TGA GCC CGG TGC TGC CCG 1392
GCC TGC TGG GCG GGT CCG CCG ACC TCG CCG GCT CCT GCA TGA CCC TGG 1440
TGA AGA GCT CCG GCG ACT ACC AGG CTG ACT CCC CCG CCG AGC GCA ACT 1488
TCC GCT TCG GCG TGC GCG AGC ACG CCA TGG GCG CCA TCG GCA ACG GCA 1536
TGG CCC TGC ACT CCC CCG GCC TCC TGC CCT ACA CAG CCA CCT TCT TCA 1584
TCT TCA CGG ACT ACA TGC GCA ACG CCA TCC GCA TGG CCG CCC TGT CCC 1632
AGG CCG GCC AGC TCT TCG TCA TGA CCC ACG ACT CCA TCG GCC TGG GCG 1680
AGG ACG GGC CCA CCC ACC AGC CCG TGG AGC AGC TGG CCT CCT TCC GCG 1728
CCA TGC CCA ACA TCC TCA TGC TCC GCC CCG GCG ACG GCA ACG AGA CCG 1776
CAG GCG CG              					1784

<210> SEQ ID NO: 11
<211> 267
<212> Amino Acid Sequence
<213> Arabidopsis thaliana

Met	Ala	Ala	Ser	Thr	Met	Ala	Leu	Ser	Ser	Pro	Ala	Phe	Ala	Gly
1				5					10					15
Lys	Ala	Val	Asn	Leu	Ser	Pro	Ala	Ala	Ser	Glu	Val	Leu	Gly	Ser
				20					25					30
Gly	Arg	Val	Thr	Met	Arg	Lys	Thr	Val	Ala	Lys	Pro	Lys	Gly	Pro
				35					40					45
Ser	Gly	Ser	Pro	Trp	Tyr	Gly	Ser	Asp	Arg	Val	Lys	Tyr	Leu	Gly
				50					55					60
Pro	Phe	Ser	Gly	Glu	Ser	Pro	Ser	Tyr	Leu	Thr	Gly	Glu	Phe	Pro
				65					70					75
Gly	Asp	Tyr	Gly	Trp	Asp	Thr	Ala	Gly	Leu	Ser	Ala	Asp	Pro	Glu
				80					85					90
Thr	Phe	Ala	Arg	Asn	Arg	Glu	Leu	Glu	Val	Ile	His	Ser	Arg	Trp
				95					100					105
Ala	Met	Leu	Gly	Ala	Leu	Gly	Cys	Val	Phe	Pro	Glu	Leu	Leu	Ala
				110					115					120
Arg	Asn	Gly	Val	Lys	Phe	Gly	Glu	Ala	Val	Trp	Phe	Lys	Ala	Gly
				125					130					135
Ser	Gln	Ile	Phe	Ser	Asp	Gly	Gly	Leu	Asp	Tyr	Leu	Gly	Asn	Pro
				140					145					150
Ser	Leu	Val	His	Ala	Gln	Ser	Ile	Leu	Ala	Ile	Trp	Ala	Thr	Gln
				155					160					165
Val	Ile	Leu	Met	Gly	Ala	Val	Glu	Gly	Tyr	Arg	Val	Ala	Gly	Asn
				170					175					180
Gly	Pro	Leu	Gly	Glu	Ala	Glu	Asp	Leu	Leu	Tyr	Pro	Gly	Gly	Ser
				185					190					195
Phe	Asp	Pro	Leu	Gly	Leu	Ala	Thr	Asp	Pro	Glu	Ala	Phe	Ala	Glu
				200					205					210
Leu	Lys	Val	Lys	Glu	Leu	Lys	Asn	Gly	Arg	Leu	Ala	Met	Phe	Ser
				215					220					225
Met	Phe	Gly	Phe	Phe	Val	Gln	Ala	Ile	Val	Thr	Gly	Lys	Gly	Pro
				230					235					240
Ile	Glu	Asn	Leu	Ala	Asp	His	Leu	Ala	Asp	Pro	Val	Asn	Asn	Asn
				245					250					255
Ala	Trp	Ala	Phe	Ala	Thr	Asn	Phe	Val	Pro	Gly	Lys	His	Val	Ile
				260					265					270
His	Val	Ile	Asn	Gly	Asn	Thr	Asp	Val	Asp	Gly	Leu	Arg	Ala	Ala
				275					280					285
Ile	Ala	Gln	Ala	Lys	Ala	Val	Lys	Asp	Lys	Pro	Thr	Leu	Ile	Lys
				290					295					300
Val	Ser	Thr	Leu	Ile	Gly	Tyr	Gly	Ser	Pro	Asn	Lys	Ala	Asp	Ser
				305					310					315
His	Asp	Val	His	Gly	Ala	Pro	Leu	Gly	Pro	Asp	Glu	Thr	Ala	Ala
				320					325					330
Thr	Arg	Lys	Asn	Leu	Asn	Trp	Pro	Tyr	Gly	Glu	Phe	Glu	Val	Pro
				335					340					345
Gln	Asp	Val	Tyr	Asp	Val	Phe	Arg	Gly	Ala	Ile	Lys	Arg	Gly	Ala
				350					355					360
Glu	Glu	Glu	Ala	Asn	Trp	His	Lys	Ala	Cys	Ala	Glu	Tyr	Lys	Ala
				365					370					375
Lys	Tyr	Pro	Lys	Glu	Trp	Ala	Glu	Phe	Glu	Ala	Leu	Thr	Ser	Cys
				380					385					390
Lys	Leu	Pro	Glu	Asn	Trp	Glu	Ala	Ala	Leu	Pro	His	Phe	Lys	Pro
				395					400					405
Glu	Asp	Lys	Gly	Leu	Ala	Thr	Arg	Gln	His	Ser	Gln	Thr	Met	Ile
				410					415					420
Asn	Ala	Leu	Ala	Pro	Ala	Leu	Pro	Gly	Leu	Ile	Gly	Gly	Ser	Ala
				425					430					435
Asp	Leu	Ala	Pro	Ser	Asn	Leu	Thr	Leu	Met	Lys	Ile	Ser	Gly	Asp
				440					445					450
Phe	Gln	Lys	Gly	Ser	Tyr	Ala	Glu	Arg	Asn	Leu	Arg	Phe	Gly	Val
				455					460					465
Arg	Glu	His	Ala	Met	Gly	Ala	Ile	Cys	Asn	Gly	Ile	Ala	Leu	His
				470					475					480
Lys	Ser	Gly	Leu	Ile	Pro	Tyr	Cys	Ala	Thr	Phe	Tyr	Ile	Phe	Thr
				485					490					495
Asp	Tyr	Met	Arg	Asn	Ala	Met	Arg	Met	Ser	Ala	Leu	Ser	Glu	Ala
				500					505					510
Gly	Val	Val	Tyr	Val	Met	Thr	His	Asp	Ser	Ile	Gly	Leu	Gly	Glu
				515					520					525
Asp	Gly	Pro	Thr	His	Gln	Pro	Ile	Glu	His	Leu	Ala	Ser	Phe	Arg
				530					535					540
Ala	Met	Pro	Asp	Met	Leu	Met	Ile	Arg	Pro	Ala	Gly	Gly	Asn	Glu
				545					550					555
Thr	Ala	Gly	Ala	Tyr	Lys	Val	Ala	Ile	Ala	Asn	Arg	Lys	Arg	Pro
				560					565					570
Thr	Thr	Ile	Ala	Leu	Ser	Arg	Gln	Asn	Met	Pro	Asn	Ile	Pro	Asn
				575					580					585
Cys	Ser	Val	Glu	Gly	Val	Ala	Lys	Gly	Ala	Tyr	Thr	Ile	His	Asp
				590					595					600
Thr	Lys	Ala	Gly	Val	Lys	Pro	Asp	Val	Ile	Leu	Met	Gly	Thr	Gly
				605					610					615
Ser	Glu	Leu	Glu	Leu	Ala	Thr	Ala	Ala	Ala	Gly	Ile	Leu	Glu	Lys
				620					625					630
Glu	Gly	Lys	Asn	Val	Arg	Val	Val	Ser	Phe	Pro	Cys	Trp	Glu	Leu
				635					640					645
Phe	Glu	Glu	Gln	Ser	Ala	Glu	Tyr	Lys	Glu	Ser	Val	Leu	Pro	Ser
				650					655					660
Asp	Val	Thr	Ala	Arg	Val	Ser	Val	Glu	Ala	Ala	Thr	Ser	Phe	Gly
				665					670					675
Trp	Ala	Lys	Tyr	Ile	Gly	Leu	Lys	Gly	Lys	His	Val	Gly	Ile	Asp
				680					685					690
Thr	Phe	Gly	Ala	Ser	Ala	Pro	Ala	Pro	Thr	Leu	Tyr	Glu	Lys	Phe
				695					700					705
Gly	Ile	Thr	Val	Asn	His	Val	Val	Glu	Ala	Ala	Lys	Ala	Thr	Leu
				710					715					720
Gln	His													

<210> SEQ ID NO: 12
<211> 1226
<212> Genomic Sequence 
<213> Auxenochlorella protothecoides

<400>
AGAGTGCGAT GAAACGGCAT GTAAGCAAAG CATGCATGGA CTGGGCCCCT 50
GGACAGGGGG TCGAGGAGCG GTTGGGTTGG CTGCTCAGCA AACTGGATGT 100
GCTGCATGTG GTATGCCCCC TCCTCGCCCC CCTCGTCCTC CTCGGGCCCA 150
CCAGAGTCCC CTCATCCCAC CCAGGTATGG CCCCGACCGC GCGCTGTACC 200
TGCCTGACGG CCTACTGGAC CGGGACGAGG TCTCCCCCGT GCTCAACGGC 250
ACCCTGCCTG GCGACTACGG CTACGACCCG CTGGGCCTGG CCAAGAACTC 300
CGAGACGCTG GAGAAGTACC GCGCCAACGA GCTGCTGCAC GCGCGCTGGG 350
CCATGCTGGC CGCGGCCGGG GCCATCATCC CCGAGGGCCT GGAGGCCAAC 400
GGCGCCAACA TCCACGGCGG CACCTGGTTC GAGACGGGGG CGGAGATGCT 450
CAACGGCGGC ACGCTCAACT ACTTCGCGGT GCCCTGGGGC ATCGTGGGCA 500
ACCCACTGCC CCTGGCGGGG GCCATCCTCA TTGAGCTGGT CCTGCTCTTC 550
CAGGTGGAGA CCTTCCGCGG CAAAGGTGCG AGGAGCGGAG GGGGGTGCCT 600
CTGGTGTCGT CTGATGGGGA GGTGTGTGTG ATGAGGGGCA ATCTACTTGT 650
TTGTACTCCA TGTGGCATGG GGGGGGAAAT TCCTGTTGCT TGGCCTGAGG 700
CAATGCTGTC GACCCGGCTT GTACTGCAGG CAGACCTCAA CATGATGTTT 750
CCATTTCTTG TTTCAAATAA TCAAAGTACA GTCGGTGTAT GCCAACACTC 800
TCACAATCAG CAACATGCCA GATGCTTCGA CCACCCCCAC ACCACACAGG 850
CACTGGCCCC CCTGGCTACT CCCCCGGCGT GGGTAAGTTT GAGTCGTCGG 900
ACCTGCAGGG GCTGGACCCC CTGTACCCTG GCGGTCCCTT CGACCCCCTG 950
GGATCAAGAA CGGGCGGCTG GCCATGTTCT CCATGTTTGG CTTCTTCATC 1000
CAGGCCATCG TAACGGGGAA GGGCCCCATC CAGAACCTGA ACGACCACCT 1050
TGCCGACCCC GGCGCAAACA ACGCCTGGGC CTTCGCCACG AAGTTTACCC 1100
CCTGATCTGG GATGGGCGCG AGAAGAGACC CCCCCCGCTC GCCTTCGATT 1150
TTTAGTAGCG CAATCTTTTG ATGCTGGGGC TGTGCAGCCT GTGCCGCCTG 1200
TGCCACACCC GCCGATTCCT GATGTA                           1226

<210> SEQ ID NO: 13
<211> 251
<212> Amino Acid Sequence
<213> Arabidopsis thaliana
Met	Ala	Thr	Val	Thr	Thr	His	Ala	Ser	Ala	Ser	Ile	Phe	Arg	Pro
1				5					10					15
Cys	Thr	Ser	Lys	Pro	Arg	Phe	Leu	Thr	Gly	Ser	Ser	Gly	Arg	Leu
				20					25					30
Asn	Arg	Asp	Leu	Ser	Phe	Thr	Ser	Ile	Gly	Ser	Ser	Ala	Lys	Thr
				35					40					45
Ser	Ser	Phe	Lys	Val	Glu	Ala	Lys	Lys	Gly	Glu	Trp	Leu	Pro	Gly
				50					55					60
Leu	Ala	Ser	Pro	Asp	Tyr	Leu	Thr	Gly	Ser	Leu	Ala	Gly	Asp	Asn
				65					70					75
Gly	Phe	Asp	Pro	Leu	Gly	Leu	Ala	Glu	Asp	Pro	Glu	Asn	Leu	Lys
				80					85					90
Trp	Phe	Val	Gln	Ala	Glu	Leu	Val	Asn	Gly	Arg	Trp	Ala	Met	Leu
				95					100					105
Gly	Val	Ala	Gly	Met	Leu	Leu	Pro	Glu	Val	Phe	Thr	Lys	Ile	Gly
				110					115					120
Ile	Ile	Asn	Val	Pro	Glu	Trp	Tyr	Asp	Ala	Gly	Lys	Glu	Gln	Tyr
				125					130					135
Phe	Ala	Ser	Ser	Ser	Thr	Leu	Phe	Val	Ile	Glu	Phe	Ile	Leu	Phe
				140					145					150
His	Tyr	Val	Glu	Ile	Arg	Arg	Trp	Gln	Asp	Ile	Lys	Asn	Pro	Gly
				155					160					165
Ser	Val	Asn	Gln	Asp	Pro	Ile	Phe	Lys	Gln	Tyr	Ser	Leu	Pro	Lys
				170					175					180
Gly	Glu	Val	Gly	Tyr	Pro	Gly	Gly	Ile	Phe	Asn	Pro	Leu	Asn	Phe
				185					190					195
Ala	Pro	Thr	Gln	Glu	Ala	Lys	Glu	Lys	Glu	Leu	Ala	Asn	Gly	Arg
				200					205					210
Leu	Ala	Met	Leu	Ala	Phe	Leu	Gly	Phe	Val	Val	Gln	His	Asn	Val
				215					220					225
Thr	Gly	Lys	Gly	Pro	Phe	Glu	Asn	Leu	Leu	Gln	His	Leu	Ser	Asp
				230					235					240
Pro	Trp	His	Asn	Thr	Ile	Val	Gln	Thr	Phe	Asn				
				245					250					

<210> SEQ ID NO: 14
<211>  969
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400>
GCT GGC TGT TCT TGT CGT ATC TCA GCT TGG CTG TGC TCC GTG CTG AAG 48
ACC ATC ATG CTG GCG ACA GCT CCC ACA ACC GCA TTT GGG CAG GTG GCT 96
CCA AGC AGG GTC TGC GGC CTG TCG GGG GCA GCC CGG AGG GCC AGC GTC 144
ATC GCC CGG GCC GAG AAC AAC AAG ATC CAG AAG GTA GAC CGC ATC CAG 192
AAG AAT GGC CCG CTG TAC CTG AAC TTT GCC AGC GAC CAG TCC CTG ACC 240
TAC CTG GAC GGC ACG CTC CCC GCC GAC TTC GGC TTC GAC CCC CTT GGC 288
CTG TCC GAC CCC GAG GGC GCG GGG GGA TTC GTC ACC CCC GAA TGG CTG 336
GCT TAC GGC GAG GTG TAC AAC GGC CGC TGG GCC ATG CTG GGC GCA GCG 384
GGT GTG CTG ATC CCG GAC GTG CTG TCC CAC GCG GGC CTG ATT CCG CAG 432
ACC CCG GAG GAG ATC AAG TGG TGG AAG ACG GGG GTC ATC CCG CCC GCG 480
GGC CAG TAC GAC AAG CTT TGG CTG GAC CCC TAC TCC CTC TTC TGG ATC 528
GAG GCC ATC CTG ATG AAC TTC GTG GAG CTG CGG CGG TAC CAG GAC TAC 576
CGC AAC CCG GGG TCA ATG GGC CGC CAG TAC TTC CTG GGG CTG GAG GGC 624
GGC TTC AAG GGC TCC GGC GAG CCC GCC TAC CCC GGC GGC CCT TTC TTC 672
AAC CCG CTT GGC TTC GGC ACC AAG AGC GCC GAC GAC CTC AAG GTC TGG 720
AAG ACC AAG GAG CTG CGG AAC GGG CGC CTG GCC ATG ATC GCC ATG CTG 768
GGC TTC GCG GGG CAG GCT GTT GCG ACA GAC CGT GGC CCA GTG GAG GGC 816
CTG CTC GCC CAC CTC AGC GAT CCC TTT GGC AAC AAT GTC ATC GGC AAC 864
CTG TCC CAC TTC CTG CGC TGA GCA TGC ACC TGC ATC CCC CTA GTA CAA 912
TGG CAG TAC CGC TGC AGA GAA GAA TTC CTG GCG TGT AAT CTT CCC CCC 960
TCC AAA ACA 							969

<210> SEQ ID NO: 15
<211> 145
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Lys	Ala	Thr	Leu	Arg	Ala	Pro	Ala	Ser	Arg	Ala	Ser	Ala	Val
1				5					10					15
Arg	Pro	Val	Ala	Ser	Leu	Lys	Ala	Ala	Ala	Gln	Arg	Val	Ala	Ser
				20					25					30
Val	Ala	Gly	Val	Ser	Val	Ala	Ser	Leu	Ala	Leu	Thr	Leu	Ala	Ala
				35					40					45
His	Ala	Asp	Ala	Thr	Val	Lys	Leu	Gly	Ala	Asp	Ser	Gly	Ala	Leu
				50					55					60
Glu	Phe	Val	Pro	Lys	Thr	Leu	Thr	Ile	Lys	Ser	Gly	Glu	Thr	Val
				65					70					75
Asn	Phe	Val	Asn	Asn	Ala	Gly	Phe	Pro	His	Asn	Ile	Val	Phe	Asp
				80					85					90
Glu	Asp	Ala	Ile	Pro	Ser	Gly	Val	Asn	Ala	Asp	Ala	Ile	Ser	Arg
				95					100					105
Asp	Asp	Tyr	Leu	Asn	Ala	Pro	Gly	Glu	Thr	Tyr	Ser	Val	Lys	Leu
				110					115					120
Thr	Ala	Ala	Gly	Glu	Tyr	Gly	Tyr	Tyr	Cys	Glu	Pro	His	Gln	Gly
				125					130					135
Ala	Gly	Met	Val	Gly	Lys	Ile	Ile	Val	Gln					
				140					145					

<210> SEQ ID NO: 16
<211>  766
<212> Nucleic Acid Sequence 
<213> Auxenochlorella protothecoides

<400>
GATCTACCGA GTCTCTCTCC GAGGCAGCTT ATCACCCTTG CACTACTCCT   50
 AGATCACCGC CTAGATAACA TGCAGACCTC AGTCAGCGCC ATGCGTGGCA  100
 GCGCCCTGGT GGCGACCACC CACACTGCCA GGACCTCCAC CCGGGCAACC  150
 CGCTGCGTGG CTGCCTTCGA TGCTCGTCGC TGGGCCAGCG TGGCGGGAGT  200
 CGGTCTGGCC TCCATTGGCC TGACTCTCAC CGCCAATGCT GCGACCGTGA  250
 AGCTTGGAAC CGATTCCGGC GGCCTGCAGT TCGACCCCGA GACGGTGACC  300
 ATCTCCAAGG GCGACTCCGT CACCTGGCAG AACAACGCCG GCTTCCCCCA  350
 CAACATCGTC TTCGACGAGG ATGGGGTGCC GTCTGGCGTG AACGTGTCGA  400
 GCCTGAACCA TGAGGACTAC CTCAACGCCC CCGGCCAGTC TGTGACCTCC  450
 AAGTTTGATG TGGCTGGCGA GTACAACTAC TACTGCGAGC CCCACCAAGG  500
 TGCCGGCATG CAGGGCAAGG TCATCGTCAA CTAGGGGCCT GCACAAACCG  550
 ACCTCCCTGT GCCAAGCCTG GCGGCCTGAG ATGGTTTGAT TCTACTTGCA  600
 TGGGAGACCA GAGCTTGCCT TGGCCCGCGT CGACACATCG TGCTTGTGCA  650
 GGACTGGAGT AGCCACAGGC TGGCACCAGG TTGGAAACAC CTCGTCCCCC  700
 TGCACATGCG CCGCCGTGTG CACGATCGTT TTGCATCCTG CTTTGATTTT  750
 GAGAGGAAGA CGTATA                                       766
<210> SEQ ID NO: 17
<211> 408
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Gln	Val	Thr	Met	Lys	Ser	Ser	Ala	Val	Ser	Gly	Gln	Arg	Val
1				5					10					15
Gly	Gly	Ala	Arg	Val	Ala	Thr	Arg	Ser	Val	Arg	Arg	Ala	Gln	Leu
				20					25					30
Gln	Val	Val	Ala	Ser	Ser	Arg	Lys	Gln	Met	Gly	Arg	Trp	Arg	Ser
				35					40					45
Ile	Asp	Ala	Gly	Val	Asp	Ala	Ser	Asp	Asp	Gln	Gln	Asp	Ile	Thr
				50					55					60
Arg	Gly	Arg	Glu	Met	Val	Asp	Asp	Leu	Phe	Gln	Gly	Gly	Phe	Gly
				65					70					75
Ala	Gly	Gly	Thr	His	Asn	Ala	Val	Leu	Ser	Ser	Gln	Glu	Tyr	Leu
				80					85					90
Ser	Gln	Ser	Arg	Ala	Ser	Phe	Asn	Asn	Ile	Glu	Asp	Gly	Phe	Tyr
				95					100					105
Ile	Ser	Pro	Ala	Phe	Leu	Asp	Lys	Met	Thr	Ile	His	Ile	Ala	Lys
				110					115					120
Asn	Phe	Met	Asp	Leu	Pro	Lys	Ile	Lys	Val	Pro	Leu	Ile	Leu	Gly
				125					130					135
Ile	Trp	Gly	Gly	Lys	Gly	Gln	Gly	Lys	Thr	Phe	Gln	Cys	Ala	Leu
				140					145					150
Ala	Tyr	Lys	Lys	Leu	Gly	Ile	Ala	Pro	Ile	Val	Met	Ser	Ala	Gly
				155					160					165
Glu	Leu	Glu	Ser	Gly	Asn	Ala	Gly	Glu	Pro	Ala	Lys	Leu	Ile	Arg
				170					175					180
Thr	Arg	Tyr	Arg	Glu	Ala	Ser	Asp	Ile	Ile	Lys	Lys	Gly	Arg	Met
				185					190					195
Cys	Ser	Leu	Phe	Ile	Asn	Asp	Leu	Asp	Ala	Gly	Ala	Gly	Arg	Met
				200					205					210
Gly	Asp	Thr	Thr	Gln	Tyr	Thr	Val	Asn	Asn	Gln	Met	Val	Asn	Ala
				215					220					225
Thr	Leu	Met	Asn	Ile	Ala	Asp	Asn	Pro	Thr	Asn	Val	Gln	Leu	Pro
				230					235					240
Gly	Val	Tyr	Lys	Asn	Glu	Glu	Ile	Pro	Arg	Val	Pro	Ile	Val	Cys
				245					250					255
Thr	Gly	Asn	Asp	Phe	Ser	Thr	Leu	Tyr	Ala	Pro	Leu	Ile	Arg	Asp
				260					265					270
Gly	Arg	Met	Glu	Lys	Tyr	Tyr	Trp	Asn	Pro	Thr	Arg	Glu	Asp	Arg
				275					280					285
Ile	Gly	Val	Cys	Met	Gly	Ile	Phe	Gln	Glu	Asp	Asn	Val	Gln	Arg
				290					295					300
Arg	Glu	Val	Glu	Asn	Leu	Val	Asp	Thr	Phe	Pro	Gly	Gln	Ser	Ile
				305					310					315
Asp	Phe	Phe	Gly	Ala	Leu	Arg	Ala	Arg	Val	Tyr	Asp	Asp	Met	Val
				320					325					330
Arg	Gln	Trp	Ile	Thr	Asp	Thr	Gly	Val	Asp	Lys	Ile	Gly	Gln	Gln
				335					340					345
Leu	Val	Asn	Ala	Arg	Gln	Lys	Val	Ala	Met	Pro	Lys	Val	Ser	Met
				350					355					360
Asp	Leu	Asn	Val	Leu	Ile	Lys	Tyr	Gly	Lys	Ser	Leu	Val	Asp	Glu
				365					370					375
Gln	Glu	Asn	Val	Lys	Arg	Val	Gln	Leu	Ala	Asp	Ala	Tyr	Leu	Ser
				380					385					390
Gly	Ala	Glu	Leu	Ala	Gly	His	Gly	Gly	Ser	Ser	Leu	Pro	Glu	Ala
				395					400					405
Tyr	Ser	Arg												

<210> SEQ ID NO: 18
<211> 1629
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400>
GCC CGG GGT GTA TCT CAC ATG TGC GGC GTG AAG GGC ATT AAA GAC GTG 48
ATG GTG TCA ACA TTA TGG AGC GTG CAA TCG ATG CAT CAC CTG TTC TTC 96
TCC TCA AAG AAT TTG GGC AAG ACA CTG CCA TCC CAT GGC ACA CAG ATC 144
CAC GCA AGC ATG CTT GGT ACA GAT AGG TGT AGC CTC CAT GGC CTC CTG 192
AGC GGG GGC ACC CCC TCT GCT TAC GTT CTG TCC CCT CCC CCT GCG GGA 240
CAA CTC CCT CCC CCG CTC ACT CGG CGT GGG AGG AGC CGG TGG AGC CGG 288
CCA GGG CAG CTC CTG ACA TGT AGC TCT CCG CCA GCT GAA CAC GCT TCA 336
CAT TCT CCT GCT CGC CCA CCA GCG CAT TGC CAT ACT TCA TCA GCA CGT 384
CCA GGG TCA TCA TGG GCT TGT CGA ACG TCA CCT TGC CCT CGC GCG AGT 432
TCA CCA GCC GCT TGC CGA TGG CCT CGA TGC CGG TGC CGG AGA TCC ACT 480
CGC GGA CCT TGT CGT CAT AAA CAC GGG CGC GCA GGG CGC CAA AGA AGT 528
CGA TGG ACT GAC CGG GGA AGG CGT CCA CCA GCT TCT CGA TCT CCT TCA 576
GAT TGA CGT CGT CGT GCT GGA AGA TGC CGT TGC ACA CAC CAA CAC GGT 624
CCT CCC GTG TGG GGG CCC AGT AAA ACT TCT CCA TGC GCC CAT CGC GGA 672
TGA GGG GGG CAT ACA GGG TGG AGA AGT CAT TGC CCG TGC ACA CAA CGG 720
GCA CAC GGG GGA TCT CCT CCT GCT TAT ACA CAC CCG GGA GCT GCA CAT 768
TGG TGG GGT TGT CGG CGA TGT TCA TGA GGG TGG CAT TCA CCA TCT GGT 816
TGT TCA CGG TGT ACT GCG TGG CAG AGC CCA TAC GTC CAG CAC CTG CGT 864
CCA GAT CGT TGA TGA AGA GCG AGG ACA TTT TCC CCT TCT TGA TGG TGT 912
CGG AAG CCT CGC GGT AGC GCT GGC GGA TGA GCT TGG CGG GCT CAC CCG 960
CAT TCC CGC TCT CCA GCT CAC CCG CCG ACA TGA CGA TGG GCG CGA TGC 1008
CCA GCT TCT TGA AGG CCA GGT TGC ACT GGA AGG TCT TAC CCT GGC CCT 1056
TGC CTC CCC AGA TTC CCA GGA TCA GGG GCA CCT TGA TCT TGG GCA GGT 1104
CCA GGA AGT TCT TGG CCA CAT GGA TGG ACA GCT TGT CTA GGA AGG CTG 1152
GGG AGA TGT AGA AGC CGT CCT CGA TGT TGC CCA GAG TGC GCT GGG CGG 1200
TGG ACA GGT ACT CCT CGG AGG AAA GGA TGG CGT TGT GAG TGC CGC CAA 1248
TCG CCT GGC CTC CCT GGA ACA GGC TGT CCA CCA TGT CGC GCC CGC GCT 1296
GGA TGT CTT GCT GGT CAT CAG AGG GCG CCT TGC CAG CAT CCA TGT GGG 1344
ACC AGC GGC CCT TGT GGT TGG GCT TGG CCA TGC AGC GAA CGC TGG GAC 1392
GGG CCT GAG ACA TCA TGA CCG CCG AGG GAG GGC GCT GCA CAC CAC ATT 1440
GGG CGG TCC TGA TCC CGT TCC CGG CAA CAG AGG GCT GAG TGA CGC GTG 1488
CAC ACT GCA TGG CTC CGA TCT GAT CTG CGG ATG GCA CGA GCT GTG TCA 1536
AAA GAA GTT CGA TGA GGA TGC GTG TGG CGA CAG CGG CTG TGG TTG ACA 1584
CCA CGC CCT CTA CCT TTC CCC CCA CCC TTC AGA ACG TAC TGG ATC 	1629

<210> SEQ ID NO: 19
<211> 377
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Arg	Thr	Gly	Ala	Leu	Leu	Leu	Val	Ala	Leu	Ala	Leu	Ala
1				5					10					15
Gly	Cys	Ala	Gln	Ala	Cys	Ile	Tyr	Lys	Phe	Gly	Thr	Ser	Pro	Asp
				20					25					30
Ser	Lys	Ala	Thr	Val	Ser	Gly	Asp	His	Trp	Asp	His	Gly	Leu	Asn
				35					40					45
Gly	Glu	Asn	Trp	Glu	Gly	Lys	Asp	Gly	Ala	Gly	Asn	Ala	Trp	Val
				50					55					60
Cys	Lys	Thr	Gly	Arg	Lys	Gln	Ser	Pro	Ile	Asn	Val	Pro	Gln	Tyr
				65					70					75
Gln	Val	Leu	Asp	Gly	Lys	Gly	Ser	Lys	Ile	Ala	Asn	Gly	Leu	Gln
				80					85					90
Thr	Gln	Trp	Ser	Tyr	Pro	Asp	Leu	Met	Ser	Asn	Gly	Thr	Ser	Val
				95					100					105
Gln	Val	Ile	Asn	Asn	Gly	His	Thr	Ile	Gln	Val	Gln	Trp	Thr	Tyr
				110					115					120
Asn	Tyr	Ala	Gly	His	Ala	Thr	Ile	Ala	Ile	Pro	Ala	Met	His	Asn
				125					130					135
Gln	Thr	Asn	Arg	Ile	Val	Asp	Val	Leu	Glu	Met	Arg	Pro	Asn	Asp
				140					145					150
Ala	Ala	Asp	Arg	Val	Thr	Ala	Val	Pro	Thr	Gln	Phe	His	Phe	His
				155					160					165
Ser	Thr	Ser	Glu	His	Leu	Leu	Ala	Gly	Lys	Ile	Tyr	Pro	Leu	Glu
				170					175					180
Leu	His	Ile	Val	His	Gln	Val	Thr	Glu	Lys	Leu	Glu	Ala	Cys	Lys
				185					190					195
Gly	Gly	Cys	Phe	Ser	Val	Thr	Gly	Ile	Leu	Phe	Gln	Leu	Asp	Asn
				200					205					210
Gly	Pro	Asp	Asn	Glu	Leu	Leu	Glu	Pro	Ile	Phe	Ala	Asn	Met	Pro
				215					220					225
Ser	Arg	Glu	Gly	Thr	Phe	Ser	Asn	Leu	Pro	Ala	Gly	Thr	Thr	Ile
				230					235					240
Lys	Leu	Gly	Glu	Leu	Leu	Pro	Ser	Asp	Arg	Asp	Tyr	Val	Thr	Tyr
				245					250					255
Glu	Gly	Ser	Leu	Thr	Thr	Pro	Pro	Cys	Ser	Glu	Gly	Leu	Leu	Trp
				260					265					270
His	Val	Met	Thr	Gln	Pro	Gln	Arg	Ile	Ser	Phe	Gly	Gln	Trp	Asn
				275					280					285
Arg	Tyr	Arg	Leu	Ala	Val	Gly	Leu	Lys	Glu	Cys	Asn	Ser	Thr	Glu
				290					295					300
Thr	Ala	Ala	Asp	Ala	Gly	His	His	His	His	His	Arg	Arg	Leu	Leu
				305					310					315
His	Asn	His	Ala	His	Leu	Glu	Glu	Val	Pro	Ala	Ala	Thr	Ser	Glu
				320					325					330
Pro	Lys	His	Tyr	Phe	Arg	Arg	Val	Met	Leu	Ala	Glu	Ser	Ala	Asn
				335					340					345
Pro	Asp	Ala	Tyr	Thr	Cys	Lys	Ala	Val	Ala	Phe	Gly	Gln	Asn	Phe
				350					355					360
Arg	Asn	Pro	Gln	Tyr	Ala	Asn	Gly	Arg	Thr	Ile	Lys	Leu	Ala	Arg
				365					370					375
Tyr	His													

<210> SEQ ID NO: 20
<211> 1289
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400>
GCG AGG TGA TGG CTC GGC ACA TAA GAC TTG CAA AGT CTT TCC GGG TTC 48
ACA GAA GTC AGC CCG GTG ACC TTG AGT TGG CAT CAT GGT ACT CTC ATG 96
CAT CAT CCT ATG TAT TGC CTT TGC GCT GGG CCA TGT CCT ACC GCT GAC 144
CGC CGG CTG CGG TCA GCA GCA CTT CCA TAA GGG CAG GCG CAT CGC TGA 192
GGA GTC CGA CAC CTC CGC CTG GAA CTA TGA CCT CAA CGG CGC CGA CTG 240
GCC CGG GAC CTG CAA GGA GGG CAG GGC ACA GTC CCC GAT TGC CAT CCA 288
GCT GAC TGC TGC TGC CCC ATT TGC CGG CCC CCC AAT CTC GTT CGA GTT 336
TGG CAA GGC AAA GGG ACT CCG CGT ATT CAA CAT CAG GAC CGC AAT CCA 384
GGT GGA ATG GGA CGA CCT CGA GGG CAG CCA AAC CGT GAT CCC CAC GAA 432
CGG CAT GTG GGG GTC CGA CGC GGA GAA CAT CTC CTC CGT TCC CGT CAA 480
ACC TTT CCA GTT CCA TTG GCA TTC TAC ATG CGA ACA CAT AGT GGA CGG 528
CTT CAT GTG CCC CTT GGA GCT GCA CCT GGT CAC GAA GGT GGA CAA CGA 576
GAC GGA CGC ACC GGT GCC AAG GTA CTG CCA GAA AAA CAC CTG CCT GGC 624
TGT GTT CGG CGT CAC GTT TGA GTA TAA CGC AGA CGA TGT CTT CAC AGG 672
AAC TGT GCC TTT CAT GGA GGG CAT CGT AGA AAG TCT TCC CTC CCA GGA 720
TGA CGT AGT CGC ACA GGA CGC GAC ATT CTT GGA CAC CAC CTT TGA CCT 768
GAA CAC GCT CAT ACC CTC CAA CTC GAG CTA TGC CTG GTA CCA GGG ATC 816
ACT CAC GGC GCC GCC TTG CTG GGA AGG CGT GAG CTG GCA TGT CTT CAC 864
CGA TCC CAT GGA GGG TCT GTC GGT GCC ACA GCT GGA GGC CCT GCA ATC 912
GGC GTT GGC CTC GCA TCC CGA GGA GCG CGA GCT CAC ATG CTC CCC AGA 960
TGA AGA CAG TGG ATG TGG AGT GAC CGC AGC AGA CTG CAC CAC CAC GCA 1008
CGG CAG CAG GAC CAA CAA CCG TGC CCT GCA ACC GAC CAA TGG ACG CCA 1056
AGT GTT CCT GGC CTC GGC GGC GTG AGC TGG TGC GCC GCT GCT GCT AGC 1104
ACC CCA CCT TGT CAG GCC ACG GTG TGA ACC GCT TTC CTT GCA GCT GCC 1152
AGA GAG TTG ATG TCC TTG AAG AGT TAT ATG ATC TGT CGC CCA TTA CCC 1200
CTT GCC ATG CCA GTT TTG CCC TCG GAT CTG CTA TAA TAA CGA GTG CTG 1248
TGT TAT TAT CGT GCC ATA AGG GGA ATC CGA GCG AAT CCC AG 		1289

<210> SEQ ID NO: 21
<211> 280
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Val	Phe	Lys	Phe	Pro	Thr	Pro	Pro	Gly	Thr	Gln	Lys	Lys	Ala
1				5					10					15
Gly	Thr	Thr	Ala	Thr	Lys	Pro	Ala	Pro	Lys	Ala	Thr	Thr	Lys	Lys
				20					25					30
Val	Ala	Thr	Ser	Thr	Gly	Thr	Arg	Ser	Gly	Gly	Val	Gly	Tyr	Arg
				35					40					45
Lys	Tyr	Gln	Gly	Asp	Ala	Leu	Trp	Leu	Pro	Asn	Thr	Thr	Arg	Pro
				50					55					60
Glu	Trp	Leu	Asp	Gly	Ser	Leu	Pro	Gly	Asp	Arg	Gly	Phe	Asp	Pro
				65					70					75
Leu	Gly	Leu	Ser	Lys	Pro	Ser	Glu	Phe	Val	Val	Ile	Gly	Val	Asp
				80					85					90
Glu	Asn	Asp	Gln	Asn	Ala	Ala	Lys	Asn	Asn	Lys	Gly	Ser	Val	Glu
				95					100					105
Ala	Ile	Val	Gln	Ala	Thr	Pro	Asp	Glu	Val	Ser	Ser	Glu	Asn	Arg
				110					115					120
Leu	Ala	Pro	Tyr	Ser	Glu	Val	Phe	Gly	Leu	Ala	Arg	Phe	Arg	Glu
				125					130					135
Cys	Glu	Leu	Ile	His	Gly	Arg	Trp	Ala	Met	Leu	Ala	Cys	Leu	Gly
				140					145					150
Ala	Leu	Val	Ala	Glu	Ala	Thr	Thr	Gly	Val	Ser	Trp	Val	Glu	Ala
				155					160					165
Gly	Lys	Val	Glu	Leu	Asp	Gly	Ala	Ser	Tyr	Ala	Gly	Leu	Ser	Leu
				170					175					180
Pro	Phe	Ser	Ile	Thr	Gln	Leu	Ile	Trp	Ile	Glu	Val	Ile	Leu	Val
				185					190					195
Gly	Gly	Ala	Glu	Phe	Tyr	Arg	Asn	Ser	Glu	Thr	Asn	Pro	Glu	Lys
				200					205					210
Arg	Cys	Tyr	Pro	Gly	Gly	Val	Phe	Asp	Pro	Leu	Lys	Leu	Ala	Ser
				215					220					225
Glu	Asp	Glu	Glu	Arg	Ala	Phe	Arg	Leu	Lys	Thr	Ala	Glu	Ile	Lys
				230					235					240
His	Ala	Arg	Leu	Ala	Met	Val	Ser	Phe	Phe	Gly	Tyr	Gly	Val	Gln
				245					250					255
Ala	Leu	Ser	Thr	Gly	Glu	Gly	Ala	Leu	Gly	Ser	Leu	Ala	Lys	Phe
				260					265					270
Ala	Asp	Gly	Leu	Asn	Asn	Gly	Lys	Gly	Leu					
				275					280					

<210> SEQ ID NO: 22
<211> 866 
<212> Nucleic Sequence 
<213> Auxenochlorella protothecoides

<400>
TTC CGA TCT CCC GCC CCG AGT GGC TGG ACG GCT CCC TCC CCG GTG ACC 48
GCG GCT TCG ACC CCC TGG GCC TGT CCA AGC CCA CCT CCT ACA TCC AGA 96
TGG ACC TGG ACG CGC TGG ACC AGA ACT CGG CGG TGA ACA AGG CGG GCG 144
GCG TGG TGG GGT CCT TCG CCT CCA ACG CCG ATG AGG TGT CCC CCG ACG 192
CGC TGG CGC CTT ACA GCG AGG TCT TCG GCC TCG CCC GCT TCC GCG AGA 240
ACG AGG TCA TCC ACG GGC GCT GGG CCA TGC TGG CCA CGC TGG GCG TGA 288
TCG TGG CCG AGG CTT CCA CGG GCG TGG CCT GGC AGG ACG CGG GCA AGG 336
TGG AGC TGG ACG GCG CAC AGT ACC TCG GCT TCC CCC TGC CCT TCA CCC 384
TCA CCC AGC TGG TCT GGA TCG AGG CGC TGC TGG TGG GCG GCG CCG AGG 432
TGT ACC GCA ACA CCG AGC TGG ACA CGG AGA AGC GGG CCT ACC CCG GCG 480
GCT GGT TCG ACC CCC TGC GCC TCG CCT CGG GCG ACG ACA ACC GCG CCT 528
TCA AGC TCA AGG AGG CCG AGC TCA AAC ACG GGC GCC TGG CCA TGA TTG 576
CGT TCC TGG GTT TCA GCG TGC AGG CCT GGA CCA CAG GCG AGG GAG CCC 624
TGG GGT CGC TGG CCA AGT TCG CCA CCT CCT TTG CCG GAT GAG GCA GGG 672
TGC GGT GGA ATC TTG AGA AGA TCC AGT CGA ATT TTG TTG TGG AAG TTC 720
CTG GCC CAT GTA GAG GAG CTT GCA CCG GAG TGT CCG ACA TGA GGA CGG 768
CAT GCG GTG CAC CGC TCC TGT GCT CAA GTA AAT CTC TTG TAA TCT TCC 816
CAA CAA CGG AAT GGC CCG GTC AGC CGC TGT TAC CAA TTT GAG GTT CTG 864
GC 								866

<210> SEQ ID NO: 23
<211> 231
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Thr	Lys	Asn	Cys	Ser	Val	Glu	Pro	Ala	Ala	Leu	Tyr	Lys	Ile
1				5					10					15
Leu	Leu	His	Ser	Leu	Lys	His	Thr	Ala	Gly	Val	Asn	Gly	Val	Leu
				20					25					30
Leu	Gly	Thr	Val	Ser	Val	Gly	Thr	Ala	Thr	Gly	Ala	Gly	Pro	Ala
				35					40					45
Thr	Ala	Val	Arg	Val	Val	Asp	Ala	Val	Pro	Val	Gly	His	Gly	Phe
				50					55					60
Val	Thr	Leu	Thr	Pro	Val	Leu	Glu	Met	Ala	Leu	Ser	Gln	Ile	Glu
				65					70					75
Gly	Tyr	Val	His	Glu	Gln	Ala	Gly	Ser	Thr	Cys	Pro	Leu	Gly	Leu
				80					85					90
Arg	Ile	Val	Gly	Tyr	Tyr	Gln	Cys	Asn	Glu	Arg	Leu	Gly	Asp	Ser
				95					100					105
Glu	Leu	Gly	Gly	Gly	Arg	Arg	Val	Ala	Asp	Arg	Ile	Glu	Ala	Ala
				110					115					120
Phe	Pro	Asp	Ser	Val	Ala	Val	Val	Leu	Asp	Ser	Thr	Val	Met	Asp
				125					130					135
Thr	Ala	Leu	Gln	Ala	Ala	Val	Ala	Gln	Gln	Gln	Gln	Asp	Gln	Ala
				140					145					150
Gly	Lys	Gln	Gln	Val	Glu	Glu	Glu	Pro	Val	Leu	Ala	Leu	Phe	Val
				155					160					165
Lys	Asp	Gly	Met	Arg	Gly	Trp	Val	Arg	Ala	Ser	Ala	Ser	Asp	Gly
				170					175					180
Lys	Thr	Arg	Leu	His	Cys	Pro	Thr	Gln	Gly	Val	Ala	Ala	Gln	Leu
				185					190					195
Ala	Gln	Tyr	Ala	Ala	Glu	Gly	Arg	His	Arg	Ala	Leu	Val	Asp	Phe
				200					205					210
Glu	Gln	His	Leu	Asp	Asp	Ile	Asn	Ala	Asn	Trp	Leu	Asn	Thr	Gly
				215					220					225
Leu	Leu	Glu	Gly	Pro	Ala									
				230										

<210> SEQ ID NO: 24
<211>  1311
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GAT CAT ACG CTC GAG CGT ACC GCT CTG CTC AAG ATC CTC CTT CAC GCG 48
ATC AAG TAC CCC GCA TTC GGT GTC AAC GGG GTT TTG CTG GGC AGC ATA 96
GCC AAT GGC ACC GTC ACA GCC ACG GAT GCT GTG CCC TTT GTA CAC AAC 144
TTC ACA ACA CTT CTT CCG GCC TTT GAG ACG GCA CTG TGC CAG GTG GAT 192
AAC CAC ACG AAA ACA CAG AGC GGA GTC CAG GTG GTG GGC TAC TAC CAG 240
GCC AAC GAG CAC AAC AAG GAC ACA CAG CTT TCC GGT GCC GGC AAG CTG 288
CTC GCC ACA AAA GTC GCG ACG CTG TGG CCC TCC GCC ATC GCC GTT GTG 336
CTG GAC CCT GAG GGC CTG AGG CAG CTG TTG CAA TCC AGT GGA GGC GAG 384
TCA CCC AGC GAC GCA CGG CCC TTG TTC ACC CTC CAC ATC AAG GAT GGC 432
CGG AAT TGG GTG GAG TAT GGA GGC AGG CTG AAG GTA CAC ACA GCT GGG 480
ATT CCA TCA GCC CTG GAG CAG CTG AGT GCG GAG GGA AAG CAC AGA CAG 528
CTC GCA GAT TTT GAG GAT CAC ATG GAG GAT GTC AGC CTG GAC TGG CTC 576
AAT GCC GGG CTG ATC AAG GCG TCA TGA GGG GTA GAA GAT AAC TGT TGA 624
GAC GCT GGC AAG GTG TTG ATC GTA GGT ATG AGA CCT GAA GAT CTG TCC 672
TGG TGA CTG CCC TCG TCA CAC CAG ACA ATG AAG CCC AGT TTA CCT CTC 720
CAG CTT CCA GTA CAA ATG GCC GTG AGG TCC ACT GAT GGA GGG ACT CGC 768
AGC TGT TCT GGT CTC CTC GAC AAT CTG GAG CAC TCC CGA GGT CTT TGC 816
AAT CCT GCC TCT CGC GCC TTT GCT TGG CGT CAC ACC CCT GCC CAC GCC 864
TCA CCC CTC CAC TCC CTT GCT CTC ACT TCT TGC GAG CTC TCC CTG CTC 912
CTC AGC ATG CCC TCG CAT GTA TGC TTC AAA GTT ACA CAA GGG AAT GTC 960
GTG AGT GGG TAT GAT CGG AAA TGC ATG TGG GCA TTG CAC AGT CAT CGG 1008
GTT TCC CAA CTT GGG TGT GGG GTG GGG AGA AGG TCT CTC AAG GGG CAA 1056
GTT GAG ATG TGG AAT GCA GGA GGT TCA GGG GGC TGC CAG GCT GGG CTC 1104
CAG GGG ACC TGC GCC GTT GAG TTC AGG GCC CGT GAC GGC CCG CAG CTG 1152
CTC TCC CAC CAC TTC GTA CCC CAG GCT CGT CCT GCT CCT CCG CGC CAC 1200
ATC CTG CAG CTC CGA CAG GTG CGT CCG CCT CAG GTC CCG GAT GGC TTC 1248
GGC CAC ATC GCC CTC GGG GAT GTC CAG CTC TGT CAG GAC GGC GGC CGC 1296
CAG CTG CAG GCT CGG 						1311

<210> SEQ ID NO: 25
<211> 358
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Ala	Met	Leu	Ala	Ser	Lys	Gln	Gly	Ala	Phe	Met	Gly	Arg
1				5					10					15
Ser	Ser	Phe	Ala	Pro	Ala	Pro	Lys	Gly	Val	Ala	Ser	Arg	Gly	Ser
				20					25					30
Leu	Gln	Val	Val	Ala	Gly	Leu	Lys	Glu	Val	Arg	Asp	Arg	Ile	Ala
				35					40					45
Ser	Val	Lys	Asn	Thr	Gln	Lys	Ile	Thr	Asp	Ala	Met	Lys	Leu	Val
				50					55					60
Ala	Ala	Ala	Lys	Val	Arg	Arg	Ala	Gln	Glu	Ala	Val	Val	Asn	Gly
				65					70					75
Arg	Pro	Phe	Ser	Glu	Asn	Leu	Val	Lys	Val	Leu	Tyr	Gly	Val	Asn
				80					85					90
Gln	Arg	Val	Arg	Gln	Glu	Asp	Val	Asp	Ser	Pro	Leu	Cys	Ala	Val
				95					100					105
Arg	Pro	Val	Lys	Ser	Val	Leu	Leu	Val	Val	Leu	Thr	Gly	Asp	Arg
				110					115					120
Gly	Leu	Cys	Gly	Gly	Tyr	Asn	Asn	Phe	Ile	Ile	Lys	Lys	Thr	Glu
				125					130					135
Ala	Arg	Tyr	Arg	Glu	Leu	Thr	Ala	Met	Gly	Val	Lys	Val	Asn	Leu
				140					145					150
Val	Cys	Val	Gly	Arg	Lys	Gly	Ala	Gln	Tyr	Phe	Ala	Arg	Arg	Lys
				155					160					165
Gln	Tyr	Asn	Ile	Val	Lys	Ser	Phe	Ser	Leu	Gly	Ala	Ala	Pro	Ser
				170					175					180
Thr	Lys	Glu	Ala	Gln	Gly	Ile	Ala	Asp	Glu	Ile	Phe	Ala	Ser	Phe
				185					190					195
Ile	Ala	Gln	Glu	Ser	Asp	Lys	Val	Glu	Leu	Val	Phe	Thr	Lys	Phe
				200					205					210
Ile	Ser	Leu	Ile	Asn	Ser	Asn	Pro	Thr	Ile	Gln	Thr	Leu	Leu	Pro
				215					220					225
Met	Thr	Pro	Met	Gly	Glu	Leu	Cys	Asp	Val	Asp	Gly	Lys	Cys	Val
				230					235					240
Asp	Ala	Ala	Asp	Asp	Glu	Ile	Phe	Lys	Leu	Thr	Thr	Lys	Gly	Gly
				245					250					255
Glu	Phe	Ala	Val	Glu	Arg	Glu	Lys	Thr	Thr	Ile	Glu	Thr	Glu	Ala
				260					265					270
Leu	Asp	Pro	Ser	Leu	Ile	Phe	Glu	Gln	Glu	Pro	Ala	Gln	Ile	Leu
				275					280					285
Asp	Ala	Leu	Leu	Pro	Leu	Tyr	Met	Ser	Ser	Cys	Leu	Leu	Arg	Ser
				290					295					300
Leu	Gln	Glu	Ala	Leu	Ala	Ser	Glu	Leu	Ala	Ala	Arg	Met	Asn	Ala
				305					310					315
Met	Asn	Asn	Ala	Ser	Asp	Asn	Ala	Lys	Glu	Leu	Lys	Lys	Gly	Leu
				320					325					330
Thr	Val	Gln	Tyr	Asn	Lys	Gln	Arg	Gln	Ala	Lys	Ile	Thr	Gln	Glu
				335					340					345
Leu	Ala	Glu	Ile	Val	Gly	Gly	Ala	Ala	Ala	Thr	Ser	Gly		
				350					355					

<210> SEQ ID NO: 26
<211> 1515
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CGG CTC ATT AGG GTT GAG GAT CGT CGA ATC ACG GCC AGC ATT CGC GCG 48
AGA TAG GGT AGG GGC AAC ACC ATC ATG GAG GCT CTC TCC GGC ACC CGT 96
GCA TCC TTT GCA GGA GCC ACG CAT GCC TTC ACC GCG CGC AAG CTC CCC 144
GGT ACT TGT CGC CGG ATC AGC CTG CAG GTT CGG AAC GCG GGT ACC AGG 192
GAG ATC AGG GAC CGC ATC TCT TCC GTG AAG AAC ACC CAG AAG ATT ACG 240
GAG GCC ATG AAG CTG GTG GCG GCA GCT AAA GTG CGG CGA GCG CAG GAC 288
GCT GTG GTG CGG GGC CGG CCC TTT GCG GAG AAT CTC GTC AAG GTG CTG 336
TAT GGA GTC AAT CAG CGT CTG CGC GTG GAG GAC ATC GAC TCC CCC CTG 384
GTC GAG AAC CGG CCT GTG AAG ACG GTC CTG CTC CTC TGC GTC ACG GGC 432
GAC CGT GGG CTG TGC GGC GGG TAC AAC GCC ATT GTC ATC AAG AAA ACC 480
GAG GCT CGG TAC AAC GAG CTG ACC AAG CTG GGG GTG AAG GTG CGG CTG 528
CTC CTC ATC GGG AAG AAG GGT TCC GCC GCC TTC AAG CGT CGG CCC AAC 576
TAC AAG AAT GCC ATC GGC AAT GTG TAC GAC ATG GGC CAG GCT CCC TCC 624
GTC AAG GAG GCC CAG GTG GTG GCC GAC GAG ATC TTC AGC GAC TTT GTG 672
TCG CAG GAG GTG GAC AAG GTG GAG ATC ATC TAC ACA AAG TTC GTG TCG 720
CTC ATC GCA TCG GAG CCT GTG GTC CAG ACG CTG CTC CCG CTG TCG CCC 768
ATC GGC GAG GTA TGC GAC GTG GAT GGC AAC TGC GTG GAC GCC GCG GAG 816
GAC GAG GTC TTC AAG CTG ACC ACG CGG GAG GGG CAG CTG GTG GTG GAG 864
TCT GAG AAG ACG GCC AAG GCG GAC CCC AGC GAC TTT GAG GCC GGC CTC 912
ATC TTC GAG CAG GAC CCG GTG CAG ATC GTG GAC GCG CTG CTG CCG CTG 960
TAC CTC AAC TCC ACG CTG CTG CGC AGC CTG CAG GAG GCC CTG GCC TCG 1008
GAG CTG GCG GCG CGC ATG AAT GCC ATG AGC TCC GCC TCG GAC AAC GCG 1056
GCG GAG CTG CGC AAG GGG CTC AAC CAG GTG TAC AAC CGC AAG CGC CAG 1104
GCC AAG ATC ACG ACC GAG CTG AGC GAG ATC GTG GCC GGC GCG AGC TCG 1152
GTG TGA GGT GGC CAG AGG CTG GCG TGG GGC CTG GTT GGC TGC CCC TAT 1200
GGG GGA CTT GGG GGC ACT GCT GGG CGA GTG AAT GCT GCT GAA CAG AGA 1248
CCC AAA TGG GCC TGC AGC CGG CCG GCT GCA TGC GTG CAC AGG CCA GGC 1296
CGT CGG CAG CCT CCC CTG CCG CCC GCG CGA GCC CCC GTC ACT GGC TTT 1344
TGC ATC GCC TGG TCC CAT TCT TTC CTG CAC TCC ATT TCC GTC CTT TGC 1392
CTT GCT TGA GAA GAT GCG TAA CGT CAA GGA CTT GGT ACA GGT GTA GGG 1440
AGA GGC AAA GAT GTG TGG CTT GGG GGA TAG AAT GCC GAG GCT GGG TAG 1488
CGG GCA GCC TGA TTG AAG CTG CAG GGA 				1515

<210> SEQ ID NO: 27
<211> 327
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Tyr	Asn	Phe	Lys	Ser	Leu	Glu	Leu	Asn	Ser	Trp	Asn	Glu	Lys
1				5					10					15
Ile	Ser	Ile	Phe	Ser	Thr	Val	Ala	Ser	Lys	Trp	Leu	Leu	Ile	Val
				20					25					30
Val	Ser	Ser	Phe	Phe	Leu	Thr	Ser	Ser	Ser	Asp	Ala	Tyr	Pro	Ile
				35					40					45
Phe	Ala	Gln	Gln	Asn	Tyr	Glu	Asn	Pro	Arg	Glu	Ala	Asn	Gly	Arg
				50					55					60
Ile	Val	Cys	Ala	Asn	Cys	His	Leu	Ala	Glu	Lys	Pro	Val	Glu	Ile
				65					70					75
Glu	Ile	Pro	Gln	Ala	Val	Leu	Pro	Asp	Thr	Val	Phe	Glu	Ala	Val
				80					85					90
Val	Lys	Ile	Pro	Tyr	Asp	Lys	Gln	Ile	Lys	Gln	Val	Leu	Gly	Asn
				95					100					105
Gly	Lys	Lys	Gly	Gly	Leu	Asn	Val	Gly	Ala	Val	Leu	Ile	Leu	Pro
				110					115					120
Glu	Gly	Phe	Glu	Leu	Ala	Pro	Pro	Asp	Arg	Ile	Pro	Glu	Glu	Met
				125					130					135
Lys	Ala	Lys	Val	Gly	Lys	Leu	Tyr	Phe	Gln	Pro	Tyr	Ser	Ala	Glu
				140					145					150
Gln	Lys	Asn	Ile	Tyr	Val	Val	Gly	Pro	Val	Pro	Gly	Lys	Lys	Tyr
				155					160					165
Ser	Glu	Met	Val	Phe	Pro	Ile	Leu	Ser	Pro	Asp	Pro	Thr	Lys	Asn
				170					175					180
Lys	Ser	Val	Ser	Tyr	Leu	Lys	Tyr	Pro	Ile	Tyr	Val	Gly	Gly	Asn
				185					190					195
Arg	Gly	Arg	Gly	Gln	Val	Tyr	Pro	Asp	Gly	Ser	Lys	Ser	Asn	Asn
				200					205					210
Thr	Ile	Tyr	Thr	Ala	Ser	Ala	Ala	Gly	Lys	Ile	Thr	Ala	Ile	Glu
				215					220					225
Pro	Val	Glu	Lys	Lys	Gly	Gly	Tyr	Val	Val	Thr	Ile	Glu	Thr	Ser
				230					235					240
Asn	Gly	Glu	Ser	Val	Ser	Asp	Thr	Leu	Pro	Pro	Gly	Pro	Glu	Leu
				245					250					255
Ile	Val	Lys	Pro	Gly	Asp	Thr	Val	Ala	Val	Asp	Gln	Ala	Leu	Thr
				260					265					270
Thr	Asn	Pro	Asn	Val	Gly	Gly	Phe	Gly	Gln	Gly	Glu	Thr	Glu	Ile
				275					280					285
Val	Leu	Gln	Asn	Pro	Thr	Arg	Ile	Gln	Gly	Leu	Leu	Val	Phe	Phe
				290					295					300
Leu	Phe	Val	Leu	Leu	Ala	Gln	Val	Phe	Leu	Val	Leu	Lys	Lys	Lys
				305					310					315
Gln	Phe	Glu	Lys	Val	Gln	Leu	Ala	Glu	Met	Asn	Phe			
				320					325					

<210> SEQ ID NO: 28
<211> 960
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG AAG TAT TCC ATT TAT CTA TTT AAA CTA CCA ATT AAT TTT TGT ATT 48
AAA TTA ACA TTT CTT TTG GGT ATT ACT TTT CTA GTT AGT GCT CCA AGT 96
AAT GCC TAT CCT ATT TTT GCT CAA CAA AAT TAC AAC AAC CCT CGG GAA 144
GCA AAT GGG CGA ATT GTG TGT GCG AAT TGT CAT TTA GCC CAA AAG CCA 192
ATC GAA ATT GAA GTA CCA CAA GCT GTT TTA CCA AAT ACA GTA TTT GAG 240
GCA GTT GCA AAA ATA CCT TAC GAT AAA CAA ATT AAA CAA GTT TTA GGG 288
AAC GGT AAA AAA GGT GAT TTA AAT GTA GGC GCT GTG CTT ATT TTA CCA 336
GAA AAT TTT CAG TTA GCA CCT CCG GAA AGA ATA CCA GAA GAA ATT AAA 384
GCG AAA ATG GGG AAA TTA TAT TTT TCA TCA TAT AAT GCG GAA AAA CCT 432
AAT ATT TTA GTT GTT GGG CCA GTT CCA GGT AAA AAA TAT AAA GAA CTT 480
GTT TTT CCT ATT TTA TCA CCC GAT CCG ACT AAA AAT AAA TCA GTA TCG 528
TAT TTA AAA TAC CCT ATT TAT TTT GGT GGA AAT CGT GGT CGT GGT CAA 576
ATT TAT CCA GAC GGC TCA AAA AGT AAT AAT ACT ATT TAT ACT GCA TCT 624
AGC GCC GGA AAA ATC ACT AGT ATA GAA CCA ATT GAG AAA AAA GGT GGA 672
TTT TTA ATT AGT ATT CAA ACG GCA AAT GGT GAT ATT ATT ACA GAT ACA 720
GTA CCG CCA GGT CCC GAG TTA ATT GTA AAA ACT GGT AAT GAA GTT AAG 768
CTA GAT CAA GCA TTA ACA AGT AAC CCG AAT GTC GGT GGT TTT GGG CAA 816
GGT GAA ACA GAA ATT GTT TTA CAA AAC CCA ACT CGA ATT ATC GGT TTA 864
TTA GTT TTT TTT ATT TTA GTT TTA TTT GCC CAA GTA TTT TTT GTA TTA 912
AAA AAG AAA CAA TAC GAA AAA GTT CAA TTA TCT GAA ATG AAT TTT TAA 960

<210> SEQ ID NO: 29
<211> 423
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Ala	Arg	Ala	Val	Arg	Ser	Leu	Glu	Ala	Pro	His	Ala	Ala	His
1				5					10					15
Ser	Ser	Thr	Pro	Leu	Ser	Leu	Ala	Arg	Pro	Cys	Cys	Leu	Pro	Ala
				20					25					30
Arg	Ser	Ala	Pro	Ser	Ala	Phe	His	Ser	Leu	Ala	Ala	His	Leu	Thr
				35					40					45
Thr	Thr	Leu	Phe	Leu	Leu	Gly	Asp	Ala	Ala	Asp	Ser	Leu	Ala	Asp
				50					55					60
Ala	Ser	Gly	Thr	Ala	Ala	Ala	Ala	Val	Asp	Ala	Ala	Ser	Asn	Ala
				65					70					75
Ala	Ser	Ala	Val	Ala	Asp	Ala	Ala	Pro	Ala	Ala	Ala	Glu	Glu	Ala
				80					85					90
Ala	Lys	Asn	Ser	Gly	Phe	Phe	Gly	Ala	Phe	Ala	Gly	Ala	Phe	Glu
				95					100					105
Ala	Phe	Leu	Lys	Val	Leu	Asp	Asp	Gly	Leu	Glu	Lys	Val	Gly	Val
				110					115					120
Pro	Tyr	Ser	Tyr	Gly	Phe	Ala	Ile	Ile	Leu	Leu	Thr	Val	Leu	Val
				125					130					135
Lys	Val	Ala	Thr	Tyr	Pro	Leu	Thr	Lys	Gln	Gln	Val	Glu	Ser	Thr
				140					145					150
Leu	Ser	Met	Gln	Ala	Met	Gln	Pro	Arg	Val	Lys	Glu	Leu	Gln	Ala
				155					160					165
Lys	Tyr	Ala	Asn	Asp	Pro	Glu	Arg	Leu	Gln	Val	Glu	Thr	Ala	Lys
				170					175					180
Met	Tyr	Gln	Thr	Ala	Gly	Val	Asn	Pro	Leu	Ala	Gly	Cys	Leu	Pro
				185					190					195
Ser	Leu	Ala	Thr	Ile	Pro	Val	Phe	Ile	Gly	Leu	Tyr	Lys	Ala	Leu
				200					205					210
Ser	Asn	Val	Ala	Ser	Glu	Gly	Leu	Leu	Thr	Asp	Gly	Phe	Phe	Trp
				215					220					225
Ile	Pro	Ser	Leu	Ala	Gly	Pro	Thr	Thr	Val	Asn	Gly	Gly	Leu	Asp
				230					235					240
Trp	Leu	Phe	Lys	Trp	Gln	Asp	Gly	Ala	Pro	Leu	Leu	Gly	Tyr	Gly
				245					250					255
Gln	Thr	Ala	Ala	Tyr	Leu	Val	Leu	Pro	Ile	Leu	Leu	Val	Val	Ser
				260					265					270
Gln	Ala	Ile	Ser	Gln	Lys	Val	Ile	Ser	Pro	Pro	Gln	Gln	Ser	Asn
				275					280					285
Asp	Pro	Ala	Gln	Gln	Gln	Thr	Gln	Ala	Ile	Leu	Lys	Phe	Leu	Pro
				290					295					300
Leu	Met	Ile	Gly	Trp	Phe	Ser	Leu	Asn	Val	Pro	Ser	Gly	Leu	Thr
				305					310					315
Leu	Tyr	Trp	Phe	Thr	Asn	Asn	Leu	Ile	Thr	Thr	Ala	Gln	Gln	Leu
				320					325					330
Tyr	Leu	Arg	Arg	Gly	Phe	Thr	Ala	Ala	Gln	Ala	Ala	Ala	Ala	Gly
				335					340					345
Pro	Ala	Ser	Thr	Ala	Ile	Val	Asn	Val	Glu	Val	Gln	Glu	Ala	Glu
				350					355					360
Lys	Arg	Pro	Ser	Gly	Lys	Glu	Leu	Asn	Ala	Arg	Arg	Ser	Ala	Lys
				365					370					375
Gln	Leu	Glu	Ala	Pro	Val	Ala	Ala	Pro	Ser	Gly	Gly	Gly	Gly	Gly
				380					385					390
Gly	Gly	Gly	Gly	Gly	Ser	Arg	Gly	Glu	Lys	Phe	Arg	Ala	Ile	Lys
				395					400					405
Ala	Arg	Glu	Ala	Val	Ala	Arg	Ala	Ser	Glu	Gln	Val	Gly	Arg	Ala
				410					415					420
Ser	Arg	Gly												
														
<210> SEQ ID NO: 30
<211> 1128
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG ATG CAT CCT TGC CAT GCT GGA CCC CCT GCC TGC CCG CCC TCG TCA 48
CCA TCC CCA TCT TCA TCG GCC TTT ACA GGG CAC TGT CAA ATG TTG CGG 96
ACG AGG GGC TCC TGG GCG AGG GCT TCT TCT GGA TTC CAT CCC TGG CAG 144
GCC CAA CCA CCA ACC AGG GTG GCC TGG GGT GGC TGA AGC TGG CCG GCG 192
GGG CGC CGC CTC TGG GCT GGC ATG ACA CCA TTG CCT ACC TGG TCC TGC 240
CAG TGC TGC TGG TTG TGT CGC AGT ACA TCA GCC AGA AGG TAC TGC AGC 288
CGC CCT CGG CCG ATC CCT CGC AAG CCG CGC AAT CCA ATG CCA TCC TCA 336
AGT TCC TGC CCC TCA TGA TCG GCT GGT TCT CTC TCA ACG TGC CCT CTG 384
GGC TTG GTC TCT ACT GGC TGA CAA ACA ACC TTG TGA CCA CTG GGC AGC 432
AGC TCT ACT TCC GCC GGC AGT TTG CGG GCG CAT CCG CAG GGT CTG GGA 480
GCG AGA TCC AGA GTG GGA CCT CCA GTT CAC AGC CGT CCT CGT CGC AAG 528
TGT ACG ATG TCG AGG CCA CAG AGG TCA AGC CCA GCG GCA AGG AGC TGA 576
ATG CGC GGC GGT CGT CAA ACG CGT CCT CCT CCA GCC GGG GAA AGA AGT 624
TCA AGG CAC TGC GGG AAC GTG AGG CCG CCC AAC GGA CAC AGG GCG GGG 672
CAT CGG CGG CCG AGG CGC CTG AGC GCA GCA AGT CTG GGG TGG CCA CCA 720
TCA GCG CAG AGC GGG GGG AGG CGG AGA GGC TGA AGG TGC CTG ATG CGC 768
CCC CCG CCA AGA ATA ACA ATA GGG ATT GAG GTG GAG GAA GAA GTC TGG 816
TAC CCA GTA TGT CTG TGA CGC ACC CCT GCG CCT CGG ATC TCC CGT CTC 864
ATA CTT TTG TGT TGA TAA CTT GGC CTT GAA TCT CCT GCT CAC ACC TTC 912
TCG TGC ATA TCA GTG GGC TCC CCC TCG ATC CGA GGT GAT GCC TGT TCA 960
TCG CAT CAT GCA AGC AAT CTG CTG TCA TCA GGG ATA TCC CCA AGG TGG 1008
TGC CTC CCA GGC AGG CTG GGA CTG GGC GAA TTG GTG TTG GGA GGT TTG 1056
CCT GTT TGA TAG GAA CCA ATG GTG CAT TCA GTA AGA GTG TCC TGA ATC 1104
CAC AGC TCC TGA GAG TGT GCT TGG     				1128

<210> SEQ ID NO: 31
<211> 257
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Phe	Ala	Leu	Ser	Phe	Ser	Arg	Lys	Ala	Leu	Gln	Val	Ser
1				5					10					15
Ala	Lys	Ala	Thr	Gly	Lys	Lys	Gly	Thr	Gly	Lys	Thr	Ala	Ala	Lys
				20					25					30
Gln	Ala	Pro	Ala	Ser	Ser	Gly	Ile	Glu	Phe	Tyr	Gly	Pro	Asn	Arg
				35					40					45
Ala	Lys	Trp	Leu	Gly	Pro	Tyr	Ser	Glu	Asn	Ala	Thr	Pro	Ala	Tyr
				50					55					60
Leu	Thr	Gly	Glu	Phe	Pro	Gly	Asp	Tyr	Gly	Trp	Asp	Thr	Ala	Gly
				65					70					75
Leu	Ser	Ala	Asp	Pro	Glu	Thr	Phe	Lys	Arg	Tyr	Arg	Glu	Leu	Glu
				80					85					90
Leu	Ile	His	Ala	Arg	Trp	Ala	Met	Leu	Gly	Ala	Leu	Gly	Cys	Ile
				95					100					105
Thr	Pro	Glu	Leu	Leu	Ala	Lys	Ser	Gly	Thr	Gln	Phe	Gly	Glu	Ala
				110					115					120
Val	Trp	Phe	Lys	Ala	Gly	Ala	Gln	Ile	Phe	Ser	Glu	Gly	Gly	Leu
				125					130					135
Asp	Tyr	Leu	Gly	Asn	Pro	Ser	Leu	Val	His	Ala	Gln	Asn	Ile	Val
				140					145					150
Ala	Thr	Leu	Ala	Val	Gln	Val	Ile	Leu	Met	Gly	Leu	Val	Glu	Gly
				155					160					165
Tyr	Arg	Val	Asn	Gly	Gly	Pro	Ala	Gly	Glu	Gly	Leu	Asp	Pro	Leu
				170					175					180
Tyr	Pro	Gly	Glu	Ser	Phe	Asp	Pro	Leu	Gly	Leu	Ala	Asp	Asp	Pro
				185					190					195
Asp	Thr	Phe	Ala	Glu	Leu	Lys	Val	Lys	Glu	Ile	Lys	Asn	Gly	Arg
				200					205					210
Leu	Ala	Met	Phe	Ser	Met	Phe	Gly	Phe	Phe	Val	Gln	Ala	Ile	Val
				215					220					225
Thr	Gly	Lys	Gly	Pro	Ile	Gln	Asn	Leu	Asp	Asp	His	Leu	Ser	Asn
				230					235					240
Pro	Thr	Val	Asn	Asn	Ala	Phe	Ala	Phe	Ala	Thr	Lys	Phe	Thr	Pro
				245					250					255
Ser	Ala													

<210> SEQ ID NO: 32
<211> 792
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG CAG GCT TCC ACC TTC TTC GGC AAT GCC ACC GCG CTG CGG CCC CAG 48
GTC GCC GCG CAG GCC AGC ACC GGC AAG CGT GCG GTG ACC ACC ATG AGG 96
CGC ACG CTG AAG CAG CAG AAG CCA AAG GTG TCG GAG GGT GTC TGG TAC 144
GGC CCC GAC CGC CCC AAG TTC CTG GGC CCC TTT TCT GAT GGC ATC ACC 192
CCC AGC TAC CTG AAC GGA GAG TTT GCT GGC GAC TAC GGC TGG GAC ACG 240
GCC GGT CTG TCT GCC GAC CCC GAG ACC TTT GCC CGC TAC AGG GTG ATC 288
GAG GTC ATT CAC TGC CGC TGG GCC ACC CTG GGA ACG CTC GGC ATC GTC 336
TTC CCT GAG ATC CTG CAG AAG TAC AGT GGC GTG CAG TTC CAG GAG GCC 384
GTG TGG TTC AAG GCC GGC GCC CAA ATC TTC TCC CCT GAC GGG CTC AAC 432
TAC CTG GGC AAC CCC TCC CTG GTG CAC GCC CAA TCC ATC GTG GCC ACC 480
CTC ATC TCC CAG GTC ATT CTC ATG GGC CTG GTG GAG GGC TAC CGC GTG 528
AAC GGC GGG CCC GCT GGC GAA GGC CTG GAC CCC CTG TAC CCC GGT GAG 576
GCC TTC GAC CCC CTG GGC CTG GCC GAC GAC CCC GAC GCG CTG GCG GAG 624
CTG AAG GTG AAG GAG CTG AAG AAC GGC CGT CTG GCC ATG TTT GCC AAC 672
TTT GGT TTC TTC GTC CAG GCC ATC GTC ACC GGC AAG GGA CCC CTG GAG 720
AAC CTG AGC GAC CAC CTG GCC GAC CCC TGG ACC AAC AAC GGC TTC GCG 768
GCT GCC ACC AAG TTC ACC CCC TAG					792	

<210> SEQ ID NO: 33
<211> 286
<212> Amino Acid Sequence
<213> Coccomyxa subellipsoidea

Met	Thr	Glu	Val	Ile	Lys	His	Leu	Phe	Thr	Ala	Glu	Asn	Thr	Lys
1				5					10					15
Leu	Val	Val	Leu	Ser	Ile	Leu	Ala	Leu	Val	Leu	Val	Pro	Leu	Leu
				20					25					30
Ser	Trp	Ile	Leu	Val	Phe	Arg	Gly	Lys	Lys	Arg	Gln	Pro	Phe	Leu
				35					40					45
Asn	Pro	Asp	Val	Trp	Gln	Glu	Leu	Pro	Leu	Ala	Glu	Lys	Glu	Val
				50					55					60
Ile	Thr	His	Asn	Thr	Arg	Arg	Phe	Arg	Phe	Ala	Leu	Pro	Tyr	Lys
				65					70					75
Asp	Gln	Pro	Ile	Gly	Leu	Pro	Ile	Gly	Gln	His	Ile	Ser	Leu	Lys
				80					85					90
Ala	Leu	Lys	Pro	Ala	Ala	Asp	Gly	Thr	Glu	Ile	Phe	Lys	Pro	Tyr
				95					100					105
Thr	Pro	Val	Ser	Asp	Asp	Asp	Leu	Leu	Gly	Tyr	Val	Asp	Phe	Val
				110					115					120
Ile	Lys	Val	Tyr	Glu	Gln	Gly	Arg	Met	Thr	Lys	His	Met	Asp	Glu
				125					130					135
Leu	Ala	Ile	Gly	Asp	Lys	Leu	Leu	Phe	Lys	Gly	Pro	Lys	Gly	Arg
				140					145					150
Phe	Lys	Tyr	Ser	Cys	Asn	Ala	Lys	Arg	Ser	Leu	Gly	Met	Ile	Ala
				155					160					165
Gly	Gly	Thr	Gly	Ile	Thr	Pro	Met	Tyr	Gln	Val	Ala	Thr	Gln	Leu
				170					175					180
Leu	Lys	Asp	His	Gln	Asp	His	Thr	Lys	Met	Ser	Leu	Ile	Phe	Gly
				185					190					195
Asn	Val	Ser	His	Asp	Asp	Ile	Leu	Ile	Lys	Glu	Glu	Leu	Glu	Ala
				200					205					210
Leu	Ala	Ala	Ala	His	Pro	Thr	Arg	Phe	Lys	Val	Tyr	His	Val	Leu
				215					220					225
Asn	Gln	Ala	Pro	Pro	Gly	Trp	Thr	Gln	Gly	Val	Gly	Phe	Ile	Thr
				230					235					240
Ala	Asp	Ile	Ile	Lys	Gln	His	Ile	Asp	Pro	Pro	Ala	Glu	Asp	Val
				245					250					255
Met	Val	Leu	Arg	Cys	Gly	Pro	Gly	Pro	Met	Asn	Val	Ala	Val	Lys
				260					265					270
Lys	Ala	Leu	Asp	Gly	Leu	Gly	Tyr	Thr	Arg	Glu	Met	Gln	Phe	Glu
				275					280					285
Phe														

<210> SEQ ID NO: 34
<211> 1,278
<212> cDNA
<213> Chlorella protothecoides

<400>
 CGCAGGTTCA TCTCGCTGTG AAGCAGCCTC GCGCATCAAA ATCCTGCAAC    50
 AGCGCACTTT GCCTGACCAG GACCCGATTG GCAGGACGTG GTGGGCCTCC   100
 AGGGACACCT TATGTCCTCC GGCGTAGGAT TCGAGAGCAT GGTGGACTCC   150
 GTCTTCCTGG ACGTGATCCG GGTGGTTCCA CAGCTCCTGC TGGTGGTCCT   200
 TGTCGTCATC GTAGCAGTAC TGATCGGGAA AGATATCTTG AATGGCATGC   250
 GGCGGCCCTT CCTCAAGCCC GACCAGTGGC AAGCGCTCCC CCTGACTGAG   300
 GTGAAGAAGA CAACGCATAA CACGCGCCTT TTCCGCTTCG CGCTACCCCA   350
 TGTCGATCAG CAGCTGGGAC TGCCCGCGGG TCAGCACATC ACGCTCAAGG   400
 TTACAGGACC CCATGGCGAC GAGATCCTCA GGCCCTACAC ACCAGTCTCC   450
 GACATCTCCC AGCGTGGGAC CGTCGATTTC CTGATCAAGG TCTATCCCGA   500
 GGGCCGCATG TCCCAGGCGT TGGATGCCCT GGCTGTGGGA GACAAGGTGC   550
 TCTTCAAGGG GCCAAAGGGG CGATTCGTGT ACAAGCCGGA CACCTGGAAT   600
 GCCATCGGCA TGCTGGCTGG GGGCACGGGG ATCACGCCCA TGTACCAGCT   650
 GCTGCAGAAG ATCCTGAAGG ACCCGAATGA CACGACAGAG ATCTCGCTGA   700
 TCTTTGGAAA TGTCACCGAG GATGACATAC TGCTGCAGAA GGAGCTGGAG   750
 GAGATGGCGA GTAAGCACAA GCGATTCAAG GTATTCCACG TCCTCAACAA   800
 CCCTCCAGCT CAGTGGACTG GAGGTGTTGG GTTCATCTCC GAAAGCATGA   850
 TCAGGGAGCA CTTCCCGGCA CCGGCTGCTG ATGTCCTCAT TCTGCGGTGT   900
 GGGCCTCTGC CCATGATGAA GGCCATGGAA ACCCACCTGG ACAGGCTCGG   950
 GTACACCCGC GAAGTCCAGT TTCAGTTCTG AGGGCCTCTT GCAGTGGCTT  1000
 GCATGCGACT ATCTCACAAT GTTGCAATGC ACAGCAAGCA GCATGGGTAA  1050
 GAATGCAAGG GATGGACAGG TGAGCAAGCA TCATCCTCGT CGCCTGCCCT  1100
 GTGAGCGTGG GAACAGGCCG ACAGACTGCC TTCTGCCAAA TTCGACCCTG  1150
 TTTGGCACAG CCAGGTCCAC ATTTGCCAGG CTCAGCCTGG AGAGACTCTC  1200
 TTCCACGCCC ACCGGTCCTC CCTCTGCATC CATCCTCACA ACGTCTGGCT  1250
 GGGACAGGTC TCCACGTGTG GGCGGTCC                          1278

<210> SEQ ID NO: 35
<211> 634
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Ala	Ser	Pro	Thr	Val	Ala	Thr	Pro	Val	Glu	Ala	Val	Ala	Ala
1				5					10					15
Gly	Thr	Pro	Pro	Ala	Pro	Thr	Ala	Ala	Ala	Pro	Gly	Ala	Gly	Thr
				20					25					30
Ala	Ala	Ala	Ala	Thr	His	Asn	Ser	Ser	Leu	Tyr	Val	Gly	Asp	Leu
				35					40					45
Asp	Arg	Asp	Val	Thr	Glu	Ala	Gln	Leu	Phe	Glu	Val	Phe	Ser	Gln
				50					55					60
Ile	Gly	Pro	Val	Ala	Ser	Ile	Arg	Val	Cys	Arg	Asp	Ala	Val	Thr
				65					70					75
Arg	Arg	Ser	Leu	Gly	Tyr	Ala	Tyr	Val	Asn	Tyr	Asn	Ser	Val	Leu
				80					85					90
Asp	Pro	Ala	Ala	Ala	Glu	Arg	Ala	Leu	Asp	Gln	Leu	Asn	Tyr	Thr
				95					100					105
Pro	Leu	Val	Gly	Arg	Pro	Met	Arg	Ile	Met	Trp	Ser	His	Arg	Asp
				110					115					120
Pro	Ala	Phe	Arg	Lys	Ser	Gly	Val	Gly	Asn	Ile	Phe	Ile	Lys	Asn
				125					130					135
Leu	Asp	Arg	Ser	Val	Asp	Asn	Lys	Ala	Leu	His	Asp	Thr	Phe	Ser
				140					145					150
Ala	Phe	Gly	Asn	Ile	Leu	Ser	Cys	Lys	Val	Ala	Gln	Asp	Leu	Lys
				155					160					165
Gly	Glu	Ser	Lys	Gly	Tyr	Gly	Phe	Val	His	Phe	Glu	Lys	Asp	Glu
				170					175					180
Ser	Ala	Arg	Leu	Ala	Ile	Glu	Lys	Val	Asn	Gly	Met	Leu	Leu	Glu
				185					190					195
Gly	Lys	Lys	Val	Tyr	Val	Gly	Pro	Phe	Leu	Arg	Arg	Ser	Glu	Arg
				200					205					210
Ser	Ser	Asp	Ser	Glu	Val	Lys	Phe	Thr	Asn	Val	Phe	Val	Lys	Asn
				215					220					225
Leu	Asp	Glu	Ala	Val	Ser	Asp	Asp	Glu	Val	Lys	Ala	Met	Phe	Ala
				230					235					240
Glu	His	Gly	Thr	Val	Asn	Ser	Cys	Ile	Ile	Met	Arg	Asp	Asp	Glu
				245					250					255
Gly	Lys	Ser	Lys	Gly	Phe	Gly	Phe	Ile	Asn	Phe	Glu	Glu	Pro	Glu
				260					265					270
Gln	Ala	Ala	Ser	Ala	Val	Gln	Ala	Leu	Asn	Gly	Lys	Asp	Val	Asn
				275					280					285
Cys	Lys	Glu	Leu	Tyr	Val	Gly	Arg	Ala	Gln	Lys	Lys	Ala	Glu	Arg
				290					295					300
Glu	Ala	Met	Leu	Arg	Ala	Lys	Phe	Glu	Glu	Leu	Arg	Ser	Glu	Arg
				305					310					315
Ile	Ala	Lys	Tyr	Gln	Gly	Met	Asn	Leu	Tyr	Val	Lys	Asn	Leu	His
				320					325					330
Asp	Asp	Ile	Asp	Asp	Glu	Thr	Leu	Arg	Thr	Glu	Phe	Ser	Gln	Phe
				335					340					345
Gly	Thr	Ile	Thr	Ser	Ala	Lys	Val	Met	Val	Asp	Ser	Ala	Gly	Lys
				350					355					360
Ser	Arg	Gly	Phe	Gly	Phe	Val	Cys	Tyr	Ala	Ser	Pro	Glu	Glu	Ala
				365					370					375
Thr	Arg	Ala	Val	Thr	Glu	Met	Asn	Gly	Arg	Met	Ile	Lys	Gly	Lys
				380					385					390
Pro	Ile	Tyr	Val	Ala	Leu	Ala	Gln	Arg	Arg	Asp	Val	Arg	Arg	Ala
				395					400					405
Gln	Leu	Glu	Gln	Gln	Tyr	Gln	Gln	Arg	Val	Ala	Met	Pro	Pro	Gly
				410					415					420
Pro	Arg	Gly	Pro	Met	Ala	Pro	Gly	Met	Phe	Pro	Pro	Gly	Gly	Ala
				425					430					435
Pro	Pro	Met	Phe	Tyr	Pro	Pro	Gly	Pro	Arg	Gly	Pro	Met	Gly	Pro
				440					445					450
Gly	Met	Met	Asn	Pro	Tyr	Met	Met	Ala	Gly	Pro	Gly	Arg	Gly	Met
				455					460					465
Pro	Gly	Arg	Gly	Pro	Arg	Gly	Met	Met	Met	Pro	Pro	Gln	Met	Met
				470					475					480
Gly	Pro	Gly	Gly	Ala	Arg	Ser	Gly	Arg	Gly	Gly	Arg	Gly	Gly	Arg
				485					490					495
Gly	Gly	Arg	Gly	Glu	Phe	Met	Gly	Arg	Gly	Arg	Gly	Pro	Ala	Ser
				500					505					510
Gly	Arg	Gly	Gly	Arg	Gly	Arg	Gly	Glu	Gly	Ile	Pro	Ala	Pro	Pro
				515					520					525
Val	Gly	Ala	Pro	Ala	Ala	Pro	Ala	Ala	Ala	Val	Thr	Pro	Thr	Glu
				530					535					540
Gly	Gly	Gln	Leu	Thr	Ala	Ala	Met	Leu	Ala	Ala	Ala	Pro	Ala	Glu
				545					550					555
Gln	Gln	Lys	Gln	Leu	Leu	Gly	Glu	Arg	Leu	Phe	Pro	Leu	Val	Ala
				560					565					570
Asn	Val	Gln	Pro	Asp	Leu	Ala	Gly	Lys	Ile	Thr	Gly	Met	Leu	Leu
				575					580					585
Glu	Met	Asp	Asn	Ser	Glu	Leu	Leu	Leu	Leu	Leu	Glu	Asp	Pro	Thr
				590					595					600
Ala	Leu	Asp	Ala	Lys	Val	Glu	Glu	Ala	Val	Ser	Val	Leu	Lys	Gln
				605					610					615
														
His	Asn	Ala	Ile	Pro	Asp	Gly	Ala	Ile	Val	Arg	Glu	Lys	Gly	Gly
				620					625					630
Ala	Gly	Gln	Ala											

<210> SEQ ID NO: 36
<211> 1093
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
TGG CGC AGC TCT ACG AGG TCT TCG CCC AGA TTG GGC CCG TGG CTT CCA 48
TCC GCG TGT GCC GCG ACG CCG TGA CCC GTC GCT CCT TGG GCT ACG CCT 96
ACG TCA ACT ACA ACT CTG GCG TCG ATG CCG GCG CAG CCG AGC GTG CCC 144
TGG AGC AGC TCA ACT ATG CCC CCC TGG CGG GCC GCC CCA TGC GGC TGA 192
TGT GGA GCC ACC GTG ACC CCT CCT ACC GCA AGT CCG GCG TGG GCA ACA 240
TCT TCA TCA AGA ACC TGG ACA AGT CCA TCG ACA ACA AGG CCC TGC ACG 288
ACA CCT TCT CCG CCT TTG GCA ACA TCC TGT CCT GCA AGG TGG CCC TGG 336
ACG CCG CGG GCG AGT CCA AGG GCT ACG GCT TCG TGC ACT ACG AGG CGG 384
ACG AGG CGG CGC GCC TGG CCA TCG CCA AGG TAA ACG GCA TGC TGC TGG 432
AGG ACA AGA AGG TGT ACG TGG GCC CCT TCC TGC GTC GCA GCG AGC GTG 480
CGG CCG AGT CTG GCG GCC GCT TCA CCA ACG TGT TTG TCA AGA ACC TGG 528
ACG AAT CGG TGG ACG ACG AGG CAC TGA AGG CCG CCT TTG CCG CCC ACG 576
GCG CGG TCT CCT CCG CCG TGG TCA TGC GCG GCG AGG ACG GCG CCT CCC 624
GCG GCT TTG GCT TCG TGT CGT TCG AGG AGC CCG AGG CGG CGG CCG CGG 672
CGG TGG AGG CGC TCA ACG GCG CGC CCC TGG CCG GCA AGG AGC TGT TCG 720
TGG GGC GTG CGC AGA AGA AGG CGG AGC GCG AGG CCA TGC TCC GCG CGC 768
GGT TTG AGG ATG TGC GCG CGG AGC GCA TCG CGC GCT ACC AGG GCA TGA 816
ACC TGT ACG TGA AGA ACC TGC CCG ACG ACG CCG ACG ACG AGT TCC TGC 864
GCG CGA CGT TTG CGC CCT ACG GCA CCA TCA CCT CCG CCA AGG TCA TGG 912
TGG ATG CGG CCA GCG GCA AGC CCC GCG GCT TTG GCT TCG TCT GCT ATG 960
CCT CGC CGG AGG ACG CCA CCC GCG CCG TCA GCG AGC TGA ACA ACC ACA 1008
TGC TGC GCG GCA AGC CTG TTT ACG TGG CCC TGG CCC AGC GGC GCG AGG 1056
TGC GCC GCG CGC AGC TGG AGC AGC AGC ACG CGC AGC G 		1093 
<210> SEQ ID NO: 37
<211> 182
<212> Amino Acid Sequence
<213> Nannochloris bacillaris

Met	Ala	Ser	Thr	Ile	Ala	Phe	Ser	Thr	Ala	Ala	Val	Arg	Val	Ala
1				5					10					15
Pro	Ile	Thr	Lys	Val	Asn	Ala	Thr	Ser	Thr	Lys	Ala	Arg	Thr	Ala
				20					25					30
Phe	Val	Ser	Asn	Gly	Thr	Val	Lys	Lys	Thr	Thr	Ala	Met	Leu	Val
				35					40					45
Trp	Thr	Pro	Ile	Asn	Asn	Lys	Phe	Phe	Glu	Thr	Phe	Ser	Phe	Leu
				50					55					60
Pro	Pro	Leu	Ser	Asp	Ser	Glu	Ile	Thr	Lys	Gln	Val	Asp	Tyr	Ile
				65					70					75
Val	Arg	Asn	Gly	Trp	Thr	Pro	Cys	Leu	Glu	Phe	Ser	Asp	Pro	Glu
				80					85					90
Gly	Ala	Tyr	Ile	Gly	Asp	Thr	Asn	Thr	Val	Arg	Leu	Gln	Gly	Thr
				95					100					105
Ser	Ala	Gly	Tyr	Gln	Asp	Asn	Arg	Tyr	Trp	Ser	Met	Trp	Lys	Leu
				110					115					120
Pro	Met	Phe	Gly	Cys	Thr	Asp	Gly	Ser	Gln	Val	Leu	Lys	Glu	Ile
				125					130					135
Ala	Gly	Cys	Thr	Lys	Ala	Phe	Pro	Gly	Ala	Tyr	Ile	Arg	Leu	Val
				140					145					150
Ala	Phe	Asp	Asn	Gln	Lys	Gln	Val	Gln	Cys	Thr	Gly	Phe	Leu	Val
				155					160					165
His	Arg	Pro	Val	Gly	Ala	Lys	Glu	Phe	Gln	Pro	Ala	Asp	Lys	Arg
				170					175					180
Ser	Val													

<210> SEQ ID NO: 38
<211> 564
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG GCC ACC CTC TCC ATC TTC GCC TCT GCG GCC GTG TGC CTG CCC GTG 48
GGC CGC ACT GCC AGC ACC ACC CGC AGC CAG CCT CGC GTG GTC TCC AGG 96
CGC GCT CCT TTT GTC AGC AAT GGC AAC ATC AAG CGC ACC ACC GCC ATG 144
CTG GTG TGG ACC CCC ACC AAC AAC AAG ATG TTC GAG ACC TTC TCG TTC 192
CTG CCC CCC CTG TCC GAC GAC CAG ATC TCC CGC CAG GTC GAC TAC ATC 240
ATC TCC AAC AAG TGG ATC CCC TGC CTG GAA TTC ATC GAC GAC CCT TTC 288
CCG TAC GTG GCG CAG GAG AAC ACC ATC CGC ATG GGT GCC GTG GCC TCC 336
AAC TAC TAC GAT AAC CGC TAC TGG ACC ATG TGG AAG CTG CCC ATG TTC 384
GGG TGC AAC GAC GGT AGC CAG GTG CTG ACC GAG GTG GGC AAC TGC CGC 432
AAG GCC TTC CCC GAT GCG TAC ATC CGT CTG GTC GCC TTC GAC AAC GTC 480
CGC CAG GTG CAG ATC GGC GGG TTC CTT GTC CAC AGG CCC CCC TCC GCC 528
AAC GAC TTC CAG CCC ACC GAC AAG CGC TCC GTG TGA 		564

<210> SEQ ID NO: 39
<211> 115
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Leu	Val	Gly	Trp	Glu	Thr	Ala	Ala	Asn	Pro	Thr	Gly	Cys	Gly
1				5					10					15
His	His	Gly	Ser	Arg	Ala	His	Pro	Ser	Ile	Ala	Pro	Cys	Leu	Val
				20					25					30
Ser	Cys	Arg	Ala	Gly	Arg	Arg	Ser	Leu	Ala	Val	Val	Ala	Gly	Asn
				35					40					45
Lys	Gly	Ala	Gly	Gly	Pro	Phe	Ala	Pro	Ile	Val	Val	Val	Thr	Arg
				50					55					60
Asn	Ala	Met	Gly	Thr	Lys	Glu	Phe	Asn	Gln	Phe	Arg	Gly	Lys	Ala
				65					70					75
Ile	Ser	Leu	His	Ser	Gln	Val	Ile	Lys	Asp	Phe	Cys	Lys	Gln	Leu
				80					85					90
Gly	Ala	Asp	Thr	Lys	Gln	Val	Gln	Gly	Leu	Ile	Arg	Leu	Ala	Lys
				95					100					105
Lys	Asn	Gly	Glu	Lys	Leu	Gly	Phe	Leu	Ala					
				110					115					

<210> SEQ ID NO: 40
<211> 817 
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CCA AGA CCA GCC TGG GAT ACA AAA AGC CCT TGC ATG TCA GTG CAG AAC 48
GGC TTT GCC CCA CAG CCC TTA ACT GGC CCC TGC ACC CAT CTG ATG CAA 96
TAC CTC ATG TGT TGA TGG TGC CTC CCG TTG GTA AAG GAT TTA CAA GCT 144
TGA GAC ACT GCT CTG GAT GTA TAC TGA AAC ATT GAT GAT CGA AAA GCA 192
GTA CCA CGA TGG GGT CAG CCC TGG CTT TGA CCG CTG GCC ATG CTC TGA 240
CAC TCA GAA ACA CGG GAG CTC GCA CAC ATC TGG CCG TGC TGG AGT CCC 288
CAC GCC ATG CTC CCA AAT TAA GCC AGG AAA CCA AGC TTC TCT CCG TTT 336
TTC TTT GCT AGG CGG ATC AGG CCT TGA CTT TGA GAA TTG CTC ACT CCA 384
ATC TTT TGA CAG AAA CCC TTG ATG ACT TGC GAA TGG AGA GAG ATG GCC 432
TTG CCA CGG AGC TGG TTA AAT TTT TTG GTT CCC AGC CTG TCT CTG ACG 480
ATG CGC ACT ATG GGG GCA AAG GGC CCG GTC GTG GCT TTG TTC CCG GCA 528
ACG ACG GAG AGA GGT GTC CTC CCT GCG TGT GTT GGT CGA GCC GGC CCA 576
TGC TGG GAT GCC CAG GGG GCG GCG AAG GTA GAT GCT GTG GAG GTC AGG 624
GAG CAG TAC TGT TTT GTC CTG GAG GTT GTC CTG ATG GAG CCG TTG AAT 672
CGT TGG TTG CTT GGG CTG CAC CCA ACG CTG CAC CTC GAG CAC GCA AGA 720
GTC TGC ATG GGA ATC TGG TCA CGC TAC GTC TCC AAC TTA TAA CTC TGC 768
ATG CGC AAT TAT GCA ATA AAA CAA AGC CTG CGC GAA GTC AGT AAC TTT 816
G 								817	

<210> SEQ ID NO: 41
<211> 750
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Gly	Val	Pro	Ala	Pro	Ala	Ser	Gln	Leu	Thr	Lys	Val	Leu
1				5					10					15
Ala	Gly	Leu	Arg	His	Thr	Phe	Val	Val	Ala	Asp	Ala	Thr	Leu	Pro
				20					25					30
Asp	Cys	Pro	Leu	Val	Tyr	Ala	Ser	Glu	Gly	Phe	Tyr	Ala	Met	Thr
				35					40					45
Gly	Tyr	Gly	Pro	Asp	Glu	Val	Leu	Gly	His	Asn	Cys	Arg	Phe	Leu
				50					55					60
Gln	Gly	Glu	Gly	Thr	Asp	Pro	Lys	Glu	Val	Gln	Lys	Ile	Arg	Asp
				65					70					75
Ala	Ile	Lys	Lys	Gly	Glu	Ala	Cys	Ser	Val	Arg	Leu	Leu	Asn	Tyr
				80					85					90
Arg	Lys	Asp	Gly	Thr	Pro	Phe	Trp	Asn	Leu	Leu	Thr	Val	Thr	Pro
				95					100					105
Ile	Lys	Thr	Pro	Asp	Gly	Arg	Val	Ser	Lys	Phe	Val	Gly	Val	Gln
				110					115					120
Val	Asp	Val	Thr	Ser	Lys	Thr	Glu	Gly	Lys	Ala	Leu	Ala	Asp	Asn
				125					130					135
Ser	Gly	Val	Pro	Leu	Leu	Val	Lys	Tyr	Asp	His	Arg	Leu	Arg	Asp
				140					145					150
Asn	Val	Ala	Arg	Thr	Ile	Val	Asp	Asp	Val	Thr	Ile	Ala	Val	Glu
				155					160					165
Lys	Ala	Glu	Gly	Val	Glu	Pro	Gly	Gln	Ala	Ser	Ala	Val	Ala	Ala
				170					175					180
Ala	Ala	Pro	Leu	Gly	Ala	Lys	Gly	Pro	Arg	Gly	Thr	Ala	Pro	Lys
				185					190					195
Ser	Phe	Pro	Arg	Val	Ala	Leu	Asp	Leu	Ala	Thr	Thr	Val	Glu	Arg
				200					205					210
Ile	Gln	Gln	Asn	Phe	Cys	Ile	Ser	Asp	Pro	Thr	Leu	Pro	Asp	Cys
				215					220					225
Pro	Ile	Val	Phe	Ala	Ser	Asp	Ala	Phe	Leu	Glu	Leu	Thr	Gly	Tyr
				230					235					240
Ser	Arg	Glu	Glu	Val	Leu	Gly	Arg	Asn	Cys	Arg	Phe	Leu	Gln	Gly
				245					250					255
Ala	Gly	Thr	Asp	Arg	Gly	Thr	Val	Asp	Gln	Ile	Arg	Ala	Ala	Ile
				260					265					270
Lys	Glu	Gly	Ser	Glu	Leu	Thr	Val	Arg	Ile	Leu	Asn	Tyr	Thr	Lys
				275					280					285
Ala	Gly	Lys	Ala	Phe	Trp	Asn	Met	Phe	Thr	Leu	Ala	Pro	Met	Arg
				290					295					300
Asp	Gln	Asp	Gly	His	Ala	Arg	Phe	Phe	Val	Gly	Val	Gln	Val	Asp
				305					310					315
Val	Thr	Ala	Gln	Ser	Thr	Ser	Pro	Asp	Lys	Ala	Pro	Val	Trp	Asn
				320					325					330
Lys	Thr	Pro	Glu	Glu	Glu	Val	Ala	Lys	Ala	Lys	Met	Gly	Ala	Glu
				335					340					345
Ala	Ala	Ser	Leu	Ile	Ser	Ser	Ala	Leu	Gln	Gly	Met	Ala	Ala	Pro
				350					355					360
Thr	Thr	Ala	Asn	Pro	Trp	Ala	Ala	Ile	Ser	Gly	Val	Ile	Met	Arg
				365					370					375
Arg	Lys	Pro	His	Lys	Ala	Asp	Asp	Lys	Ala	Tyr	Gln	Ala	Leu	Leu
				380					385					390
Gln	Leu	Gln	Glu	Arg	Asp	Gly	Lys	Met	Lys	Leu	Met	His	Phe	Arg
				395					400					405
Arg	Val	Lys	Gln	Leu	Gly	Ala	Gly	Asp	Val	Gly	Leu	Val	Asp	Leu
				410					415					420
Val	Gln	Leu	Gln	Gly	Ser	Glu	Leu	Lys	Phe	Ala	Met	Lys	Thr	Leu
				425					430					435
Asp	Lys	Phe	Glu	Met	Gln	Glu	Arg	Asn	Lys	Val	Ala	Arg	Val	Leu
				440					445					450
Thr	Glu	Ser	Ala	Ile	Leu	Ala	Ala	Val	Asp	His	Pro	Phe	Leu	Ala
				455					460					465
Thr	Leu	Tyr	Cys	Thr	Ile	Gln	Thr	Asp	Thr	His	Leu	His	Phe	Val
				470					475					480
Met	Glu	Tyr	Cys	Asp	Gly	Gly	Glu	Leu	Tyr	Gly	Leu	Leu	Asn	Ser
				485					490					495
Gln	Pro	Lys	Lys	Arg	Leu	Lys	Glu	Glu	His	Val	Arg	Phe	Tyr	Ala
				500					505					510
Ser	Glu	Val	Leu	Thr	Ala	Leu	Gln	Tyr	Leu	His	Leu	Leu	Gly	Tyr
				515					520					525
Val	Tyr	Arg	Asp	Leu	Lys	Pro	Glu	Asn	Ile	Leu	Leu	His	His	Thr
				530					535					540
Gly	His	Val	Leu	Leu	Thr	Asp	Phe	Asp	Leu	Ser	Tyr	Ser	Lys	Gly
				545					550					555
Ser	Thr	Thr	Pro	Arg	Ile	Glu	Lys	Ile	Gly	Gly	Ala	Gly	Ala	Ala
				560					565					570
Gly	Gly	Ser	Ala	Pro	Lys	Ser	Pro	Lys	Lys	Ser	Ser	Ser	Lys	Ser
				575					580					585
Gly	Gly	Ser	Ser	Ser	Gly	Ser	Ala	Leu	Gln	Leu	Glu	Asn	Tyr	Leu
				590					595					600
Leu	Leu	Ala	Glu	Pro	Ser	Ala	Arg	Ala	Asn	Ser	Phe	Val	Gly	Thr
				605					610					615
Glu	Glu	Tyr	Leu	Ala	Pro	Glu	Val	Ile	Asn	Ala	Ala	Gly	His	Gly
				620					625					630
Pro	Ala	Ala	Val	Asp	Trp	Trp	Ser	Leu	Gly	Ile	Leu	Ile	Phe	Glu
				635					640					645
Leu	Leu	Tyr	Gly	Thr	Thr	Pro	Phe	Arg	Gly	Ala	Arg	Arg	Asp	Glu
				650					655					660
Thr	Phe	Glu	Asn	Ile	Ile	Lys	Ser	Pro	Leu	Lys	Phe	Pro	Ser	Lys
				665					670					675
Pro	Ala	Val	Ser	Glu	Glu	Cys	Arg	Asp	Leu	Ile	Glu	Lys	Leu	Leu
				680					685					690
Val	Lys	Asp	Val	Gly	Ala	Arg	Leu	Gly	Ser	Arg	Thr	Gly	Ala	Asn
				695					700					705
Glu	Ile	Lys	Ser	His	Pro	Trp	Phe	Lys	Gly	Ile	Asn	Trp	Ala	Leu
				710					715					720
Leu	Arg	His	Gln	Gln	Pro	Pro	Tyr	Val	Pro	Arg	Arg	Ala	Ser	Lys
				725					730					735
Ala	Ala	Gly	Gly	Ser	Ser	Thr	Gly	Gly	Ala	Ala	Phe	Asp	Asn	Tyr
				740					745					750

<210> SEQ ID NO: 42
<211>  525
<212> Genomic Sequence 
<213> Auxenochlorella protothecoides

<400>
CGCCGCCGCG CGCGACGGGG GCGCGGCTGA AACGCTGCGC GGGGCGGTGG 50
CGCGGGAGGG GGGCCTGCGC CTGCGCTCCT TCCAGCGCCT GCGCTCGCTG 100
GGCGCGGGGG ACGTGGGCAT GGTGGACCTG GTCAACCTGG TGGGCACGCC 150
CCACAACTTT GCGCTCAAGT CCCTGAACAA GCGAGAGATG GTGCAGCGCA 200	
ACAAGCTGGG CCGGGTCAAC ACGGAGCTGG CGGTGCTGAC CAGCGTGGAC 250
CACCCCTTCC TGGTCAACCT GTACGCCACC CTGCAGACGG ACACCCACGT 300
CCACTTCCTG TTGGAGTACT GCGGCGGCGG GGAGCTGTAC GCCGCGCTCT 350
CCACGCTGCC CAACAAGCGG CTGGGCGAGT GCGTGGCGCG CATGTATGCG 400
GCCGAGGTGC TGCTGGCCCT GCAGTACCTC CACGTCCAGG GGTTTGTGTA 450
CCGCGACCTC AAGCCCGAGA ACATCCTGCT GCTGGAGTCG GGGCACCTGC 500
GGCTGACCGA CTTCGACCTG TCCCA 			       525

<210> SEQ ID NO: 43
<211> 186
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Gln	Cys	Ala	Leu	Ser	Ala	Lys	Val	Ser	Leu	Ala	Gly	Arg	Gln
1				5					10					15
Val	Thr	Ala	Ala	Ala	Pro	Lys	Gln	Arg	Val	Ala	Ala	Ala	Arg	Phe
				20					25					30
Val	Val	Arg	Ala	Glu	Glu	Lys	Lys	Ala	Ala	Glu	Pro	Lys	Pro	Trp
				35					40					45
Ala	Pro	Pro	Ala	Leu	Asp	Ala	Ser	Ala	Pro	Ser	Pro	Ile	Phe	Gly
				50					55					60
Gly	Ser	Thr	Gly	Gly	Leu	Leu	Arg	Gln	Ala	Gln	Val	Glu	Glu	Phe
				65					70					75
His	Val	Ile	Thr	Trp	Glu	Ala	Lys	Lys	Glu	Gln	Ile	Phe	Glu	Met
				80					85					90
Pro	Thr	Gly	Gly	Ala	Ala	Ile	Met	Arg	Gln	Gly	Pro	Asn	Leu	Leu
				95					100					105
Lys	Leu	Ala	Arg	Lys	Glu	Gln	Cys	Leu	Ala	Leu	Leu	Thr	Gln	Leu
				110					115					120
Arg	Thr	Lys	Phe	Lys	Thr	Asp	Gly	Phe	Ile	Tyr	Arg	Val	Phe	Pro
				125					130					135
Asn	Gly	Glu	Val	Gln	Tyr	Leu	His	Pro	Lys	Asp	Gly	Val	Tyr	Pro
				140					145					150
Glu	Lys	Val	Asn	Ala	Gly	Arg	Ser	Gly	Asp	Asn	Thr	Asn	Met	Arg
				155					160					165
Arg	Ile	Gly	Gln	Asn	Thr	Asn	Pro	Ser	Gln	Ile	Lys	Phe	Thr	Gly
				170					175					180
Lys	Ile	Pro	Ala	Glu	Phe									
				185										

<210> SEQ ID NO: 44
<211> 1235
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
TGA CAT CAT GCA GCT CGC TTG CTC CAG CAG TGT GCG GCT GAC GGG GCC 48
TCG GCC CTT CAG TGC CAC CCC TTC CCG TCA TGG GAG GTC TCG TGC CCC 96
ATG CCC CCG GAT CTA CGC CGA GGA TGG CGC TGC CGC CAA GAA GGT CGC 144
GGA GAA GAA GCC AGA GTC GAA GAA GCC CTT TAG CCC GCC CAC CCT CGA 192
CCC CGA CAC GCC GTC TCC CAC TTT CGG GGG AAG CAC GGG TGG CCT GCT 240
CCG CAA GGC TCA GGT GGA GGA GTT CCA TGT GAT CAC CTG GAA CTC CAA 288
GAA GGA GCA GGT GTT CGA GAT GCC CAC AGC AGG CGC GGC CAT CAT GCG 336
TGA GGG CCC CAA CCT TCT GAA GCT GTC CCG TAA GGA GCA GTG CCT GGC 384
GCT CCT CAC CCA GCT GCG CAC CAA GTT CAA GAT CGA CGG CGC CAT CTA 432
CCG TGT CTT CCC CAA CGG CGA GAC CCA GTA CCT GCA CCC CAA GGA TGG 480
CGT GTA CCC CGA GAA GGT CAA CCC TGG CCG CAA GGA GGT GAA CGC CGT 528
CTT CAG GCG AAT TGG GTC CAA CAC CAA CCC CGC AAA GGT CAA GTT CAC 576
CGG CAA GCC GAC GTA CGA GGT GGA GGC ATA GGG GGC AGT GAG TTA TCA 624
GGA ACA GGG TGG AGC CTG TAA GGG GAT GCA TCT CAG CGG CTC AAG CCG 672
AGG ACA TCT GAG GCC GGA ACC TCC ACA GTT TCT ACT CTC TCC CTG TAT 720
GAT GAG CAG CTG CAC GAC CCT TTG TCG AGA AGA ATA TGT GTA CCG AGG 768
ATG TGG TGC TGT CAA CAA CAC AAC ACT GCG AGG AAG GGG GCC AGG GGC 816
ATG TTT GTA CAG GGG CGT GCA CCG CCA GGC ATC TGC ATG GTG CCG AGT 864
CAC CAG CAT GTC AGA GGC AGC TGC TAT TGG GGA AAG GGC TTG GGA TAC 912
CCG GGG TTT GTT GTG CAT GCT TAT CGT GCA TGT GGC ACA GGG CAG CAG 960
CAA GCC GGG TCT GGT CAG GGA AGG CTT CCT GGG GTC CTT GAG ATG GGC 1008
TTT GCC ACA TCC AGT CCC CTG CAC GCC TCG ACC CCG GGC CCT GGG GCC 1056
CGT CCA CCT ACC CCT CCT CCC CAA ACT CGG TTG TGA ACT TCT TGT ACG 1104
TCT CCA GGT CGT CCT GAG ACA CTG TCG GCC GCG CCC TGA GCA GCA CCT 1152
TTT CGA AAT CCC GGA ACG TGA TCT TGG GAG GGT GCA CCA GCT GCG CCT 1200
GGC CCT TGT CTG CCA AGC TCT GGA GCG TGG CCG GG			1235

<210> SEQ ID NO: 45
<211> 245
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Met	Thr	Leu	Ser	Thr	Lys	Ala	Phe	Ala	Gln	Arg	Gly	Val
1				5					10					15
Ser	Ala	Arg	Lys	Asn	Thr	Val	Arg	Val	Tyr	Ala	Ala	Thr	Thr	Lys
				20					25					30
Val	Asn	Pro	Lys	Leu	Ala	Ser	Lys	Thr	Glu	Val	Glu	Arg	Phe	Lys
				35					40					45
Gln	Ala	Thr	Gly	Leu	Pro	Ala	Pro	Ala	Ile	Asn	Gly	Lys	Gln	Phe
				50					55					60
Pro	Leu	Lys	Leu	Gly	Phe	Thr	Lys	Thr	Asn	Glu	Leu	Phe	Val	Gly
				65					70					75
Arg	Leu	Ala	Met	Val	Gly	Phe	Ser	Ala	Ser	Leu	Ile	Gly	Glu	Ile
				80					85					90
Leu	Thr	Gly	Lys	Gly	Ala	Leu	Ala	Gln	Phe	Gly	Tyr	Glu	Thr	Gly
				95					100					105
Leu	Asn	Gly	Ile	Glu	Val	Asp	Gly	Leu	Val	Ile	Gly	Leu	Ile	Ala
				110					115					120
Phe	Asn	Leu	Ile	Ala	Ala	Val	Leu	Pro	Thr	Ser	Gln	Thr	Phe	Val
				125					130					135
Pro	Glu	Glu	Gln	Asp	Thr	Ile	Ser	Glu	Arg	Pro	Ala	Gly	Pro	Leu
				140					145					150
Gln	Asp	Pro	Arg	Ile	Thr	Leu	Leu	Glu	Pro	Lys	Lys	Phe	Phe	Gly
				155					160					165
Val	Gln	Gly	Phe	Gly	Phe	Thr	Lys	Glu	Asn	Glu	Leu	Phe	Val	Gly
				170					175					180
Arg	Ala	Ala	Gln	Leu	Gly	Phe	Ala	Phe	Ser	Leu	Ile	Gly	Glu	Ala
				185					190					195
Val	Thr	Gly	Lys	Gly	Ala	Leu	Ala	Gln	Phe	Asp	Ile	Glu	Thr	Gly
				200					205					210
Leu	Ser	Leu	Arg	Asp	Thr	Glu	Phe	Gly	Leu	Val	Val	Phe	Ile	Leu
				215					220					225
Phe	Leu	Leu	Phe	Ala	Ala	Ile	Asn	Glu	Gly	Ser	Gly	Lys	Phe	Val
				230					235					240
Asp	Glu	Glu	Ser	Ala										
				245										

<210> SEQ ID NO: 46
<211> 1086
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GGC CGT TTC AAC CAA TCA ATG CGA GGG GGT CAA AAG GGA CAT AAC ATC 48
TTC CGA ACT ATC AAA CAC CGC ATT AAT TTC CCC AAC TCC CAC CAA TTC 96
ACA TCA CAC ACA CAT TCA CAA TGA TGC AAA CCC TTG CCA CCT CGA TGA 144
CCG TCT GCG GTC CAT CAT GCA TGC CCA AGT CGC TGT CCA GGT CTG GCA 192
CTG CCC TCC AGG TCA GGG CTC CCT GCT CCA CCC TTC AGT CCC GTC GCT 240
CCA TGA GGG TGT TCG CCG CCA AGC AAG ATC AGC CTT TCC GCA CCA AGG 288
ACC CTG CCC AGT CGC TGA TTC CCC AGT TCA CTC GTG AGC GCG AGA GGG 336
CTG TCG GCC GCC TCG CCA CGC TGG GCC TGG CCT TCT CCT GGG TTG GCG 384
AGA CTC TGA CGG GTT ATG GTC CCA TCA CCC AGC TCT CCA ACG AGA CTG 432
GTG TGC CCG TTG TGT GGG GCT ATG GCG TCA CCC TAA CCC TAA TCG CCA 480
TCC AGC TGT TCT TCG GCC TCA ACT CTG GCA CTC CCA CCT ACA GCG ACG 528
AGA ACC AGC GTG ATG TTG CCA AGC GTA GCA AGG GCC TGA CTG GCA TTA 576
CCC AGA TTG AGC CTG ATG TGG ACC GTG TGA TTG ACC CCA CCA AGG ACC 624
CTG GCC AGT TCT TCC TGC GCC TGG AGC TTG CCC AGG GCC GTA CCG CCA 672
TGC TGG TGT TCC TCA CCA CTG CCA TCC TGG AGT TTG CCA ACG GTG GCA 720
TGT CCC CCC TGG GTC AGG CAG GCA TCA TTC CCA CGG GCG TGC CCT TCA 768
GCC AGG CTC CCA TCT GGC TGA TCG CCC TCA CCA TCC TCA ACT TTG TTG 816
GTG GTA TCG GCA CTT TTT CCC TGT TCG ACC AGA ACA AGG ATT CTG CTG 864
CCT ACT GAT TGG ATT GTT TCC GTA CCC TTC CCA TCT CAT CCC ACC AGG 912
CAA TGA TGT GTT TGC AAG TAG CTT GCC TCA TCT CAG AAT TCT TTT GTG 960
ATG TTG GAG AGT TTG TCA TCT GAG TGC CCA CAG GCC CTG ATT CAC ACC 1008
CCT CCC CCA CCC TCG CCC TCC CCT ATA CTC ATC ACT ACA TCA CCA TTT 1056
TTA CAA TTC ACA ATG TCG ATT GTA ATC TGA 			1086

<210> SEQ ID NO: 47
<211> 444
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Ala	Arg	Ala	Arg	Leu	Asp	Ser	Arg	Pro	Val	Ala	Ser	Ala	Arg	Ser
1				5					10					15
Ser	Ala	Arg	Ser	Val	Ala	Pro	Arg	Val	Ala	Ser	Arg	Lys	Ser	Thr
				20					25					30
Val	Arg	Ile	Ala	Ala	Val	Ala	Ala	Pro	Glu	Arg	Pro	Ile	Thr	Asp
				35					40					45
Tyr	Gln	Arg	Pro	Asp	Ala	Asn	Gly	Arg	Tyr	Gly	Gln	Phe	Gly	Gly
				50					55					60
Lys	Tyr	Val	Pro	Glu	Thr	Leu	Ile	Pro	Ala	Leu	Glu	Gln	Leu	Glu
				65					70					75
Lys	Asp	Tyr	Asn	Glu	Ala	Ile	Ala	Asp	Pro	Ala	Phe	Lys	Ala	Glu
				80					85					90
Met	Glu	Ala	Ile	Leu	Lys	Asp	Tyr	Val	Gly	Arg	Glu	Thr	Pro	Leu
				95					100					105
Tyr	His	Ala	Glu	Arg	Leu	Ser	Ala	His	Tyr	Lys	Thr	Ala	Asp	Gly
				110					115					120
Gly	His	Ala	Glu	Ile	Tyr	Leu	Lys	Arg	Glu	Asp	Leu	Asn	His	Thr
				125					130					135
Gly	Ala	His	Lys	Ile	Asn	Asn	Ser	Leu	Gly	Gln	Ala	Leu	Leu	Cys
				140					145					150
Lys	Arg	Leu	Asn	Lys	Gln	Arg	Ile	Ile	Ala	Glu	Thr	Gly	Ala	Gly
				155					160					165
Gln	His	Gly	Val	Ala	Thr	Ala	Thr	Ile	Cys	Ala	Arg	Leu	Gly	Leu
				170					175					180
Lys	Cys	Ile	Val	Tyr	Met	Gly	Ala	Lys	Asp	Met	Glu	Arg	Gln	Ala
				185					190					195
Leu	Asn	Val	Phe	Arg	Met	Arg	Leu	Cys	Gly	Ala	Glu	Val	Arg	Pro
				200					205					210
Val	Asn	Ser	Gly	Thr	Ala	Thr	Leu	Lys	Asp	Ala	Thr	Ser	Glu	Ala
				215					220					225
Ile	Arg	Asp	Trp	Val	Thr	Asn	Val	Glu	Thr	Thr	His	Tyr	Ile	Leu
				230					235					240
Gly	Ser	Ala	Ala	Gly	Pro	His	Pro	Tyr	Pro	Met	Met	Val	Arg	Glu
				245					250					255
Phe	Gln	Ser	Val	Ile	Gly	Arg	Glu	Thr	Lys	Val	Gln	Ala	Gln	Glu
				260					265					270
Lys	Trp	Gly	Gly	Leu	Pro	Asp	Ile	Val	Met	Ala	Cys	Val	Gly	Gly
				275					280					285
Gly	Ser	Asn	Ala	Ile	Gly	Ile	Phe	Asn	Glu	Phe	Ile	Asn	Asp	Thr
				290					295					300
Ser	Val	Arg	Leu	Ile	Gly	Val	Glu	Ala	Gly	Gly	Glu	Gly	Val	Asn
				305					310					315
Thr	Thr	Lys	His	Ala	Ala	Thr	Leu	Thr	Met	Gly	Thr	Pro	Gly	Val
				320					325					330
Leu	His	Gly	Ser	Tyr	Ser	Tyr	Leu	Leu	Gln	Asp	Asp	Asp	Gly	Gln
				335					340					345
Ile	Ile	Asp	Pro	His	Ser	Ile	Ser	Ala	Gly	Leu	Asp	Tyr	Pro	Gly
				350					355					360
Ile	Gly	Pro	Glu	His	Ser	Phe	Leu	Lys	Asp	Val	Lys	Arg	Ala	Glu
				365					370					375
Tyr	Tyr	Ala	Val	Thr	Asp	Ala	Glu	Ala	Leu	Glu	Gly	Phe	Gln	Leu
				380					385					390
Leu	Ser	Lys	Leu	Glu	Gly	Ile	Ile	Pro	Ala	Leu	Glu	Thr	Ser	His
				395					400					405
Ala	Ile	Ala	Tyr	Leu	Glu	Lys	Leu	Ile	Pro	Thr	Leu	Lys	Ser	Gly
				410					415					420
Thr	Arg	Val	Val	Ile	Asn	Cys	Ser	Gly	Arg	Gly	Asp	Lys	Asp	Val
				425					430					435
Asn	Asn	Ala	Met	Lys	Tyr	Ile	Asn	Pro						
				440										

<210> SEQ ID NO: 48
<211> 1197
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CAC AAG CCG GAG GAG GTG GGC ATC CAC GAC ACC CCC CTC CGC CCC GAC 48
GCC TCG GGG AGG TAT GGC AAG TTT GGA GGC AAG TAC GTG CCG GAG ACG 96
CTC ATC GTC GCC CTG ACG GAG CTG GAG CAG GCG TAC AAG GAG GCC CTC 144
GCC GAC CCG ACC TTC ATT GCC GAG CTG GAC CAC CTT CTG AAG ACG TAT 192
GTG GGC CGC CCC TCT CCC CTT TAC CAC GCG GAG CGG CTG TCG GAG CAC 240
TAC CGC CGC CCC GAC GGC TCC GGC CCC GAG ATC TGG CTG AAG CGG GAG 288
GAC CTG AAC CAC ACG GGC GCG CAC AAG ATC AAC AAC TCG CTG GGC CAG 336
GCG CTG CTG TGC CAG CGC ATG GGC AAG AAG CGC GTG ATC GCG GAG ACG 384
GGC GCT GGC CAG CAC GGC GTG GCG ACG GCC ACG GTC TGC GCC CGC GCG 432
GGG CTG CAG TGC GTG GTC TAC ATG GGC ACC AAG GAC ATG GAG CGC CAG 480
TCG CTG AAC GTG TTC CGC ATG CGC CTG CTG GGC GCG GAG GAC GCG ACC 528
AGC GAG GCC CTG CGC GAC TGG GTG ACC AAC GTG GAG GGG TCG CAC TAC 576
ATC CTG GGC TCG GTG GCG GGG CCC CAC CCC TTC CCC CAG ATC GTG CGC 624
GAC TTC CAG GCG GTC ATC GGC CGC GAG CTG CGC GGG CAG GCC GCT GAG 672
GCC TGG GGC GGG CTG CCG GAC GTG CTG CTG GCC TGC GTC GGC GGC GGC 720
AGC AAC GCC ATG GGC CTC TTC CAC GAG TTC GTG GGC GAC GCA TCC GTG 768
CGC CTC GTC GGC GTG GAG GCC GCC GGG CAC GGG CTG GCC ACC GAC AAG 816
CAC GCG GCG ACG CTG ACG CTG GGG CGC GTG GGC GTG CTG CAC GGC TCC 864
ATG TCC TAC GTC ATC CAA GAC CCG GAT GGC CAG ATC GTG GAC CCC CAC 912
TCC ATC TCC GCG GGG CTG GAC TAC CCA GGC GTG GGC CCC GAG CAC GCC 960
TAC CTG CGC GAC GCG GGC CGC GCC GAG TAC GGG TCG GTG ACC GAC GCG 1008
CAG GCG CTG GAG GCC TTC CAG CTC CTG TCG CGC CTG GAG GGC ATC ATC 1056
CCC GCC CTG GAG TCC AGC CAC GCG CTG GCC TAC CTG GAC ACG CTG TGC 1104
CCC ACG CTG GCC CAC GGC ACG CGC GTC GTT GTC AAC TGC AGC GGC CGT 1152
GGA GAC AAA GAC GTG CAG CAG GTC ATC CAG CAC CTG GAC CTC TGA 	1197 	

<210> SEQ ID NO: 49
<211> 258
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Asn	Ala	Leu	Lys	Ala	Lys	Gly	Lys	Val	Ala	Phe	Ile	Pro	Phe
1				5					10					15
Ile	Cys	Ala	Gly	Asp	Pro	Asp	Leu	Asp	Thr	Thr	Ser	Leu	Ala	Leu
				20					25					30
Arg	Lys	Leu	Asp	Glu	Val	Gly	Ala	Asp	Val	Ile	Glu	Leu	Gly	Val
				35					40					45
Pro	Tyr	Ser	Asp	Pro	Leu	Ala	Asp	Gly	Pro	Val	Ile	Gln	Gly	Ala
				50					55					60
Ala	Thr	Arg	Ala	Leu	Asp	Lys	His	Thr	Thr	Leu	Asp	Lys	Val	Ile
				65					70					75
Glu	Met	Val	Arg	Arg	Thr	Ala	Pro	Ala	Met	Lys	Ala	Pro	Leu	Val
				80					85					90
Met	Phe	Thr	Tyr	Tyr	Asn	Pro	Ile	Met	Arg	Lys	Gly	Leu	Asp	Asn
				95					100					105
Phe	Ala	Arg	Thr	Ile	Lys	Glu	Ala	Gly	Ala	Ala	Gly	Leu	Leu	Val
				110					115					120
Pro	Asp	Leu	Pro	Leu	Glu	Glu	Thr	Val	Ser	Val	Arg	Ala	Ala	Cys
				125					130					135
Glu	Lys	Ala	Gly	Ile	Glu	Leu	Val	Leu	Leu	Ala	Thr	Pro	Thr	Thr
				140					145					150
Pro	Gln	Ala	Arg	Met	Arg	Ala	Ile	Ala	Gln	Ala	Ser	Gln	Gly	Phe
				155					160					165
Val	Tyr	Leu	Val	Ser	Val	Thr	Gly	Val	Thr	Gly	Met	Lys	Glu	Gln
				170					175					180
Val	Ser	Gly	Arg	Val	Glu	Gly	Leu	Val	Ser	Glu	Leu	Lys	Ala	Val
				185					190					195
Thr	Asp	Lys	Pro	Val	Cys	Val	Gly	Phe	Gly	Val	Ser	Arg	Ala	Glu
				200					205					210
His	Ala	Lys	Gln	Ile	Val	Ser	Trp	Gly	Ala	Asp	Gly	Val	Ile	Cys
				215					220					225
Gly	Ser	Ala	Leu	Val	Lys	Ala	Leu	Gly	Glu	Ala	Ser	Ser	Pro	Ala
				230					235					240
Glu	Gly	Leu	Gln	Ala	Met	Glu	Lys	Leu	Ala	Arg	Glu	Leu	Arg	Ala
				245					250					255
Ala	Thr	Pro												
														
<210> SEQ ID NO: 50
<211> 1739
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG GAT GCC GAG GCA TTG CCA TGC ACA TGG CCG GGT CCC TGC CCA GTC 48
TCC AAA CCC AGG CCG CTT GTG ATT TTG GGA ATG GCC TCC TCC CTC CGC 96
CTG GCA GCG GCC CCA GGC CAG GCC ACA GTA CTC ACC CGG AAA GGT GGT 144
CAC CTC GCC GTC CTG CGC CCA GCA CCC AGG TTC GCA GCC TTC CAG TCC 192
AGA CCC CGG TGC GCT GCC CTT CAC CCC GTC ATG GTA GCT TCA GCC GAG 240
TCA GCT GCC CTG GAG GGC GCT CAG GCG CGT GGT GCC AGC ATC ACC GAC 288
ACC ATG CAG GCC CTG AAA CGG GAG GGC CGG TGA GCA GGG CCG TGG GCG 336
GGT GCC GGG AGC ATG GCG AGG CAT GGG CAT TGG AAT TGC CAC CCT CAT 384
GCA CAA TCT GCA CCC CAC AGA GTG GCA TTC ATC CCC TTC CTG GTG GCC 432
TGC GAC CCG GAC CGG GAC ACC ACG GTG GAG GCC ATC CGC CGG CTG GAT 480
GCC CTG GGC GCC TCT GTC ATC GAG CTG GGG GTG CCC TAT TGC GAC CCT 528
GTG GCG GAC GGA CCC GTC ATC CAG GCC GCT GCC ACG CGT GGC CTG GCC 576
CAC GGC ACC ACC CTG GAC GAC GTG CTG GGC CTG GTG GCC GAG GTG GCG 624
CCC ACC ATC AAG GCC CCC ATC CTG CTC TTC GTC TAC TTC AAC CCC ATC 672
CTC ACC CGG GGC CTG GAG AAG TTC TGC ACG CAG GCA ATT GCC GCG GGG 720
GCC AAG GGT GAG ACG GGT CTG TAG GGA GTG TGG GCA TGC ATC CGC ATG 768
GCC TCG GCA GAC CCC CCG CTC CGC ACC GTG CCA TGT CTC TCC GCC GTG 816
CCC CTC GAG CAG TAT GGG CAT GTG CTC GGC CCA TTC CAG TCT TTG CCC 864
ACC CTC TCA GTC GCC CCT CTC TGT CTC CCT CAT CCA AGG CCT CCT GGT 912
CCC CGA CAT CCC CCT GGA GGA GGC GTC GGC CGT GGC CGA CGT GGC GGC 960
GGC GCA CGG CCT GGA GCT TGT CCT GCT GAC CAC GCC CAC GAC CCC CCC 1008
CGA GCG CAT GCA GGC CAT CGC CCG TCG CTC CTC CGG CTT CGT CTA CCT 1056
CGT GTC CAT CAC CGG GGT CAC GGG GGT CAA GGA GGC CAT GGA GGC GCG 1104
CGT GGA GGG CCT CAT CGC CGA GCT GCA GCG CCA GAC CGA CAA GCC CGT 1152
CGC GGT CGG CTT CGG CGT CTC CCG CCC CGA CCA GGC GCG CCA GCT GGC 1200
GGG CTG GGG CGC GGA GGG CGT CAT CGT CGG CTC GGC CCT TGT GCG GGC 1248
GCT GGG CGA GGC CGC CAG CCC CGC AGA GGG CCT GGA CGC CAT GGC CGA 1296
GCT GGC CGC CAG CCT GGT GCA GGC GCG CCA GAA GTG AGC GGG GCG CGG 1344
GTG CGC TGG GCC GCC CAG CGC TCC GTG CGG GAG GGT GCC GGG CGC AGG 1392
CCG GCG AGG GGT CTG CTC CCG AGC CGG CCA AGG CGT GAA ACG GGG CAC 1440
GCG CCT GCC AAT CCC TCG GCG CCC AGT CGA CGA ACA CTT GAC CTC CAG 1488
TCG CTG CGT TAC GCT GCG GCC CGA TGC AGG CAG CCC TCC CTT CTT GTA 1536
CCT CCG AGT GCT CCT CTC GCT GCC TGC CCC CTG TCC CTC TGC CCC CAG 1584
GTT TTG AGT CCA CCT GCT GCT GCT TGG CAG GAA GAT GCA AGG CCA ACA 1632
TGT GGC AAC ACC CAG CCT CCC ATC CAG GAT TGG AAT GCA CAG GAA CCC 1680
ACT TCT CTC CTC CAG CTT GCT TGT CCA GGC TAT CCT GCT GGC AGT GGT 1728
TAT TAG CAT GC 							1739

<210> SEQ ID NO: 51
<211> 539
<212> Amino Acid Sequence
<213> Arabidopsis thaliana

Met	Glu	Ser	Ser	Leu	Phe	Ser	Pro	Ser	Ser	Ser	Ser	Tyr	Ser	Ser
1				5					10					15
Leu	Phe	Thr	Ala	Lys	Pro	Thr	Arg	Leu	Leu	Ser	Pro	Lys	Pro	Lys
				20					25					30
Phe	Thr	Phe	Ser	Ile	Arg	Ser	Ser	Ile	Glu	Lys	Pro	Lys	Pro	Lys
				35					40					45
Leu	Glu	Thr	Asn	Ser	Ser	Lys	Ser	Gln	Ser	Trp	Val	Ser	Pro	Asp
				50					55					60
Trp	Leu	Thr	Thr	Leu	Thr	Arg	Thr	Leu	Ser	Ser	Gly	Lys	Asn	Asp
				65					70					75
Glu	Ser	Gly	Ile	Pro	Ile	Ala	Asn	Ala	Lys	Leu	Asp	Asp	Val	Ala
				80					85					90
Asp	Leu	Leu	Gly	Gly	Ala	Leu	Phe	Leu	Pro	Leu	Tyr	Lys	Trp	Met
				95					100					105
Asn	Glu	Tyr	Gly	Pro	Ile	Tyr	Arg	Leu	Ala	Ala	Gly	Pro	Arg	Asn
				110					115					120
Phe	Val	Ile	Val	Ser	Asp	Pro	Ala	Ile	Ala	Lys	His	Val	Leu	Arg
				125					130					135
Asn	Tyr	Pro	Lys	Tyr	Ala	Lys	Gly	Leu	Val	Ala	Glu	Val	Ser	Glu
				140					145					150
Phe	Leu	Phe	Gly	Ser	Gly	Phe	Ala	Ile	Ala	Glu	Gly	Pro	Leu	Trp
				155					160					165
Thr	Ala	Arg	Arg	Arg	Ala	Val	Val	Pro	Ser	Leu	His	Arg	Arg	Tyr
				170					175					180
Leu	Ser	Val	Ile	Val	Glu	Arg	Val	Phe	Cys	Lys	Cys	Ala	Glu	Arg
				185					190					195
Leu	Val	Glu	Lys	Leu	Gln	Pro	Tyr	Ala	Glu	Asp	Gly	Ser	Ala	Val
				200					205					210
Asn	Met	Glu	Ala	Lys	Phe	Ser	Gln	Met	Thr	Leu	Asp	Val	Ile	Gly
				215					220					225
Leu	Ser	Leu	Phe	Asn	Tyr	Asn	Phe	Asp	Ser	Leu	Thr	Thr	Asp	Ser
				230					235					240
Pro	Val	Ile	Glu	Ala	Val	Tyr	Thr	Ala	Leu	Lys	Glu	Ala	Glu	Leu
				245					250					255
Arg	Ser	Thr	Asp	Leu	Leu	Pro	Tyr	Trp	Lys	Ile	Asp	Ala	Leu	Cys
				260					265					270
Lys	Ile	Val	Pro	Arg	Gln	Val	Lys	Ala	Glu	Lys	Ala	Val	Thr	Leu
				275					280					285
Ile	Arg	Glu	Thr	Val	Glu	Asp	Leu	Ile	Ala	Lys	Cys	Lys	Glu	Ile
				290					295					300
Val	Glu	Arg	Glu	Gly	Glu	Arg	Ile	Asn	Asp	Glu	Glu	Tyr	Val	Asn
				305					310					315
Asp	Ala	Asp	Pro	Ser	Ile	Leu	Arg	Phe	Leu	Leu	Ala	Ser	Arg	Glu
				320					325					330
Glu	Val	Ser	Ser	Val	Gln	Leu	Arg	Asp	Asp	Leu	Leu	Ser	Met	Leu
				335					340					345
Val	Ala	Gly	His	Glu	Thr	Thr	Gly	Ser	Val	Leu	Thr	Trp	Thr	Leu
				350					355					360
Tyr	Leu	Leu	Ser	Lys	Asn	Ser	Ser	Ala	Leu	Arg	Lys	Ala	Gln	Glu
				365					370					375
Glu	Val	Asp	Arg	Val	Leu	Glu	Gly	Arg	Asn	Pro	Ala	Phe	Glu	Asp
				380					385					390
Ile	Lys	Glu	Leu	Lys	Tyr	Ile	Thr	Arg	Cys	Ile	Asn	Glu	Ser	Met
				395					400					405
Arg	Leu	Tyr	Pro	His	Pro	Pro	Val	Leu	Ile	Arg	Arg	Ala	Gln	Val
				410					415					420
Pro	Asp	Ile	Leu	Pro	Gly	Asn	Tyr	Lys	Val	Asn	Thr	Gly	Gln	Asp
				425					430					435
Ile	Met	Ile	Ser	Val	Tyr	Asn	Ile	His	Arg	Ser	Ser	Glu	Val	Trp
				440					445					450
Glu	Lys	Ala	Glu	Glu	Phe	Leu	Pro	Glu	Arg	Phe	Asp	Ile	Asp	Gly
				455					460					465
Ala	Ile	Pro	Asn	Glu	Thr	Asn	Thr	Asp	Phe	Lys	Phe	Ile	Pro	Phe
				470					475					480
Ser	Gly	Gly	Pro	Arg	Lys	Cys	Val	Gly	Asp	Gln	Phe	Ala	Leu	Met
				485					490					495
Glu	Ala	Ile	Val	Ala	Leu	Ala	Val	Phe	Leu	Gln	Arg	Leu	Asn	Val
				500					505					510
Glu	Leu	Val	Pro	Asp	Gln	Thr	Ile	Ser	Met	Thr	Thr	Gly	Ala	Thr
				515					520					525
Ile	His	Thr	Thr	Asn	Gly	Leu	Tyr	Met	Lys	Val	Ser	Gln	Arg	
				530					535					

<210> SEQ ID NO: 52
<211> 1301
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GTA AAC TCG TCA TGA ATT CCA GAT TCC TCC ATG GCA GGC CAG GCT GTC 48
TCC TGG TGC ACC CTG GCT CCC ATA CAA GAA AAG AGG CGA GAC TGC TCA 96
AGG CCG CAC CCA TCA ACC GGC GAC GTC AGG TTT CGT GTA CGG CCG GCC 144
TAG CCT CCC AGC TCC TTC CTT GGG CCA TAT TTT TCA GTC CTG GAG CAG 192
GCA TTT TGG CGT ATG CGG CCG TGA GGG GCG GGG GAA ATA TCA AGG ACG 240
GCT TGA GCC ATG TGC TCA CCC AAG TCA GCC AGG GGT ACT TTC AAC CCC 288
AGG TCG GGG GCG ACA CCA TCC CGG CAG CGC AGG GCA ACC TCT CAG ATC 336
TTG CGG GAG ACG AGC CTC TCT TCA AAG CAC TGT TCA AAT GGT TTC AAG 384
AGT GTG GAG GAG TCT TCA AGC TGG AGT TCG GAC CCA AAG CCT TCA TTG 432
TCG TGT CAG ACC CTG TGG TCG TAC GCC ACC TGC TAA AGG AGA ATG CAT 480
ACA ACT ATG ACA AGG GAG TGC TGG CAG ACA TCC TGG AAC CGT TCA TGG 528
GCA AGG GCC TCA TAC CAG CTG ACT GGG ACA CAT GGA AAG TGA GGC GCA 576
AGA AGA TCG TTC CGG GCT TTC ACA AGG CGT ACC TCA ACG CTT CCA TGG 624
CCA TGT TTG GAC GCT GCA CGG AGC GTA TGA TCG GCA AGC TGG ATG CCC 672
ACA CTG CTT CGG CAA ATA GCG ACG ACA TGG TGG ACA TGG AGA CTG AGT 720
TTC TGA ACC TGG GCC TTG ACA TCA TTG GGT TGG GAG TCT TCA ACT ACG 768
AAT TTG GGA GCA TCA CCC GCG AGT CAC CTG TCA TCA AGG CTG TTT ACA 816
GTG TCT TGA AGG AGG CAG AGC ATC GCA GCA CAT TTT ATT TGC CGT ACT 864
GGG ACT TGC CCC TGG CGT CCC TGG TCG TGC CCC GCC AGC GGA GAT TTC 912
AGA GAG ACA TCC AGG TAC TGA ATG AGT GTC TGG ATG AGC TCA TTG AGC 960
AGG CCC GCA GGA TCA GCG AGC CCG AAG AGA TCG AAG CGC TGC AGC AGC 1008
GGG ACT ACA GCA AGG TCA AGG ATG CCA GCC TGC TGC GCT TCC TTG TGG 1056
ATG CGG GGG ATG CTG ACC TGG GTG CTC GCC AGC TGA GGG ACG ACC TGA 1104
TGA CCA TGC TGA TTG CGG GCC ACG AGA CCA CTG CGG CTG TGC TGA CTT 1152
GGA CGC TGT ACT GCC TGA CAA AGA GCC CCT CAC ACA TGG CTG AGA TCC 1200
AGG CCG AGG TGG ATG CCG CGC TGG GGG ACA GCC ACC CAG ACC TGG ACG 1248
ACA TCG GGC GTC TCC CCA AAA CAC GAG CAG CGC TGG CAG AGT CCT TGC 1296
GGC TC                                                          1301                                                                     

<210> SEQ ID NO: 53
<211> 524
<212> Amino Acid Sequence
<213> Arabidopsis thaliana

Met	Glu	Cys	Val	Gly	Ala	Arg	Asn	Phe	Ala	Ala	Met	Ala	Val	Ser
1				5					10					15
Thr	Phe	Pro	Ser	Trp	Ser	Cys	Arg	Arg	Lys	Phe	Pro	Val	Val	Lys
				20					25					30
Arg	Tyr	Ser	Tyr	Arg	Asn	Ile	Arg	Phe	Gly	Leu	Cys	Ser	Val	Arg
				35					40					45
Ala	Ser	Gly	Gly	Gly	Ser	Ser	Gly	Ser	Glu	Ser	Cys	Val	Ala	Val
				50					55					60
Arg	Glu	Asp	Phe	Ala	Asp	Glu	Glu	Asp	Phe	Val	Lys	Ala	Gly	Gly
				65					70					75
Ser	Glu	Ile	Leu	Phe	Val	Gln	Met	Gln	Gln	Asn	Lys	Asp	Met	Asp
				80					85					90
Glu	Gln	Ser	Lys	Leu	Val	Asp	Lys	Leu	Pro	Pro	Ile	Ser	Ile	Gly
				95					100					105
Asp	Gly	Ala	Leu	Asp	Leu	Val	Val	Ile	Gly	Cys	Gly	Pro	Ala	Gly
				110					115					120
Leu	Ala	Leu	Ala	Ala	Glu	Ser	Ala	Lys	Leu	Gly	Leu	Lys	Val	Gly
				125					130					135
Leu	Ile	Gly	Pro	Asp	Leu	Pro	Phe	Thr	Asn	Asn	Tyr	Gly	Val	Trp
				140					145					150
Glu	Asp	Glu	Phe	Asn	Asp	Leu	Gly	Leu	Gln	Lys	Cys	Ile	Glu	His
				155					160					165
Val	Trp	Arg	Glu	Thr	Ile	Val	Tyr	Leu	Asp	Asp	Asp	Lys	Pro	Ile
				170					175					180
Thr	Ile	Gly	Arg	Ala	Tyr	Gly	Arg	Val	Ser	Arg	Arg	Leu	Leu	His
				185					190					195
Glu	Glu	Leu	Leu	Arg	Arg	Cys	Val	Glu	Ser	Gly	Val	Ser	Tyr	Leu
				200					205					210
Ser	Ser	Lys	Val	Asp	Ser	Ile	Thr	Glu	Ala	Ser	Asp	Gly	Leu	Arg
				215					220					225
Leu	Val	Ala	Cys	Asp	Asp	Asn	Asn	Val	Ile	Pro	Cys	Arg	Leu	Ala
				230					235					240
Thr	Val	Ala	Ser	Gly	Ala	Ala	Ser	Gly	Lys	Leu	Leu	Gln	Tyr	Glu
				245					250					255
Val	Gly	Gly	Pro	Arg	Val	Cys	Val	Gln	Thr	Ala	Tyr	Gly	Val	Glu
				260					265					270
Val	Glu	Val	Glu	Asn	Ser	Pro	Tyr	Asp	Pro	Asp	Gln	Met	Val	Phe
				275					280					285
Met	Asp	Tyr	Arg	Asp	Tyr	Thr	Asn	Glu	Lys	Val	Arg	Ser	Leu	Glu
				290					295					300
Ala	Glu	Tyr	Pro	Thr	Phe	Leu	Tyr	Ala	Met	Pro	Met	Thr	Lys	Ser
				305					310					315
Arg	Leu	Phe	Phe	Glu	Glu	Thr	Cys	Leu	Ala	Ser	Lys	Asp	Val	Met
				320					325					330
Pro	Phe	Asp	Leu	Leu	Lys	Thr	Lys	Leu	Met	Leu	Arg	Leu	Asp	Thr
				335					340					345
Leu	Gly	Ile	Arg	Ile	Leu	Lys	Thr	Tyr	Glu	Glu	Glu	Trp	Ser	Tyr
				350					355					360
Ile	Pro	Val	Gly	Gly	Ser	Leu	Pro	Asn	Thr	Glu	Gln	Lys	Asn	Leu
				365					370					375
Ala	Phe	Gly	Ala	Ala	Ala	Ser	Met	Val	His	Pro	Ala	Thr	Gly	Tyr
				380					385					390
Ser	Val	Val	Arg	Ser	Leu	Ser	Glu	Ala	Pro	Lys	Tyr	Ala	Ser	Val
				395					400					405
Ile	Ala	Glu	Ile	Leu	Arg	Glu	Glu	Thr	Thr	Lys	Gln	Ile	Asn	Ser
				410					415					420
Asn	Ile	Ser	Arg	Gln	Ala	Trp	Asp	Thr	Leu	Trp	Pro	Pro	Glu	Arg
				425					430					435
Lys	Arg	Gln	Arg	Ala	Phe	Phe	Leu	Phe	Gly	Leu	Ala	Leu	Ile	Val
				440					445					450
Gln	Phe	Asp	Thr	Glu	Gly	Ile	Arg	Ser	Phe	Phe	Arg	Thr	Phe	Phe
				455					460					465
Arg	Leu	Pro	Lys	Trp	Met	Trp	Gln	Gly	Phe	Leu	Gly	Ser	Thr	Leu
				470					475					480
Thr	Ser	Gly	Asp	Leu	Val	Leu	Phe	Ala	Leu	Tyr	Met	Phe	Val	Ile
				485					490					495
Ser	Pro	Asn	Asn	Leu	Arg	Lys	Gly	Leu	Ile	Asn	His	Leu	Ile	Ser
				500					505					510
Asp	Pro	Thr	Gly	Ala	Thr	Met	Ile	Lys	Thr	Tyr	Leu	Lys	Val	
				515					520					

<210> SEQ ID NO: 54
<211> 1369
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CTC CAG TTG GAT GCC TAC GAC CCC ATG GAG GTC CAG GCT GCC GAC CTG 48
GTC GTG GTG GGC GCT GGG CCC GCA GGC CTG TCG GTG GCT GCA CGC GTG 96
TCT CAG GCC GGC TTC CGC GTG GTC CTG GTG GAC CCC GAT CCT CTG GGG 144
GAG TGG CGC AAC AAC TAT GGC GTC TGG TGC GAC GAG TTC GAG GAC ATG 192
GGC CTG GAG GAC TGC TTC GAC ACG GTG TGG GAC CGG GCC GTC GTC TAT 240
CTG GAC TCG GGT CCT GAT GGA CAA AGG AGC CTG AGC CGG CCC TAC GCC 288
CGC GTG GAC CGA CCC CGG CTG AAG CGC AAG ATG CTG TCA GAG TGC GTC 336
CGC CAT GGC GTC CAG TTC CAC AGC GTC AAG GCC GAG GAC GCG ACG CAC 384
GGG GAT GGG CGC TCG ACG CTG CGC TGC AGC GAT GGG TCC CAG GTC ACG 432
GCA AGT CTC GTC CTA GAC GCC AGT GGC CAC AGC AGG TCC CTG GTC AAG 480
TAT GAC ACA AAT TTT GAC CCT GGG TAC CAG GGC GCG TAT GGC TTC GTG 528
GCA GAG GTC GAG TGG CAT CCC TTT GAC CCT GGG GCA ATG CTC TTC ATG 576
GAC TGG CGT GAC GAC CAC CTG GCA GAC CGC CCA GAC CTG CGA GCC TCC 624
AAT GAA CGG CTG CCT TCG TTC CTG TAT GCC ATG CCC CTG TCA GCC ACA 672
TCT GTG TTT TTG GAG GAG ACG TCG CTG GTG GCA CGT CCC ATC ATC CCC 720
TTT GAG GAT TTG AAG GTC AGG CTG GAG GCC CGA CTG AAG TAT CTC GGA 768
CTC GAG ATC AAG AGC ATA TCT GAG GAG GAG TAT TGT CGC ATT CCC ATG 816
GGC GGT GCA CTC CCG ACC CTG CCT CAG CGT GTG CTG GGG TTG GGA GGC 864
ACT GCT GGG ATG GTG CAC CCC TCC ACA GGG TAC ATG ATC TCG AGG GTT 912
CTG GGT GCT GCC CCG CTC GTG GCA GAC ACC ATA ATC GAC CAG CTC TGC 960
TCC GTC TCA GAC AGA GCA CAG GAC CGC AAC ACG CCC CGC GCC CCA CGG 1008
AGC GAG GAG GAG GCT GAT GCA ATG GCT GCC GCC GTC TGG GCG GCG CTG 1056
TGG CCC AAG CGA CGC CAG AGG CAG CGA GAG TTC TTC TGG TTT GGC ATG 1104
GAG ATC CTG TTG AAG CTG GAC TTG CAT GAG ACC CGG AAC TTT TTC TCG 1152
GCA TTT TTC TCC TTG TCT GAG CAT CAC TGG CAT GGG TTC CTC TCG GCG 1200
CGA CTC ACC TTC ACT GAG CTC ATT GGC TTT GGC TTG AGC CTC TTC GCA 1248
AAG TCC AGC AGC TCT GCG CGC CTG GGC CTT ATT GCC AAG GGG CTA CCG 1296
GGC TTG GTC ACC ATG CTT GCT CGT GCG ACT CAA ATT AAG TGA TTT GGA 1344
GAT TTT GTA AGG CTC AGC AGT GCC A 				1369

<210> SEQ ID NO: 55
<211> 595
<212> Amino Acid Sequence
<213> Arabidopsis thaliana

Met	Ala	Met	Ala	Phe	Pro	Leu	Ser	Tyr	Thr	Pro	Thr	Ile	Thr	Val
1				5					10					15
Lys	Pro	Val	Thr	Tyr	Ser	Arg	Arg	Ser	Asn	Phe	Val	Val	Phe	Ser
				20					25					30
Ser	Ser	Ser	Asn	Gly	Arg	Asp	Pro	Leu	Glu	Glu	Asn	Ser	Val	Pro
				35					40					45
Asn	Gly	Val	Lys	Ser	Leu	Glu	Lys	Leu	Gln	Glu	Glu	Lys	Arg	Arg
				50					55					60
Ala	Glu	Leu	Ser	Ala	Arg	Ile	Ala	Ser	Gly	Ala	Phe	Thr	Val	Arg
				65					70					75
Lys	Ser	Ser	Phe	Pro	Ser	Thr	Val	Lys	Asn	Gly	Leu	Ser	Lys	Ile
				80					85					90
Gly	Ile	Pro	Ser	Asn	Val	Leu	Asp	Phe	Met	Phe	Asp	Trp	Thr	Gly
				95					100					105
Ser	Asp	Gln	Asp	Tyr	Pro	Lys	Val	Pro	Glu	Ala	Lys	Gly	Ser	Ile
				110					115					120
Gln	Ala	Val	Arg	Asn	Glu	Ala	Phe	Phe	Ile	Pro	Leu	Tyr	Glu	Leu
				125					130					135
Phe	Leu	Thr	Tyr	Gly	Gly	Ile	Phe	Arg	Leu	Thr	Phe	Gly	Pro	Lys
				140					145					150
Ser	Phe	Leu	Ile	Val	Ser	Asp	Pro	Ser	Ile	Ala	Lys	His	Ile	Leu
				155					160					165
Lys	Asp	Asn	Ala	Lys	Ala	Tyr	Ser	Lys	Gly	Ile	Leu	Ala	Glu	Ile
				170					175					180
Leu	Asp	Phe	Val	Met	Gly	Lys	Gly	Leu	Ile	Pro	Ala	Asp	Gly	Glu
				185					190					195
Ile	Trp	Arg	Arg	Arg	Arg	Arg	Ala	Ile	Val	Pro	Ala	Leu	His	Gln
				200					205					210
Lys	Tyr	Val	Ala	Ala	Met	Ile	Ser	Leu	Phe	Gly	Glu	Ala	Ser	Asp
				215					220					225
Arg	Leu	Cys	Gln	Lys	Leu	Asp	Ala	Ala	Ala	Leu	Lys	Gly	Glu	Glu
				230					235					240
Val	Glu	Met	Glu	Ser	Leu	Phe	Ser	Arg	Leu	Thr	Leu	Asp	Ile	Ile
				245					250					255
Gly	Lys	Ala	Val	Phe	Asn	Tyr	Asp	Phe	Asp	Ser	Leu	Thr	Asn	Asp
				260					265					270
Thr	Gly	Val	Ile	Glu	Ala	Val	Tyr	Thr	Val	Leu	Arg	Glu	Ala	Glu
				275					280					285
Asp	Arg	Ser	Val	Ser	Pro	Ile	Pro	Val	Trp	Asp	Ile	Pro	Ile	Trp
				290					295					300
Lys	Asp	Ile	Ser	Pro	Arg	Gln	Arg	Lys	Val	Ala	Thr	Ser	Leu	Lys
				305					310					315
Leu	Ile	Asn	Asp	Thr	Leu	Asp	Asp	Leu	Ile	Ala	Thr	Cys	Lys	Arg
				320					325					330
Met	Val	Glu	Glu	Glu	Glu	Leu	Gln	Phe	His	Glu	Glu	Tyr	Met	Asn
				335					340					345
Glu	Arg	Asp	Pro	Ser	Ile	Leu	His	Phe	Leu	Leu	Ala	Ser	Gly	Asp
				350					355					360
Asp	Val	Ser	Ser	Lys	Gln	Leu	Arg	Asp	Asp	Leu	Met	Thr	Met	Leu
				365					370					375
Ile	Ala	Gly	His	Glu	Thr	Ser	Ala	Ala	Val	Leu	Thr	Trp	Thr	Phe
				380					385					390
Tyr	Leu	Leu	Thr	Thr	Glu	Pro	Ser	Val	Val	Ala	Lys	Leu	Gln	Glu
				395					400					405
Glu	Val	Asp	Ser	Val	Ile	Gly	Asp	Arg	Phe	Pro	Thr	Ile	Gln	Asp
				410					415					420
Met	Lys	Lys	Leu	Lys	Tyr	Thr	Thr	Arg	Val	Met	Asn	Glu	Ser	Leu
				425					430					435
Arg	Leu	Tyr	Pro	Gln	Pro	Pro	Val	Leu	Ile	Arg	Arg	Ser	Ile	Asp
				440					445					450
Asn	Asp	Ile	Leu	Gly	Glu	Tyr	Pro	Ile	Lys	Arg	Gly	Glu	Asp	Ile
				455					460					465
Phe	Ile	Ser	Val	Trp	Asn	Leu	His	Arg	Ser	Pro	Leu	His	Trp	Asp
				470					475					480
Asp	Ala	Glu	Lys	Phe	Asn	Pro	Glu	Arg	Trp	Pro	Leu	Asp	Gly	Pro
				485					490					495
Asn	Pro	Asn	Glu	Thr	Asn	Gln	Asn	Phe	Ser	Tyr	Leu	Pro	Phe	Gly
				500					505					510
Gly	Gly	Pro	Arg	Lys	Cys	Ile	Gly	Asp	Met	Phe	Ala	Ser	Phe	Glu
				515					520					525
Asn	Val	Val	Ala	Ile	Ala	Met	Leu	Ile	Arg	Arg	Phe	Asn	Phe	Gln
				530					535					540
Ile	Ala	Pro	Gly	Ala	Pro	Pro	Val	Lys	Met	Thr	Thr	Gly	Ala	Thr
				545					550					555
Ile	His	Thr	Thr	Glu	Gly	Leu	Lys	Leu	Thr	Val	Thr	Lys	Arg	Thr
				560					565					570
Lys	Pro	Leu	Asp	Ile	Pro	Ser	Val	Pro	Ile	Leu	Pro	Met	Asp	Thr
				575					580					585
Ser	Arg	Asp	Glu	Val	Ser	Ser	Ala	Leu	Ser					
				590					595					

<210> SEQ ID NO: 56
<211> 1301
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GTA AAC TCG TCA TGA ATT CCA GAT TCC TCC ATG GCA GGC CAG GCT GTC 48
TCC TGG TGC ACC CTG GCT CCC ATA CAA GAA AAG AGG CGA GAC TGC TCA 96
AGG CCG CAC CCA TCA ACC GGC GAC GTC AGG TTT CGT GTA CGG CCG GCC 144
TAG CCT CCC AGC TCC TTC CTT GGG CCA TAT TTT TCA GTC CTG GAG CAG 192
GCA TTT TGG CGT ATG CGG CCG TGA GGG GCG GGG GAA ATA TCA AGG ACG 240
GCT TGA GCC ATG TGC TCA CCC AAG TCA GCC AGG GGT ACT TTC AAC CCC 288
AGG TCG GGG GCG ACA CCA TCC CGG CAG CGC AGG GCA ACC TCT CAG ATC 336
TTG CGG GAG ACG AGC CTC TCT TCA AAG CAC TGT TCA AAT GGT TTC AAG 384
AGT GTG GAG GAG TCT TCA AGC TGG AGT TCG GAC CCA AAG CCT TCA TTG 432
TCG TGT CAG ACC CTG TGG TCG TAC GCC ACC TGC TAA AGG AGA ATG CAT 480
ACA ACT ATG ACA AGG GAG TGC TGG CAG ACA TCC TGG AAC CGT TCA TGG 528
GCA AGG GCC TCA TAC CAG CTG ACT GGG ACA CAT GGA AAG TGA GGC GCA 576
AGA AGA TCG TTC CGG GCT TTC ACA AGG CGT ACC TCA ACG CTT CCA TGG 624
CCA TGT TTG GAC GCT GCA CGG AGC GTA TGA TCG GCA AGC TGG ATG CCC 672
ACA CTG CTT CGG CAA ATA GCG ACG ACA TGG TGG ACA TGG AGA CTG AGT 720
TTC TGA ACC TGG GCC TTG ACA TCA TTG GGT TGG GAG TCT TCA ACT ACG 768
AAT TTG GGA GCA TCA CCC GCG AGT CAC CTG TCA TCA AGG CTG TTT ACA 816
GTG TCT TGA AGG AGG CAG AGC ATC GCA GCA CAT TTT ATT TGC CGT ACT 864
GGG ACT TGC CCC TGG CGT CCC TGG TCG TGC CCC GCC AGC GGA GAT TTC 912
AGA GAG ACA TCC AGG TAC TGA ATG AGT GTC TGG ATG AGC TCA TTG AGC 960
AGG CCC GCA GGA TCA GCG AGC CCG AAG AGA TCG AAG CGC TGC AGC AGC 1008
GGG ACT ACA GCA AGG TCA AGG ATG CCA GCC TGC TGC GCT TCC TTG TGG 1056
ATG CGG GGG ATG CTG ACC TGG GTG CTC GCC AGC TGA GGG ACG ACC TGA 1104
TGA CCA TGC TGA TTG CGG GCC ACG AGA CCA CTG CGG CTG TGC TGA CTT 1152
GGA CGC TGT ACT GCC TGA CAA AGA GCC CCT CAC ACA TGG CTG AGA TCC 1200
AGG CCG AGG TGG ATG CCG CGC TGG GGG ACA GCC ACC CAG ACC TGG ACG 1248
ACA TCG GGC GTC TCC CCA AAA CAC GAG CAG CGC TGG CAG AGT CCT TGC 1296
GGC TC                                                          1301                                                                    

<210> SEQ ID NO: 57
<211> 534
<212> Amino Acid Sequence
<213> Parachlorella kessleri

Met	Ala	Gly	Gly	Gly	Val	Val	Val	Val	Ser	Gly	Arg	Gly	Leu	Ser
1				5					10					15
Thr	Gly	Asp	Tyr	Arg	Gly	Gly	Leu	Thr	Val	Tyr	Val	Val	Met	Val
				20					25					30
Ala	Phe	Met	Ala	Ala	Cys	Gly	Gly	Leu	Leu	Leu	Gly	Tyr	Asp	Asn
				35					40					45
Gly	Val	Thr	Gly	Gly	Val	Val	Ser	Leu	Glu	Ala	Phe	Glu	Lys	Lys
				50					55					60
Phe	Phe	Pro	Asp	Val	Trp	Ala	Lys	Lys	Gln	Glu	Val	His	Glu	Asp
				65					70					75
Ser	Pro	Tyr	Cys	Thr	Tyr	Asp	Asn	Ala	Lys	Leu	Gln	Leu	Phe	Val
				80					85					90
Ser	Ser	Leu	Phe	Leu	Ala	Gly	Leu	Val	Ser	Cys	Leu	Phe	Ala	Ser
				95					100					105
Trp	Ile	Thr	Arg	Asn	Trp	Gly	Arg	Lys	Val	Thr	Met	Gly	Ile	Gly
				110					115					120
Gly	Ala	Phe	Phe	Val	Ala	Gly	Gly	Leu	Val	Asn	Ala	Phe	Ala	Gln
				125					130					135
Asp	Met	Ala	Met	Leu	Ile	Val	Gly	Arg	Val	Leu	Leu	Gly	Phe	Gly
				140					145					150
Val	Gly	Leu	Gly	Ser	Gln	Val	Val	Pro	Gln	Tyr	Leu	Ser	Glu	Val
				155					160					165
Ala	Pro	Phe	Ser	His	Arg	Gly	Met	Leu	Asn	Ile	Gly	Tyr	Gln	Leu
				170					175					180
Phe	Val	Thr	Ile	Gly	Ile	Leu	Ile	Ala	Gly	Leu	Val	Asn	Tyr	Ala
				185					190					195
Val	Arg	Asp	Trp	Glu	Asn	Gly	Trp	Arg	Leu	Ser	Leu	Gly	Pro	Ala
				200					205					210
Ala	Ala	Pro	Gly	Ala	Ile	Leu	Phe	Leu	Gly	Ser	Leu	Val	Leu	Pro
				215					220					225
Glu	Ser	Pro	Asn	Phe	Leu	Val	Glu	Lys	Gly	Lys	Thr	Glu	Lys	Gly
				230					235					240
Arg	Glu	Val	Leu	Gln	Lys	Leu	Cys	Gly	Thr	Ser	Glu	Val	Asp	Ala
				245					250					255
Glu	Phe	Ala	Asp	Ile	Val	Ala	Ala	Val	Glu	Ile	Ala	Arg	Pro	Ile
				260					265					270
Thr	Met	Arg	Gln	Ser	Trp	Ala	Ser	Leu	Phe	Thr	Arg	Arg	Tyr	Met
				275					280					285
Pro	Gln	Leu	Leu	Thr	Ser	Phe	Val	Ile	Gln	Phe	Phe	Gln	Gln	Phe
				290					295					300
Thr	Gly	Ile	Asn	Ala	Ile	Ile	Phe	Tyr	Val	Pro	Val	Leu	Phe	Ser
				305					310					315
Ser	Leu	Gly	Ser	Ala	Asn	Ser	Ala	Ala	Leu	Leu	Asn	Thr	Val	Val
				320					325					330
Val	Gly	Ala	Val	Asn	Val	Gly	Ser	Thr	Leu	Ile	Ala	Val	Met	Phe
				335					340					345
Ser	Asp	Lys	Phe	Gly	Arg	Arg	Phe	Leu	Leu	Ile	Glu	Gly	Gly	Ile
				350					355					360
Gln	Cys	Cys	Leu	Ala	Met	Leu	Thr	Thr	Gly	Val	Val	Leu	Ala	Ile
				365					370					375
Glu	Phe	Ala	Lys	Tyr	Gly	Thr	Asp	Pro	Leu	Pro	Lys	Ala	Val	Ala
				380					385					390
Ser	Gly	Ile	Leu	Ala	Val	Ile	Cys	Ile	Phe	Ile	Ser	Gly	Phe	Ala
				395					400					405
Trp	Ser	Trp	Gly	Pro	Met	Gly	Trp	Leu	Ile	Pro	Ser	Glu	Ile	Phe
				410					415					420
Thr	Leu	Glu	Thr	Arg	Pro	Ala	Gly	Thr	Ala	Val	Ala	Val	Val	Gly
				425					430					435
Asn	Phe	Leu	Phe	Ser	Phe	Val	Ile	Gly	Gln	Ala	Phe	Val	Ser	Met
				440					445					450
Leu	Cys	Ala	Met	Glu	Tyr	Gly	Val	Phe	Leu	Phe	Phe	Ala	Gly	Trp
				455					460					465
Leu	Val	Ile	Met	Val	Leu	Cys	Ala	Ile	Phe	Leu	Leu	Pro	Glu	Thr
				470					475					480
Lys	Gly	Val	Pro	Ile	Glu	Arg	Val	Gln	Ala	Leu	Tyr	Ala	Arg	His
				485					490					495
Trp	Phe	Trp	Asn	Arg	Val	Met	Gly	Pro	Ala	Ala	Ala	Glu	Val	Ile
				500					505					510
Ala	Glu	Asp	Glu	Lys	Arg	Val	Ala	Ala	Ala	Ser	Ala	Ile	Ile	Lys
				515					520					525
Glu	Glu	Glu	Leu	Ser	Lys	Ala	Met	Lys						
				530										
<210> SEQ ID NO: 58
<211> 3542
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CTT CAG TCC TGC ACT TCA TAC AGG GAG GGC GGG GGT GCA GCT CGT GCA 48
CGT GCG TCG AGG GGA GAG TGT CGG GGA GGT GCT GGT GCA CCC GGG CTA 96
CGA CCC GGC CAC CAT GAT CCA GAG AGT GCA CGA GGC CAC CAT GCT CAG 144
GGT CAA CGC GGG GAA CCT GCC CAT GCG ACG AGC GGA GGT CAT CAT GCA 192
GTG CTA CCG AGA GCG CAT GGC GGG GTA CAC GTG AGT GCC AGG AGG ATG 240
ACT AGG AGG GTG ATT TTG AAT GGC GGG CTG GCA TGT GCA TAG GGG GTG 288
CAC CCA CTG CCA GAT AAA AAG TAA TGC CCA GTG ATA CCT CCT CCA CTC 336
CAC ACA CAG ATA CAT GCG CTA GAC TCT CAC AAT CAT ATG ACC GTG ATG 384
GTC GCG CGC AAC TCA AGC CGG CTA TAA CGT GCA CCA GCA GAA ACG CAT 432
TCC CCC CTT GCG ACC ACC ACT TCT CCG ACG TAC ACC AGG GAG TGC AAA 480
CAC ATC CCT CGC CAG ATT TAT CTC AGC ATG GCG CTG GTT GCC TTC TGT 528
AAA AAC TGA TAA TCT AAA TCT GGA GTT TGT GAG TGG CCT CGA CCT GTT 576
CTT GAA CTC CGC CGC TGG GAG GGA GTG GTG TGC TGC TGC CCA AAC CTA 624
ATA GAT TCT CCG TCA AAA TAT GGC GGG ACC GGG GCG ATC CAA AAT CTC 672
CGA CAC AGA TTT CGC ACC TTC CGT GGC GCT TCA CCC GCG TGC TCT CGC 720
TGG GTC TCG CAG AAT AAC GTG GGC GAC AAA CAC ACC TGC CAA AAG AAT 768
TAT AAG AAT TAA TAA ATT AAA GGG ATT ACT GAC TGT CAT CGC CCG TCA 816
CCC CGT GGT GAG GTA CCC GAG TCG AAG CTG AAC CCC GTC ATC CCG GCG 864
GCT GCG CAA GGT ACG GAT GCG AGG ATG GGG GGC CCG GAT CTA TAA ATC 912
TTG CCC GTA TTC GCA GTG CAT GTC GGC ACC ACA CGG GCT CGC ACC ATC 960
TGC CGT GGG GAT CAC GGG CCC GCG GCC TTT GGT TGA CGT AAG CGG TTT 1008
CTT TGC TGA GTC AGA CTA ACG TTC ACG AAG TGC GGC TGC GAG GTA TAC 1056
GTC TCA CGA GCC CCT GTA CGG CCA CAA AAC CTC CAC TCA GGG GAA ACA 1104
ACT TCG AGT TGT CAC ATT TAT CTA CCC TCC CGA CAG AGT AAT CAT GGC 1152
CGG AGG AGG CAT GGT GGC CGT GGG TGC GGA GCA CCT CCA CGC TGA GGC 1200
GTA TGG AGG CCG CCT CAC CGG CTA CGT CAT CTT TGT TGC AAT CCT GTC 1248
GGC ATG TGG AGG CCT GCT GTT CGG CTA CGA TAT CGG TGT CAC CGG TGG 1296
TGT GAC CGC CAT GGC CGG CTT CCA GCG GAA GTT CTT CCC CGA CGT GTA 1344
TGA GCG TAC CGT TCT GGG CAT TTC CGA CAC AAG CCC CTA CTG CAA GTA 1392
CGA CAA CCA GGT GCT GCA GAC CTT CAC CTC CAC CAT GTT CCT GGC TGG 1440
CAT GTT CTC CTC CTT CTT TGC GGG CAC CAT GTG CCG CAA GCT GGG GCG 1488
CAA GGC CAC CAT GTT CAC CTC CGC CTG CCT GTT CAT CCT GGG CGC GGG 1536
CCT GCA GGC CGG CGC CCA GAA CCT GGC CAT GCT CTT TGT GGG CCG CGC 1584
CTT CCT GGG ATT CGG CAT CGG CTT CGC CAA CAC CGT GGT CCC CCT GTA 1632
CCT CTC CGA GGT CGC GCC CTT CAA CTT CCG TGG CGG CCT GAA CAT GCT 1680
GTT CCA GCT TTT CAC CAC CAT CGG CAT CCT CAT CGC CGG CCT GGT CAA 1728
TTA TGG GGT CCA GGA CTG GTG GGC AGG CTG GCG CCT GTC CCT GGG CCT 1776
GGC CGC CGC GTT TGC CCT CGG CCT CCT GGT CGG CTC CAT CGT GCT GCC 1824
TGA GTC CCC CAA CTC GCT CAT CGA GCG TGG GCA GGT GGA GAA GGG GCG 1872
TGC CGT GCT GGA GCG CCT GCG TGG CAC CAA GGA CGT GGA GGC GGA GAT 1920
GGA GGA CAT CCT GGA GGC CAC GGA GCT GGC CAG CCT CGT CTC CCT GCG 1968
CCA GTC CTA CAC CAT GCT GCT GAC CAA GGC CTA CCG GCC CCA GCT CAT 2016
CCT GTC CTG CCT GAT CCC CTT CTT CCA GCA GTT CAC CGG TAT CAA CGC 2064
CAT CAT GTT CTA CGT CCC CGT CAT CTT CAA CTC CCT GGG GAG TGG GCG 2112
CCG CGC GTC GCT GCT GAA CAC CAT CAT CAT CAA CGC CGT CAA CTT TGT 2160
GGC CAC TTT CCT CTC CAT CCT GAC CGT GGA CCG CCT GGG TCG CCG CTT 2208
CTG GTT CCT GGA GGG CGG TGT GCA GAT GTT CCT GGC CCA GAT CGT GAC 2256
CGG TGT CGT GCT GGC CGT GCA GTT TGG CAA GTA CGA GGA CGG CGA CCT 2304
GCC CAC CGC CAC CGC CAT CGG CGT GCT CAT CGT GGT CTG CGT CTT TGT 2352
CTC CGC CTT TGC CTG GTC CTG GGG CCC ACT GGG CTG GCT GGT GCC TTC 2400
GGA GAT CCA CAA CCT GGA GAC GCG CGC GGC GGG GAT GTC GGT GGC CGT 2448
CAC CAC CAA CTT CCT CTT CTC CTT TGT CAT CGG CCA GGC CTT CCT CTC 2496
CAT GCT CTG CGC CAT GAA GTG GGG CGT GTT CAT CTT CTT TGC CTC CTT 2544
CGT GGC CAT CAT GAC CGT CTT CAT CTT CTT CAT GAT GCC CGA GAC CAA 2592
GGG CAT CCC CGT GGA GCG CGT GCC CAT CAC CTT TGC GCG GCA CAT GGC 2640
CTG GCG CCC CAT CAT GAG CGC TGA GGT GGC GCA GCA GAT CAT CGA CCG 2688
CGA CGC GAC GCG CAC CGC CTC ACG CGC CGC CAC CAA GGC TGC TCG CGA 2736
CGC AGA CGC AGG GGA GAA GGC CAA GGA GGG GCC CAT TGG CGG TAC CGC 2784
CAT CTG AGC GGC GCA GCG GCA TTC TGT CGC GTC GCG GCA GAG AGG GCA 2832
GGA TTC TTC CCT GAG CCT GAA GCC GGC GCG CTA CTG CCA GCA CGG GCT 2880
GGG ACG GTT TTA TGA TTT TCC AAG CGC GAC CGG TGA GGA TCC TTC CCC 2928
TTC TTC CCT GCG ACA CGC CCC ACA CGC GCA CAC ACC CCT TCA CCT CTC 2976
TAC TAC TAG TAT CTA AGA GAG CAT GCA CGC CCA CCG CCG CGT CAT TTT 3024
GGA CGA GTG CAG TCG TGG CCG GCG CGG CTG TGG CCT CTG CGT TTT GCC 3072
ACC AGT CAC TTT AAA AAC TCA ACT GCC TGG CCG GAC GAT GGC TTG GGT 3120
CTG GCT GGA GTG GCA TGT CAT CGC CCC ACC CCC ATC CCC TTC CGC CCC 3168
TCC CGC CAG CGC ATG AGA TTT GAA AAA TGG ACT CGA CCC ATG CTC CCA 3216
GTT GAA CCG CCT GTG TGT GCC CTT GTC TCT TGT TCT GAG CGG TAC GAG 3264
TCG AGG CGC ACT GAC CCC CCC CCT ACT AGC AAG CCC CTC CGT GCA GGC 3312
AGG GCG CCA CCC CCG CCC CTG CCT TGC CAT GCC AAA CTC GAC ACA GTA 3360
CGC CTG CCC TGC AAA AAC TCA TCC CCT GCA TGC GCC CGT CTT CCC GCT 3408
GTA CAG TCG AGC TGG ATG ATG GGT GAC TGA GAG AAG TGG GAA CAG TGA 3456
CTG GGA AGT GTT CAG AGC GGT TTC AGA GTA CGT GTG CAC CGA GAC GCT 3504
CGC TTT GTG GGG TGG CGT GAC TCG AGT CTC CAA CCA AG 		3542

<210> SEQ ID NO: 59
<211> 465
<212> Amino Acid Sequence
<213> Chlorella variabilis

<400>
Met	Glu	Ile	Arg	Ser	Leu	Ile	Val	Ser	Met	Asn	Pro	Asn	Leu	Ser
1				5					10					15
Ser	Phe	Glu	Leu	Ser	Arg	Pro	Val	Ser	Pro	Leu	Thr	Arg	Ser	Leu
				20					25					30
Val	Pro	Phe	Arg	Ser	Thr	Lys	Leu	Val	Pro	Arg	Ser	Ile	Ser	Arg
				35					40					45
Val	Ser	Ala	Ser	Ile	Ser	Thr	Pro	Asn	Ser	Glu	Thr	Asp	Lys	Ile
				50					55					60
Ser	Val	Lys	Pro	Val	Tyr	Val	Pro	Thr	Ser	Pro	Asn	Arg	Glu	Leu
				65					70					75
Arg	Thr	Pro	His	Ser	Gly	Tyr	His	Phe	Asp	Gly	Thr	Pro	Arg	Lys
				80					85					90
Phe	Phe	Glu	Gly	Trp	Tyr	Phe	Arg	Val	Ser	Ile	Pro	Glu	Lys	Arg
				95					100					105
Glu	Ser	Phe	Cys	Phe	Met	Tyr	Ser	Val	Glu	Asn	Pro	Ala	Phe	Arg
				110					115					120
Gln	Ser	Leu	Ser	Pro	Leu	Glu	Val	Ala	Leu	Tyr	Gly	Pro	Arg	Phe
				125					130					135
Thr	Gly	Val	Gly	Ala	Gln	Ile	Leu	Gly	Ala	Asn	Asp	Lys	Tyr	Leu
				140					145					150
Cys	Gln	Tyr	Glu	Gln	Asp	Ser	His	Asn	Phe	Trp	Gly	Asp	Arg	His
				155					160					165
Glu	Leu	Val	Leu	Gly	Asn	Thr	Phe	Ser	Ala	Val	Pro	Gly	Ala	Lys
				170					175					180
Ala	Pro	Asn	Lys	Glu	Val	Pro	Pro	Glu	Glu	Phe	Asn	Arg	Arg	Val
				185					190					195
Ser	Glu	Gly	Phe	Gln	Ala	Thr	Pro	Phe	Trp	His	Gln	Gly	His	Ile
				200					205					210
Cys	Asp	Asp	Gly	Arg	Thr	Asp	Tyr	Ala	Glu	Thr	Val	Lys	Ser	Ala
				215					220					225
Arg	Trp	Glu	Tyr	Ser	Thr	Arg	Pro	Val	Tyr	Gly	Trp	Gly	Asp	Val
				230					235					240
Gly	Ala	Lys	Gln	Lys	Ser	Thr	Ala	Gly	Trp	Pro	Ala	Ala	Phe	Pro
				245					250					255
Val	Phe	Glu	Pro	His	Trp	Gln	Ile	Cys	Met	Ala	Gly	Gly	Leu	Ser
				260					265					270
Thr	Gly	Trp	Ile	Glu	Trp	Gly	Gly	Glu	Arg	Phe	Glu	Phe	Arg	Asp
				275					280					285
Ala	Pro	Ser	Tyr	Ser	Glu	Lys	Asn	Trp	Gly	Gly	Gly	Phe	Pro	Arg
				290					295					300
Lys	Trp	Phe	Trp	Val	Gln	Cys	Asn	Val	Phe	Glu	Gly	Ala	Thr	Gly
				305					310					315
Glu	Val	Ala	Leu	Thr	Ala	Gly	Gly	Gly	Leu	Arg	Gln	Leu	Pro	Gly
				320					325					330
Leu	Thr	Glu	Thr	Tyr	Glu	Asn	Ala	Ala	Leu	Val	Cys	Val	His	Tyr
				335					340					345
Asp	Gly	Lys	Met	Tyr	Glu	Phe	Val	Pro	Trp	Asn	Gly	Val	Val	Arg
				350					355					360
Trp	Glu	Met	Ser	Pro	Trp	Gly	Tyr	Trp	Tyr	Ile	Thr	Ala	Glu	Asn
				365					370					375
Glu	Asn	His	Val	Val	Glu	Leu	Glu	Ala	Arg	Thr	Asn	Glu	Ala	Gly
				380					385					390
Thr	Pro	Leu	Arg	Ala	Pro	Thr	Thr	Glu	Val	Gly	Leu	Ala	Thr	Ala
				395					400					405
Cys	Arg	Asp	Ser	Cys	Tyr	Gly	Glu	Leu	Lys	Leu	Gln	Ile	Trp	Glu
				410					415					420
Arg	Leu	Tyr	Asp	Gly	Ser	Lys	Gly	Lys	Val	Ile	Leu	Glu	Thr	Lys
				425					430					435
Ser	Ser	Met	Ala	Ala	Val	Glu	Ile	Gly	Gly	Gly	Pro	Trp	Phe	Gly
				440					445					450
Thr	Trp	Lys	Gly	Asp	Thr	Ser	Asn	Thr	Pro	Glu	Leu	Leu	Lys	Gln
				455					460					465
Ala	Leu	Gln	Val	Pro	Leu	Asp	Leu	Glu	Ser	Ala	Leu	Gly	Leu	Val
				470					475					480
Pro	Phe	Phe	Lys	Pro	Pro	Gly	Leu							
				485										

<210> SEQ ID NO: 60
<211> 2001
<212> Genomic Sequence
<213> Auxenochlorella protothecoides

<400>
CCAGCAGGGA GAGGAGTGAG ATCCTTCGTC GCGCCCGCGC CCTGGCCCTC 50
CCAGATCACT TTGGCCTCAC GAGGAAGTCA CGGGGGTGTC ACACATGCGT 100
CTTCCATCGA GGTGCACCAT TACAGCCCAC GTGAAATCTT ATTTTATCGA 150
CATGATGCTC CTGAATGCCG CATGTGCGCG TCCCGCACTG TTCAATGGTC 200
TCTCCCCAAG TGTGTCAGGG TGGCGCACAC CCGAAGTGAG GTTTTGGGGA 250
GCGGGACGAT GCCCTCGACA GCGCAGGCTA GCATCCACAG CAGCACATAG 300
CAATGGATTT GATCCAGATG ATACGATTCC CCGCACCACC ACCCCCCACT 350
CCGGCTACCA TTTCGACGGA TCGGGTGAGT GGGAGAGCAT GTGTGATGCA 400
CACTGACCTC CTCAACTGGT GCGGTTGGTC CCGCCTCACC GTGCAGGTGT 450
GGCGGCCACG AAGGCAAAGT TGTACTGAAC AAGATCATCT CCAATCCCGC 500
TGCATGGCAT GCCTGCTCCC ACTTCCATCA CCCAGCCCCT CTACAGCCAC 550
CCCACCCCCT TTCCCCCTCC CCCAGACCGC CGCTTCTTCG AGGGGTGGTA 600
CTGGCGCCTG ACCCTGCCTG GCCGCGGCCA GAGCTTTGCC GTCATCTACT 650
CGGTGGAGGA CCCCGGCTCC CAGCGCCCCA CCAGCGGCGT GGGCGCCCAG 700
ATCATGGGCC CAAACGACAG CTACCTGCTG CAGTACGAGG AGGGCACCGC 750
CGGCTTCTGG GCGGACCCGC ATGAGCTGGC GCTGGGCAAG GTCTTCGCCG 800
CCAGCCCCGG CTGCGCCCCC GCGGCCGCGC CCTGCCCCCC GGAGCGCTTT 850
GAGGCCCTCG TCTCCGAGGG CTTCCAGGTG CAGGGAGGGC GGCACCAGGG 900
GCGCATCGTG TCACGGGAGG CCGGGGTCGT CCCCGGCCCC GCCCCCAGCG 950
TTGCCTCCGC GCGCTGGGAC TTCCAGGTCA CCCCCAGGCT GGGCTGGGGC 1000
AGCCCCGGGC AGCGCCAGCT CTCCACCGCC GGCTGGCTGG CCGCGGTGCC 1050
CATCTTCGAG CCGCACTGGC AGGTGCTCAT GGCGCACGGC TCCGCCTCGG 1100
GCTGGATCGA GTGGGGAGGG GAGAGGTACG CCTTCCAGGG CGTGCCCACC 1150
TACGCGGGTG AGCGCCTCGG CATCGGGAGG GTTGCTGGTG GAGGGGCGGG 1200
CTGGTGTTGC CCCCAAGAGG TTGAAGTTGG GTGGCGGTCT CGCATGGCAC 1250
GCCCCGGAGC AGGGTGCCGT TCTATTGAGT CAATGAGTGT GCAGTCATCA 1300
GTCTGCTGCT CTCTTCCCCG GCCCCTATCG CGTGCCCTCC AACACCCAAT 1350
CCCACGCACG ACCCCTTGCT CCCCACCCGC CAGAGAAGAA CTGGGGGGGC 1400
GGCTTCCCCA GCCGCTGGTG CTGGGTGCAG TGCAACAGCT TTGACGGCCA 1450
CCCAGGCACC AGCGTCACAG CCGTCTGTGA GTGCGGGGCG GGGGAAGCTT 1500
GCCTGCTCCC TTGCATTCGT CGCTCCTGAT GACAGCAGGC CTCGTGTGCC 1550
TCCCATTTAT TTCCCGATTG CCACCCCCCG TGGCCGATGA GCGGCATGCA 1600
CCAGGCTGTT TGGCTGGCCC CACCGTCTGC TCCAGGTGCC CACACTCCAC 1650
ACACCGTGGG CATGGGTCCC CCCCCACGCC GCCCCACAGG TGCCCGGCGT 1700
GGGGTCCGCC TCGTGCCGGG GCTGGAGGAG GACGTGGGCC TGATCGGGCT 1750
GCACGCGGCC GGCGACTTCC ACGAGTTCGT GCCCAACAAG GGCGACATCG 1800
GGTGGGAGGT GGAGCCGTGG GGCACATGGC GCATCCACGG CGAGAACGCG 1850
GAGCACACGG CGCTGGTGGA GGCCACCTGC CCCCCCGAGG CGGGCACCGT 1900
GCTGCGGGCC CCCACCGCCG ACCTGGGACT GGCGCCCTTC TGCCGCGACA 1950
GCTTCTCGGG CAGGGTGAGG CGCTTGCGAT CGAGAGGGTG GGGGGAGGGG 2000
G 						       2001

<210> SEQ ID NO: 61
<211> 411
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Ala	Val	Ala	Arg	His	Phe	Thr	Ala	Gln	Pro	Trp	His	Ile	Ala
1				5					10					15
Pro	Ser	Arg	Tyr	Asn	Ala	Ala	Arg	Cys	Arg	Gln	Arg	Arg	His	Phe
				20					25					30
Leu	Val	Ala	Gly	Ala	Gln	Gln	Gln	Glu	Gly	Leu	Val	Ala	Ala	Ser
				35					40					45
Thr	Ser	His	Thr	Ala	Glu	Ala	Ala	Ala	His	Leu	Ala	Gln	Pro	Ile
				50					55					60
Ala	Asp	Ile	Ala	Ala	Arg	Ala	Ala	Ala	Ala	Pro	Pro	Pro	Gln	Ala
				65					70					75
Asp	Thr	Ala	Ala	Ala	Ala	Pro	Leu	Leu	Gly	Pro	Thr	Pro	Leu	Ala
				80					85					90
Ala	Leu	Leu	Leu	Gly	Gly	Ala	Leu	Leu	Ala	Gly	Tyr	Gly	Ile	Lys
				95					100					105
Lys	Val	Tyr	Asp	Thr	Pro	Ser	Arg	Ser	Tyr	Asp	Gln	Asn	Val	Gly
				110					115					120
Gln	Glu	Tyr	Asp	Ala	Trp	Thr	Glu	Glu	Gly	Val	Leu	Glu	Tyr	Tyr
				125					130					135
Trp	Gly	Glu	His	Ile	His	Leu	Gly	His	Tyr	Ser	Glu	Glu	Glu	Arg
				140					145					150
Gln	Arg	Gly	Tyr	Lys	Lys	Lys	Asn	Phe	Ile	Gln	Ala	Lys	Tyr	Asp
				155					160					165
Phe	Val	Glu	Glu	Met	Leu	Arg	Trp	Ser	Gly	Trp	Ala	Cys	Ala	Asp
				170					175					180
Val	Ser	Gly	Asp	Gly	Gly	Val	Pro	Lys	Ile	Leu	Asp	Val	Gly	Cys
				185					190					195
Gly	Ile	Gly	Gly	Thr	Ser	Arg	Tyr	Leu	Ala	Ala	Lys	Phe	Pro	Gln
				200					205					210
Ala	Ser	Val	Thr	Gly	Ile	Thr	Leu	Ser	Pro	Ser	Gln	Val	Gln	Arg
				215					220					225
Gly	Thr	Glu	Leu	Ala	Ala	Glu	Arg	Gly	Leu	Ser	Asn	Ala	Lys	Phe
				230					235					240
Gln	Val	Met	Asp	Ala	Leu	Ser	Met	Asp	Phe	Pro	Asp	Asn	Ser	Phe
				245					250					255
Asp	Leu	Val	Trp	Ala	Cys	Glu	Ser	Gly	Glu	His	Met	Pro	Asp	Lys
				260					265					270
Lys	Ala	Tyr	Val	Asp	Glu	Met	Val	Arg	Val	Leu	Lys	Pro	Gly	Gly
				275					280					285
Thr	Ile	Val	Ile	Ala	Thr	Trp	Cys	Gln	Arg	Asp	Glu	Thr	Pro	Glu
				290					295					300
Ala	Pro	Phe	Ser	Glu	Arg	Asp	Arg	Glu	Arg	Leu	Thr	Phe	Leu	Tyr
				305					310					315
Glu	Glu	Trp	Ala	His	Pro	Tyr	Phe	Val	Ser	Lys	Glu	Glu	Tyr	Gly
				320					325					330
Arg	Ile	Met	Glu	Ala	Thr	Gly	Glu	Leu	Ser	Gly	Val	Gly	Leu	Ala
				335					340					345
Asp	Trp	Thr	Pro	Gln	Thr	Ile	Asp	Ser	Trp	Arg	His	Ser	Ile	Trp
				350					355					360
Val	Gly	Val	Trp	Asp	Pro	Trp	Ile	Val	Val	Leu	Lys	Gly	Pro	Arg
				365					370					375
Met	Trp	Tyr	Lys	Val	Thr	Arg	Glu	Ile	Val	Thr	Leu	Glu	Arg	Met
				380					385					390
His	Arg	Ala	Phe	Ala	Asp	Gly	Leu	Met	Glu	Tyr	Gly	Met	Met	Lys
				395					400					405
Ala	Thr	Lys	Lys	Lys	Ala									
				410										

<210> SEQ ID NO: 62
<211> 786
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CGG TAA CAT GTC GAG TCA CGG GGT CAG AGC AGC ATA GCG TGT CAC ACA 48
TGG AGA GAT CGA GCA AGA GGA GTA GTG ATC GGG TGC CCT TTG CCG CGG 96
TGA CGC CCA CTG CCA GGG CGG GGC CTG TCC TGC TTG GCA CTG AGG TCC 144
CCA CTC CCC TGG TGG CCG CTG TGG CCC TTG GGG GAG CCG CTC TGG GCC 192
TTC GGG CCA TAA AAA AGG TGT TCG ACA CCC CGT CTC GAA CCT ACG ATG 240
GCC AGA ACG TGG GCA AGG AGT ACG ATG CAT GGA CCA GGG AGG GCA TCC 288
TGG AGC ACT ACT GGG GCG AGC ACA TCC ACC TTG GGT ACT ACA CCG AGG 336
AGG AGC AAG CAG CGG GGT ACA AGA AGA AGG ACT TCA TCC AGG CCA AGC 384
ACG ACT TTG TGG ACC GCA TGA TCC AAT TCG CGG GGG TGT CCA ACC CCG 432
GCA GCA TCC TGG ATG TGG GCT GCG GCA TTG GCG GCA CCA CGC GGA TGC 480
TGG CTT CCC GCT TTC CCG GTG CCA AGG TCG CTG GCA TCA CGC TGT CGC 528
CCA ACC AAG TCG CCC GGG GTA CAG CCC TGG CCG CCG AGA AGG GGC TCG 576
CAA ACT GCG AAT TCA AGG TGA TGG ATG CGC TGA AGA TGG ACT ACC CCG 624
ACA ACT CCT TCG ATG TGG TGT GGG CTT GCG AGT CCG GGG AGC ACA TGC 672
CCG ACA AGG GGG CCT ACG TGC GTG AGA TGG TGC GTG TGC TGA AGC CCG 720
GGG GCA CCC TCG TCA TCG CCA CCT GGT GCC AGC GGG AGG TGA CGC CGG 768
CCA ACC CCT TCA CCG CCA						786

<210> SEQ ID NO: 63
<211> 332
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Ala	Cys	Cys	Ser	Trp	Ile	Ala	Ser	Gln	Arg	Ser	Ser	Leu	Val
1				5					10					15
Gln	Gln	Pro	Ala	Gly	Pro	Gln	Gln	Arg	Ser	Arg	Arg	Arg	Arg	Gly
				20					25					30
Leu	Ala	Ala	Arg	Ala	Gly	Leu	Leu	Asp	Thr	Leu	Ile	Lys	Pro	Ile
				35					40					45
Thr	Ser	Ser	Gly	Glu	Arg	Lys	Pro	Leu	Lys	Glu	Gly	Ile	Ala	Asn
				50					55					60
Phe	Tyr	Asp	Glu	Ser	Ser	Gln	Leu	Trp	Glu	Ser	Met	Trp	Gly	Glu
				65					70					75
His	Met	His	His	Gly	Tyr	Tyr	Pro	Lys	Gly	Gly	Ala	Pro	Lys	Ser
				80					85					90
Asn	Gln	Gln	Ala	Gln	Leu	Asp	Met	Ile	Glu	Glu	Ser	Leu	Arg	Trp
				95					100					105
Ala	Gly	Ala	Glu	Gly	Ala	Thr	Lys	Met	Val	Asp	Val	Gly	Cys	Gly
				110					115					120
Ile	Gly	Gly	Ser	Ser	Arg	His	Ile	Ala	Arg	Lys	Phe	Gly	Cys	Glu
				125					130					135
Ser	Arg	Gly	Ile	Thr	Leu	Ser	Pro	Val	Gln	Ala	Ala	Arg	Ala	Asn
				140					145					150
Glu	Ile	Ser	Arg	Gln	Gln	Gly	Phe	Gly	Asp	Arg	Leu	Ser	Phe	Gln
				155					160					165
Val	Ala	Asp	Ala	Leu	Asp	Gln	Pro	Phe	Pro	Asp	Gly	Glu	Phe	Asp
				170					175					180
Leu	Val	Trp	Ser	Met	Glu	Ser	Gly	Glu	His	Met	Pro	Asp	Lys	Pro
				185					190					195
Arg	Phe	Val	Gly	Glu	Leu	Ala	Arg	Val	Cys	Ala	Pro	Gly	Gly	Arg
				200					205					210
Ile	Ile	Val	Val	Thr	Trp	Cys	His	Arg	Val	Leu	Ala	Pro	Gly	Glu
				215					220					225
Ala	Gly	Leu	Ser	Gly	Asp	Glu	Gln	Ala	Leu	Leu	Asp	Arg	Ile	Cys
				230					235					240
Glu	Ala	Tyr	Tyr	Leu	Pro	Ala	Trp	Cys	Ser	Val	Ala	Asp	Tyr	Glu
				245					250					255
Gln	Leu	Phe	Arg	Glu	Gln	Gly	Leu	Thr	Asp	Ile	Arg	Thr	Thr	Asp
				260					265					270
Trp	Ser	Glu	Glu	Val	Ala	Pro	Phe	Trp	Gly	Glu	Val	Ile	Lys	Ser
				275					280					285
Ala	Phe	Ser	Ala	Glu	Gly	Val	Ser	Gly	Leu	Leu	Lys	Ala	Gly	Trp
				290					295					300
Thr	Thr	Ile	Lys	Gly	Ala	Leu	Val	Met	Pro	Leu	Met	Ala	Gln	Gly
				305					310					315
Phe	Arg	Met	Gly	Leu	Val	Lys	Phe	Val	Leu	Ile	Thr	Gly	Arg	Lys
				320					325					330
Pro	Glu													

<210> SEQ ID NO: 64
<211> 1156
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CGC GCG AGG CGG GCC TGG GGG ACC GCG CCG CAT TCC AGG TGG CGG ACG 48
CCC TGA GCC AGC CCT TCC CGG ATG GCT CCT TTG ACC TGG TGT GGT CGC 96
TGG AGA GCG GGG AGC ACA TGC CCG AGA AGG AGA CCT TCG TGA GGG AGC 144
TGG CGC GCG TGG CGC TGC CGG GGG GGC GCA TCA TCA TCG TGA CCT GGT 192
GCC ACA GGA ACC TGG CAC CTG GAG AAA CTG CTT TGA CCC CGG AGG AAC 240
AGA CGC TGT TGG ACA GGC TGT GCG AGG CCT ACT ACC TCC CAG CCT GGT 288
GCT CTC TGG CCG ACT ACG AGC GCC TCT TTG CTG CAC ACG GAA TCA GGG 336
ATG TGA AGA CGG CGG ACT GGT CCG TGG AGG TGC AGC CCT TCT GGG GCC 384
AGG TGA TCA AGA GCG CGC TGA CCA CGC AAG GCG TCG CCG GCC TGT TGA 432
AAG CGG GGT GGA CCA CGA TCA AGG GGG CAC TGG TCA TGC CGC TCA TGG 480
CCA GGG GCC TGT CTA TGG GCC TCA TCA AAT TCG TGC TGA TCA CTG GCG 528
TCA AGG CAG AGA CCT GAA CTG GTG CGG GGA CAG TGT TTC CCA CCT GTC 576
AGT CTG CCC TGA TTC TGG TTG CCG TTG CTT TTC TAC CCA GGT TTG CAC 624
CCC ACG TTT CTG CAA AAA CTT GTC GAG TAG CAT TCT TGG GCT GTA AGA 672
TTG ATC CTT GTA CTG CAT GCA CCG CTG GCA TTT CAG TCC AAA CAG GTG 720
CAG CTG GAT GCA AGC AGG GCA CTC CTC TGC ATG ATG ATG CAA CCG GCG 768
CGG TGC AAC TGG GAG GAT ACC AGC ACA TGG GAG TAC CTT GAT GTC CAA 816
CAC ATG CTG AAG GAG TTC TGT TTC CGA TGA CGG TAC CCG AGG CTC GAC 864
AGC CAG CCG GTT TAT GCA GTG CAG CCT GCA CGC CTC TGC ACC TAC TTT 912
ATA TTC ATG ATC ACA GCT ATG CAG CGT CAT CCG CCA TTG ATC TAC TCG 960
TCC GTC CTC GTC TGG TGT GCC TCC TTC CTT GCC AGA AAG GAT GAC AGC 1008
ATA TAG GCA ATG CTC TCC ACC GTG ACC ATG GGG TTC ACG CCT GTG GCT 1056
GTG GGG AAG GTG GAG GCG TCC ATC ACA TAC AGT CCT GCC ACG TCC CAG 1104
CTC TGG CCC TCG GCA TCC ACC ACC GAG GCG GCG GGG TCG GTG CCC ATC 1152
CGC G 								1156

<210> SEQ ID NO: 65
<211> 499
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Thr	Ala	Ser	Thr	Leu	Ala	Cys	Ser	Gly	Tyr	Ala	Arg	Gly	Thr
1				5					10					15
Ser	Ser	Leu	Ala	Ser	Glu	Arg	Arg	Pro	Arg	Ser	Phe	Ala	Arg	Ser
				20					25					30
Arg	Pro	Ser	Arg	Cys	Val	Ala	Asp	Ala	Ala	Gly	Glu	Gly	Gly	Ser
				35					40					45
Leu	Ser	Asn	Gly	Thr	Ser	Asn	Gly	Phe	Pro	Ala	Ala	Ala	Ala	Ala
				50					55					60
Gly	Ala	Gly	Gly	Leu	Arg	Ala	Lys	Ala	Thr	Gly	Pro	Ala	Val	Cys
				65					70					75
Asp	Pro	Cys	Ser	Ser	Asp	Pro	Ala	Ala	Ala	Pro	Arg	Ser	Ala	Val
				80					85					90
Gln	Trp	Pro	Gln	Lys	Ser	Glu	Leu	Tyr	Ile	Leu	Arg	Ser	Asp	Gly
				95					100					105
His	Ser	Cys	Thr	Arg	Glu	Thr	Val	Gln	Pro	Ser	Gly	Asn	Leu	Gln
				110					115					120
Phe	Ala	Cys	Pro	Ser	Leu	Gln	Gln	Gln	Leu	Leu	Val	Trp	Lys	Thr
				125					130					135
Arg	Pro	Gln	Arg	Val	Met	Val	Leu	Lys	Lys	Leu	Gly	Asp	Glu	Leu
				140					145					150
Met	Glu	Glu	Tyr	Val	Asp	Val	Leu	Arg	Tyr	Leu	Gly	Glu	Glu	Leu
				155					160					165
Gly	Met	Arg	Val	Val	Val	Glu	Pro	His	Asp	His	Ala	Val	Leu	Lys
				170					175					180
Gly	Leu	Cys	Met	Gly	Trp	Val	Asp	Thr	Tyr	Gln	Glu	Arg	Asp	Leu
				185					190					195
Gly	Glu	Leu	His	Ser	Cys	Val	Asp	Phe	Ile	Val	Cys	Leu	Gly	Gly
				200					205					210
Asp	Gly	Leu	Leu	Leu	His	Ala	Ala	Ser	Leu	Phe	Gly	Asn	Ala	Leu
				215					220					225
Pro	Pro	Ile	Ile	Ser	Phe	Lys	Leu	Gly	Ser	Leu	Gly	Phe	Leu	Thr
				230					235					240
Thr	His	Asn	Tyr	Val	Asp	Tyr	Arg	Arg	His	Leu	Arg	Asn	Val	Val
				245					250					255
His	Gly	Cys	Arg	Glu	Leu	Ala	Ser	Cys	Glu	Leu	Val	Ser	Ser	Ala
				260					265					270
Asp	Gly	Arg	Pro	Leu	Arg	Gly	Val	His	Ile	Thr	Leu	Arg	Met	Arg
				275					280					285
Leu	Gln	Cys	Glu	Ile	Trp	Arg	Cys	Ala	Ala	Arg	Glu	Gly	Arg	Gly
				290					295					300
Gly	Ala	Gly	Trp	Arg	Ala	Gly	Cys	Pro	Glu	Ala	Phe	Glu	Val	Leu
				305					310					315
Asn	Glu	Val	Val	Leu	Ser	Arg	Gly	Ala	Asn	Pro	Tyr	Leu	Ser	Lys
				320					325					330
Ile	Glu	Val	Ser	Glu	Ala	Gly	Arg	Leu	Ile	Thr	Lys	Val	Gln	Ala
				335					340					345
Asp	Gly	Val	Met	Leu	Ala	Thr	Pro	Thr	Gly	Ser	Thr	Ala	Tyr	Asn
				350					355					360
Val	Ala	Ala	Gly	Gly	Ser	Met	Val	His	Pro	Ser	Val	Pro	Ala	Ile
				365					370					375
Leu	Phe	Thr	Pro	Ile	Cys	Pro	His	Ser	Leu	Asn	Phe	Arg	Pro	Val
				380					385					390
Ile	Leu	Pro	Asp	Tyr	Ala	Glu	Leu	Asp	Leu	Arg	Ile	Ala	Asp	Asp
				395					400					405
Ala	Arg	Cys	Ser	Ala	Val	Val	Cys	Phe	Asp	Gly	Arg	Asp	Ser	Arg
				410					415					420
Glu	Leu	Ala	Arg	Gly	Asp	Ser	Ile	Lys	Val	Arg	Met	Ser	Pro	Asn
				425					430					435
Pro	Val	Pro	Thr	Ile	Asn	Asn	Ala	Asp	Gln	Thr	Thr	Asp	Trp	Phe
				440					445					450
Ala	Ser	Ile	Gln	Arg	Cys	Phe	His	Trp	Ser	Glu	Arg	Ile	Glu	Gln
				455					460					465
Leu	Pro	Ile	Asn	Gly	Gln	Leu	Leu	Ala	Ala	Arg	Glu	Gln	Glu	Leu
				470					475					480
Ser	Ala	Met	Ser	Ser	Gly	Leu	Ala	Ser	Val	Gly	Ser	Ser	Met	Asp
				485					490					495
Ser	Thr	Glu	Ala											
														

<210> SEQ ID NO: 66
<211> 750
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG ACT GGG GAG CTT CTC TAC TCC GAG ACG ACC CTG CTG GGG CGT GCC 48
TCC CCC CGG CAC ACC CGT CAT GGG CCA CAT TGG ACG GGA TGC TTC ACC 96
TCC AGG AGG CCA AAG AGG GGC TGC AAT GCG TGT GTG TCC GCC AGA ACG 144
CTC AAC GAC GTG CAG CAG CCT GTC AAG CCC CAT GCC CAC CTC GCT CCC 192
AAG AGC GAT GTG TAT CTG TTG CGC TCA GAT GGA CTC TCT TGT TCC CGT 240
GAG ATC GTG GAA GCA AAC GGA AGC CTG AGC TTC GCC TGC ACT AGC TCT 288
CAG CAG CAC CTC TTA GTC TGG AAG CAG CGC CCC AAA TGT GTC ATG GTC 336
CTC AAG AAG ATC GGG GAG GAG CTG GAG GAG GAG TTT GCT GCG GTG GTG 384
GAC TTC CTC GGG CGG GAG CAG GGC TTG CTT GTC GTC GTG GAA GAT TCC 432
TGC CAT GAG TGC CTG GTG CGC CAG GGC CTG GGC AAG TGG ATC CTG CCC 480
TTC CAC CCC AGC GAG GCC TTT GGC CTG CAC CGG GCG GTG GAC TTC ATC 528
GTC AGC CTG GGG GGC GAT GGC CTG ATT CTG CAC GCT GCG CAT CTG TTT 576
GGC ACC ACC ATC CCG CCT ATC ATC TCC TTC AAG CTG GGC TCC CTG GGG 624
TTC CTC ACC TGC CAC GAC CAC CGC GAC TAT CGC CGA CAC TTG CAC GAC 672
GTC ATC CAT GGC TGC ACA GAG CTC ACC GAG TGC GGC CCC ATC AAG AAC 720
TCG TCG GGC GTG GCG CTG CAG GGC ATC CCC 			750

<210> SEQ ID NO: 67
<211> 504
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ser	Arg	Pro	Pro	Arg	Pro	Ala	Pro	Ala	Arg	Cys	Cys	Met	Leu
1				5					10					15
Met	Ala	Thr	Pro	Gly	Phe	Gly	Ser	Cys	Ala	Val	Arg	Gly	Gly	Gln
				20					25					30
Gly	Cys	Glu	Ala	Ala	Ala	Ser	Ser	Arg	Arg	Val	Glu	Glu	Ser	Cys
				35					40					45
Pro	Ala	Ser	Leu	Cys	Ala	Gly	Cys	Cys	Tyr	Val	Leu	Ser	Asp	Arg
				50					55					60
Glu	Gln	Asp	Arg	Gln	Pro	Arg	Gly	Glu	Val	Cys	Ser	Ser	Ile	Ile
				65					70					75
Leu	Gly	Gly	Gly	Ala	Gly	Thr	Arg	Leu	Phe	Pro	Leu	Thr	Lys	Ser
				80					85					90
Arg	Ala	Lys	Pro	Ala	Val	Pro	Ile	Gly	Gly	Ala	Tyr	Arg	Leu	Ile
				95					100					105
Asp	Val	Pro	Met	Ser	Asn	Cys	Ile	Asn	Ser	Gly	Ile	Ser	Lys	Ile
				110					115					120
Tyr	Ile	Leu	Thr	Gln	Phe	Asn	Ser	Thr	Ser	Leu	Asn	Arg	His	Leu
				125					130					135
Gly	Arg	Ala	Tyr	Asn	Met	Gly	Ser	Gly	Val	Arg	Phe	Gly	Gly	Asp
				140					145					150
Gly	Phe	Val	Glu	Val	Leu	Ala	Ala	Thr	Gln	Thr	Pro	Thr	Asp	Lys
				155					160					165
Glu	Trp	Phe	Gln	Gly	Thr	Ala	Asp	Ala	Val	Arg	Gln	Tyr	Ser	Trp
				170					175					180
Leu	Leu	Glu	Asp	Thr	Lys	Asn	Arg	Ala	Ile	Glu	Asp	Val	Leu	Ile
				185					190					195
Leu	Ser	Gly	Asp	His	Leu	Tyr	Arg	Met	Asp	Tyr	Met	Lys	Phe	Val
				200					205					210
Asn	Tyr	His	Arg	Glu	Thr	Asn	Ala	Asp	Ile	Thr	Ile	Gly	Cys	Ile
				215					220					225
Ala	Tyr	Gly	Ser	Asp	Arg	Ala	Lys	Glu	Phe	Gly	Leu	Met	Lys	Ile
				230					235					240
Asp	Glu	Lys	Arg	Arg	Val	Thr	Ser	Phe	Ala	Glu	Lys	Pro	Lys	Thr
				245					250					255
Gln	Glu	Ala	Leu	Asp	Ala	Met	Lys	Val	Asp	Thr	Thr	Val	Leu	Gly
				260					265					270
Leu	Thr	Pro	Glu	Glu	Ala	Ala	Glu	Lys	Pro	Tyr	Ile	Ala	Ser	Met
				275					280					285
Gly	Ile	Tyr	Val	Phe	Lys	Lys	Ser	Val	Leu	Leu	Gln	Leu	Leu	Asn
				290					295					300
Asp	Ser	Tyr	Ala	Lys	Ala	Asn	Asp	Phe	Gly	Gly	Glu	Ile	Ile	Pro
				305					310					315
Ser	Ala	Ala	Lys	Asp	His	Asn	Val	Val	Ala	Tyr	Pro	Phe	Tyr	Gly
				320					325					330
Tyr	Trp	Glu	Asp	Ile	Gly	Thr	Ile	Lys	Ser	Phe	Phe	Glu	Glu	Asn
				335					340					345
Leu	Lys	Leu	Cys	Arg	His	Pro	Ala	Thr	Phe	Glu	Phe	Tyr	Asp	Pro
				350					355					360
Gln	Ser	Pro	Ile	Tyr	Thr	Ser	Pro	Arg	Val	Leu	Pro	Pro	Ala	Thr
				365					370					375
Val	Arg	Asn	Cys	Lys	Val	Thr	Asp	Ala	Ile	Ile	Ala	Gln	Gly	Ser
				380					385					390
Phe	Val	Ser	Asp	Cys	Thr	Ile	Asn	Asn	Ala	Val	Ile	Gly	Ile	Arg
				395					400					405
Ser	Ile	Ile	Gly	Gln	Asn	Cys	Thr	Ile	Gln	Asp	Ala	Leu	Val	Met
				410					415					420
Gly	Ala	Asp	Tyr	Tyr	Glu	Ser	Asp	Asp	Gln	Arg	Ala	Thr	Leu	Leu
				425					430					435
Lys	Lys	Gly	Gly	Val	Pro	Val	Gly	Ile	Gly	Ala	Asn	Ser	Val	Ile
				440					445					450
Thr	Asn	Ala	Ile	Ile	Asp	Lys	Asn	Ala	Arg	Val	Gly	Lys	Asn	Val
				455					460					465
Lys	Ile	Val	Asn	Lys	Glu	Gly	Val	Thr	Glu	Gly	Thr	Arg	Glu	Ala
				470					475					480
Glu	Gly	Ile	Tyr	Ile	Arg	Ser	Gly	Ile	Val	Val	Ile	Asp	Lys	Gly
				485					490					495
Ala	Leu	Val	Pro	Asp	Asn	Thr	Thr	Ile						
				500										

<210> SEQ ID NO: 68
<211> 1340
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
AGC CCC GCC CGC ACG CAT CAG ATG CGG GTG CCT GCC TTG ATG GTG GCG 48
TTG CGG GCC ACC ACC ACG ATG CCA GAG CGG ATG AAG TAG CCC TCC TCC 96
TCC CGC GAA GCC TCC TGG ACC CCC TCC GGG TTT GTG ATG TGC ACA TCC 144
TCC GCA ATG CGG GCG TTC TTG TCC AGG ATC ACT CGA GTC AGC TTG GAA 192
TTG GCC CCA ACG CCC ATG GGC ACC TGG CCG GAG GCA ATC AGC TCC GCA 240
CGC TGC TCG TTG GAC TCA AAG TAG TCC TGC CCC ATC ATG AGG ACC TCC 288
TTC ATC TCC ACG CCC TTG TCG ATG CGA GAC CGG ATG CCG ATG ATG GAA 336
GTG TCC ACA AAG CAG TCC CTG AGG AGC GAG CCG TGG CTC ACC ACC GCA 384
TCC GTC ACG CGG GAT CGG ATG ATC TTT GCG GGG GGG AGG AAG CGC GGC 432
GAT GTG TAG ATG GGG CCC TTG GGG TCG TGG AAC TCG AAC TGG GGG GGA 480
TTC CGG GTC AGG TTG ATG TTG GAC TCG AAG AAG GAC TCG ATG GTC CCA 528
ATG TCC TCC CAG TAG CCG TTG AAG AGG TAC GCC ATC ACC TTG TTG TCC 576
TTG GCC GAC TGC GGT ATG ATC TCC CCC CCA AAG TCG TTG CGG CTG GGG 624
TCG TCC TTG AGC AGC TTG ATC ATG ACA TCC TTG CGG AAG ACG TAG ATG 672
CCC ATG GAG GCG ATG AAG GGG TTG GCG GCC GCC GCC TCT GCA TCC AGC 720
CCC AGC ACG GTC GTG TCC ACC GCC ATG GCC TCC AAC GCC GCG CCG CGC 768
GGC TTC TCC GCA AAG TCC ACG ATG CGG CCC TCG CCA TCT ATC TTC ATG 816
AGA CCA AAG TCG GAG GCG CGA GTG TGG TCC ACG GGA AGG CAG CCG ATG 864
CTG ATG TCG GCG TTG CTG GCG CGG TGC GCC TCC ACG AAC TTC ATG TAG 912
TCC ATG CGG TAC AGG TGG TCG CCG GAA AGG ACG ACG ATG TCC TGC ACG 960
TTG CGG TTC TTC ACG TCT TCA AAC AGC CAG CTG TAC TGC CGC ACC GCA 1008
TCG GCG GTG CCC TGG AAC CAG GCG TCT GTG GTG GGC GTC TGG TTG GCC 1056
GCC AGG ATC TCG ACG AAG GAG TCC CCT CCA AAG CGC ACA CCC GAG GTG 1104
CCC ATG TTG TAG GTG CGT GCC AGA TGG CGG TTG AGA GAG GTG GAG TTG 1152
AAC TGG GTG AGG ATG TAG ATC TTG GAG ATG CCG GAG TTG ATG CAG TTG 1200
CTC ATG GGC ACG TCG ATG AGG CGA TAG GCG CCT CCA ATG GGC ACA GCC 1248
GGC TTG GCA CGG TTC TTG GTG AGG GGG TAG AGA CGG GTG CCT GCC CCG 1296
CCG CCG AGG ATC ACG GAC AGG ACG GAG CTC CTC TGC GCG CGG GG 	1340

<210> SEQ ID NO: 69
<211> 514
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Leu	Lys	Met	Arg	Val	Ser	Gln	Arg	Gln	Ala	Leu	Gly	Ser
1				5					10					15
Gln	Thr	Phe	Val	Cys	Pro	His	Gly	Ser	Val	Val	Arg	Lys	Ala	Val
				20					25					30
Ser	Ser	Lys	Ala	Arg	Ala	Val	Ser	Arg	Gln	Ala	Gln	Val	Val	Arg
				35					40					45
Ala	Gln	Ala	Val	Ser	Thr	Pro	Val	Glu	Thr	Lys	Val	Ala	Asn	Gly
				50					55					60
Val	Ala	Ala	Ser	Ser	Ala	Ala	Gly	Thr	Gly	Gln	Asn	Asp	Pro	Ala
				65					70					75
Gly	Asp	Ile	Ser	Lys	Thr	Val	Leu	Gly	Ile	Ile	Leu	Gly	Gly	Gly
				80					85					90
Ala	Gly	Thr	Arg	Leu	Tyr	Pro	Leu	Thr	Lys	Lys	Arg	Ala	Lys	Pro
				95					100					105
Ala	Val	Pro	Leu	Gly	Ala	Asn	Tyr	Arg	Leu	Ile	Asp	Ile	Pro	Val
				110					115					120
Ser	Asn	Cys	Leu	Asn	Ser	Asn	Val	Thr	Lys	Ile	Tyr	Cys	Leu	Thr
				125					130					135
Gln	Phe	Asn	Ser	Ala	Ser	Leu	Asn	Arg	His	Leu	Ser	Gln	Ala	Tyr
				140					145					150
Asn	Ser	Ser	Val	Gly	Gly	Tyr	Asn	Ser	Arg	Gly	Phe	Val	Glu	Val
				155					160					165
Leu	Ala	Ala	Ser	Gln	Ser	Ser	Ala	Asn	Lys	Ser	Trp	Phe	Gln	Gly
				170					175					180
Thr	Ala	Asp	Ala	Val	Arg	Gln	Tyr	Met	Trp	Leu	Phe	Glu	Glu	Ala
				185					190					195
Val	Arg	Glu	Gly	Val	Glu	Asp	Phe	Leu	Ile	Leu	Ser	Gly	Asp	His
				200					205					210
Leu	Tyr	Arg	Met	Asp	Tyr	Arg	Asp	Phe	Val	Arg	Lys	His	Arg	Asn
				215					220					225
Ser	Gly	Ala	Ala	Ile	Thr	Ile	Ala	Ala	Leu	Pro	Cys	Ala	Glu	Lys
				230					235					240
Glu	Ala	Ser	Ala	Phe	Gly	Leu	Met	Lys	Ile	Asp	Glu	Glu	Gly	Arg
				245					250					255
Val	Ile	Glu	Phe	Ala	Glu	Lys	Pro	Lys	Gly	Glu	Ala	Leu	Thr	Lys
				260					265					270
Met	Arg	Val	Asp	Thr	Gly	Ile	Leu	Gly	Val	Asp	Pro	Ala	Thr	Ala
				275					280					285
Ala	Ala	Lys	Pro	Tyr	Ile	Ala	Ser	Met	Gly	Ile	Tyr	Val	Met	Ser
				290					295					300
Ala	Lys	Ala	Leu	Arg	Glu	Leu	Leu	Leu	Asn	Arg	Met	Pro	Gly	Ala
				305					310					315
Asn	Asp	Phe	Gly	Asn	Glu	Val	Ile	Pro	Gly	Ala	Lys	Asp	Ala	Gly
				320					325					330
Phe	Lys	Val	Gln	Ala	Phe	Ala	Phe	Asp	Gly	Tyr	Trp	Glu	Asp	Ile
				335					340					345
Gly	Thr	Val	Glu	Ala	Phe	Tyr	Asn	Ala	Asn	Leu	Ala	Leu	Thr	Asp
				350					355					360
Pro	Glu	Lys	Ala	Gln	Phe	Ser	Phe	Tyr	Asp	Lys	Asp	Ala	Pro	Ile
				365					370					375
Tyr	Thr	Met	Ser	Arg	Phe	Leu	Pro	Pro	Ser	Lys	Val	Met	Asp	Cys
				380					385					390
Asp	Val	Asn	Met	Ser	Ile	Ile	Gly	Asp	Gly	Cys	Val	Ile	Lys	Ala
				395					400					405
Gly	Ser	Lys	Ile	His	Asn	Ser	Ile	Ile	Gly	Ile	Arg	Ser	Leu	Ile
				410					415					420
Gly	Ser	Asp	Cys	Ile	Ile	Asp	Ser	Ala	Met	Met	Met	Gly	Ser	Asp
				425					430					435
Tyr	Tyr	Glu	Thr	Leu	Glu	Glu	Cys	Glu	Tyr	Val	Pro	Gly	Cys	Leu
				440					445					450
Pro	Met	Gly	Val	Gly	Asp	Gly	Ser	Ile	Ile	Arg	Arg	Ala	Ile	Val
				455					460					465
Asp	Lys	Asn	Ala	Arg	Ile	Gly	Pro	Lys	Cys	Gln	Ile	Ile	Asn	Lys
				470					475					480
Asp	Gly	Val	Lys	Glu	Ala	Asn	Arg	Glu	Asp	Gln	Gly	Phe	Val	Ile
				485					490					495
Lys	Asp	Gly	Ile	Val	Val	Val	Ile	Lys	Asp	Ser	His	Ile	Pro	Ala
				500					505					510
Gly	Thr	Ile	Ile											
														
<210> SEQ ID NO: 70
<211> 2036
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CCC CGC GCG CAG AGG AGC TCC GTC CTG TCC GTG ATC CTC GGC GGC GGG 48
GCA GGC ACC CGT CTC TAC CCC CTC ACC AAG AAC CGT GCC AAG CCG GCT 96
GTG CCC ATT GGA GGC GCC TAT CGC CTC ATC GAC GTG CCC ATG AGC AAC 144
TGC ATC AAC TCC GGC ATC TCC AAG ATC TAC ATC CTC ACC CAG TTC AAC 192
TCC ACC TCT CTC AAC CGC CAT CTG GCA CGC ACC TAC AAC ATG GGC ACC 240
TCG GGT GTG CGC TTT GGA GGG GAC TCC TTC GTC GAG ATC CTG GCG GCC 288
AAC CAG ACG CCC ACC ACA GAC GCC TGG TTC CAG GGC ACC GCC GAT GCG 336
GTG CGG CAG TAC AGC TGG CTG TTT GAA GAC GTG AAG AAC CGC AAC GTG 384
CAG GAC ATC GTC GTC CTT TCC GGC GAC CAC CTG TAC CGC ATG GAC TAC 432
ATG AAG TTC GTG GAG GCG CAC CGC GCC AGC AAC GCC GAC ATC AGC ATC 480
GGC TGC CTT CCC GTG GAC CAC ACT CGC GCC TCC GAC TTT GGT CTC ATG 528
AAG ATA GAT GGC GAG GGC CGC ATC GTG GAC TTT GCG GAG AAG CCG CGC 576
GGC GCG GCG TTG GAG GCC ATG GCG GTG GAC ACG ACC GTG CTG GGG CTG 624
GAT GCA GAG GCG GCG GCC GCC AAC CCC TTC ATC GCC TCC ATG GGC ATC 672
TAC GTC TTC CGC AAG GAT GTC ATG ATC AAG CTG CTC AAG GAC GAC CCC 720
AGC CGC AAC GAC TTT GGG GGG GAG ATC ATA CCG CAG TCG GCC AAG GAC 768
AAC AAG GTG ATG GCG TAC CTC TTC AAC GGC TAC TGG GAG GAC ATT GGG 816
ACC ATC GAG TCC TTC TTC GAG TCC AAC ATC AAC CTG ACC CGG AAT CCC 864
CCC CAG TTC GAG TTC CAC GAC CCC AAG GGC CCC ATC TAC ACA TCG CCG 912
CGC TTC CTC CCC CCC GCA AAG ATC ATC CGA TCC CGC GTG ACG GAT GCG 960
GTG GTG AGC CAC GGC TCG CTC CTC AGG GAC TGC TTT GTG GAC ACT TCC 1008
ATC ATC GGC ATC CGG TCT CGC ATC GAC AAG GGC GTG GAG ATG AAG GAG 1056
GTC CTC ATG ATG GGG CAG GAC TAC TTT GAG TCC AAC GAG CAG CGT GCG 1104
GAG CTG ATT GCC TCC GGC CAG GTG CCC ATG GGC GTT GGG GCC AAT TCC 1152
AAG CTG ACT CGA GTG ATC CTG GAC AAG AAC GCC CGC ATT GCG GAG GAT 1200
GTG CAC ATC ACA AAC CCG GAG GGG GTC CAG GAG GCT TCG CGG GAG GAG 1248
GAG GGC TAC TTC ATC CGC TCT GGC ATC GTG GTG GTG GCC CGC AAC GCC 1296
ACC ATC AAG GCA GGC ACC CGC ATC GCG TGC GTG CGG GCG GGG CTG GGC 1344
CGG CTA TTC GTG TGC CAC CTT TGC TGT CCC GGT GCT CCC TCA CCG CTT 1392
ACG CTC CTC ACT TGG GCC GCC GAG TTC TCT CTA AAT CGA ATC CGT ACA 1440
CAC ACA CCT TCT CTG TCT GCT CCT GCG TCC CTG CTC TCT TGG GTG TGC 1488
CCT TGT GGT CAT CGC TGA TGC AGC ACG CGG TGT GCA TGG ACC CTC CCT 1536
CGG TCC CTG CTG CAC GGC GTC AGC CCA AGC GCC TCC TGT TGC CTC CAC 1584
CTC CCG GCT GAG ATT TTG TCA ATG CTG ACC GAG CAA GGA GGG TTA ACA 1632
AGA GAA AGC AAC CCG TCA CCT GGG CTT GAG CCT ATA CCC TGG TGG GAA 1680
TGA GAA GTC ATC GTG CCA CCT CCG AGG GGG TCT TGG TCT TCA GGG CCA 1728
GGG CAT GCA GCC CCG CAT CCA GCT CCT CCT TCA GCA GGG TGT ACA CCA 1776
GCC GGT GCC TCG CCA CCA GGG GTT TGC CCT GGA AGG CCT CCG ACA CGA 1824
TCT CCA CGT TGA AGT GAG TTT CTG CAT CAG GGG CGC CTG TGG GGT TTC 1872
CTG TGT GGC CCG CAT GCT TGT GGC TCT CGT TGG TCA GCT TGA GGG TGA 1920
CGG GCT GGA GCC CAG CCG TGA GCT TCT CCT TGA TGG AGA TCT CGA TCG 1968
GTC CAC CTG CCA TGC CCG AGA ATG TCC GCA CAC TCC TCC TGA GAA TGA 2016
GCG CAG GGC GCA TGC GGC CC 					2036	

<210> SEQ ID NO: 71
<211> 601
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Ala	Ser	Thr	Ala	Gly	Pro	Ala	Ala	Ala	Ala	Val	Glu	Pro	Arg
1				5					10					15
Ser	Leu	Gly	Gln	His	Leu	Ala	Ser	Arg	Leu	Val	Gln	Ile	Gly	Cys
				20					25					30
Ser	Arg	Phe	Phe	Gly	Val	Pro	Gly	Asp	Tyr	Asn	Leu	Thr	Leu	Leu
				35					40					45
Asp	Glu	Leu	Glu	Lys	Glu	Pro	Gly	Leu	Lys	Gly	Ala	Trp	Cys	Cys
				50					55					60
Asn	Glu	Leu	Asn	Ala	Gly	Lys	Gln	Leu	Trp	Cys	Val	Thr	Gly	Arg
				65					70					75
His	Arg	Ile	Ser	Pro	Asn	Ser	Tyr	Ala	Ala	Asp	Gly	Tyr	Gly	Arg
				80					85					90
Leu	Lys	Gly	Val	Gly	Cys	Ala	Val	Val	Thr	Phe	Thr	Val	Gly	Gly
				95					100					105
Leu	Ser	Ile	Ile	Asn	Ala	Ile	Ala	Gly	Ala	Phe	Ala	Glu	Ser	Leu
				110					115					120
Pro	Val	Ile	Cys	Ile	Thr	Gly	Gly	Pro	Asn	Thr	Asn	Asp	Phe	Ala
				125					130					135
Ser	Asn	Arg	Leu	Ile	His	His	Thr	Leu	Gly	Arg	Lys	Phe	Asp	Phe
				140					145					150
Met	Gln	Glu	Leu	Glu	Ala	Phe	Lys	Gln	Val	Thr	Cys	Glu	Gln	Val
				155					160					165
Val	Ile	His	Ser	Leu	Asp	Asp	Ala	His	Glu	Glu	Ile	Asp	Lys	Ala
				170					175					180
Ile	Ser	Ala	Ala	Leu	Leu	His	Ser	Lys	Pro	Ala	Tyr	Ile	Cys	Val
				185					190					195
Cys	Cys	Asn	Leu	Ala	Gly	Lys	Arg	Gln	Arg	Arg	Arg	Arg	Arg	Arg
				200					205					210
Gln	Gln	Arg	Arg	Met	His	His	Pro	Ser	Phe	Asp	Thr	Ser	Pro	Ile
				215					220					225
Pro	Tyr	Ser	Leu	Ser	Thr	Lys	Gln	Ser	Asn	Lys	Arg	Ser	Leu	Glu
				230					235					240
Ala	Ala	Val	Glu	Ala	Ala	Ala	Ala	Phe	Leu	Glu	Ser	Lys	Gln	Lys
				245					250					255
Pro	Val	Ala	Leu	Ala	Gly	Pro	Gln	Leu	Arg	Ile	Gly	Gly	Ala	Ser
				260					265					270
Gln	Gln	Phe	Met	Lys	Cys	Val	Glu	Ala	Ser	Gly	Tyr	Pro	Tyr	Ala
				275					280					285
Asn	Met	Ala	Ala	Ala	Lys	Ser	Leu	Val	Pro	Glu	Ser	His	Arg	Gln
				290					295					300
Tyr	Met	Gly	Thr	Tyr	Trp	Gly	Gln	Ile	Ser	Ala	Pro	Cys	Val	Ser
				305					310					315
Glu	Val	Val	Glu	Ser	Ala	Asp	Ala	Tyr	Leu	Val	Ala	Gly	Pro	Val
				320					325					330
Phe	Ser	Asp	Tyr	Ala	Ser	Val	Gly	Tyr	Thr	Leu	Gly	Leu	Ser	Glu
				335					340					345
Ser	Lys	Met	Val	Arg	Val	Asp	Pro	Tyr	Arg	Val	Thr	Ile	Ala	Gly
				350					355					360
Gly	Lys	Gly	Gly	Gln	Val	Phe	Gly	Cys	Val	Asn	Met	Arg	Asp	Phe
				365					370					375
Leu	Ala	Ala	Leu	Ala	Ala	Arg	Arg	Leu	Lys	Pro	Asn	Ala	Thr	Ser
				380					385					390
Met	Asp	Ile	Tyr	Arg	Arg	Leu	Tyr	Ala	Pro	Pro	Pro	Glu	Val	Ala
				395					400					405
Pro	Ser	Pro	Ala	Gly	Ser	Pro	Leu	Gln	Thr	Lys	Val	Leu	Phe	Lys
				410					415					420
His	Ile	Gln	Gly	Leu	Leu	Gln	Pro	Ser	Thr	Met	Leu	Leu	Gly	Glu
				425					430					435
Thr	Gly	Asp	Ala	Ile	Phe	Asn	Cys	Gln	Lys	Leu	Ala	Leu	Pro	Asp
				440					445					450
Gly	Cys	Arg	Tyr	Asp	Trp	Ser	Gln	Gln	Tyr	Gly	Ser	Ile	Gly	Trp
				455					460					465
Ser	Val	Gly	Ala	Thr	Leu	Gly	Leu	Ala	Met	Ala	Gly	Arg	Asp	Ala
				470					475					480
Gly	Arg	Arg	Glu	Val	Ser	Thr	Met	Leu	Arg	Tyr	Asn	Leu	Asn	Pro
				485					490					495
Ile	Ile	Phe	Leu	Ile	Asn	Asn	Gly	Gly	Tyr	Thr	Ile	Glu	Val	Glu
				500					505					510
Ile	His	Asp	Gly	Pro	Tyr	Asn	Val	Ile	Lys	Asn	Trp	Asp	Tyr	Val
				515					520					525
Gly	Leu	Val	Gln	Ala	Met	Gln	Asn	Gly	Gln	Gly	Gln	Leu	Phe	Ala
				530					535					540
Thr	Arg	Val	Arg	Thr	Glu	Ala	Glu	Leu	Ala	Asp	Ala	Val	Lys	Val
				545					550					555
Val	Arg	Arg	Glu	Ala	Lys	Asp	Arg	Leu	Cys	Phe	Ile	Glu	Cys	Ile
				560					565					570
Ile	His	Arg	Arg	Ala	Thr	Asp	Asp	Cys	Ser	Lys	Glu	Leu	Leu	Glu
				575					580					585
Trp	Gly	Ala	Arg	Val	Ala	Ala	Ala	Asn	Ser	Arg	Pro	Pro	Lys	Pro
				590					595					600
Ser														
														
<210> SEQ ID NO: 72
<211> 1342
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
TGC GCA GGG GGG CGG GCT CCT GCG CCT GGC TGG CGC AGA GGA GGA GTT 48
TGT GCG CCT GGT GGA GGC CGC GCA GTA CCC CTT TGC CGT CAT GGC GGC 96
GGG GAA GGG GCT GGT GCC GGA GGA CCA CCC GCA GTG GAT GGG CAC CTA 144
CTG GGG GCA GAT CAG CAG CCC GTT CGT CGC GGA GGT GGT GGA GAG CGC 192
AGA TGC CGC CCT CTT TGT CGG TGC ACA GTT CAA CGA CTA TGC CAC GGC 240
CGG GAA CTC GCT CAA CCT GCA GCA GTC ACG CAT GAT CAA GGT GGA GCC 288
CTA TCG CGT AGT GAT TGC AGG AGG AAA AGG AGG GCA GGT CTT TGG CTG 336
TGT CCG GAT GGA CGA CTT CCT GTG TGG ACT TGC GGA GGC AAT CAC GCC 384
CAA TGA CAC CGC CCA TCA GAT CTT TAA GCG CCT CTG GGT GCC CAC GCC 432
CGT CGT AGA GCC CAG CGA GGA GGG CTC CCC CTT GCG CAC CAA GCA TCT 480
CTT TGC CCA TGT GCA AAT GCT GCT TGA TGA CAA GAC TAT CCT TAT CGC 528
CGA GAC CGG GGA CTC GAT CTT TAA TTG CCA GAA GCT CAA ACT GCC ACG 576
GGG ATG CAC GTA CGA ATG GTC GCA GCA GTA CGG GAG TAT TGG CTG GAG 624
CGT CGG TGC TGT GCT GGG TGC CGC CAT TGG AGG CCA GAA GGA TGG GCG 672
GCG CGT GGT GGC GTG CAT CGG GGA TGG CAG CTT CCA GGT GAC TGC CCA 720
GGA TGT GTC CAC CAT GAT GCG GTA CGG ACA GAA TCC CAT CAT CAT CCT 768
CGT CAA CAA CGG CGG CTA TAC GAT TGA GGT GGA GAT CCA CGA TGG ACC 816
CGC TGA CAA CAA CTA CAA CAT CAT CAA GAA CTG GGA CTA TGT GGC CCT 864
TTT CAA AGC CAT GCA GAA CAA GGA GGG CCA GCT TTT TGC CAC CAG GGC 912
GAA GAC CGA GAA GGA CCT GGA GGA TGC CAT CGA ATT TGC GAA GAA CGA 960
GGC TGC TGA TTC GCT GTG CCT CAT CGA GTG CAT TGT GCA CAG GGA TGA 1008
CTG CAG CAG CGA GTT GCT GGA GTG GGG GGG CAG GGT GGC AGC GGC CAA 1056
CAG CCG CCC TCC ACA GTC TGA TTG ATT GTG CGC AAT TTC AGA ATG ACC 1104
CGG CTG ACA GTA CAC TCC AGA AGC CCT GAC CAT GTG TGA TCT GCC ATG 1152
GTG GCA CGC GAT CAG CTT GCT ACA CAT GTA CAT TGC AAG GAT GAA CGT 1200
GTA TAG TAC ATG CAT TCT GCG CAG GAG CAC AGG CGT CTG AGA CAT TGG 1248
GGA AGG GGG CTG GCG GGG TCA ATG GAG CAA GTG TCG TGA GTG CTG TAA 1296
GTA AGT CGA GAG GCA AGG ATC ACT GCG CGG ATG GGT GTG CGT TTC T 	1342 

<210> SEQ ID NO: 73
<211> 420
<212> Amino Acid Sequence
<213> Ostreococcus tauri

Met	Ser	Gly	Ser	Ala	Ser	Arg	Ala	Leu	Ser	Gln	Ala	Leu	Val	Asp
1				5					10					15
Arg	Ile	Lys	Ser	His	Val	Gly	Glu	Arg	Val	Val	Ile	Lys	Thr	Arg
				20					25					30
Asp	Val	Gly	Ala	Thr	Gln	Pro	Pro	Ser	Phe	Val	Asn	Glu	Gln	Trp
				35					40					45
Ile	Gly	Ala	Ser	Phe	Thr	Pro	Glu	Glu	Asn	Arg	Thr	Asp	Ala	Gln
				50					55					60
Arg	Glu	Thr	Leu	Arg	Glu	Ser	Asp	Gly	Leu	Ile	Glu	Glu	Leu	Arg
				65					70					75
Ser	Ser	Asp	Val	Val	Val	Val	Gly	Ala	Ala	Met	Tyr	Asn	Phe	Gly
				80					85					90
Val	Pro	Ala	Ala	Leu	Lys	Ala	Tyr	Phe	Asp	Gln	Val	Ala	Arg	Ala
				95					100					105
Gly	Val	Thr	Phe	Lys	Tyr	Asn	Asp	Gln	Gly	Val	Pro	Glu	Gly	Leu
				110					115					120
Leu	Lys	Gly	Lys	Lys	Ala	Phe	Ile	Val	Val	Thr	Ser	Gly	Gly	Val
				125					130					135
Pro	Met	Asn	Ala	Ser	Gly	Met	Asp	Phe	Met	Thr	Pro	His	Val	Val
				140					145					150
Thr	Phe	Leu	Gly	Leu	Leu	Gly	Ile	Thr	Asp	Val	Ser	Val	Ile	Asp
				155					160					165
Ala	Ser	Ser	Gln	Met	Lys	Arg	Asp	Asp	Ala	Asn	Asp	Val	Ala	Lys
				170					175					180
Ala	Ala	Ile	Asp	Ala	Ile	Asp	Leu	Asp	Ala	Val	Leu	Ala	Thr	Phe
				185					190					195
Ala	Arg	Ser	Arg	Ile	Ala	Arg	Thr	Arg	Thr	Asn	Gly	Arg	Gln	Arg
				200					205					210
Glu	Pro	Arg	Trp	Met	Thr	Ala	Ser	Trp	Thr	Arg	Ala	Arg	Ala	Tyr
				215					220					225
Ala	Thr	Glu	Asp	Gly	Lys	Ser	Gly	Glu	Glu	Gly	Ala	Asp	Val	Ala
				230					235					240
Ser	Glu	Glu	Gly	Glu	Glu	Ile	Glu	Gly	Glu	Glu	Asn	Asp	Glu	Ile
				245					250					255
Ala	Lys	Leu	Arg	Gly	Glu	Leu	Glu	Glu	Lys	Asp	Ala	Ala	Val	Ala
				260					265					270
Asp	Leu	Lys	Asp	Arg	Ile	Leu	Arg	Thr	Met	Ala	Glu	Met	Glu	Asn
				275					280					285
Leu	Arg	Glu	Arg	Thr	Arg	Arg	Gln	Ala	Glu	Asp	Ala	Lys	Lys	Phe
				290					295					300
Ala	Val	Gln	Gly	Phe	Cys	Lys	Asp	Leu	Leu	Asp	Val	Ala	Asp	Asn
				305					310					315
Leu	Asp	Arg	Ala	Ile	Ser	Thr	Val	Pro	Glu	Glu	Glu	Ile	Glu	Thr
				320					325					330
Asp	Val	Glu	Lys	Ile	Lys	Ala	Lys	Leu	Lys	Ser	Phe	Arg	Glu	Gly
				335					340					345
Val	Val	Leu	Thr	Glu	Lys	Gln	Leu	Ser	Ser	Thr	Phe	Asn	Lys	His
				350					355					360
Gly	Val	Ala	Lys	Phe	Asn	Pro	Glu	Gly	Glu	Glu	Phe	Asp	Ala	Asn
				365					370					375
Leu	His	Met	Ala	Leu	Phe	Asn	Val	Pro	Ile	Pro	Glu	Gly	Ser	Asp
				380					385					390
Ala	Lys	Ala	Gly	Thr	Val	Ala	Ala	Val	Thr	Lys	Thr	Gly	Tyr	Thr
				395					400					405
Leu	His	Glu	Arg	Val	Ile	Arg	Ala	Ala	Glu	Val	Gly	Val	Tyr	Gln
				410					415					420

<210> SEQ ID NO: 74
<211> 824
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GGG GGC CCC ACC CCA GCC GCA GGC CCC GAG CCT GCC GCC GCG TCT GCT 48
GGT GAG CCC GCC ACA GAG GCC GCT GCA TCC GAT CTC ACT CCA GAC GAG 96
CTG AAG CAG GCA CTG CAA GCC GCC AAG GAG CAG CTG GAA CTC GCT CAG 144
AAG GAG GCT GCA GAA TCC AAG GAC CGG CTG GTA AGG ACA CTG GCA GAC 192
ATG CAG AAT CTG CGA GAA CGG ACG GCC CGA CAG ATT GCC GAC ACC AAG 240
CAG TTT GCC GTC CAG GGA GTG GTG AAG TCC TTC ATC GAG GTG GCT GAT 288
AAC CTG GAT CTG GCA ATA GGG TCT GTT CCT GAG AAG GAG CTG GAA GGG 336
GGT GAG GAC GTC GAT GTG GAG AGG GCG CTC ACC CTC CTG AAG CGG CTT 384
CGG GAT GGA GTG GTC ATG ACG GAG AGC ATC ATG CTC AAG CTG CTG GAG 432
AAG GAG GGT GTG AGG AAG TAT GAT CCC CAG GGC GAG CCC TTC GAC CCC 480
AAC CTT CAC AAC GCC ATG TTC CGT GTG CCC AAC TCC GGG GTC AAG TCT 528
GGG CAT GTG GCC CAC GTC ATT AAG AAA GGA TAC ATG CTG CAT GAG AGA 576
GCC GTC CGA GCC GCC GAC GTT GGC GTG GCT GAG TGA GGT CGC CAG TGC 624
TGC CGT GTC ACC GTG GTG CCA CAG CGC ACC CTC TGA ACG TAT CTT CTG 672
CTC CAT GTG ACA ACT CTC GTG TAA GAC CGG ACG CAT GGC AGT TTC AAA 720
CCC TGA AGA GCA CAT CAG GAT GGT GTT GTA AAC AGG CTT ACT TAG AGT 768
GCC CAA TCC CTA TGC ACG CCC CGT GTG CCA TCC TCT ACC CCA CAA GCT 816
GGG CAT GA 							824

<210> SEQ ID NO: 75
<211>347
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii
Met	Thr	Ser	Leu	Pro	Ala	Leu	Val	Pro	Phe	Ala	Ala	Leu	Pro	Ala
1				5					10					15
Leu	Val	Pro	Phe	Ala	Ala	Leu	Ala	Ser	Thr	Gly	Arg	Leu	Leu	Gly
				20					25					30
Ser	Met	Ser	Gly	Leu	Val	Cys	Gly	Ala	Gln	Arg	Arg	Leu	Pro	Ala
				35					40					45
His	Thr	Ala	Phe	Ala	Arg	Ser	His	Gly	Thr	Ala	Thr	Gly	His	Ala
				50					55					60
Gly	Ile	Val	Gly	Gly	Ala	Gly	Leu	Ser	His	Val	Lys	Asp	Ala	Ala
				65					70					75
Gln	Ala	Phe	Gly	Ala	Asn	Gln	Ser	Ser	Ser	Ser	Pro	Ser	Phe	Ala
				80					85					90
Thr	Ser	Gly	Val	Ala	Pro	His	Pro	Gly	Met	Lys	Ala	Pro	Ser	Pro
				95					100					105
Pro	Thr	Asp	Asp	Glu	Val	Glu	Ala	Cys	Trp	Arg	Pro	Val	Tyr	Asp
				110					115					120
Thr	Ala	Tyr	Leu	Glu	Lys	Val	Lys	Pro	Phe	His	Ile	Thr	Pro	Glu
				125					130					135
Arg	Leu	Tyr	Gln	Arg	Ile	Gly	Phe	Arg	Ala	Ile	Met	Ala	Ala	Arg
				140					145					150
Trp	Thr	Phe	Asp	Lys	Leu	Thr	Gly	Tyr	Gly	Pro	Asn	Met	Thr	Glu
				155					160					165
Ala	Lys	Trp	Leu	Gln	Arg	Met	Ile	Phe	Leu	Glu	Thr	Ile	Ala	Gly
				170					175					180
Val	Pro	Gly	Met	Val	Ala	Gly	Val	Leu	Arg	His	Leu	Lys	Ser	Leu
				185					190					195
Arg	Ser	Met	Lys	Arg	Asp	His	Gly	Trp	Ile	His	Thr	Leu	Leu	Gln
				200					205					210
Glu	Ala	Glu	Asn	Glu	Arg	Met	His	Leu	Leu	Thr	Phe	Phe	Glu	Leu
				215					220					225
Arg	Lys	Pro	Gly	Pro	Leu	Phe	Arg	Ala	Ser	Ile	Ile	Val	Ala	Gln
				230					235					240
Gly	Val	Phe	Trp	Asn	Leu	Tyr	Phe	Ile	Gly	Tyr	Leu	Val	Ser	Pro
				245					250					255
Arg	Thr	Cys	His	Ala	Ala	Val	Gly	Phe	Leu	Glu	Glu	Glu	Ala	Val
				260					265					270
Lys	Thr	Tyr	Thr	His	Ala	Leu	Gln	Glu	Ile	Asp	Ala	Gly	Arg	Leu
				275					280					285
Trp	Lys	Gly	Lys	Val	Ala	Pro	Pro	Ile	Ala	Cys	Glu	Tyr	Trp	Gly
				290					295					300
Leu	Lys	Pro	Gly	Ala	Ser	Met	Arg	Asp	Leu	Ile	Leu	Ala	Val	Arg
				305					310					315
Ala	Asp	Glu	Ala	Cys	His	Ala	His	Val	Asn	His	Thr	Leu	Ser	Gly
				320					325					330
Leu	Pro	Ala	Thr	Ala	Pro	Asn	Pro	Phe	Ala	Tyr	Gly	Ala	Ser	Gln
				335					340					345
Leu	Pro													

<210> SEQ ID NO: 76
<211> 745 
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CAT TCA AGA GCC GCT CCC CTG CAG GCG TTC CAG GCT TCG TCG CAG GCA 48
TGC TGC GCC ACA TGC GCT CCC TGC GCT CCA TGA AGC GCG ACC GGG GCT 96
GGA TCC ACA CGC TGC TGG AAG AAG CAG AGA ATG AGC GCA TGC ACC TCC 144
TGA CCT TCC TGC AGT TCA GGG AGC CTG GGC CCT TGA TGA GGG GCA CCG 192
TGC TCC TCG GCC AGG GCG TGT TCC TCA ACT TTT ACC TCC TTG CCT ACA 240
CCA TCT CAC CCC GCA CCT GCC ACA GCT TTG TCG GGT ATC TGG AGG AAG 288
AGG CAG TCA AGA CCT ACA CCC GCT GCA TCA AGG ACC TTG ACG CTG GCT 336
TAA TCC CGC AGT GGG AGA AAA AGG CCT TGC CTG AAA TCG CCA TCA AGT 384
ACT GGC AGC TGA GCC CTG ATG CCA CCA TGA GGG ACC TCC TGC TGG CTG 432
TCA GGG CCG ATG AGG CCT GCC ACA GCC ATG TGA ACC ACG CCT TCT CCC 480
ACA TGA AGC CGT CAG AGG AGA ATC CCT TCC TTC CAG GGA CGG TGC ACG 528
TGC CCT AAA GGA CTT GGC AAG GGT ACT GTG CCT AAG ATT GTG AGC GAC 576
TAC GGA TTG AAG AGT TTG TCG AGA CTA GGC TTC CAA GAT TCC TTT ACG 624
GCC TCA TCT ATT TCC CCA CCT CCC TGC TTT ACT AGA TAG CTT GGC TCT 672
GGC CCC ACT ATC CGG CTT ACT CAC CAC CCA CTC AGG ACG TGT GGA AGA 720
ATG AGA TGT AAT CTC CTA TCA CGC A 				745

<210> SEQ ID NO: 77
<211> 445
<212> Amino Acid Sequence
<213> Arabidopsis thaliana

Met	Gly	His	Gln	Asn	Ala	Ala	Val	Ser	Glu	Asn	Gln	Asn	His	Asp
1				5					10					15
Asp	Gly	Ala	Ala	Ser	Ser	Pro	Gly	Phe	Lys	Leu	Val	Gly	Phe	Ser
				20					25					30
Lys	Phe	Val	Arg	Lys	Asn	Pro	Lys	Ser	Asp	Lys	Phe	Lys	Val	Lys
				35					40					45
Arg	Phe	His	His	Ile	Glu	Phe	Trp	Cys	Gly	Asp	Ala	Thr	Asn	Val
				50					55					60
Ala	Arg	Arg	Phe	Ser	Trp	Gly	Leu	Gly	Met	Arg	Phe	Ser	Ala	Lys
				65					70					75
Ser	Asp	Leu	Ser	Thr	Gly	Asn	Met	Val	His	Ala	Ser	Tyr	Leu	Leu
				80					85					90
Thr	Ser	Gly	Asp	Leu	Arg	Phe	Leu	Phe	Thr	Ala	Pro	Tyr	Ser	Pro
				95					100					105
Ser	Leu	Ser	Ala	Gly	Glu	Ile	Lys	Pro	Thr	Thr	Thr	Ala	Ser	Ile
				110					115					120
Pro	Ser	Phe	Asp	His	Gly	Ser	Cys	Arg	Ser	Phe	Phe	Ser	Ser	His
				125					130					135
Gly	Leu	Gly	Val	Arg	Ala	Val	Ala	Ile	Glu	Val	Glu	Asp	Ala	Glu
				140					145					150
Ser	Ala	Phe	Ser	Ile	Ser	Val	Ala	Asn	Gly	Ala	Ile	Pro	Ser	Ser
				155					160					165
Pro	Pro	Ile	Val	Leu	Asn	Glu	Ala	Val	Thr	Ile	Ala	Glu	Val	Lys
				170					175					180
Leu	Tyr	Gly	Asp	Val	Val	Leu	Arg	Tyr	Val	Ser	Tyr	Lys	Ala	Glu
				185					190					195
Asp	Thr	Glu	Lys	Ser	Glu	Phe	Leu	Pro	Gly	Phe	Glu	Arg	Val	Glu
				200					205					210
Asp	Ala	Ser	Ser	Phe	Pro	Leu	Asp	Tyr	Gly	Ile	Arg	Arg	Leu	Asp
				215					220					225
His	Ala	Val	Gly	Asn	Val	Pro	Glu	Leu	Gly	Pro	Ala	Leu	Thr	Tyr
				230					235					240
Val	Ala	Gly	Phe	Thr	Gly	Phe	His	Gln	Phe	Ala	Glu	Phe	Thr	Ala
				245					250					255
Asp	Asp	Val	Gly	Thr	Ala	Glu	Ser	Gly	Leu	Asn	Ser	Ala	Val	Leu
				260					265					270
Ala	Ser	Asn	Asp	Glu	Met	Val	Leu	Leu	Pro	Ile	Asn	Glu	Pro	Val
				275					280					285
His	Gly	Thr	Lys	Arg	Lys	Ser	Gln	Ile	Gln	Thr	Tyr	Leu	Glu	His
				290					295					300
Asn	Glu	Gly	Ala	Gly	Leu	Gln	His	Leu	Ala	Leu	Met	Ser	Glu	Asp
				305					310					315
Ile	Phe	Arg	Thr	Leu	Arg	Glu	Met	Arg	Lys	Arg	Ser	Ser	Ile	Gly
				320					325					330
Gly	Phe	Asp	Phe	Met	Pro	Ser	Pro	Pro	Pro	Thr	Tyr	Tyr	Gln	Asn
				335					340					345
Leu	Lys	Lys	Arg	Val	Gly	Asp	Val	Leu	Ser	Asp	Asp	Gln	Ile	Lys
				350					355					360
Glu	Cys	Glu	Glu	Leu	Gly	Ile	Leu	Val	Asp	Arg	Asp	Asp	Gln	Gly
				365					370					375
Thr	Leu	Leu	Gln	Ile	Phe	Thr	Lys	Pro	Leu	Gly	Asp	Arg	Pro	Thr
				380					385					390
Ile	Phe	Ile	Glu	Ile	Ile	Gln	Arg	Val	Gly	Cys	Met	Met	Lys	Asp
				395					400					405
Glu	Glu	Gly	Lys	Ala	Tyr	Gln	Ser	Gly	Gly	Cys	Gly	Gly	Phe	Gly
				410					415					420
Lys	Gly	Asn	Phe	Ser	Glu	Leu	Phe	Lys	Ser	Ile	Glu	Glu	Tyr	Glu
				425					430					435
Lys	Thr	Leu	Glu	Ala	Lys	Gln	Leu	Val	Gly					
				440					445					

<210> SEQ ID NO: 78
<211> 892
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GTT GGA TCT GAT CAC AGG CTA CGT AGC CAT CGT GCT CAT GAT CCA GCT 48
CTT ACC TAA GCC GCA ACT TTA TAC CGT TCC GAC TGA CTG AAT GTA ATC 96
CAG ACA AAT TCC ATC AAG AAT GGG GAT GCC GCA TGT CTT TTT CAG CTG 144
ACA AGA AAA CCA GGT CTC CGC CTT GTC GAC GTC AGC CCC GCG GTG CCA 192
TAG ATT TTG GTT TCG TTC CGC GGC CCC ATT TGA TTG CTG TTA CTC GTC 240
GTC CAT GCC AGT CTT GGG TCA CAG TCA TCT TTG CTT GAC ACA TAT AAG 288
ACG CGC TTT CGC AAC CTG CAG CTC TCG CCC ATT GCG TGT CCC CTC GGT 336
CGC ATC GCG CCG GTC CCC CAA ATC TTC CTG GGT GTC CAT GGG AGG TGC 384
TGT CGA GAC TGC TGC CGC CAA CCC CCC TTC TGA GTC CAT AGC TCG CAA 432
ACT GGT CGG CGC CAA GGG CTT TCA GCG TCA CAA CCC GCT GAC CGA CAA 480
GTT CCC CAT CCA CCG CTT CCA CCA CTT TGA GTT CTA CTG CGG GGA TGC 528
GAC AAA CAC CAG CCG CAG GTT TGG CCT GGG CCT GGG GCT GTC CCA GGT 576
AGC CAA GTC TGA CCA GGG CAC AGG GAA CCA GAA GTT TGC CAG CTA TGT 624
CAT GCG CTC CAA CCA GCT CAT CTT CAC CTT CAC TGC GCC CTA CAA CGG 672
CGC GGG GGG CGA GGC CTC CGG CCC GGA CGC CGG CTC CCC GGT CCC CTG 720
GTA CGA CGT CGA CGC CGC GCA CAC CTT CAA CAG GAA CCA CGG CCT GGG 768
CGT CCG CGC AGT GGG CAT CGT GGT GGA GGA CGC GGC GGA GGC CTT CCG 816
CAT CTC CGC CGC CAA TGG CGG CAT CCC CGT GCA GCC CCC TAC GCG GCT 864
GGC CGA CGC GAG CGG GTC GCT GAC GGC G 				892

<210> SEQ ID NO: 79
<211> 645
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Ser	Gly	Pro	Ala	Ile	Gly	Ile	Asp	Leu	Gly	Thr	Thr	Tyr	Ser
1				5					10					15
Cys	Val	Gly	Leu	Trp	Gln	His	Asp	Arg	Val	Glu	Ile	Ile	Pro	Asn
				20					25					30
Glu	Gln	Gly	Asn	Arg	Thr	Thr	Pro	Ser	Tyr	Val	Ala	Phe	Thr	Asp
				35					40					45
Thr	Glu	Arg	Leu	Ile	Gly	Asp	Ala	Ala	Lys	Asn	Gln	Val	Ala	Met
				50					55					60
Asn	Pro	Ile	Asn	Thr	Val	Phe	Asp	Ala	Lys	Arg	Leu	Ile	Gly	Arg
				65					70					75
Lys	Phe	Ser	Asp	Pro	Thr	Ile	Gln	His	Asp	Val	Ser	His	Trp	Pro
				80					85					90
Phe	Lys	Val	Val	Ala	Gly	Pro	Gly	Asp	Lys	Pro	Met	Ile	Gln	Val
				95					100					105
Ala	Tyr	Lys	Gly	Glu	Glu	Lys	Thr	Phe	Ala	Pro	Glu	Glu	Ile	Ser
				110					115					120
Ser	Met	Val	Leu	Val	Lys	Met	Lys	Glu	Ile	Ala	Gln	Ala	Tyr	Val
				125					130					135
Gly	Ala	Asp	Lys	Glu	Val	Lys	Lys	Ala	Val	Val	Thr	Val	Pro	Ala
				140					145					150
Tyr	Phe	Asn	Asp	Ser	Gln	Arg	Gln	Ala	Thr	Lys	Asp	Ala	Gly	Val
				155					160					165
Ile	Ala	Gly	Leu	Glu	Val	Met	Arg	Ile	Ile	Asn	Glu	Pro	Thr	Ala
				170					175					180
Ala	Ala	Ile	Ala	Tyr	Gly	Leu	Asp	Lys	Lys	Gly	Thr	Thr	Ser	Gly
				185					190					195
Glu	Gln	Asn	Val	Leu	Ile	Phe	Asp	Leu	Gly	Gly	Gly	Thr	Phe	Asp
				200					205					210
Val	Ser	Leu	Leu	Thr	Ile	Glu	Glu	Gly	Ile	Phe	Glu	Val	Lys	Ala
				215					220					225
Thr	Ala	Gly	Asp	Thr	His	Leu	Gly	Gly	Glu	Asp	Phe	Asp	Asn	Arg
				230					235					240
Leu	Val	Asn	Phe	Phe	Val	Gln	Ala	Ser	Phe	Lys	Arg	Lys	His	Arg
				245					250					255
Lys	Asp	Ile	Ser	Ser	Asn	Pro	Arg	Ala	Leu	Arg	Arg	Leu	Arg	Thr
				260					265					270
Ser	Cys	Glu	Arg	Ala	Lys	Arg	Thr	Leu	Ser	Ser	Ser	Thr	Gln	Ala
				275					280					285
Ser	Ile	Glu	Ile	Asp	Ser	Leu	Tyr	Glu	Gly	Ile	Asp	Phe	Tyr	Ser
				290					295					300
Ser	Ile	Thr	Arg	Ala	Arg	Phe	Glu	Glu	Leu	Asn	Met	Asp	Leu	Phe
				305					310					315
Arg	Lys	Cys	Met	Glu	Pro	Val	Glu	Lys	Val	Leu	Arg	Asp	Ala	Lys
				320					325					330
Met	Asp	Lys	Gly	Gln	Val	Asn	Glu	Val	Val	Leu	Val	Gly	Gly	Ser
				335					340					345
Thr	Arg	Ile	Pro	Lys	Val	Gln	Gln	Leu	Leu	Gln	Asp	Phe	Phe	Asn
				350					355					360
Gly	Lys	Glu	Leu	Cys	Lys	Ser	Ile	Asn	Pro	Asp	Glu	Ala	Val	Ala
				365					370					375
Tyr	Gly	Ala	Ala	Val	Gln	Ala	Ala	Ile	Leu	Asn	Gly	Glu	Thr	His
				380					385					390
Glu	Lys	Val	Gln	Asp	Leu	Leu	Leu	Leu	Asp	Val	Ile	Pro	Leu	Ser
				395					400					405
Leu	Gly	Leu	Glu	Thr	Ala	Gly	Gly	Val	Met	Thr	Val	Leu	Ile	Ala
				410					415					420
Arg	Asn	Thr	Thr	Ile	Pro	Thr	Lys	Lys	Glu	Gln	Val	Phe	Ser	Thr
				425					430					435
Tyr	Ser	Asp	Asn	Gln	Pro	Gly	Val	Leu	Ile	Gln	Val	Tyr	Glu	Gly
				440					445					450
Glu	Arg	Ser	Arg	Thr	Lys	Asp	Asn	Asn	Leu	Leu	Gly	Lys	Phe	Glu
				455					460					465
Leu	Thr	Gly	Ile	Pro	Pro	Ala	Pro	Arg	Gly	Val	Pro	Gln	Ile	Asn
				470					475					480
Val	Thr	Phe	Asp	Val	Asp	Ala	Asn	Gly	Ile	Leu	Asn	Val	Ser	Ala
				485					490					495
Glu	Asp	Lys	Thr	Thr	Gly	Asn	Lys	Asn	Lys	Ile	Thr	Ile	Thr	Asn
				500					505					510
Asp	Lys	Gly	Arg	Leu	Ser	Lys	Asp	Glu	Ile	Glu	Arg	Met	Val	Gln
				515					520					525
Glu	Ala	Glu	Lys	Tyr	Lys	Ala	Glu	Asp	Glu	Thr	His	Arg	Thr	Arg
				530					535					540
Val	Glu	Ala	Arg	Asn	Gly	Leu	Glu	Asn	Ala	Ala	Tyr	Gly	Leu	Arg
				545					550					555
Asn	Thr	Val	Arg	Asp	Thr	Asn	Leu	Ala	Asp	Lys	Leu	Ser	Ala	Glu
				560					565					570
Asp	Lys	Glu	Ala	Ile	Glu	Lys	Ala	Val	Asp	Lys	Val	Val	Asp	Trp
				575					580					585
Leu	Asp	His	Asn	Gln	Leu	Ala	Glu	Glu	Glu	Glu	Ile	Thr	His	Gln
				590					595					600
Arg	Glu	Glu	Met	Glu	Ala	Val	Cys	Asn	Pro	Ile	Ile	Thr	Lys	Leu
				605					610					615
Tyr	Gln	Gly	Ala	Pro	Pro	Pro	Pro	Glu	Ala	Gly	Gly	Ala	Ser	Ala
				620					625					630
Gly	Gly	Ala	Ala	Ser	Gly	Pro	Gly	Pro	Lys	Ile	Glu	Glu	Val	Asp
				635					640					645

<210> SEQ ID NO: 80
<211> 3064
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG ACC GTG TGG AGA TCA TCC CCA CCA TTC TGC ACA CAA TTA ACT CTT 48
CTG CGT ACC CTC AGT AAT ACC CCT ATT TTC AAT CAC ACA TCG CAA ACA 96
TGT CGA ACG GAC CTG CCA TCG GCA TTG ACC TGG GCA CAA CCT ACT CCT 144
GCG TCG GCA TCT GGC AGC ATG ACC GTG TGG AGA TCA TCC CCA ACG AGC 192
AGG GTA ACC GTA CCA CGC CCT CGT ACG TGG CCT TCA CCG ACA CGG AGC 240
GCC TGA TCG GCG ATG CCG CCA AGA ACC AGG TCG CCA TGA ACC CCA TCA 288
ACA CTG TGT TCG ATG CCA AGC GCC TGA TCG GCC GCA AGT TCA ACG ACG 336
GCA ACA TCC AGC ACG ACA TCT CCC ACT GGC CCT TCA AGG TCG TGT CTG 384
GTG CTG GCG AGA AGC CCA TGA TCC AGG TCG AGT ACA AGG GCG AGA CCA 432
AGA CCT TTG CCC CCG AGG AGA TCT CCT CCA TGG TGC TGG TGA AGA TGA 480
AGG AGG TGG CCC AGT CCT TTG TGG GCG CCG ACA AGG AGG TGA AGA AGG 528
CGG TGG TCA CCG TGC CCG CCT ACT TCA ACG ACT CCC AGC GCC AGG CGA 576
CCA AGG ACG CCG GCG TGA TTG CGG GCC TGG ACG TCA TGC GCA TCA TCA 624
ACG AGC CCA CCG CCG CCG CCA TCG CGT ACG GCC TGG ACA AGA AGC ACG 672
CCA CCA CCG GCG AGC AGA ACG TGC TCA TCT TTG ACC TGG GCG GCG GCA 720
CCT TTG ATG TGT CCC TGC TGA CGA TCG AGG AGG GCA TCT TCG AGG TGA 768
AGG CCA CCG CGG GCG ACA CCC ACC TGG GCG GCG AGG ACT TTG ACA ACC 816
GCC TGG TCA GCT TCT TCG TCC AGG AGT TCA AGC GCA AGA ACA AGA AGG 864
ACA TCA GCG GCA ACC CGC GCG CGC TGC GCC GCC TGC GCA CTG CCT GCG 912
AGC GCG CCA AGC GCC AGC TCT CCT CCT CCA CCC AGG CCT CCA TCG AGA 960
TCG ACT CCC TGT ACG AGG GCA TCG ACT TCT ACT CCG CCA TCA CCC GCG 1008
CCC GCT TCG AGG AGC TGA ACA TGG ACC TGT TCC GCA AGT GCA TGG ACC 1056
CGG TGG AGA AGG TCA TCC GCG ACG CCA AGA TGG ACA AGG GCC AGA TCC 1104
ACG AGG TGG TGC TGG TCG GCG GCT CCA CCC GTA TCC CCA AGG TCC AGA 1152
CCC TGC TCC AGG ACT TTT TCA ACG GCA AGG AGC TGT GCA AGA GCA TCA 1200
ACC CCG ACG AGG CCG TCG CCT ACG GCG CCG CCG TCC AGG CGG CCA TCC 1248
TGA ACG GCG AGA CGC ATG AGA AGG TGC AGG ACC TGC TGC TCC TGG ACG 1296
TCA TCC CCC TCT CCC TGG GCC TGG GCG CCG CGG GCG GCG TGA TGA CGA 1344
CGC TGA TCG CGC GCA ACA CCA CCA TCC CCA CCA AGA AGG AGC AGG TGT 1392
TCT CCA CCT ACT CCG ACA ACC AGC CCG GCG TGC TGA TTC AGG TGT ACG 1440
AGG GTG AGC GCA CGC GCA CCA AGG ACA ACA ACC TGC TGG GCA AGT TCG 1488
AGC TGA CGG GCA TCC CTC CCG CGC CGC GCG GCG TGC CCC AGA TCA CCG 1536
TCA CCT TTG ACG TGG ACG CCA ACG GCA TCC TGA ACG TCA GTG CGG AGG 1584
ACA AGA CGA CGG GGA TCA AGA ACA AGA TCA CCA TCA CCA ACG ACA AGG 1632
GGC GCC TGT CCA AGG AGG AGA TCG AGC GCA TGG TGC AGG AGG CGG AGA 1680
AGT ACA AGG GCG AGG ACG AGG CGC ACT CCA AGA AGG TGG AGG CGC GCA 1728
ACG GGC TGG AGA ACT ACG CCT ACT CCA TGC GCA ACA CGC TCA AGG AGG 1776
GCT CGG TGG CCG AGA AAC TGG AGG CCA GCG ACA AGG CCG CCA TGG AGG 1824
CCG CCA TCG ACA AGG CCA TCG AGT GGC TGG ACC ACA ACC AGC TGG CCG 1872
AGG AGG AGG AGA TCT CCC ACC AGC GCG AGG AGC TGG AGG GCG TGT GCA 1920
GCC CCA TCA TCT CCA AGC TGT ACG CGG CCG GCG GCG CCC CTG CAG GCG 1968
GCG CCC CCG CCG GCC CCG GCG CCC CCG AGG GCA CCG GCC CAG GCC CCA 2016
AGA TCG AGG AGG TCG ACT GAG CAG CGG TGT TGG TCG GTG GGG TCC AGG 2064
GAG TTG GCG CTT GAG CGA GCC CAA GGC GGT CGC CCC TGT CCA TGC TCC 2112
CGT CTC CCC AGC CTC CCC CCT GTC TAT TTG CTT CAT TGC TGC GTT CTG 2160
CGC TTT CCC CCT TTG CTC CCT CCT TCC CGT GCT GAG AAC CTT GAA AGG 2208
TGT CGT AAG CAT ACT CAC AGC CTA AGC AGC CAG GGA ACA AGG AGA GGG 2256
GAG GGA AAG ACG GCC GGG GAG GGG AGG ACG AGC CCG CTA TGC GAG GTA 2304
GTT GTA GAT CTG CGC GTC ACG CAC GTC GCG TTC CAA GTA CTC GCG CTC 2352
GAT CAG CGA CTC GAT CCG CTT CTT CAG GTC GCT CGC CTT GAT GGG GAA 2400
CTT GAG CTG TAC CAT GAG CTC GTT CAC CAG CAG CTT GTG CGA CAG CGT 2448
GCG CCG CGT CTT CAT GAT GCG CAC CAC GGC GGC GTC CAC CTG GTA CTG 2496
CCG GTC CTG CAG CAC CTG GTC GTT GGT CTT GGC GCT CTC CTC CGC GCT 2544
CTC GCG CAG CTG GAT GGA GTT GAT CTT CAC GCG GAA CAG GCG GCT GGT 2592
GTA GCC CTC GTT GTA GGT GAA GCG ATC GCC GTC CTC CAC CTC GCG CCC 2640
CTT GGG CTC CTT GAG GAG CAC CCT CTC CTT CCC ACA CGC CAG CGA CTG 2688
CAG GGT TCT TCG CAG CTC CCG GTC CTC GAT GTT GGT GGC GGC CGC CAG 2736
CTC CTC CAG GCT CAG CGT GTC TGT GCC TCG CTC AAA CTG CAT GAG CAC 2784
GGT CGC CTG GAA GAG CGA CAC GGA GAG CTC CTT GGA CCC TGC GCG GAA 2832
GCT GGC CCG CAA GAC GCA GGT GCC CAG CGA GTT GTA CCA GAC GAG TCG 2880
CCG CCC CGA GTG CTT TTT CAG GTA AAA GTC CTT GAA TGT CTG CTG GCT 2928
GGC GGT GAG GAC CTC GGG CAG GGA GCA CTC GTC GGC AGG GTA GGA GGG 2976
CCA GAA GCC CAG AGT CAG GAC GTG CAC GCC CAT CTC CTG CAC CGC GGC 3024
GCC CCC CAC CGC CAG CTG CCC AGA GTC AGG ACG TGC ACG C 		3064   

<210> SEQ ID NO: 74
<211> 763
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Leu	Ala	Ser	Thr	Tyr	Thr	Pro	Cys	Gly	Val	Arg	Gln	Val	Ala
1				5					10					15
Gly	Arg	Thr	Val	Ala	Val	Pro	Ser	Ser	Leu	Val	Ala	Pro	Val	Ala
				20					25					30
Val	Ala	Arg	Ser	Leu	Gly	Leu	Ala	Pro	Tyr	Val	Pro	Val	Cys	Glu
				35					40					45
Pro	Ser	Ala	Ala	Leu	Pro	Ala	Cys	Gln	Gln	Pro	Ser	Gly	Arg	Arg
				50					55					60
His	Val	Gln	Thr	Ala	Ala	Thr	Leu	Arg	Ala	Asp	Asn	Pro	Ser	Ser
				65					70					75
Val	Ala	Gln	Leu	Val	His	Gln	Asn	Gly	Lys	Gly	Met	Lys	Val	Ile
				80					85					90
Ile	Ala	Gly	Ala	Gly	Ile	Gly	Gly	Leu	Val	Leu	Ala	Val	Ala	Leu
				95					100					105
Leu	Lys	Gln	Gly	Phe	Gln	Val	Gln	Val	Phe	Glu	Arg	Asp	Leu	Thr
				110					115					120
Ala	Ile	Arg	Gly	Glu	Gly	Lys	Tyr	Arg	Gly	Pro	Ile	Gln	Val	Gln
				125					130					135
Ser	Asn	Ala	Leu	Ala	Ala	Leu	Glu	Ala	Ile	Asp	Pro	Glu	Val	Ala
				140					145					150
Ala	Glu	Val	Leu	Arg	Glu	Gly	Cys	Ile	Thr	Gly	Asp	Arg	Ile	Asn
				155					160					165
Gly	Leu	Cys	Asp	Gly	Leu	Thr	Gly	Glu	Trp	Tyr	Val	Lys	Phe	Asp
				170					175					180
Thr	Phe	His	Pro	Ala	Val	Ser	Lys	Gly	Leu	Pro	Val	Thr	Arg	Val
				185					190					195
Ile	Ser	Arg	Leu	Thr	Leu	Gln	Gln	Ile	Leu	Ala	Lys	Ala	Val	Glu
				200					205					210
Arg	Tyr	Gly	Gly	Pro	Gly	Thr	Ile	Gln	Asn	Gly	Cys	Asn	Val	Thr
				215					220					225
Glu	Phe	Thr	Glu	Arg	Arg	Asn	Asp	Thr	Thr	Gly	Asn	Asn	Glu	Val
				230					235					240
Thr	Val	Gln	Leu	Glu	Asp	Gly	Arg	Thr	Phe	Ala	Ala	Asp	Val	Leu
				245					250					255
Val	Gly	Ala	Asp	Gly	Ile	Trp	Ser	Lys	Ile	Arg	Lys	Gln	Leu	Ile
				260					265					270
Gly	Glu	Thr	Lys	Ala	Asn	Tyr	Ser	Gly	Tyr	Thr	Cys	Tyr	Thr	Gly
				275					280					285
Ile	Ser	Asp	Phe	Thr	Pro	Ala	Asp	Ile	Asp	Ile	Val	Gly	Tyr	Arg
				290					295					300
Val	Phe	Leu	Gly	Asn	Gly	Gln	Tyr	Phe	Val	Ser	Ser	Asp	Val	Gly
				305					310					315
Asn	Gly	Lys	Met	Gln	Trp	Tyr	Gly	Phe	His	Lys	Glu	Pro	Ser	Gly
				320					325					330
Gly	Thr	Asp	Pro	Glu	Gly	Ser	Arg	Lys	Ala	Arg	Leu	Leu	Gln	Ile
				335					340					345
Phe	Gly	His	Trp	Asn	Asp	Asn	Val	Val	Asp	Leu	Ile	Lys	Ala	Thr
				350					355					360
Pro	Glu	Glu	Asp	Val	Leu	Arg	Arg	Asp	Ile	Phe	Asp	Arg	Pro	Pro
				365					370					375
Ile	Phe	Thr	Trp	Ser	Lys	Gly	Arg	Val	Ala	Leu	Leu	Gly	Asp	Ser
				380					385					390
Ala	His	Ala	Met	Gln	Pro	Asn	Leu	Gly	Gln	Gly	Gly	Cys	Met	Ala
				395					400					405
Ile	Glu	Asp	Ala	Tyr	Glu	Leu	Ala	Ile	Asp	Leu	Ser	Arg	Ala	Val
				410					415					420
Ser	Asp	Lys	Ala	Gly	Asn	Ala	Ala	Ala	Val	Asp	Val	Glu	Gly	Val
				425					430					435
Leu	Arg	Ser	Tyr	Gln	Asp	Ser	Arg	Ile	Leu	Arg	Val	Ser	Ala	Ile
				440					445					450
His	Gly	Met	Ala	Gly	Met	Ala	Ala	Phe	Met	Ala	Ser	Thr	Tyr	Lys
				455					460					465
Cys	Tyr	Leu	Gly	Glu	Gly	Trp	Ser	Lys	Trp	Val	Glu	Gly	Leu	Arg
				470					475					480
Ile	Pro	His	Pro	Gly	Arg	Val	Val	Gly	Arg	Leu	Val	Met	Leu	Leu
				485					490					495
Thr	Met	Pro	Ser	Val	Leu	Glu	Trp	Val	Leu	Gly	Gly	Asn	Thr	Asp
				500					505					510
His	Val	Ala	Pro	His	Arg	Thr	Ser	Tyr	Cys	Ser	Leu	Gly	Asp	Lys
				515					520					525
Pro	Lys	Ala	Phe	Pro	Glu	Ser	Arg	Phe	Pro	Glu	Phe	Met	Asn	Asn
				530					535					540
Asp	Ala	Ser	Ile	Ile	Arg	Ser	Ser	His	Ala	Asp	Trp	Leu	Leu	Val
				545					550					555
Ala	Glu	Arg	Asp	Ala	Ala	Thr	Ala	Ala	Ala	Ala	Asn	Val	Asn	Ala
				560					565					570
Ala	Thr	Gly	Ser	Ser	Ala	Ala	Ala	Ala	Ala	Ala	Ala	Asp	Val	Asn
				575					580					585
Ser	Ser	Cys	Gln	Cys	Lys	Gly	Ile	Tyr	Met	Ala	Asp	Ser	Ala	Ala
				590					595					600
Leu	Val	Gly	Arg	Cys	Gly	Ala	Thr	Ser	Arg	Pro	Ala	Leu	Ala	Val
				605					610					615
Asp	Asp	Val	His	Val	Ala	Glu	Ser	His	Ala	Gln	Val	Trp	Arg	Gly
				620					625					630
Leu	Ala	Gly	Leu	Pro	Pro	Ser	Ser	Ser	Ser	Ala	Ser	Thr	Ala	Ala
				635					640					645
Ala	Ser	Ala	Ser	Ala	Ala	Ser	Ser	Ala	Ala	Ser	Gly	Thr	Ala	Ser
				650					655					660
Thr	Leu	Gly	Ser	Ser	Glu	Gly	Tyr	Trp	Leu	Arg	Asp	Leu	Gly	Ser
				665					670					675
Gly	Arg	Gly	Thr	Trp	Val	Asn	Gly	Lys	Arg	Leu	Pro	Asp	Gly	Ala
				680					685					690
Thr	Val	Gln	Leu	Trp	Pro	Gly	Asp	Ala	Val	Glu	Phe	Gly	Arg	His
				695					700					705
Pro	Ser	His	Glu	Val	Phe	Lys	Val	Lys	Met	Gln	His	Val	Thr	Leu
				710					715					720
Arg	Ser	Asp	Glu	Leu	Ser	Gly	Gln	Ala	Tyr	Thr	Thr	Leu	Met	Val
				725					730					735
Gly	Lys	Ile	Arg	Asn	Asn	Asp	Tyr	Val	Met	Pro	Glu	Ser	Arg	Pro
				740					745					750
Asp	Gly	Gly	Ser	Gln	Gln	Pro	Gly	Arg	Leu	Val	Thr	Ala		
				755					760					

<210> SEQ ID NO: 81
<211> 763
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Leu	Ala	Ser	Thr	Tyr	Thr	Pro	Cys	Gly	Val	Arg	Gln	Val	Ala
1				5					10					15
Gly	Arg	Thr	Val	Ala	Val	Pro	Ser	Ser	Leu	Val	Ala	Pro	Val	Ala
				20					25					30
Val	Ala	Arg	Ser	Leu	Gly	Leu	Ala	Pro	Tyr	Val	Pro	Val	Cys	Glu
				35					40					45
Pro	Ser	Ala	Ala	Leu	Pro	Ala	Cys	Gln	Gln	Pro	Ser	Gly	Arg	Arg
				50					55					60
His	Val	Gln	Thr	Ala	Ala	Thr	Leu	Arg	Ala	Asp	Asn	Pro	Ser	Ser
				65					70					75
Val	Ala	Gln	Leu	Val	His	Gln	Asn	Gly	Lys	Gly	Met	Lys	Val	Ile
				80					85					90
Ile	Ala	Gly	Ala	Gly	Ile	Gly	Gly	Leu	Val	Leu	Ala	Val	Ala	Leu
				95					100					105
Leu	Lys	Gln	Gly	Phe	Gln	Val	Gln	Val	Phe	Glu	Arg	Asp	Leu	Thr
				110					115					120
Ala	Ile	Arg	Gly	Glu	Gly	Lys	Tyr	Arg	Gly	Pro	Ile	Gln	Val	Gln
				125					130					135
Ser	Asn	Ala	Leu	Ala	Ala	Leu	Glu	Ala	Ile	Asp	Pro	Glu	Val	Ala
				140					145					150
Ala	Glu	Val	Leu	Arg	Glu	Gly	Cys	Ile	Thr	Gly	Asp	Arg	Ile	Asn
				155					160					165
Gly	Leu	Cys	Asp	Gly	Leu	Thr	Gly	Glu	Trp	Tyr	Val	Lys	Phe	Asp
				170					175					180
Thr	Phe	His	Pro	Ala	Val	Ser	Lys	Gly	Leu	Pro	Val	Thr	Arg	Val
				185					190					195
Ile	Ser	Arg	Leu	Thr	Leu	Gln	Gln	Ile	Leu	Ala	Lys	Ala	Val	Glu
				200					205					210
Arg	Tyr	Gly	Gly	Pro	Gly	Thr	Ile	Gln	Asn	Gly	Cys	Asn	Val	Thr
				215					220					225
Glu	Phe	Thr	Glu	Arg	Arg	Asn	Asp	Thr	Thr	Gly	Asn	Asn	Glu	Val
				230					235					240
Thr	Val	Gln	Leu	Glu	Asp	Gly	Arg	Thr	Phe	Ala	Ala	Asp	Val	Leu
				245					250					255
Val	Gly	Ala	Asp	Gly	Ile	Trp	Ser	Lys	Ile	Arg	Lys	Gln	Leu	Ile
				260					265					270
Gly	Glu	Thr	Lys	Ala	Asn	Tyr	Ser	Gly	Tyr	Thr	Cys	Tyr	Thr	Gly
				275					280					285
Ile	Ser	Asp	Phe	Thr	Pro	Ala	Asp	Ile	Asp	Ile	Val	Gly	Tyr	Arg
				290					295					300
Val	Phe	Leu	Gly	Asn	Gly	Gln	Tyr	Phe	Val	Ser	Ser	Asp	Val	Gly
				305					310					315
Asn	Gly	Lys	Met	Gln	Trp	Tyr	Gly	Phe	His	Lys	Glu	Pro	Ser	Gly
				320					325					330
Gly	Thr	Asp	Pro	Glu	Gly	Ser	Arg	Lys	Ala	Arg	Leu	Leu	Gln	Ile
				335					340					345
Phe	Gly	His	Trp	Asn	Asp	Asn	Val	Val	Asp	Leu	Ile	Lys	Ala	Thr
				350					355					360
Pro	Glu	Glu	Asp	Val	Leu	Arg	Arg	Asp	Ile	Phe	Asp	Arg	Pro	Pro
				365					370					375
Ile	Phe	Thr	Trp	Ser	Lys	Gly	Arg	Val	Ala	Leu	Leu	Gly	Asp	Ser
				380					385					390
Ala	His	Ala	Met	Gln	Pro	Asn	Leu	Gly	Gln	Gly	Gly	Cys	Met	Ala
				395					400					405
Ile	Glu	Asp	Ala	Tyr	Glu	Leu	Ala	Ile	Asp	Leu	Ser	Arg	Ala	Val
				410					415					420
Ser	Asp	Lys	Ala	Gly	Asn	Ala	Ala	Ala	Val	Asp	Val	Glu	Gly	Val
				425					430					435
Leu	Arg	Ser	Tyr	Gln	Asp	Ser	Arg	Ile	Leu	Arg	Val	Ser	Ala	Ile
				440					445					450
His	Gly	Met	Ala	Gly	Met	Ala	Ala	Phe	Met	Ala	Ser	Thr	Tyr	Lys
				455					460					465
Cys	Tyr	Leu	Gly	Glu	Gly	Trp	Ser	Lys	Trp	Val	Glu	Gly	Leu	Arg
				470					475					480
Ile	Pro	His	Pro	Gly	Arg	Val	Val	Gly	Arg	Leu	Val	Met	Leu	Leu
				485					490					495
Thr	Met	Pro	Ser	Val	Leu	Glu	Trp	Val	Leu	Gly	Gly	Asn	Thr	Asp
				500					505					510
His	Val	Ala	Pro	His	Arg	Thr	Ser	Tyr	Cys	Ser	Leu	Gly	Asp	Lys
				515					520					525
Pro	Lys	Ala	Phe	Pro	Glu	Ser	Arg	Phe	Pro	Glu	Phe	Met	Asn	Asn
				530					535					540
Asp	Ala	Ser	Ile	Ile	Arg	Ser	Ser	His	Ala	Asp	Trp	Leu	Leu	Val
				545					550					555
Ala	Glu	Arg	Asp	Ala	Ala	Thr	Ala	Ala	Ala	Ala	Asn	Val	Asn	Ala
				560					565					570
Ala	Thr	Gly	Ser	Ser	Ala	Ala	Ala	Ala	Ala	Ala	Ala	Asp	Val	Asn
				575					580					585
Ser	Ser	Cys	Gln	Cys	Lys	Gly	Ile	Tyr	Met	Ala	Asp	Ser	Ala	Ala
				590					595					600
Leu	Val	Gly	Arg	Cys	Gly	Ala	Thr	Ser	Arg	Pro	Ala	Leu	Ala	Val
				605					610					615
Asp	Asp	Val	His	Val	Ala	Glu	Ser	His	Ala	Gln	Val	Trp	Arg	Gly
				620					625					630
Leu	Ala	Gly	Leu	Pro	Pro	Ser	Ser	Ser	Ser	Ala	Ser	Thr	Ala	Ala
				635					640					645
Ala	Ser	Ala	Ser	Ala	Ala	Ser	Ser	Ala	Ala	Ser	Gly	Thr	Ala	Ser
				650					655					660
Thr	Leu	Gly	Ser	Ser	Glu	Gly	Tyr	Trp	Leu	Arg	Asp	Leu	Gly	Ser
				665					670					675
Gly	Arg	Gly	Thr	Trp	Val	Asn	Gly	Lys	Arg	Leu	Pro	Asp	Gly	Ala
				680					685					690
Thr	Val	Gln	Leu	Trp	Pro	Gly	Asp	Ala	Val	Glu	Phe	Gly	Arg	His
				695					700					705
Pro	Ser	His	Glu	Val	Phe	Lys	Val	Lys	Met	Gln	His	Val	Thr	Leu
				710					715					720
Arg	Ser	Asp	Glu	Leu	Ser	Gly	Gln	Ala	Tyr	Thr	Thr	Leu	Met	Val
				725					730					735
Gly	Lys	Ile	Arg	Asn	Asn	Asp	Tyr	Val	Met	Pro	Glu	Ser	Arg	Pro
				740					745					750
Asp	Gly	Gly	Ser	Gln	Gln	Pro	Gly	Arg	Leu	Val	Thr	Ala		
				755					760					
       
<210> SEQ ID NO: 82
<211> 1379 
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CAC GGC GAG TCT CAT GGA GCT TAC TAC ATC AAT TGT GAC CCT AGT ACA 48
GCG TAA CAC GCA CTA GGT AGA CGA TTA GAG ATA GGC CTT TCC TGC CCA 96
TGG CGT CGA GTA CAA CCA CCA CGT CTA CCC GCA CGT CGC CCC ACC CAT 144
GCC TAC CGA AGC TCC CTC GTG GGC TAC ATG GTG CCC GGC ACA GGT TTA 192
CGA GGC ATT CAG CGC CCT TGT TAG ACC ATT GCA CGC GTA AAC GGA ACC 240
TCA CGC GGG CCA TGA GGG CCA CCA TGA CCC CGT CTG TCG CCG AGG CCT 288
CGG CGA CCC AAG ATC CGA TGC ATG TGA TCA TAG CCG GGG CTG GCA TTG 336
GCG GGC TAG TGC TGG CCG TGG GAC TGC TCA AGG CGG GAT TCA GGG TCA 384
CGG TCC TCG AGC GTG ACC TGA CCG CCA TTC GTG GCG AGG GGA AGT ACC 432
GCG GAC CCA TCC AGC TGC AGA GCA ATG CGC TGG CAG CGC TGG AGG CCC 480
TGG ATG AGC ATG TGG GCC AGA GGA TCC TGG ACG AGG GCT GCA TCA CGG 528
GGG ACA GGA TCA ACG GAC TCT GTG ACG GCA TCA CAG GGG ACT GGT ATG 576
TCA AGT TTG ACA CCT TCC ACC CTG CTG TGG GCC GCG GCC TGC CAG TCA 624
CTA GAG TCA TCA GTC GGA CCA GAT TGC AGG AGA TCT TGG CCG AGC GAT 672
GCT GCG AGC TGG GCG GAC CTG ATG CCA TCT CAA ACA ATG CCA ACG TCG 720
TTG ACT TTA TTG ACG AGC GGG ATG CTG CGG GTC ACG TCA CTG CCA TCC 768
TTT CGG ATG GAC GGC GGG TGA AGG GGG ACT TGC TGG TCG GGG CAG ACG 816
GCA TTT GGT CAA AGG TCC GAT CCA AGC TGC TAG GCG ATT CCA AGC CAA 864
ACT ACT CTA ACT ACA CGT GCT ACA CTG GCA TAG CTG ATT TCA CCC CCG 912
GTG ACA TTG ATA CAG TGG GAT ACC GGG TGT TCC TGG GCA ACG GCA AGT 960
ACT TTG TGT CCA GCG ATG TGG GCG GCG GCA AGA TGC AGT GGT ATG CCT 1008
TCC ACA AAG AGC CAG CGG GGG GCT CAG ACC CGC CAG GTC AGC GCC AGG 1056
AAC GAC TCA TGC GTA TCT TTG GCT CCT GGT CTG ACA ACG TGA CGG ATC 1104
TCA TCA TGG CCA CTC GAG AGG ACG ACA TTC TTC GGC GCG ACA TCT TCG 1152
ACC GGC CCC CCA CCA TGA CGT GGT ACA AGG GCC GAG TGG TTC TGC TGG 1200
GTG ATT CGG CAC ATG CCA TGC AGC CCA ACC TGG GTC AGG GAG GGG GCA 1248
TGG CGA TCG AGG ACA GCT TCC AGA TGG TGC AGG AGC TGC GGT CCA GCG 1296
GCA GCC GCT CCG TGG CGC ACA TCC GCG GGC AGT ACC AGC TGC GGC GCA 1344
TGC TGC GCT CCT CCG TGG TGC ATG GTA TGA CGG GC 			1379

<210> SEQ ID NO: 83
<211> 856
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Leu	Leu	Phe	Ser	Gly	Ala	Leu	Arg	Asn	Trp	Arg	Lys	Thr	Arg
1				5					10					15
Lys	Gly	Pro	Asp	Ala	Ser	Ala	Ser	Gly	Ser	Ser	Val	Ser	Glu	Ala
				20					25					30
His	Ile	Lys	Ala	Val	Thr	Val	Val	Lys	Gly	Ala	Lys	Thr	Thr	Arg
				35					40					45
Thr	Thr	Thr	Ala	Gly	Gly	Val	Gly	His	Ala	Gly	Ala	Ala	Ser	Asn
				50					55					60
Ala	Pro	Val	Pro	Pro	Pro	Arg	Arg	Glu	Ala	Arg	Ala	Ser	Ala	Arg
				65					70					75
Ser	Ala	Lys	Ala	Pro	Leu	Lys	Leu	His	Lys	Val	Val	Glu	Leu	Cys
				80					85					90
Ala	Thr	Pro	Ser	Ala	Leu	Ala	Glu	Gly	Pro	Ala	Ser	Tyr	Glu	Val
				95					100					105
Val	Leu	Ser	Asp	Gly	Thr	Asp	Val	Ser	Asp	Pro	Glu	Glu	Glu	Gly
				110					115					120
His	Lys	Gln	Gln	Ala	His	Gln	Gln	His	Gln	Ser	His	Ala	Thr	Ala
				125					130					135
Asn	Ser	Lys	Pro	Arg	Thr	Asp	Ala	Ala	Ala	Pro	Pro	Pro	Arg	Arg
				140					145					150
Ser	Arg	Ala	Pro	Ala	Pro	Ala	Ala	Ala	Leu	Ser	Ser	Thr	Val	Leu
				155					160					165
Ala	Ser	Val	Arg	Arg	Arg	Thr	Lys	Gln	Ala	Leu	Pro	Pro	Ala	Val
				170					175					180
Ala	Ser	Leu	Ala	Ala	Ile	Pro	Pro	Gln	Arg	Ser	Val	Pro	Ala	Ala
				185					190					195
Ala	Ala	Ala	Ala	Ala	Val	Ala	Ala	Gly	Pro	Gly	Ala	Val	Lys	Ala
				200					205					210
Gln	Ala	Gly	Gln	Thr	Arg	Pro	Arg	His	Gly	Gly	Asn	Gly	Trp	Ser
				215					220					225
Glu	Ala	Val	Ala	Arg	Leu	Thr	Ser	Gly	Ala	Ala	Ala	Arg	Pro	Val
				230					235					240
Leu	Thr	Val	Leu	Leu	Leu	Leu	Ala	Ala	Gly	Ala	Leu	Met	Ala	Ser
				245					250					255
Gly	Ser	Gln	Arg	Leu	Val	Thr	Ala	Phe	Arg	Gln	Pro	Val	Pro	Arg
				260					265					270
Gly	Ser	His	Arg	Leu	Gly	Gly	Ser	Ala	Ser	Pro	Leu	Thr	Ser	Phe
				275					280					285
Arg	Pro	Val	Ala	Ala	Ala	Ala	Ala	Ala	Ala	Gly	Ser	Gly	Ser	Ser
				290					295					300
Pro	Gly	Arg	Gly	Ala	Arg	Gly	Ala	Ala	Ser	Ala	Ser	Val	Trp	Gly
				305					310					315
Thr	Ala	Ala	Trp	Ile	Gly	Gly	Ala	Met	Arg	Leu	Ala	Ser	Ser	Arg
				320					325					330
Ser	Leu	Ala	Ala	Arg	Ala	Val	Pro	Ala	Pro	Ser	Thr	Ala	Ser	Ala
				335					340					345
Thr	Gly	Ser	Ala	Pro	Gly	Met	Pro	Gly	Arg	Asn	Gly	Gly	Ala	Ala
				350					355					360
Leu	Ala	Thr	Arg	Ser	Gly	Ala	Gly	Trp	Trp	Ser	Ala	Ala	Ala	Ala
				365					370					375
Val	Arg	Gly	Ser	Gly	Ser	Gly	Ser	Ser	Arg	Ser	Pro	Val	Ala	Ala
				380					385					390
Ala	Ala	Ala	Ala	Ala	Ala	Asp	Ala	Ala	Gly	Gly	Ala	Ala	Ala	Ala
				395					400					405
Pro	Glu	His	Ser	Tyr	Glu	Trp	Arg	Glu	Ser	Ala	Ala	Val	Arg	Val
				410					415					420
Ile	Ser	Ile	Val	Ser	Asp	Ala	Ala	Ser	Pro	Tyr	Arg	Thr	Ser	Trp
				425					430					435
Asp	Ala	Leu	Ala	Glu	His	Thr	Ala	Gln	Arg	Leu	Glu	Trp	Thr	Asp
				440					445					450
Pro	Ser	Tyr	Gln	Met	Ile	Val	Phe	Arg	Gln	Asp	Gln	Leu	Ala	Ser
				455					460					465
Lys	Pro	Ala	Thr	Ala	Thr	His	Cys	Cys	Thr	Leu	Gln	Ser	Ala	Phe
				470					475					480
Thr	Gln	Thr	Pro	Ser	Ala	Ser	Leu	Ser	His	Leu	Gln	Thr	Cys	Ser
				485					490					495
Arg	Leu	Leu	Asn	Leu	Thr	Met	Cys	Phe	His	Gln	Met	Pro	Arg	Ser
				500					505					510
Leu	Leu	Lys	Leu	Thr	Trp	Thr	Pro	Asp	Gly	Ala	Gly	Leu	Ala	Val
				515					520					525
Trp	Asp	Thr	Val	Gln	Leu	Leu	Leu	Gly	Arg	His	Asp	Ser	Asp	Asn
				530					535					540
Phe	Leu	Phe	Val	Phe	Leu	Val	Leu	Val	Asn	Gln	Tyr	Val	Thr	Thr
				545					550					555
Val	Arg	Gln	Val	Asp	Thr	Thr	Lys	Gly	Phe	Asp	Leu	Thr	Ser	Ile
				560					565					570
Ile	Cys	Met	Ile	Lys	Asn	Cys	Gly	Ser	Lys	Val	Val	Gly	Cys	Val
				575					580					585
Gln	Asp	Pro	Thr	Cys	Lys	Thr	Ala	Leu	Asp	Cys	Leu	Asn	Gly	Cys
				590					595					600
Thr	Phe	Asn	Asp	Gln	Val	Cys	Gln	Tyr	Arg	Cys	Ile	Val	Ser	Tyr
				605					610					615
Glu	Ser	Pro	Leu	Leu	Glu	Gln	Phe	Ser	Leu	Cys	Ile	Leu	Gln	Leu
				620					625					630
His	Asn	Cys	Arg	Asn	Leu	Asp	Ala	Lys	Pro	Pro	Ala	Leu	Pro	Asp
				635					640					645
Pro	Ala	Pro	Met	Thr	Ser	Phe	Arg	Gly	Ala	Ala	Leu	Thr	His	Glu
				650					655					660
Ala	Ala	Glu	Asp	Leu	Phe	Ile	Gly	Trp	Leu	Asp	Gln	Pro	Gly	Gln
				665					670					675
Gly	Ala	Pro	Ala	Gly	Gln	His	Leu	Gly	Gln	Met	Pro	Gly	Lys	Arg
				680					685					690
Tyr	Ser	Trp	Leu	Val	Ala	Ala	Gly	Lys	Asn	Pro	Ala	Tyr	Asp	Tyr
				695					700					705
Phe	Pro	Cys	Gln	His	Gln	Leu	Tyr	Tyr	Arg	Gly	Lys	Gly	Arg	Gly
				710					715					720
Gln	Met	Trp	Tyr	Glu	Pro	Ile	Phe	Lys	Ala	Ile	Thr	Leu	Asp	Gly
				725					730					735
Arg	Glu	Val	Trp	Arg	Arg	Arg	Val	Tyr	Arg	Val	Arg	Arg	Ala	Lys
				740					745					750
Val	Pro	Gly	Thr	Phe	Tyr	Leu	Ser	Val	Leu	Asp	Asn	Gly	Val	Thr
				755					760					765
Ser	Asn	Glu	Tyr	Trp	Arg	Ile	Val	Asp	Cys	Asp	Glu	Asn	Leu	Asp
				770					775					780
Trp	Cys	Leu	Phe	Tyr	Tyr	Ser	Gly	Ala	Ala	Ser	Thr	Ala	Gly	Leu
				785					790					795
Ala	Tyr	Ser	Gly	Ala	Val	Leu	Gly	Thr	Pro	Asp	Gly	Gly	Met	Pro
				800					805					810
Gly	Pro	Gln	His	Thr	Gln	Arg	Leu	His	Thr	Ala	Leu	Arg	Arg	Ala
				815					820					825
Gly	Ile	Glu	Pro	Trp	Glu	Leu	Ser	Phe	Val	Asp	Asn	Ser	Lys	Cys
				830					835					840
Ala	Asp	Ala	Pro	Leu	Gln	Ile	Thr	Gly	Pro	Thr	Pro	Ala	Pro	Val
				845					850					855
Val														

<210> SEQ ID NO: 84
<211> 1377
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
GTG GGA AAG GGG CTG CAG GAA ATG ATT CAA AAT CAT GTT CCT GGG TAC 48
AAG ATT GAA GGA TCA ATG CTG GGA TCC TTT ATG AGC TGC CTC ATG CTA 96
GCT GTC TGC CTT TTT ATC GAC CGC CCC TCC TAT TTG TAC AGC GGC ACT 144
CCC CAG CCT GGC AGC AGA CAT GGT GAA GAC CGG GAC CTG TCT CCT GCA 192
AAA GTG TCA GCC GGA GCT GGC GAG GTG CTT GAT AGA TGG GGA GTG CGT 240
GGA GAA CCT CGT CTG CCT CCA GGT CTG CAA TGG GAA GGA CGA TGA GAG 288
CGG GTG CCA AGT AAA ATG TGC CGA TCG GTA CAA GAA TGA CAT TAT CGA 336
CAA GTT CAA CGC CTG CGC CAT TAC CTC TGG AGG CTG TGT GCC ACA GAA 384
AAT TGA GAA GGA CAC ATG GAA GGC CCC GCC GGC CTC GGC CAT CGA TCG 432
CGC ATT TGA CCC CGA GCT GTT CCA GGG CCG CTG GTA TAT CAC CGC AGG 480
CCT GAA CCC CCT GTT CGA CGT GTT TGA CTG CCA AGA ACA TTT CTT TGC 528
AAG CCC TAG CCC AGG ATC CTT GTA CGG CAA GAT CCG CTG GCG CAT CCC 576
CAA GGG CAG CTC TGA CTT CAT TGA GCG CTC CAC GAT GCA GAG GTT TGT 624
GCA GGA CCC AGG GAA CCC TGG TGT CCT GTA CAA TCA CGA TAA TGA GTA 672
CCT GCA CTA TGA GGA CGA CTG GTA TGT TGT CGG CTG GGA GCC TGA TTC 720
ATA TGC CTT CAT TGT GTA CAA GGG ACA GAA CGA TGC CTG GAA AGG GTA 768
TGG CGG AGC CAC TGT GTA CAC CAG GGA CTC CTC ATT CCC ACC CGA GCT 816
CAC AAA CAA GAT GCG AGA GCT GGC GAG CCA AGC CGG ACT GAA ATG GGA 864
GGA GTT CAA GTT GAC AGA CAA CTC GTG TGG ACC CCA CCC ACC CCA GCG 912
AGG TGG TAT GAC AGA CTT CGA GCG GGA GAT GGA TGA CCT GGC AAG GGA 960
CGA AGC TCG ACA GGG TCT TGC ATC CTT CAG TAG GGG ACT CAC AGT GAT 1008
CGA ACA GCG TGC TGA GAG AGC AGA GAG AGA CAT TGC ACA GAG CGT GGT 1056
GGG ATT GGA GAA GAC GCT GCA GGA CGA ATT CTA CGA AGC TGA AAA GCA 1104
GCT TGC AAG CAT TGA GAA ACA ATT CTC TGC CAG CGG TGG GTT TGG TGC 1152
ATG GCT GCA GAA TAT CTT CAG GTT CTG AAG GCT CCA ACA TTG GCC CGT 1200
GTC TCC CTG CAA GAT CCT ATT GGA CGT GTA CCC GTA GCT CGA TTG AGC 1248
TTT TCC ATA TAC ATG TGT GTG TTC AGG ACC ACT TCC CAG CTC CCC CTG 1296
TAA GCC AAT CCG CCC GCA CCC CCC GCG GCG CGC AGC GGT GTG GTT CAT 1344
CCG GCT AGG TGC GCG CAT GCA CAA GCA CCA CTG 			1377

<210> SEQ ID NO: 85
<211> 227
<212> Amino Acid Sequence
<213> Chlorella variabilis

Met	Gly	Ala	Ala	Ala	Arg	His	Ala	Leu	Leu	Val	Gln	Asp	Phe	Ala
1				5					10					15
Ala	Leu	Gln	Leu	Glu	Glu	Ile	Glu	Lys	Asn	Ile	Ala	Ser	Arg	Arg
				20					25					30
Asn	Lys	Ile	Phe	Leu	Leu	Met	Glu	Glu	Val	Arg	Arg	Leu	Arg	Ile
				35					40					45
Gln	Leu	Arg	Leu	Arg	Gly	Val	Ala	Glu	Glu	Ala	Thr	Pro	Glu	Glu
				50					55					60
Glu	Tyr	Pro	Ser	Ser	Ile	Pro	Phe	Phe	Pro	Pro	Ile	Asn	Glu	Lys
				65					70					75
Thr	Ile	Lys	Met	Tyr	Thr	Arg	Phe	Tyr	Ala	Ile	Thr	Val	Ala	Gly
				80					85					90
Ile	Ile	Thr	Phe	Gly	Gly	Leu	Val	Ala	Pro	Ile	Leu	Glu	Val	Arg
				95					100					105
Leu	Gly	Ile	Gly	Gly	Ser	Ser	Tyr	Phe	Asp	Phe	Ile	Arg	Ser	Leu
				110					115					120
His	Leu	Pro	Thr	Gln	Leu	Ala	Gln	Val	Asp	Pro	Ile	Val	Ala	Ser
				125					130					135
Phe	Cys	Gly	Gly	Gly	Val	Gly	Val	Leu	Thr	Ala	Leu	Leu	Ile	Val
				140					145					150
Glu	Leu	Asn	Asn	Ser	Lys	Met	Gln	Glu	Lys	Arg	Arg	Cys	Ile	Tyr
				155					160					165
Cys	Glu	Gly	Ser	Gly	Tyr	Leu	Thr	Cys	Gly	Asn	Cys	Val	Gly	Thr
				170					175					180
Gly	Val	Ser	Gly	Gly	Glu	Gly	Ala	Met	Cys	Ala	Asn	Cys	Ala	Gly
				185					190					195
Thr	Gly	Lys	Val	Met	Cys	Thr	Ser	Cys	Leu	Cys	Thr	Gly	Lys	Lys
				200					205					210
Leu	Ala	Thr	Glu	His	Asp	Pro	Arg	Val	Asp	Pro	Phe	Thr	Leu	Gly
				215					220					225
Met	Glu													

<210> SEQ ID NO: 86
<211> 647
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CAT GGT ACA TGG AAC TGG GTG TGG GGT CAC CCC CCC CGC CCG CAG ATC 48
CCA TGC AGC GCT GAG TAG AGG GCT AGC ACA TGA GTC TGC TTG GAC AGC 96
TTG AAG TCC TCA CAC CCA GTT CCA TGT ACC ATG CGT TTT CCG GGT ACC 144
CCT CCA TAC AAA GCA GGG CGA GAT CGC AAA CGG TAA TGT CAC TAA CAA 192
TTC ACC CTG ACG CGT TGT ACA TCC GCC CTG GCC CAG CAA GCA GCC CCG 240
TGC CCT TGC CCA TTG TTC CAG TGA CAA ACA CTG TAG CTC TTA CAT GTT 288
GTT GTT CGT CGC AGA CCA GGC AGA CGG TTC AAG AAG CTC CGC AAA GTA 336
TTG GCA AGC GCA TGC TCT TAC TAG AAG GGG TTG AAG GGG TCA ATA CGT 384
GGA TCA TGC TCG GTG GCC AGC TTC TTG CCT GTG CAC AGG CAG GAG GTG 432
CAC ATG ACC TTG CCC GTT CCC GCG CAG ATG GCG CAG GCC CCC CCC CCG 480
GCA ACC TTG GTG TTG CTG CCG GCC CCC TCA CAT GCG CCG CAG GCC AGG 528
TAC CCT GTC CCC TCG CAG TAC AGG CAG CGC TGC TTG TTG TGC ACT TTA 576
GCG TTG TTG ATC TCC ACC AGC AGC AGG GCG GAG AGA ACG CCG ACC GCG 624
CCC CCA CAG AAG GAG GCC ACA AC        				647

<210> SEQ ID NO: 87
<211> 205
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Leu	Ala	Ser	Arg	Thr	Ser	Ser	Ala	Ala	Val	Val	Gly	Arg
1				5					10					15
Ser	Thr	Arg	Ser	Ala	Ala	Val	Val	Pro	Val	Arg	Ser	Ile	Ala	Ser
				20					25					30
Arg	Cys	Gln	Ala	Ala	Arg	Pro	Ala	Arg	Arg	Ala	Ser	Val	Ala	Val
				35					40					45
Arg	Ala	Ser	Asp	Glu	Asn	Gly	Ser	Val	Ser	Val	Arg	Arg	Ala	Pro
				50					55					60
Tyr	Ala	Glu	Leu	Glu	Ser	Ile	Gln	Cys	Asp	Leu	Ser	Ala	Phe	Pro
				65					70					75
Gly	Val	Lys	Phe	Phe	Arg	Ile	Glu	Ala	Ile	Phe	Arg	Pro	Trp	Arg
				80					85					90
Leu	Pro	Phe	Val	Ile	Asp	Thr	Leu	Ser	Lys	Tyr	Gly	Ile	Arg	Gly
				95					100					105
Leu	Thr	Asn	Thr	Pro	Val	Lys	Gly	Val	Gly	Val	Gln	Gly	Gly	Ser
				110					115					120
Arg	Glu	Arg	Tyr	Ala	Gly	Thr	Glu	Phe	Gly	Pro	Ser	Asn	Leu	Val
				125					130					135
Asp	Lys	Glu	Lys	Leu	Asp	Ile	Val	Val	Ser	Arg	Ala	Gln	Val	Asp
				140					145					150
Ala	Val	Val	Arg	Leu	Val	Ala	Ala	Ser	Ala	Tyr	Thr	Gly	Glu	Ile
				155					160					165
Gly	Asp	Gly	Lys	Ile	Phe	Val	His	Pro	Val	Ala	Glu	Val	Val	Arg
				170					175					180
Ile	Arg	Thr	Ala	Glu	Thr	Gly	Leu	Glu	Ala	Glu	Lys	Met	Glu	Gly
				185					190					195
Gly	Met	Glu	Asp	Met	Met	Lys	Lys	Lys	Lys					
				200					205					

<210> SEQ ID NO: 88
<211> 901
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
TCA AGA CAT CAT ACC TGA CAA AGT GAG GGA TCA GAC AGA GGT ACA CAG 48
CTT TTC GGC AAA CTT GCC ATG GCA ATT CTT GTT TGC CCT CGC GTG GTC 96
CTG GCG CCT GCC AAA TCC AAC CCC AGG TCG GCA GCC TCC TTC ATT CGG 144
TGT CCA GTC CAC ACA CCC CAA CGA GTC CAC CGT GTC GAA GGC AGA GCT 192
GGT GGC AAT GGG TCG GGT TTG GCC ACC GAA GCA ACA CCC TAC TCG AAG 240
CTG GAG TCC ATC AGA TGC GAC CTG TCT AAG TTT CCT GAC ATC GAT TTC 288
TTC CGT GTG GAG GCC ATC ATT CGT CCC TGG AGA CTT GAA AAG GTG ACG 336
AGG GAG CTC TCT GCT GAA GGC ATC TGC GGC ATG ACA GTG TCG CAA GTC 384
CGA GGG GCG GGG GTG CAA GGC GGG CAC AAG GAA AGA TAT GCT GGG ACA 432
GAG TAT GGT GGC AAG ACA AAC TTC CTG GTG GAC AAG ACC CGC TTG GAC 480
ATT GTT GTT GTC AGG TCT CAG GTG GAC AAG GTG ATC CAG ACA ATT GCC 528
AGC ACC ACG TAC ACT GGG GAG ATC GGG GAC GGC AAG ATT TTT GTG CAC 576
CCT GTG GCC GAT GTG ATC CGA GTG AGG ACG GGG GAG ACG GGG GCA ATC 624
GCA GAG CGG ATG GAG GGG GGC ATG TCG GAC AGG ACT TCG TGA GGC GGC 672
TGA GGT TGC GAT TCT CAA TAT TGC AGT GCT GCT GTA CTG CTA CAA GCT 720
GAT TGC GTC TCC AGC AAG ACG CAC ATT CCC TGT GCA TGG CCT TCA CGC 768
TGT CCC TTG GTG AAC GCT GGT GCG TCT TGC GTG CTG GTG TGA GCC TCA 816
GAC AGT AGT TCC TTG TGA CAC TCT GTT GTG ACA CAA GGA CCT ACA GTG 864
CCC ACC ATT AAA GGA TTG GCC CAG ATA CAT TCC AGA C 		901

<210> SEQ ID NO: 89
<211> 473
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Ala	Gln	Ala	Ala	Ala	Ala	Pro	Ala	Asp	Asn	Thr	Lys	Lys	Leu
1				5					10					15
Trp	Gly	Gly	Arg	Phe	Thr	Gly	Lys	Thr	Asp	Pro	Leu	Met	Glu	Lys
				20					25					30
Phe	Asn	Glu	Ser	Leu	Pro	Phe	Asp	Lys	Arg	Leu	Trp	Ala	Glu	Asp
				35					40					45
Ile	Lys	Gly	Ser	Gln	Ala	Tyr	Ala	Lys	Ala	Leu	Ala	Lys	Ala	Gly
				50					55					60
Ile	Leu	Ala	His	Asp	Glu	Ala	Val	Thr	Ile	Val	Glu	Gly	Leu	Ala
				65					70					75
Lys	Val	Ala	Glu	Glu	Trp	Lys	Ala	Gly	Ala	Phe	Val	Ile	Lys	Ala
				80					85					90
Gly	Asp	Glu	Asp	Ile	His	Thr	Ala	Asn	Glu	Arg	Arg	Leu	Thr	Glu
				95					100					105
Leu	Val	Gly	Ala	Val	Gly	Gly	Lys	Leu	His	Thr	Gly	Arg	Ser	Arg
				110					115					120
Asn	Asp	Gln	Val	Ala	Thr	Asp	Tyr	Arg	Leu	Trp	Leu	Val	Gly	Gln
				125					130					135
Val	Glu	Val	Met	Arg	Ser	Glu	Val	Gly	Glu	Leu	Met	Arg	Val	Ala
				140					145					150
Ala	Asp	Arg	Ser	Glu	Ala	Glu	Val	Glu	Val	Leu	Met	Pro	Gly	Phe
				155					160					165
Thr	His	Leu	Gln	Asn	Ala	Met	Thr	Val	Arg	Trp	Ser	His	Trp	Leu
				170					175					180
Met	Ser	His	Ala	Ala	Ala	Trp	Gln	Arg	Asp	Asp	Met	Arg	Leu	Arg
				185					190					195
Asp	Leu	Leu	Pro	Arg	Val	Ala	Thr	Leu	Pro	Leu	Gly	Ser	Gly	Ala
				200					205					210
Leu	Ala	Gly	Asn	Pro	Phe	Leu	Val	Asp	Arg	Gln	Phe	Ile	Ala	Lys
				215					220					225
Glu	Leu	Gly	Phe	Gly	Gly	Gly	Val	Cys	Pro	Asn	Ser	Met	Asp	Ala
				230					235					240
Val	Ser	Asp	Arg	Asp	Phe	Val	Ile	Glu	Thr	Val	Phe	Ala	Ala	Ser
				245					250					255
Leu	Leu	Cys	Val	His	Leu	Ser	Arg	Trp	Ala	Glu	Asp	Leu	Ile	Ile
				260					265					270
Tyr	Ser	Ser	Gly	Pro	Phe	Gly	Tyr	Val	Gln	Cys	Ser	Asp	Ala	Tyr
				275					280					285
Ala	Thr	Gly	Ser	Ser	Leu	Met	Pro	Gln	Lys	Lys	Asn	Pro	Asp	Ala
				290					295					300
Leu	Glu	Leu	Ile	Arg	Gly	Lys	Gly	Gly	Arg	Val	Gln	Gly	Asn	Leu
				305					310					315
Met	Gly	Val	Met	Ala	Val	Leu	Lys	Gly	Thr	Pro	Thr	Thr	Tyr	Asn
				320					325					330
Lys	Asp	Phe	Gln	Glu	Cys	Trp	Glu	Leu	Leu	Phe	Asp	Thr	Val	Asp
				335					340					345
Thr	Val	His	Asp	Val	Val	Arg	Ile	Ala	Thr	Gly	Val	Leu	Ser	Thr
				350					355					360
Leu	Arg	Ile	Lys	Pro	Asp	Arg	Met	Lys	Ala	Gly	Leu	Ser	Ala	Asp
				365					370					375
Met	Leu	Ala	Thr	Asp	Leu	Ala	Glu	Tyr	Leu	Val	Arg	Lys	Gly	Val
				380					385					390
Pro	Phe	Arg	Glu	Thr	His	His	His	Ser	Gly	Ala	Ala	Val	Lys	Met
				395					400					405
Ala	Glu	Asp	Arg	Gly	Cys	Thr	Leu	Phe	Asp	Leu	Thr	Val	Asp	Asp
				410					415					420
Leu	Lys	Thr	Ile	His	Pro	Leu	Phe	Thr	Asp	Asp	Val	Ala	Ala	Val
				425					430					435
Trp	Asp	Phe	Asn	Arg	Ser	Ala	Glu	Met	Arg	Asp	Thr	Glu	Gly	Gly
				440					445					450
Thr	Ser	Lys	Arg	Ser	Val	Leu	Glu	Gln	Val	Gln	Lys	Met	Arg	Thr
				455					460					465
Tyr	Leu	Ala	Ala	Glu	Gly	Gln	His							
				470										

<210> SEQ ID NO: 90
<211> 1419
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG CAG CAG CAG CAT GTG CGC AGC CTG AAC CTG GCA ACC AGC GTG GGC 48
ATC GGC GTG TAT GAG GCG CTG CGG CAG CTC GAC GGC GCC GTC CTG CCG 96
GAC GCC ACC GCG GCC CCC ACC GAC ACC CCG GCG GCC AAG AAG CTG TGG 144
GGC GGG CGC TTC ACC GGC GCC ACC GAC CCG TTG ATG GAG AAG TTC AAC 192
GAG TCC CTG CCC TTT GAC AAG CGC ATG TGG GCC GAG GAC ATC CGC GGC 240
AGC CAG GCC TAC GCC AAG GCC CTG GCC AAG GCG GGC GTG CTG ACC GAC 288
GAG GAG GCG GCC ACC ATC GTG GAG GGC CTG GCC CGC GTG GCG GGC GAG 336
TGG GAG GCC GGC ACC TTC CGC ATC GTG GCC GGG GAC GAG GAC ATC CAC 384
ACC GCC AAC GAG CGG CGC CTC AGC GAG CTG ATC GGC GCG CTG GGG GGC 432
AAG CTG CAC ACG GGG CGC AGC CGC AAC GAC CAG TGC GTC ACC GAC ACG 480
CGC CTC TGG CTG CTG GGC GCG CTG GGC CGC GTC CGC GCC TCG CTG CAC 528
GCA CTC ATC GCC GCG GCG GCG GAC CGC GCG GCG GCC GAG GCC GAT GTG 576
CTC ATG CCT GGG TAC ACC CAC CTG CAG CCC GCG CAG ACG GTG CGC TGG 624
AGC CAC TGG CTG CTG AGC CAC GCG GCG GCC TGG CAG CGT GAC GAC CAG 672
CGC CTG GCG GGC CTG CTG CCG CGC GTG GCC ACG CTG CCG CTG GGC TCC 720
GGC GCG CTG GCG GGG AAC CCC TTT GGC GTG GAC CGC CAG TTC CTG GCG 768
CGC GAG CTG GGC TTC CAC GGC GGC GTC TGC CCG AAC TCC ATG GAC GCG 816
GTG TCG GAC CGA GAC TTC GTG GCG GAG ACC ATC TTC ACC GCC AGC CTG 864
CAC CTG GTG CAC CTC TCA CGC TGG GCG GAG GAC CTC ATC ATT TAC TCC 912
TCA GGG CCC TAC CGC TTC GTG CAG TGC AGC GAC GCC TAC GCC ACC GGC 960
TCC AGC CTG ATG CCC CAG AAG AAG AAT CCC GAC GCA CTG GAG CTG ATC 1008
CGG GGC AAG GGC GGG CGC AGC ATC GGC GGC GTG ACC TCC ATG CTG GCG 1056
GTG CTC AAG GGC ACC CCC ACC ACC TAC AAC AAG GAC TTC CAG GAG TCG 1104
TGG GAG CTC ATG TTC GAC GCG GTG GAC ACG CTT CAC GAC TGC GTG CGC 1152
ATC GCC ACG GGC GTG CTG AGC ACG CTG CGC ATC GAT CCC GAG GCC ATG 1200
CGC CGC GGC CTC TCC GCC GAC ATG CTG GCC ACC GAC CTG GCC GAG TAC 1248
CTG GTG CGC CGC GGC GTA CCC TTC CGC GAG ACG CAC CAC ATC TCC GGC 1296
GCG GCG GTA TGG GAC TTT GCG CGC TCT GCC GAC ACG CGC GAC ACC GAG 1344
GGC GGC GCC AGC CAT CGC AGC GTG GCA GAG CAG ATC TCC AAG CTG CGC 1392
GCC TAC CTG GAG CAG AAC GCG GTC TGA 				1419

<210> SEQ ID NO: 91
<211> 882
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Thr	Val	Ala	Glu	Gln	Pro	Val	Ala	Leu	Val	Pro	Ser	Gly	Lys
1				5					10					15
Val	Gln	Ala	Pro	Asp	Ala	Ile	Val	Ser	Gln	Ala	Val	Glu	Leu	Gly
				20					25					30
Ala	Pro	Tyr	Glu	Pro	Pro	Leu	Ser	Pro	Glu	Asp	Ala	Asp	Trp	Ser
				35					40					45
Gln	His	Val	Leu	Pro	Ser	Ala	Val	Asp	Lys	Arg	Asp	Gln	Asp	Thr
				50					55					60
Pro	Asp	Asn	Trp	Val	Arg	Arg	Asp	Pro	Arg	Ile	Leu	Arg	Leu	Thr
				65					70					75
Gly	Arg	His	Pro	Leu	Asn	Cys	Glu	Pro	Pro	Met	Ser	Val	Leu	Met
				80					85					90
Gln	Tyr	Gly	Phe	Ile	Thr	Pro	Pro	Ala	Val	His	Phe	Val	Arg	Asn
				95					100					105
His	Gly	Ala	Ala	Pro	Arg	Ile	Arg	Trp	Asp	Glu	His	Arg	Ile	Glu
				110					115					120
Ile	Asn	Gly	Leu	Val	Asn	Lys	Pro	Leu	Thr	Leu	Thr	Met	Asp	Glu
				125					130					135
Leu	Val	Ala	Leu	Pro	Ser	Val	Thr	Phe	Pro	Val	Thr	Leu	Val	Cys
				140					145					150
Ala	Gly	Asn	Arg	Arg	Lys	Glu	Glu	Asn	Met	Leu	Lys	Lys	Ser	Ile
				155					160					165
Gly	Phe	Asn	Trp	Gly	Pro	Cys	Ala	Thr	Ser	Thr	Thr	Tyr	Trp	Thr
				170					175					180
Gly	Val	Arg	Leu	Arg	Asp	Leu	Leu	Gln	His	Ala	Gly	Ile	Lys	Thr
				185					190					195
Pro	Ala	Glu	Gly	Ala	Arg	Phe	Val	Cys	Phe	Arg	Gly	Pro	Lys	Gly
				200					205					210
Glu	Leu	Pro	Arg	Gly	Glu	Asp	Gly	Ser	Tyr	Gly	Thr	Ser	Leu	Thr
				215					220					225
Tyr	Ala	Lys	Ala	Met	Asp	Pro	Ala	Ser	Asp	Val	Ile	Ile	Ala	Tyr
				230					235					240
Lys	Gln	Asn	His	Arg	Trp	Leu	Thr	Pro	Asp	His	Gly	Phe	Pro	Val
				245					250					255
Arg	Ile	Ile	Ile	Pro	Gly	Phe	Ile	Gly	Gly	Arg	Met	Val	Lys	Trp
				260					265					270
Leu	Ser	Glu	Ile	Thr	Val	Met	Asp	Thr	Glu	Ser	Gln	Asn	Phe	Tyr
				275					280					285
His	Phe	Met	Asp	Asn	Arg	Val	Leu	Pro	Ser	His	Val	Asp	Glu	Glu
				290					295					300
Leu	Ala	Lys	Lys	Glu	Gly	Trp	Trp	Tyr	Lys	Pro	Glu	Phe	Ile	Ile
				305					310					315
Asn	Asp	Leu	Asn	Ile	Asn	Ser	Ala	Met	Ala	Arg	Pro	Trp	His	Asp
				320					325					330
Glu	Leu	Val	Pro	Leu	Asp	Ala	Asn	Arg	Pro	Tyr	Thr	Ile	Lys	Gly
				335					340					345
Tyr	Ala	Tyr	Ala	Gly	Gly	Gly	Arg	Lys	Ile	Ile	Arg	Cys	Glu	Val
				350					355					360
Ser	Leu	Asp	Asp	Gly	Lys	Thr	Trp	Arg	Leu	Gly	Asp	Ile	Gln	Arg
				365					370					375
Phe	Glu	Glu	Pro	Asn	Glu	Tyr	Gly	Lys	His	Trp	Cys	Trp	Val	His
				380					385					390
Trp	Thr	Leu	Glu	Val	Asn	Thr	Phe	Asp	Phe	Leu	Ser	Ala	Lys	Glu
				395					400					405
Val	Leu	Cys	Arg	Ala	Trp	Asp	Glu	Thr	Met	Asn	Thr	Gln	Pro	Ala
				410					415					420
Val	Ile	Thr	Trp	Asn	Leu	Met	Gly	Met	Met	Asn	Asn	Cys	Tyr	Phe
				425					430					435
Arg	Ile	Lys	Ile	His	Pro	Glu	Val	Asp	Pro	Ala	Thr	Gly	Val	Met
				440					445					450
Gly	Leu	Arg	Phe	Gln	His	Pro	Ala	Pro	Val	Glu	Leu	Gly	Asp	Lys
				455					460					465
Gly	Asn	Met	Gly	Trp	Arg	Glu	Glu	Asp	Asn	Leu	Val	Ala	Gln	Ala
				470					475					480
Val	Ala	Ala	Ala	Arg	Asp	Gly	Gly	Gly	Ala	Ala	Ala	Ala	Pro	Pro
				485					490					495
Pro	Pro	Pro	Pro	Ala	Ala	Leu	Leu	Ala	Asn	Gly	Gly	Pro	Lys	Gln
				500					505					510
Tyr	Thr	Leu	Glu	Glu	Val	Ala	Glu	His	Ala	Ser	Glu	Glu	Ser	Cys
				515					520					525
Trp	Phe	Val	His	Glu	Gly	Arg	Val	Tyr	Asp	Ala	Thr	Pro	Tyr	Leu
				530					535					540
Asn	Asp	Gln	Pro	Gly	Gly	Ala	Glu	Ser	Ile	Leu	Ile	Thr	Ala	Gly
				545					550					555
Ala	Asp	Ala	Thr	Asp	Glu	Phe	Asn	Ala	Ile	His	Ser	Ser	Lys	Ala
				560					565					570
Lys	Ala	Met	Leu	Ala	Gln	Tyr	Tyr	Ile	Gly	Asp	Leu	Val	Ala	Ser
				575					580					585
Lys	Pro	Ala	Thr	Ala	Asn	Gly	Thr	Ala	Thr	Ala	Asn	Gly	Asn	Gly
				590					595					600
Thr	Ala	Thr	Ala	Asn	Gly	Thr	Ala	Ala	Ala	Ala	Pro	Pro	Ala	Asp
				605					610					615
Pro	Leu	Val	Val	Leu	Thr	Gly	Arg	Ala	Lys	Val	Lys	Leu	Pro	Leu
				620					625					630
Val	Glu	Arg	Ile	Glu	Leu	Asn	Arg	Asn	Thr	Arg	Ile	Phe	Arg	Phe
				635					640					645
Gly	Leu	Pro	Ser	Pro	Glu	His	Arg	Ile	Gly	Leu	Pro	Val	Gly	Lys
				650					655					660
His	Val	Phe	Val	Tyr	Ala	Gln	Val	Gly	Gly	Glu	Asn	Val	Met	Arg
				665					670					675
Ala	Tyr	Thr	Pro	Ile	Ser	Gly	Asp	Glu	Glu	Lys	Gly	Arg	Leu	Asp
				680					685					690
Met	Leu	Ile	Lys	Val	Tyr	Phe	Lys	Gly	Glu	His	Ala	Ser	Tyr	Pro
				695					700					705
Glu	Gly	Gly	Lys	Met	Ser	Gln	His	Phe	Asp	Ser	Leu	Ala	Ile	Gly
				710					715					720
Asp	Cys	Leu	Glu	Phe	Lys	Gly	Pro	Leu	Gly	His	Phe	Val	Tyr	Asn
				725					730					735
Gly	Arg	Gly	Ser	Tyr	Thr	Leu	Asn	Gly	Lys	Val	Thr	Lys	His	Ala
				740					745					750
Ser	His	Met	Ser	Phe	Val	Ala	Gly	Gly	Thr	Gly	Ile	Thr	Pro	Cys
				755					760					765
Tyr	Ala	Val	Ile	Lys	Ala	Ala	Leu	Arg	Asp	Pro	Glu	Asp	Asn	Thr
				770					775					780
Lys	Leu	Ala	Leu	Leu	Phe	Ala	Asn	Thr	His	Glu	Asp	Asp	Ile	Leu
				785					790					795
Leu	Arg	Glu	Glu	Leu	Asp	Glu	Leu	Ala	Asn	Asn	His	Pro	Glu	Arg
				800					805					810
Phe	Arg	Leu	Trp	Tyr	Thr	Val	Ser	Gln	Pro	Lys	Asp	Ala	Ala	Thr
				815					820					825
Trp	Lys	Tyr	Asp	Val	Gly	Arg	Val	Ser	Lys	Asp	Met	Phe	Thr	Glu
				830					835					840
His	Leu	Phe	Ala	Ser	Thr	Gly	Glu	Asp	Cys	Leu	Ser	Leu	Met	Cys
				845					850					855
Gly	Pro	His	Gly	Met	Ile	Glu	His	Cys	Cys	Val	Pro	Phe	Leu	Glu
				860					865					870
Ala	Met	Gly	Tyr	Ser	Lys	Asp	Arg	Gln	Ile	Gln	Phe			
				875					880					

<210> SEQ ID NO: 92
<211> 2267
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CGC AGG TTC ATC TCG CTG TGA AGC AGC CTC GCG CAT CAA AAT CCT GCA 48
ACA GCG CAC TTT GCC TGA CCA GGA CCC GAT TGG CAG GAC GTG GTG GGC 96
CTC CAG GGA CAC CTT ATG TCC TCC GGC GTA GGA TTC GAG AGC ATG GTG 144
GAC TCC GTC TTC CTG GAC GTG ATC CGG GTG GTT CCA CAG CTC CTG CTG 192
GTG GTC CTT GTC GTC ATC GTA GCA GTA CTG ATC GGG AAA GAT ATC TTG 240
AAT GGC ATG CGG CGG CCC TTC CTC AAG CCC GAC CAG TGG CAA GCG CTC 288
CCC CTG ACT GAG GTG AAG AAG ACA ACG CAT AAC ACG CGC CTT TTC CGC 336
TTC GCG CTA CCC CAT GTC GAT CAG CAG CTG GGA CTG CCC GCG GGT CAG 384
CAC ATC ACG CTC AAG GTT ACA GGA CCC CAT GGC GAC GAG ATC CTC AGG 432
CCC TAC ACA CCA GTC TCC GAC ATC TCC CAG CGT GGG ACC GTC GAT TTC 480
CTG ATC AAG GTC TAT CCC GAG GGC CGC ATG TCC CAG GCG TTG GAT GCC 528
CTG GCT GTG GGA GAC AAG GTG CTC TTC AAG GGG CCA AAG GGG CGA TTC 576
GTG TAC AAG CCG GAC ACC TGG AAT GCC ATC GGC ATG CTG GCT GGG GGC 624
ACG GGG ATC ACG CCC ATG TAC CAG CTG CTG CAG AAG ATC CTG AAG GAC 672
CCG AAT GAC ACG ACA GAG ATC TCG CTG ATC TTT GGA AAT GTC ACC GAG 720
GAT GAC ATA CTG CTG CAG AAG GAG CTG GAG GAG ATG GCG AGT AAG CAC 768
AAG CGA TTC AAG GTA TTC CAC GTC CTC AAC AAC CCT CCA GCT CAG TGG 816
ACT GGA GGT GTT GGG TTC ATC TCC GAA AGC ATG ATC AGG GAG CAC TTC 864
CCG GCA CCG GCT GCT GAT GTC CTC ATT CTG CGG TGT GGG CCT CTG CCC 912
ATG ATG AAG GCC ATG GAA ACC CAC CTG GAC AGG CTC GGG TAC ACC CGC 960
GAA GTC CAG TTT CAG TTC TGA GGG CCT CTT GCA GTG GCT TGC ATG CGA 1008
CTA TCT CAC AAT GTT GCA ATG CAC AGC AAG CAG CAT GGG TAA GAA TGC 1056
AAG GGA TGG ACA GGT GAG CAA GCA TCA TCC TCG TCG CCT GCC CTG TGA 1104
GCG TGG GAA CAG GCC GAC AGA CTG CCT TCT GCC AAA TTC GAC CCT GTT 1152
TGG CAC AGC CAG GTC CAC ATT TGC CAG GCT CAG CCT GGA GAG ACT CTC 1200
TTC CAC GCC CAC CGG TCC TCC CTC TGC ATC CAT CCT CAC AAC GTC TGG 1248
CTG GGA CAG GTC TCC ACG TGT GGG CGG TCC CTT GCA TGC CTG TAT GAC 1296
AGT TAC AGG CTT CGC TCG AGC CCT GCG CCC CCT GAG ATG CGA GGT TTC 1344
AAA GTG AAA GGC GGG GTC GTA CCC ATG CGC TTC CAG GAG GTG CTC TCT 1392
TCT CTC CTC TGT GCT GCA GGT TGT GTG CTC GCA GCT CTC CAC CAA GCA 1440
ACG GTA CAC CGG AAG GCG ACG CGC AGC CTG TGC AGC AAA GAA GCT GTC 1488
GTG CAT CTC AGT CAG GTG GAG GTC AAG GAG GTG CGG ACT GGG AAG TGA 1536
GGC TCG ACA GTG GCG GCA TGC AAA CTC CTG CGA TCT CTC GGT GAC ATG 1584
CAC CAT CAT TTT GGG GGC TGG ACG AGG GGC ACT TAC TGT AAT CGC ATC 1632
GGC CAG CGA CCG AGC TCG CTG TTC GGC AGC GAT CCT CTC CCT CTC GCG 1680
ATT GCC AAA CTC AAA CAA GGA ATC ATC TGG CTG AAG GCG TCG CCG CCC 1728
TCC GAC AAA CGG CAA TGA CTG TGT TTG GGA GGG GAC ATG CCC AGC CAT 1776
GCT CAA GCA GTC CTC GTT GTC AGG GGG AGC GCA CTA AAA GTT GGA ATC 1824
CAT GTC GAA ATC CAT CAG GAC ATC CGA CAT GAA GGG CAG GAG AGG GGC 1872
AGG TGG CGC GGT GGC GAC CTG AAC CGA ATC AGT CTC GGC CTC TGA CTT 1920
GAT GTT GGC CTT GGT GTC CTC CAC AGC CTC GGT CTT TGC TTC GGG GTC 1968
TGG GCT CTG TAG TTC CTG CCT GAG CGC CCA GTC CTG GAC GAA TGC CTG 2016
TAG GGA AGA AGC CAG CCA AAT TTC ATG CAG CGA CCC CGA TAA AGG TGG 2064
ATC CAC CCT GCG GCT GAA CAG CGG TGG CTT CGC CCA ACC TCT CCA TGC 2112
TCC CTT CAT CTG CCG AAT GTA CCA GTT CGA ACT CGG CCT GTG TTA GGG 2160
CAG CCT CCA GCT CCT CTT TCG ACT GGA CCA GCT TAT TCG CCG CAG CGC 2208
CTT GTA AAG ACC TGC TTT GAG CAG ACG CAT ACA GGT CGG CAG CAG CGC 2256
CCA GAT ACT TG             					2267

<210> SEQ ID NO: 93
<211> 2,523
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Met	Leu	Ser	Gly	Val	Gly	Pro	Val	Pro	Thr	Lys	Pro	Ala	Phe
1				5					10					15
Lys	Ala	Gly	Gly	Asp	Thr	Leu	Ser	Arg	His	Leu	Glu	Glu	Leu	Cys
				20					25					30
Arg	Ser	Gly	Ala	Trp	Glu	Arg	Arg	His	Lys	Asp	Gly	Asp	Lys	Ala
				35					40					45
Leu	Leu	Glu	Tyr	Ile	Glu	Ala	Glu	Ala	Arg	Asp	Leu	Ser	Val	Glu
				50					55					60
Ala	Phe	Gly	Arg	Leu	Met	Thr	Asp	Val	Tyr	Gln	Arg	Ile	Gly	Asn
				65					70					75
Met	Leu	Leu	Lys	Gly	Asn	Asp	Ile	Thr	Arg	Arg	Met	Gly	Gly	Val
				80					85					90
Leu	Ala	Ile	Asp	Glu	Leu	Ile	Asp	Val	Lys	Leu	Ser	Gly	Asp	Asp
				95					100					105
Ala	Ala	Lys	Thr	Ala	Arg	Leu	Ser	Gly	Leu	Leu	Ser	Arg	Val	Leu
				110					115					120
Glu	Glu	Ser	Glu	Asp	Pro	Val	Leu	Ser	Glu	Ser	Ala	Ser	His	Thr
				125					130					135
Leu	Gly	His	Leu	Val	Arg	Ser	Gly	Gly	Ala	Met	Thr	Ser	Asp	Ile
				140					145					150
Val	Glu	Lys	Glu	Ile	Arg	Arg	Ser	Leu	Ala	Trp	Cys	Asp	Pro	Arg
				155					160					165
Asn	Glu	Pro	Asn	Glu	Ser	Arg	Arg	Leu	Thr	Ala	Leu	Leu	Val	Leu
				170					175					180
Thr	Glu	Ala	Ala	Glu	Ser	Ala	Pro	Ala	Val	Phe	Asn	Val	His	Val
				185					190					195
Lys	Ser	Phe	Ile	Asp	Ala	Val	Trp	Phe	Pro	Leu	Arg	Asp	Ala	Lys
				200					205					210
Gln	His	Ile	Arg	Glu	Ala	Ala	Val	Arg	Ala	Leu	Lys	Ala	Cys	Leu
				215					220					225
Cys	Leu	Val	Glu	Lys	Arg	Glu	Thr	Arg	Tyr	Arg	Val	Gln	Trp	Tyr
				230					235					240
Tyr	Lys	Leu	His	Glu	Gln	Thr	Met	Arg	Gly	Met	Lys	Arg	Asp	His
				245					250					255
Arg	Thr	Gly	Ala	Leu	Pro	Ser	Pro	Glu	Ser	Ile	His	Gly	Ser	Leu
				260					265					270
Leu	Ala	Leu	Ala	Glu	Leu	Leu	Gln	His	Thr	Gly	Glu	Phe	Met	Leu
				275					280					285
Ala	Arg	Tyr	Lys	Glu	Val	Val	Glu	Asn	Val	Phe	Arg	Tyr	Lys	Asp
				290					295					300
Ser	Lys	Glu	Lys	Asn	Ile	Arg	Arg	Ala	Val	Ile	His	Leu	Leu	Pro
				305					310					315
Arg	Met	Ala	Ala	Phe	Ser	Pro	Glu	Arg	Phe	Ala	Ser	Glu	Tyr	Leu
				320					325					330
Ala	Arg	Ala	Ile	Ala	Phe	Leu	Leu	Ile	Val	Leu	Lys	Asn	Pro	Pro
				335					340					345
Glu	Arg	Gly	Ala	Ala	Phe	Ala	Ala	Leu	Ala	Asp	Met	Ala	Ala	Ala
				350					355					360
Leu	Ala	Arg	Gly	Cys	Leu	Ser	Pro	Ile	Tyr	Val	Ala	Ile	Arg	Glu
				365					370					375
Ala	Leu	Ser	Ala	Pro	Pro	Ala	Ala	Arg	Ala	Ala	Ala	Arg	Pro	Arg
				380					385					390
Pro	Ala	Thr	Cys	Tyr	Glu	Ala	Leu	Gln	Cys	Val	Gly	Met	Leu	Ala
				395					400					405
Val	Ala	Leu	Gly	Pro	Leu	Trp	Arg	Pro	Tyr	Ala	Ala	Ala	Leu	Val
				410					415					420
Glu	Ala	Met	Val	Leu	Thr	Gly	Val	Ser	Glu	Val	Leu	Val	Gln	Ala
				425					430					435
Leu	Thr	Gln	Val	Ala	Asn	Ala	Leu	Pro	Glu	Leu	Leu	Glu	Asp	Ile
				440					445					450
Gln	Tyr	Gln	Leu	Leu	Asp	Leu	Leu	Ser	Leu	Val	Leu	Ser	Lys	Arg
				455					460					465
Pro	Phe	Asn	Ser	Ser	Thr	Thr	Gln	Pro	Lys	Phe	Ala	Ala	Leu	Ser
				470					475					480
Ala	Ala	Ile	Ala	Ala	Gly	Glu	Leu	Gln	Gly	Asn	Ala	Leu	Thr	Lys
				485					490					495
Leu	Ala	Leu	Gln	Thr	Leu	Gly	Thr	Phe	Asp	Leu	Gly	Gly	Ile	Gln
				500					505					510
Leu	Leu	Glu	Phe	Met	Arg	Asp	His	Ile	Leu	Ala	Tyr	Thr	Asp	Asp
				515					520					525
Pro	Asp	Lys	Glu	Ile	Arg	Gln	Ala	Ala	Val	Leu	Ala	Ala	Cys	Pro
				530					535					540
Arg	Ala	Gly	Ala	Ala	Arg	Ser	Ser	Leu	Arg	Val	Arg	Ser	Leu	Arg
				545					550					555
Ser	Gly	Trp	Arg	Arg	Ala	Ala	Ala	Ala	Val	Trp	His	Thr	Arg	Val
				560					565					570
Val	Glu	Arg	Cys	Val	Gly	Arg	Leu	Leu	Val	Val	Ala	Val	Ala	Asp
				575					580					585
Pro	Ser	Glu	Arg	Val	Arg	Lys	Glu	Val	Leu	Arg	Ala	Leu	Val	Ala
				590					595					600
Thr	Thr	Ala	Leu	Asp	Asp	Tyr	Leu	Ala	Gln	Ala	Asp	Cys	Leu	Arg
				605					610					615
Ala	Leu	Phe	Val	Gly	Met	Asn	Asp	Glu	Ser	Val	Ala	Val	Arg	Gly
				620					625					630
Leu	Ala	Ile	Arg	Leu	Val	Gly	Arg	Leu	Ala	Glu	Arg	Asn	Pro	Ala
				635					640					645
His	Val	Asn	Pro	Ala	Leu	Arg	Lys	His	Leu	Leu	Gln	Leu	Leu	His
				650					655					660
Asp	Met	Glu	Phe	Ser	Pro	Asp	Asn	Arg	Ala	Arg	Glu	Glu	Ser	Ala
				665					670					675
Phe	Leu	Leu	Glu	Val	Leu	Ile	Thr	Ala	Ala	Ala	Arg	Leu	Ile	Met
				680					685					690
Pro	Tyr	Val	Ser	Pro	Ile	Gln	Lys	Ala	Leu	Val	Ser	Lys	Leu	Arg
				695					700					705
Gly	Gly	Ser	Gly	Pro	Gly	Ile	Thr	Val	Leu	Ser	Thr	Leu	Gly	Ala
				710					715					720
Leu	Ala	Glu	Val	Ser	Gly	Thr	Thr	Phe	Arg	Pro	Phe	Ile	Ser	Glu
				725					730					735
Val	Met	Pro	Leu	Val	Ile	Glu	Ala	Ile	Gln	Asp	Asn	Ser	Asp	Gly
				740					745					750
Arg	Arg	Arg	Val	Val	Ala	Val	Lys	Thr	Leu	Gly	Phe	Ile	Val	Ser
				755					760					765
Ser	Cys	Gly	Asn	Val	Met	Gly	Pro	Tyr	Leu	Glu	Tyr	Pro	Gln	Leu
				770					775					780
Leu	Ser	Val	Leu	Leu	Arg	Met	Leu	His	Glu	Gly	His	Pro	Ala	Gln
				785					790					795
Arg	Arg	Glu	Val	Ile	Lys	Val	Leu	Gly	Ile	Ile	Gly	Ala	Leu	Asp
				800					805					810
Pro	His	Thr	His	Lys	Leu	Asn	Gln	Ala	Ser	Leu	Ser	Gly	Glu	Gly
				815					820					825
Lys	Leu	Glu	Lys	Glu	Gly	Val	Arg	Pro	Leu	Arg	His	Gly	Gly	Gly
				830					835					840
Gly	Ala	Gly	Gly	Ala	Gly	Gly	Gly	Ala	Gly	Gly	Gly	Gly	Val	Gly
				845					850					855
Gly	Gly	Val	Ala	Gly	Asp	Ser	Asn	Asp	Gly	Gly	Met	Gly	Pro	Gly
				860					865					870
Asp	Asp	Gly	Gly	Pro	Gly	Gly	Asp	Leu	Leu	Pro	Ser	Ser	Gly	Leu
				875					880					885
Val	Thr	Ser	Ser	Glu	Asp	Tyr	Tyr	Pro	Thr	Val	Ala	Ile	Asn	Ala
				890					895					900
Leu	Met	Arg	Val	Leu	Arg	Asp	Pro	Ala	Leu	Ala	Ser	Gln	His	Leu
				905					910					915
Ala	Val	Ile	Arg	Ala	Leu	Ala	Ala	Ile	Phe	Arg	Ala	Leu	Gln	Leu
				920					925					930
Ser	Val	Val	Pro	Tyr	Leu	Pro	Lys	Val	Leu	Pro	Ile	Leu	Leu	Gly
				935					940					945
Val	Leu	Arg	Gly	Gly	Asp	Glu	Ala	Leu	Arg	Glu	Glu	Ile	Leu	Ala
				950					955					960
Ser	Leu	Arg	Ala	Leu	Val	Gly	Tyr	Val	Arg	Gln	His	Met	Arg	Arg
				965					970					975
Phe	Leu	Pro	Asp	Leu	Thr	Gln	Leu	Val	His	Glu	Phe	Trp	Pro	Ala
				980					985					990
Ala	Pro	Arg	Thr	Cys	Leu	Ala	Leu	Ile	Ala	Asp	Leu	Gly	Met	Ala
				995					1000					1005
Leu	Arg	Asp	Asp	Ile	Arg	Ala	Lys	Pro	Leu	Pro	Pro	Leu	Pro	Leu
				1010					1015					1020
Leu	Pro	Pro	Ser	Ser	Pro	Pro	Arg	Thr	Pro	His	Asn	Arg	Gln	Tyr
				1025					1030					1035
Val	Pro	Glu	Leu	Leu	Pro	Lys	Phe	Val	Ala	Val	Phe	Ser	Glu	Ala
				1040					1045					1050
Glu	Arg	Ala	Gly	Ser	Trp	Asp	Leu	Val	Arg	Pro	Ala	Leu	Gly	Ala
				1055					1060					1065
Leu	Glu	Ser	Leu	Gly	Ser	Ala	Val	Asp	Asp	Ser	Leu	His	Leu	Leu
				1070					1075					1080
Leu	Pro	Ser	Met	Val	Arg	Leu	Ile	Ser	Pro	Ala	Ala	Ser	Ser	Thr
				1085					1090					1095
Pro	Ala	Glu	Val	Arg	Arg	Ala	Ala	Leu	Arg	Ser	Leu	Arg	Arg	Leu
				1100					1105					1110
Ile	Pro	Arg	Met	Gln	Leu	Gly	Gly	Tyr	Ala	Ser	Ala	Val	Leu	His
				1115					1120					1125
Pro	Leu	Ile	Lys	Val	Leu	Asp	Gly	His	Ser	Asp	Glu	Gln	Leu	Arg
				1130					1135					1140
Arg	Asp	Ala	Leu	Asp	Thr	Ile	Cys	Ala	Val	Ala	Val	Cys	Leu	Gly
				1145					1150					1155
Pro	Glu	Phe	Ala	Ile	Phe	Val	Pro	Thr	Ile	Arg	Lys	Val	Arg	Val
				1160					1165					1170
Arg	His	Arg	Leu	His	His	Glu	Trp	Phe	Asp	Arg	Leu	Ala	Gly	Lys
				1175					1180					1185
Val	Cys	Ala	Val	Ser	Pro	Pro	Cys	Met	Ser	Asp	Ala	Glu	Asp	Trp
				1190					1195					1200
Glu	Gly	Ala	Gly	Gly	Ala	Ala	Ser	Gly	Ala	Gly	Ser	Ala	Gly	Ala
				1205					1210					1215
Ala	Gly	Gly	Trp	Ala	Val	Glu	Ile	Asp	Leu	Leu	Ala	Arg	Met	Gln
				1220					1225					1230
Ala	Glu	Gly	Gly	Gly	Ala	Leu	Gly	Gly	Gln	Pro	Pro	Val	Pro	Pro
				1235					1240					1245
Gly	Pro	Asp	Gly	Gly	Pro	Ser	Ala	Lys	Leu	Pro	Val	Asn	Ala	Ala
				1250					1255					1260
Val	Leu	Arg	Arg	Ala	Trp	Glu	Ser	Ser	His	Arg	Val	Thr	Lys	Glu
				1265					1270					1275
Asp	Trp	Ala	Glu	Trp	Met	Arg	Asn	Phe	Ala	Val	Glu	Leu	Leu	Lys
				1280					1285					1290
Glu	Ser	Pro	Ser	Pro	Ala	Leu	Arg	Ala	Cys	His	Gly	Leu	Ala	Gln
				1295					1300					1305
Val	His	Pro	Ser	Met	Ala	Arg	Glu	Leu	Phe	Ala	Ala	Gly	Phe	Val
				1310					1315					1320
Ser	Cys	Trp	Ala	Glu	Leu	Glu	Gln	Gly	Leu	Gln	Glu	Gln	Leu	Val
				1325					1330					1335
Arg	Ser	Leu	Glu	Ala	Ala	Leu	Ala	Ser	Pro	Thr	Ile	Pro	Pro	Glu
				1340					1345					1350
Thr	Val	Thr	Ala	Leu	Leu	Asn	Leu	Ala	Glu	Phe	Met	Glu	His	Asp
				1355					1360					1365
Asp	Lys	Arg	Leu	Pro	Leu	Asp	Thr	Arg	Thr	Leu	Gly	Ala	Leu	Ala
				1370					1375					1380
Glu	Lys	Cys	His	Ala	Phe	Ala	Lys	Ala	Leu	His	Tyr	Lys	Glu	Leu
				1385					1390					1395
Glu	Phe	Gln	Thr	Ser	Pro	Gln	Ser	Ala	Ile	Glu	Ala	Leu	Ile	His
				1400					1405					1410
Ile	Asn	Asn	Gln	Leu	Arg	Gln	Pro	Glu	Ala	Ala	Val	Gly	Val	Leu
				1415					1420					1425
Ala	Tyr	Ala	Gln	Lys	His	Leu	His	Met	Glu	Leu	Lys	Glu	Gly	Trp
				1430					1435					1440
Tyr	Glu	Lys	Leu	Cys	Arg	Trp	Asp	Glu	Ala	Leu	Asp	Ala	Tyr	Glu
				1445					1450					1455
Arg	Arg	Leu	Leu	Lys	Glu	Ala	Pro	Gly	Ser	Met	Glu	Tyr	His	Thr
				1460					1465					1470
Ala	Leu	Leu	Gly	Lys	Met	Arg	Cys	Leu	Ala	Ser	Leu	Ala	Glu	Trp
				1475					1480					1485
Glu	Asn	Leu	Ser	Asn	Leu	Cys	Arg	Thr	Glu	Trp	Arg	Lys	Ser	Glu
				1490					1495					1500
Pro	His	Val	Arg	Arg	Glu	Met	Ala	Leu	Ile	Ala	Ala	His	Ala	Ala
				1505					1510					1515
Trp	His	Met	Gly	Ala	Trp	Asp	Glu	Met	Ala	Met	Tyr	Val	Asp	Thr
				1520					1525					1530
Val	Asp	Asn	Pro	Glu	Ala	Val	Gly	Pro	Asn	Ser	His	Thr	Pro	Thr
				1535					1540					1545
Gly	Ala	Phe	Leu	Arg	Ala	Val	Leu	Cys	Val	Arg	Ala	Asn	Gln	Val
				1550					1555					1560
Ser	Gly	Ala	Gln	Ala	His	Val	Glu	Arg	Thr	Arg	Glu	Leu	Met	Val
				1565					1570					1575
Ala	Asp	Leu	Ala	Ala	Leu	Val	Gly	Glu	Ser	Tyr	Glu	Arg	Ala	Tyr
				1580					1585					1590
Thr	Asp	Met	Val	Arg	Val	Gln	Gln	Leu	Ala	Glu	Leu	Glu	Glu	Val
				1595					1600					1605
Cys	Ala	Tyr	Lys	Gln	Ala	Leu	Asp	Arg	Arg	Ala	Ala	Asp	Pro	Gly
				1610					1615					1620
Gly	Ser	Glu	Ala	Arg	Ile	Gly	Phe	Ile	Gln	Gln	Leu	Trp	Arg	Asp
				1625					1630					1635
Arg	Leu	Arg	Gly	Val	Gln	Arg	His	Val	Glu	Val	Trp	Gln	Ser	Leu
				1640					1645					1650
Phe	Ser	Ile	Arg	Ser	Leu	Val	Val	Pro	Met	Ala	Gln	Asp	Val	Asp
				1655					1660					1665
Ser	Trp	Leu	Lys	Phe	Ala	Ser	Leu	Cys	Arg	Lys	Ser	Gly	Arg	Ser
				1670					1675					1680
Arg	Gln	Ala	Tyr	Arg	Met	Leu	Leu	Gln	Leu	Leu	Arg	Tyr	Asn	Pro
				1685					1690					1695
Met	Asn	Ile	Thr	Gln	Ala	Gly	Asn	Pro	Gly	Tyr	Gly	Ala	Gly	Ser
				1700					1705					1710
Gly	Ala	Pro	His	Val	Met	Leu	Ala	Phe	Leu	Lys	His	Leu	Trp	Thr
				1715					1720					1725
Gln	Gly	Asn	Arg	Thr	Glu	Ala	Tyr	Asn	Arg	Ile	Lys	Asp	Leu	Ala
				1730					1735					1740
Ser	Leu	Asn	Gly	Arg	Ala	Phe	Leu	Arg	Leu	Gly	Ile	Trp	Gln	Trp
				1745					1750					1755
Ala	Met	Asn	Asp	Leu	Asp	Asn	Pro	Gly	Val	Ile	Ala	Glu	Asn	Leu
				1760					1765					1770
Ala	Ser	Phe	Arg	Ala	Ala	Thr	Glu	His	Ala	Pro	Asn	Trp	Ala	Lys
				1775					1780					1785
Ala	Trp	His	Gln	Trp	Ala	Leu	Phe	Asn	Val	Ala	Val	Ser	Ala	His
				1790					1795					1800
Tyr	Arg	Cys	Asp	Pro	Met	Arg	Asp	Glu	Asn	Gln	Ala	Val	Ser	His
				1805					1810					1815
Val	Pro	Pro	Ala	Val	Gln	Gly	Phe	Phe	Arg	Ser	Val	Ala	Leu	Gly
				1820					1825					1830
Gln	Ala	Ala	Gly	Asp	Arg	Thr	Gly	Asn	Leu	Gln	Asp	Ile	Leu	Arg
				1835					1840					1845
Leu	Leu	Thr	Leu	Trp	Phe	Asn	Phe	Gly	Ala	Tyr	Ala	Glu	Val	Arg
				1850					1855					1860
Ala	Ala	Leu	Thr	Glu	Gly	Phe	Gln	Leu	Val	Ser	Ile	Asp	Thr	Trp
				1865					1870					1875
Leu	Leu	Val	Ile	Pro	Gln	Ile	Ile	Ala	Arg	Ile	His	Thr	His	Asn
				1880					1885					1890
Thr	Asp	Val	Arg	Gln	Leu	Ile	His	His	Leu	Leu	Val	Lys	Ile	Gly
				1895					1900					1905
Arg	His	His	Pro	Gln	Ala	Leu	Met	Tyr	Pro	Leu	Leu	Val	Ala	Thr
				1910					1915					1920
Lys	Ser	Gln	Ser	Pro	Ala	Arg	Arg	Gln	Ala	Ala	Tyr	Ser	Val	Leu
				1925					1930					1935
Glu	Cys	Ile	Arg	Gln	His	Ser	Ala	Ala	Leu	Val	Glu	Gln	Ala	Gln
				1940					1945					1950
Leu	Val	Ser	Gly	Glu	Leu	Ile	Arg	Met	Ala	Ile	Leu	Trp	His	Glu
				1955					1960					1965
Met	Trp	His	Glu	Gly	Leu	Glu	Glu	Ala	Ser	Arg	Leu	Tyr	Phe	Gly
				1970					1975					1980
Glu	Ser	Asn	Val	Glu	Gly	Met	Leu	Asn	Thr	Leu	Leu	Pro	Leu	His
				1985					1990					1995
Glu	Met	Leu	Glu	Lys	Ala	Gly	Pro	Thr	Thr	Leu	Lys	Glu	Ile	Ala
				2000					2005					2010
Phe	Val	Gln	Ser	Tyr	Gly	Arg	Glu	Leu	Ser	Glu	Ala	Tyr	Glu	Trp
				2015					2020					2025
Leu	Met	Lys	Tyr	Lys	Ala	Ser	Arg	Lys	Glu	Ala	Glu	Leu	His	Gln
				2030					2035					2040
Ala	Trp	Asp	Leu	Tyr	Tyr	His	Val	Phe	Lys	Arg	Ile	Asn	Lys	Gln
				2045					2050					2055
Leu	Arg	Ser	Leu	Thr	Thr	Leu	Glu	Leu	Gln	Tyr	Val	Ser	Pro	Ala
				2060					2065					2070
Leu	Val	Arg	Ala	Gln	Asp	Leu	Glu	Leu	Ala	Val	Pro	Gly	Thr	Tyr
				2075					2080					2085
Ile	Ala	Gly	Glu	Pro	Leu	Val	Thr	Ile	Ala	Ala	Phe	Ala	Pro	Gln
				2090					2095					2100
Leu	His	Val	Ile	Ser	Ser	Lys	Gln	Arg	Pro	Arg	Lys	Leu	Thr	Ile
				2105					2110					2115
His	Gly	Gly	Asp	Gly	Ala	Glu	Tyr	Met	Phe	Leu	Leu	Lys	Gly	His
				2120					2125					2130
Glu	Asp	Leu	Arg	Gln	Asp	Glu	Arg	Val	Met	Gln	Leu	Phe	Gly	Leu
				2135					2140					2145
Val	Asn	Thr	Met	Leu	Ala	His	Asp	Arg	Ile	Thr	Ala	Glu	Arg	Asp
				2150					2155					2160
Leu	Ser	Ile	Ala	Arg	Tyr	Ala	Val	Ile	Pro	Leu	Ser	Pro	Asn	Ser
				2165					2170					2175
Gly	Leu	Ile	Gly	Trp	Val	Pro	Asn	Cys	Asp	Thr	Leu	His	Ala	Leu
				2180					2185					2190
Ile	Arg	Glu	Tyr	Arg	Glu	Ala	Arg	Lys	Ile	Pro	Leu	Asn	Trp	Glu
				2195					2200					2205
His	Arg	Leu	Met	Leu	Gly	Met	Ala	Pro	Asp	Tyr	Asp	His	Leu	Thr
				2210					2215					2220
Val	Ile	Gln	Lys	Val	Glu	Val	Phe	Glu	Tyr	Ala	Leu	Asp	Ser	Thr
				2225					2230					2235
Ser	Gly	Glu	Asp	Leu	His	Lys	Val	Leu	Trp	Leu	Lys	Ser	Arg	Asn
				2240					2245					2250
Ser	Glu	Val	Trp	Leu	Asp	Arg	Arg	Thr	Asn	Tyr	Thr	Arg	Ser	Ala
				2255					2260					2265
Ala	Val	Met	Ser	Met	Val	Gly	Tyr	Ile	Leu	Gly	Leu	Gly	Asp	Arg
				2270					2275					2280
His	Pro	Ser	Asn	Leu	Met	Leu	Asp	Arg	Tyr	Ser	Gly	Lys	Leu	Leu
				2285					2290					2295
His	Ile	Asp	Phe	Gly	Asp	Cys	Phe	Glu	Ala	Ser	Met	Asn	Arg	Glu
				2300					2305					2310
Lys	Phe	Pro	Glu	Lys	Val	Pro	Phe	Arg	Leu	Thr	Arg	Met	Met	Ile
				2315					2320					2325
Lys	Ala	Met	Glu	Val	Ser	Gly	Ile	Glu	Gly	Asn	Phe	Arg	Thr	Thr
				2330					2335					2340
Cys	Glu	Asn	Val	Met	Arg	Val	Leu	Arg	Ser	Asn	Lys	Glu	Ser	Val
				2345					2350					2355
Thr	Ala	Met	Leu	Glu	Ala	Phe	Val	His	Asp	Pro	Leu	Ile	Asn	Trp
				2360					2365					2370
Arg	Leu	Leu	Asn	Thr	Thr	Glu	Ala	Ala	Thr	Glu	Ala	Ala	Leu	Ala
				2375					2380					2385
Arg	Thr	Asp	Gly	Gly	Gly	Gly	Gly	Gly	Gly	His	Met	Asp	Gly	Pro
				2390					2395					2400
Gly	Gly	His	Pro	Gly	Gly	Arg	Asp	Ala	Leu	Gly	Gly	Gly	Gly	Gly
				2405					2410					2415
Gly	Ala	Gly	Gly	Gly	Gly	Gly	Gly	Asp	Pro	Gly	Ala	Met	Pro	Ser
				2420					2425					2430
Pro	Pro	Arg	Arg	Glu	Thr	Arg	Glu	Lys	Glu	Leu	Lys	Glu	Ala	Phe
				2435					2440					2445
Val	Asn	Leu	Gly	Asp	Ala	Asn	Glu	Val	Leu	Asn	Thr	Arg	Ala	Val
				2450					2455					2460
Glu	Val	Met	Lys	Arg	Met	Ser	Asp	Lys	Leu	Met	Gly	Arg	Asp	Tyr
				2465					2470					2475
Ala	Pro	Glu	Leu	Cys	Val	Gly	Gly	Gly	Ser	Gly	Ala	Ser	Gly	Met
				2480					2485					2490
Glu	Pro	Asp	Ser	Val	Pro	Ala	Gln	Val	Gly	Arg	Leu	Ile	Asn	Met
				2495					2500					2505
Ala	Val	Asn	His	Glu	Asn	Leu	Cys	Gln	Ser	Tyr	Ile	Gly	Trp	Cys
				2510					2515					2520
Pro	Phe	Trp												
														
<210> SEQ ID NO: 94
<211> 1190
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
TGG GAG TCC AGC CAG CGC AGC ACC AAG GAC GAC TGG GCG GAG TGG ATG 48
CGC AAC TTC AGC ATC GAG CTG CTG AAG CAG TCG CCC TCC AAG GCG CTG 96
CGT GCC TGC GCC TCC CTT TCC CAG GCC AAC CCG GCC ATG GCG CGG GAG 144
CTG TTT GCG GCG GGC TTT GTC TCC TGC TGG TCG GAG CTG GAG GAG AGC 192
CTG CAG GAC CAG CTG GTG CGG TCG CTG GAG GCG GCC CTG GCC AGC CCC 240
ACC ATT CCC CCC GAC ATC GTG ACC ACG CTG CTC AAC CTC GCA GAA TTC 288
ATG GAG CAC GAT GAG AAG GCG CTG CCC CTG GAC ACG CGG ACA CTG GGC 336
GCC CTG GCC GAG AAG TGC CAC GCC TAC GCC AAG GCG CTG CAC TAC AAG 384
GAG CTG GAG TAC AAC ACC TCC CCC TCC ACG GCC ATC GAG GCG CTC ATC 432
AGC ATC AAT AAC CAC CTG CGC CAG CCG GAC GCG GCG GTG GGC ATT CTC 480
ACA GCG GCG CAG AAG ACC AAG GAA ATG ACC CTC AAG GAG TCC TGG TAT 528
GAG AAG CTG CAG CGC TGG GAT GAA GCG TTG AGG GCC TAC CGC AAG CGC 576
CTG GAG ACG ACT AAG CCG GGG TCG CCG GAG AAC ATC GAG GCG CTG CTG 624
GGG GAG TGC AGG TGC CTG GCG GCG CTG GCG GAG TGG GAG GAG CTG TAC 672
TCG GTG TGC CGA AGG GAG TGG CAG AAG ATG GAG CCG CAG ACG CGG CGC 720
GAG CTG GCC CCC GTC GCC GCA CAC GCC GCC TGG CAG CTG GGC GAG TGG 768
AAG TGC ATG GAG GAC TAC GTG GAC GTC ATC AAG CAC CAG CAA GCG GGG 816
AGC TCC GAG GGC GCC TTC CTC TCC GCC GTC CTG CAC GTG AAG AAG GAG 864
GAC TAC ACC GCG GCC ATG GTC GAC GTC GAC CAG GCG CGG GAG CTG CTG 912
GGC ACG GAG CTG TCG GCG CTG GTG GGC GAG AGC TAC GAG CGC GCC TAC 960
GGC GAC ATG GTG CGT GTG CAA CAG CTG ACG GAG CTG GAG GAC ATC CTG 1008
ACC TTC AAG CTG GCC GAG CAG GTC ACC AAA GGA AAC CCC GCC ATC ATG 1056
ACG CAC ACC AAG AAC TTC ACG CAG GCG ATG TGG GCG GGG CGG ATG CAG 1104
GGC GTG CAA CGC AAT GTG GAG GTC TGG CAG GCG CTG CTC AGC GTG CGT 1152
GGC CTG CTG CTG GAC ATG CAC GAG GAC ACC GCC ACC TG 		1190

<210> SEQ ID NO: 95
<211> 128
<212> Amino Acid Sequence
<213> Arabidoipsis thaliana

Met	Gln	Ile	Phe	Val	Lys	Thr	Leu	Thr	Gly	Lys	Thr	Ile	Thr	Leu
1				5					10					15
Glu	Val	Glu	Ser	Ser	Asp	Thr	Ile	Asp	Asn	Val	Lys	Ala	Lys	Ile
				20					25					30
Gln	Asp	Lys	Glu	Gly	Ile	Pro	Pro	Asp	Gln	Gln	Arg	Leu	Ile	Phe
				35					40					45
Ala	Gly	Lys	Gln	Leu	Glu	Asp	Gly	Arg	Thr	Leu	Ala	Asp	Tyr	Asn
				50					55					60
Ile	Gln	Lys	Glu	Ser	Thr	Leu	His	Leu	Val	Leu	Arg	Leu	Arg	Gly
				65					70					75
Gly	Ile	Ile	Glu	Pro	Ser	Leu	Met	Met	Leu	Ala	Arg	Lys	Tyr	Asn
				80					85					90
Gln	Asp	Lys	Met	Ile	Cys	Arg	Lys	Cys	Tyr	Ala	Arg	Leu	His	Pro
				95					100					105
Arg	Ala	Val	Asn	Cys	Arg	Lys	Lys	Lys	Cys	Gly	His	Ser	Asn	Gln
				110					115					120
Leu	Arg	Pro	Lys	Lys	Lys	Ile	Lys							
				125										

<210> SEQ ID NO: 96
<211> 894
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
TGC TCT TTC GGC ACC GTA GAC GGG CGT TCT GGG GGA GCT TCA CAA TGC 48
AGA TCT TCG TGA AGA CCC TCA CGG GCA AGA CAA TCA CCC TTG AGG TGG 96
AGT CCA ATG ACA CCA TTG ACA ACG TGA AGG CCA AGA TTC AGG ATA AGG 144
AGG GAA TCC CCC CGG ACC AAC AGC GCC TGA TCT TTG CGG GCA AGC AGC 192
TGG AGG ATG GCC GCA CCC TGT CCG ACT ACA ACA TCC AGA AGG AGT CCA 240
CCC TCC ACC TGG TCC TGC GCC TGC GTG GTG GAA TCA TCG AGC CCT CCC 288
TGC TCA TCC TGG CCC GCA AGT ACA ACC AGG ACA AGA TGA TCT GCC GCA 336
AGT GCT ATG CCC GCC TGC ACC CGC GCG CGG TCA ACT GCC GCA AGA AGA 384
AGT GCG GAC ACA GCA ACC AGC TGC GCC CCA AGA AGA AGC TGA AGT GAT 432
CTG AGC CTC ATG CTT TGT GAT TGG GTG TAA TGT GGT GCC AGA ACG AGG 480
CAG GGA CGA TGC CTG GAG TTT GGG GGT AAC AGC TGC CGG GGC GTG TGC 528
TGC AGG GAG GGA TGC TCT CTG CCC TGG GAG GCT CGT GCG AAA GTG GCT 576
TGG ACC CAT GTA TGT TGG ATG TTA AGG GTA GAG GTC CCT AAA GGC CCC 624
TCC GAC ACT GCT GCG TCT CGG TAC ACC CGG GAT GCA TGC TGC CTC TCA 672
TTT CGC GTA TTT TTA AAG ATT GCC ACA GAG CCA GCA AGC ATG GCA TTT 720
TTG TGC TGG CAG CAG GAG GAA TCC GAG CAA TGG AGT TGG AGC GAC GCC 768
ATT CAC AAA GTC ATA GTA CAA ACT TGA AAG GGG GAG GGG GGG TGG TCC 816
AGA AGA CAG ATA CAG AGT CAC AAT GCA GCG AGG TCG TCA TCA TGC ATC 864
TGC CTT CTT CTT CTT CTT CTT CTT CTT CTT 			894

<210> SEQ ID NO: 97
<211> 353
<212> Amino Acid Sequence
<213> Chlamydomonas reinhardtii

Met	Leu	Arg	Asn	Ala	Val	Met	Leu	Trp	Trp	Met	Gly	Val	Arg	Glu
1				5					10					15
Ala	Leu	Ala	Val	His	Arg	Cys	Val	Val	Phe	Leu	Val	Leu	Asp	Glu
				20					25					30
Ser	Lys	Ala	Leu	Gly	Lys	Ala	Val	Leu	His	Cys	Phe	Val	Leu	Asn
				35					40					45
Gly	Ala	Ile	Leu	Leu	Gly	Ser	Ile	Leu	Ala	Trp	Asp	Tyr	Gly	Leu
				50					55					60
Gln	Pro	Ala	Val	Gly	Trp	Leu	Leu	Arg	Val	Leu	Val	Ala	Pro	Val
				65					70					75
Tyr	Gly	Gly	Ala	Val	Ala	Gly	Gly	Ser	Arg	Trp	Leu	Leu	Gly	Thr
				80					85					90
Ala	Phe	Gln	Ala	Leu	Trp	Leu	Ala	Pro	Val	Tyr	Leu	Val	Thr	Met
				95					100					105
Leu	Val	Ser	Cys	Gly	Ile	Tyr	Asn	Asp	Val	Ala	Lys	Tyr	Ala	Tyr
				110					115					120
Gln	Ile	Lys	Thr	Arg	Gln	Gln	Lys	Gly	Gly	Ala	Gly	Gly	Lys	Gly
				125					130					135
Ala	Ala	Ala	Gly	Ser	Gly	Gly	Ala	Gly	Gly	Gly	Ser	Ala	Ala	Gly
				140					145					150
Gly	Gly	Gly	Gly	Ser	Gly	Gly	Gly	Gly	Gly	Leu	Glu	Asp	Ala	Ala
				155					160					165
Gln	Glu	Leu	Tyr	Arg	Val	Val	Leu	Phe	Cys	Ile	Phe	Phe	Ala	Glu
				170					175					180
Val	Ser	Leu	Val	Gly	Lys	Leu	Pro	Tyr	Val	Gly	Tyr	Phe	Leu	Asn
				185					190					195
Val	Leu	Ala	Leu	Ser	Trp	Leu	Tyr	Ala	Tyr	Tyr	Cys	Phe	Asp	Tyr
				200					205					210
Lys	Trp	Gly	Leu	Gln	Gly	Val	Arg	Leu	Thr	Glu	Arg	Leu	Ala	Tyr
				215					220					225
Phe	Glu	Arg	Arg	Trp	Ala	Phe	Phe	Ala	Gly	Phe	Gly	Leu	Pro	Met
				230					235					240
Ala	Leu	Ser	Thr	Val	Leu	Leu	Ser	Phe	Tyr	Pro	Gly	Ala	Ala	Val
				245					250					255
Leu	Ala	Val	Leu	Phe	Pro	Val	Tyr	Ile	Leu	Val	Ala	Cys	Asp	Cys
				260					265					270
Asp	Val	Asn	Ala	Ala	His	Asp	Cys	Val	Leu	Gly	Pro	Gly	Gly	Ala
				275					280					285
Ala	Gln	Leu	Arg	His	Leu	Pro	Ile	Phe	Ala	Leu	Ala	Leu	Trp	Pro
				290					295					300
Thr	Gln	His	Val	Val	Gln	Leu	Ile	Thr	Gly	Ser	Ser	Ser	Ser	Asn
				305					310					315
Ile	Ser	Arg	Ser	Arg	Ala	Ala	Ser	Val	Ala	Gly	Met	Arg	Ala	Val
				320					325					330
Val	Gly	Phe	Ser	Asp	Ser	Ala	Glu	Lys	Ala	Gly	Ala	Asp	Gly	His
				335					340					345
Gly	Gly	Gly	Ala	Arg	Ala	Tyr	Arg							
				350										

<210> SEQ ID NO: 98
<211> 798
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
ATG CCG TGG GGC CGC AGT GCC GAT CTG GCG ACA TTC AAG CGC CAG GCC 48
AGC TGC CAG CTA CAC GCA GGG TCC ACC AGG ACC ATA CAA TGC ACG GTC 96
CTG AAC GGC GTC GTC TTC CTG GGC AGC GTC CTC CTG GTC CGC CAC GGG 144
CTG CAG CCG CTG CTG GCG ACC ACC CTG TCT GCC CTG CTG GGC CAG GGC 192
CCC GGC TCC GCG GCC ACC TCG CTG CTC ATG GGG GCA TAC GGC GCG CTG 240
TGG GTG CTG CCC GCC CAC GTC GTG AGC CTG CTG ATC AAC ACG CTG TGG 288
TAC GCT GAG CTG GCG GAG CTC ACC ATT GCC GCG GGC CAG CGC CAG GCC 336
CTG GTG CGG CGG GCC GAG AGG CTG GGC GAG GGC CTG AGC GCG CGG CGC 384
TCG CCC ATC AAG GCC AGG ATC CCC GAC CCC ATG ACC CTC ATC TCC GAA 432
TCG GTG TAC CGC ATC CTC CTC CTC TGC TCC ATC TCC CTG CAA ACC TTT 480
GTG CTG GAT CTC CTT CCC GTC ATA GGG CGG GCG ACC AGC GCG GCC CTG 528
ATG GCC TGG CTC TAC GCC CTC TAC TCA TTC GAC TAC AAG TGG GCG GCT 576
CAT GGC ATC CCG CTC ATC ACC CGC GTT TCT TTC TTC GAG ACG CAC TGG 624
GCC TAC TTT GCA GGG TTT GGC CTG CTC CCC ATC CTG CCG GTC CTG CTG 672
ACC AGC TTC TTC ATA GGT TCC GCC GTC GTG GGT GTG GCC TTC CCC CTC 720
TTC ATC ATC TTG GCT GCG GAT GCC AAC CCC AAA GCG CGG CAA GCG GAG 768
GGG AAC CTG CAG GCT GAT GAT GCT GGG TAG  			798


<210> SEQ ID NO: 99
<211> 4083
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CCTCGAGCTG CAGAAGGTTC GAAGGGTTCG GCTGTTCGCC GATTAAAGTG    50
 GTACGTGAGT TGGGTTTAAT ACGTCGTAAA GAGTCTTGCG ACCTTAAGGA   100
 GCAATCCTTA TGTAAAAATT GCCTCATAAC GGTGAAGTCC TAATAGTAGC   150
 CTCATACACT CATTAGGGGA ATACCGTGGG AAGTCTATAA TATAAGAAAT   200
 AAAATGTGTA TCTCTTTAGT CTATTTTGCC TAGAAGGTCA TGATATACTG   250
 TGCACAATTC TTTATATTTC AGTGACCCCG TAACGACTTT AATTCTATAT   300
 CTAATAAAAA CAACTGGACA TAATAAATCT CATTTATTTT AAATTATAAT   350
 CTATATCTAT GTAAACAAAT ATAAGTTATA ATTTATATTT AGTTTTAGAT   400
 TAGAATTAAG ATATCTTTTA AATTAAAGAT ATAACACGGC AACTCTTATT   450
 ATAATAATAA GATGAAGGTA TAGTCTGAAC CTTAAGTAAT TAGGGATTTT   500
 TCTAAGTAAT TAGAACTTGT TTATTAAAAC TGTAATCTTA TTTAATACGT   550
 TTGTAAACGC TTAGAGACAG TATGTATCCT ATCTTTTTGG GGTGTAAAAA   600
 AACTGACAGG ACGATTTCTT TGTACGAGAG GACCGGAAAT CACGAACCTC   650
 TTGTGTATCA GTTGTTATGC CAATAGCATA GCTGAGTAGC TACGTTCGGA   700
 CTAGAGAACC GCTGAAAGCA TCTCAAGCGG GAAACTAACC TGAAAACAAG   750
 TTTTTAGTAT CGTAGAAGAC TACTACGTTA ATAGGTGGAA TGTGTAAAAA   800
 TAGCAATATT CCAATTTTAT TTCTAGTTTT TAACAAAAAA TAAAAGAGCA   850
 AATCCATACT AAACTCTATT TTAAATATTA TTTATATTTT ATAAATTAGG   900
 TGTAGGTAGA AATAAATAAA ATTTATTTAG ATCAATATCT AATTTATAAA   950
 ATATATAATT TTTATCATTC TGTTGCGGGC ATATATTAAT GGTAAATTTG  1000
 TTATTTTCCA AGTAATCGAT ATGGGTTCGA TTCCCATTGT CCGCTTGTGT  1050
 ATCGAGACAA GTAGATAAGA TTTAGGATAT AACTCTATTT ATAAATTTAG  1100
 GTAGATAAAA AAATTATATC AATTCTGCTA TTATGTACAA ATATGTTTTT  1150
 TACAGAGTAT AGCGCAGCCT GGTAGCGTGT TCGCTTTGGG AGCGAGAGGC  1200
 CGCAGGTTCG AATCCTGCTA CTCTGATTTC GTGTTTTATA GCCCTGCATA  1250
 ATAAACTTTG TTTTATTAAA TAGCTTTATA CCCAAAATGT ATCTGATATA  1300
 ACTATATAAG GTTGTTTTTA GTTACAGTGG GTTTCTCATC ATCTTGATCA  1350
 TCGTTTTATT TTTAGTCCTA TGCATCTCTG TACCAATAAT AACAAAAACT  1400
 AAAGGTAAAT CAAATTCTTC AATTGCATGT AAAAGGCCAG TAGGTATTTA  1450
 TAATCTATTT TTTCCCTGCT TTTTTATTAA CTAAGACTCT TAGAATATAA  1500
 ATTTAAGAAA GTATATAAAA TAAACTATAA CATATATCTT TTAGTAAATA  1550
 TGTATGATAG TGAATATGCT TGATAGTTAA CCAGCTTATT TAGTAACATC  1600
 TCCCTAATGG ACCCCCCGGA AGACAAGCTG TAGAGCAAGT TAGAGAGCAA  1650
 AGAAAAGAAT AAAATATAGA TTGATTAGTC ATATAACTTT TAACTAGTAA  1700
 AGCCTATATT TCCTATTATT GTATAAACTT TGTAATAAGT AATTCGAATA  1750
 ATTAATAATG TTATATACTT TTTACTATAT TATTATAGTA ACTATTAATT  1800
 GTATAGATTT ATTAATTGTT AATTAGAATA AGTAATTAGA TTGATTACGC  1850
 AAGTAAAAAC CTAAATCAAC TTTGTAAAGA CTAATTGTAA AACTAAATAG  1900
 ATTGGTATAA ATAAAATGCT GTTGTTCCAA ATGAGGCATT GTAATAATTG  1950
 CAACATATAA TTTATAATAC CAAGGATTAA AAGTATGGTT ATTGAGCTTG  2000
 TGAGCATATA AATTGATAGG TAATTAGTTT CAATTATACA GTAAAACGTA  2050
 AATATATCAA ATATAGCATC TTTTATTTAA AATATAATAT AAAAGCATTT  2100
 CTAGGTATAC ATTTTTTATC ACACCACCTT AAAAAGTATA CTTATTTTCC  2150
 TTATATTCTC CGTACCATGT AAAATAAATG AAAATTTATT AAANNNNNNN  2200
 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2250
 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2300
 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2350
 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2400
 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNT AAAATAAATG AAAATTTATT  2450
 AAAATAAATG AAAATTTATT TATTGGTAAG TGGTTCAACA ATATGATCAT  2500
 TANNTAATGA TCATATTGTA GATTATTGTC TAGTCTAAAA CATTGAAAAT  2550
 AATTTTATTG AAGTGATTAG TAGGCTTAAA TTCGTCGATG GGTTTTATTT  2600
 TTATCTTTCG TAAATCAAAT AAATAAATAT CATTGGTTTT AAACTTGACT  2650
 AGAAATGCGA AATGTAATAA AAAACCTCCT TATAAACAAA ATTTCTATGG  2700
 TTAATTTTGT TTATTTATTT TACTTATATT AGAAAAACAA AAGCCCCATA  2750
 GTGTAATAAA AAATAGAAAT GTCAATCGAA TTGCAGCCTG AAATTAGAGA  2800
 CAATAGAATC TATTGTATTG TCTCTTTTAA TAGAATCTTT AAATTATACT  2850
 AGTGAAATAA ACAAAGTTTA TTAAAATAAA CCAAGTTTAT TAAAATAACT  2900
 GAATTTTATT TGATTTATTT CCGAGTAGTA AATGCTTTTT CAAGCAAATT  2950
 TAATGTCTCG TCTTTATTTG GATAATATCT ATCGTATTTT ACAGTAGTGC  3000
 TGATATTAGT ATGTCCTATT AAATTTGATA CAATTTGTAA AGGTGCATGT  3050
 TTTAAGAGTG ATGTTACAAA ATTAACTCGA AAAGAGTGTG ATTTAATATT  3100
 GAGTCCTAGG TGTCGAGTCT TCTCTAACAA TCTACTATTA ATAAAATAAA  3150
 TCCAACTATT TTCGGCGATG CCTCCAGATA ATGTAGAATA CTTGTTAAAA  3200
 ACAATAGTAC ACTCATTTTC AAGTGCTCTA AACTGATCAC AACTTTGTTT  3250
 GGTAAAAAGT ATTAATCTAT ATTTATTAGT CTTAGGTTGA TAAACTTGTA  3300
 AAGATTTGTG ATGTATAATT GAATCAATAT CATCCTGTGT AATGTTTCGA  3350
 ATTTCATTTA CCCTCAATCC AGCAGCCCAC AATAGAGTAA CTGCAACTCT  3400
 GAATCTAGAA TATTGTAAAT TTTTTTCTTT AAATTCTCTT TGTAGACTCA  3450
 TAAGATAAAA GTAGACATCA TTGTTAGCTG GTTGCCTTAG AGGTAAAGTA  3500
 TTTTGTTTAT CTCTCCTTTT TGTTTTTATC GTATGTAATA TTTCACGAGT  3550
 GACTTGTTGT AACTGGTCGA CACTCTGTTC GATTCTCCTT ACAGAATCTT  3600
 GTGTTTCCGT AATAACGACA TATGGGTTTT TATCGTAGTG TATTTCAACC  3650
 ACAGGGTCAT TATGAGGTGT CTCGATAATC TGTAAATCTT TATCTTTTTT  3700
 TAAAACAGAC TCTAACTTAT TTTCTAGTTT ATTTATTTGA GGCATAAATA  3750
 ATTAATTTCG TGATTATTAT ATTCATATCA AGCAATATAG ATGTTTGNTT  3800
 CTANANNANN NTNNTNNNAT NTNGCTTGAT AATAATCACG TCTCCCGCAT  3850
 AAAGTAGTAC ATATGATGTA GAATGCTTTC ACAAATCGAG TTCGGAAAGG  3900
 GTTCGGGTGG TTCACATTCC CAGATATCAA GCTTTTNTNC TTTKNKNNKA  3950
 ANNNNTAGGC GAATAACGGG ATTTGAACCC ATGTTTCCAG AGTCACAGTC  4000
 TAGCACTTTA ACCGACTAAG TTATACTCGC CAAGAAAAAC AATAAATTTA  4050
 TACTGAACTA ATAACTGATA TTCATAAAGA ATA                    4083

<210> SEQ ID NO: 100
<211> 613
<212> Nucleic Sequence
<213> Auxenochlorella protothecoides

<400>
CAC CGA GAT CTA CCC TCT TTC CCG ACC CGA CGC TCT TCC GAT CTC TGG 48
TGG CCA CGG CGG TCG TGG AGG AGG GCA TCG ATG TCC GTC GTT GCC AGC 96
TGG TGG TGC GAT TCG ACC TCC CGC CCA CCG CGC AAA GCT ACA TCC AGT 144
CAC GTG GAC GGG CGC GCA TGC AGA ACA GCG TAC TGA TCC TCA TGG TGC 192
AGA CAG AGC TTC CCG AGG AGC TGG AGA TGA TCA ATC ACA TGA TCA AGT 240
TTG AGG CGG ATT TGA GGC AGG AGG TGC TGT CCA ATA TTC ACA AGA TGA 288
AGG AAA AGG GCC TGG TGG AAG ACG CCT CCG AGT CCT CAG AGA GCG AAG 336
AAG ACG ATG GCA TCA ACG ACG AGG AGC TGC GGT ACA TGG TGG CCA GCA 384
CCG GCG CAC GAG TGA GCG CAG GTA ACG TCC TCC GAC TCC TCC ACA CCT 432
ACA TCT CCA AGC TGC CCA CTG ACA GGT ACT CCA TCC TGC GGC CAA CCT 480
ACC GCA CAG AGG CAC TGC TCG ACG GCA TGT ACT TGA CCC ATG CCT TCC 528
TGC CCA GCA ACT GCC CCC TGC GGG AGA CGG TGG GCA TGC CGC AAT CCA 576
CCC GCC GAA AGT CCA TCG CCT CAG CGG CCC TCA ACG C 		613

<210> SEQ ID NO: 101
<211> 924
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
GTGCCAGGAG GATGACTAGG AGGGTGATTT TGAATGGCGG GCTGGCATGT   50
GCATAGGGGG TGCACCCACT GCCAGATAAA AAGTAATGCC CAGTGATACC  100 
TCCTCCACTC CACACACAGA TACATGCGCT AGACTCTCAC AATCATATGA  150
CCGTGATGGT CGCGCGCAAC TCAAGCCGGC TATAACGTGC ACCAGCAGAA  200
ACGCATTCCC CCCTTGCGAC CACCACTTCT CCGACGTACA CCAGGGAGTG  250 
CAAACACATC CCTCGCCAGA TTTATCTCAG CATGGCGCTG GTTGCCTTCT  300
GTAAAAACTG ATAATCTAAA TCTGGAGTTT GTGAGTGGCC TCGACCTGTT  350
CTTGAACTCC GCCGCTGGGA GGGAGTGGTG TGCTGCTGCC CAAACCTAAT  400
AGATTCTCCG TCAAAATATG GCGGGACCGG GGCGATCCAA AATCTCCGAC  450 
ACAGATTTCG CACCTTCCGT GGCGCTTCAC CCGCGTGCTC TCGCTGGGTC  500
TCGCAGAATA ACGTGGGCGA CAAACACACC TGCCAAAAGA ATTATAAGAA  550
TTAATAAATT AAAGGGATTA CTGACTGTCA TCGCCCGTCA CCCCGTGGTG  600
AGGTACCCGA GTCGAAGCTG AACCCCGTCA TCCCGGCGGC TGCGCAAGGT  650
ACGGATGCGA GGATGGGGGG CCCGGATCTA TAAATCTTGC CCGTATTCGC  700 
AGTGCATGTC GGCACCACAC GGGCTCGCAC CATCTGCCGT GGGGATCACG  750
GGCCCGCGGC CTTTGGTTGA CGTAAGCGGT TTCTTTGCTG AGTCAGACTA  800
ACGTTCACGA AGTGCGGCTG CGAGGTATAC GTCTCACGAG CCCCTGTACG  850
GCCACAAAAC CTCCACTCAG GGGAAACAAC TTCGAGTTGT CACATTTATC  900
TACCCTCCCG ACAGAGTAAC CATG                              924

<210> SEQ ID NO: 102
<211> 706
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TCTTCCGATC TAGCCAGGCC GCGATGGTGG CCACCCCGCC CCTCATCGGC   50
CACTACCAGC TGCGGCGGCT GCGGGAGGGG CTGGCGCGGC AGGAGGCGGG  100
CCTCTCCGGC ATGCTGGCCG GGCTGGGGGC GCTGGAGCGG GCGCTGGCGC  150
AGGCCGGCGC CTGAGGTGGG GAGTGGAAGG GGGGTGGGAG GTGGGGGACC  200
TCAGACCCAC GCGGCTCCGC ACCGAGCCTT CTTGGCCCCA GTCGGCGGTG  250
TGCTGCTGGC TGTAAGAGTG TGCATGAGCA CGGTCACAGG CAGGAGATCT  300
AAAGAGGGTG ACTGCACGCG AAGTGCCCCA CCTGGTGGAG CAGGTGCCCT  350
GGCTCCAGGG AAAACGTGAT CATTCCGGGT GAGAAGCACC TGCTCCGGAA  400
GGGTGGCGTT CCCGGATCCA GGGCCCCGCC GGCCCCCGAA AGACTCCGGC  450
TTTGGGAAGA TGTGGTGTCA GCGACACGTT TGGGTGGCCA AAGCGATGGC  500
TCGACCGCCC AGGATCGCAG CACCTCGTTT CGGGCGGGGT GGCTGGAACC  550
CACCACGAAA CCGTTCATGC ACTTTCCGGG GTCGGATGGA GCCGCCCCCT  600
TATCAAACAG CATCCGCATG CCCCCCGATG AGCCGACGNC CCCCCCCCCT  650
CGCCCTTATC CCGGACTCCC CAGCCCTTTT GCACCTCCGC CCTCCCCCCT  700
CCCACG							706
                                                                                                                    
<210> SEQ ID NO: 103
<211> 1105
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
AACATTACTT GGTGTGGCCA ATGATCTTTC TGACAAGTAT TTGCTCGGAT    50
ATCCAAACGT TCTTTGCACC TCCTCGATCA TGCCAAACCC TAAGAGAACC   100
GCATTATCTT CAAACGGATA TTGCCCTCGC CGAGGGCAAC GTGCGATGGA   150
CACACGCTCC TTCATACTCT AAGAACCCTG GAAATGGCAA TTCATGCACG   200
ACTGCATCGT CATGTAAAAT ACTGTTGTCC GGAATAATCC AAGGTTGCAC   250
GGTCCATTGA CACTGATGCA CATGGAGTAG AAGCCCTCAT CCTCCAATGG   300
ATTGCAGCTG GATCATTCTG CCACGAGCGC CGTCTCTCGG AGGGCACCAA   350
TCTGCATTGA ATGACTAGAT TGTAAATGCT TGTCCAAACG ACACCAGCTG   400
AAGCAATTGA CTAGCGAGAA TACATGTACA TGGAAGCCCC AAGGGGGATA   450
GTTGCAGAAG CTTCCAAGCA ATACAGCTTG TTCACCCTCC CTTCCTTTCT   500
CAATTTTACT GAATCATGGA TATGACGTCC TTCAACATGA TACCTACAGC   550
TTCAGCCATC ATTGAGTCCA CCAAAGGTGA ACTTGATGAA ACGTTAGGCT   600
TCACAATTGA GTGGGTGTGG CATACCTCTG CTTTGGGGAA GAAAGATCCT   650
TATGCTCTCT TCGATTATCT CGGTGAAGCT TAGTATTCTG TCTCTGGGCC   700
CCAAATGGCG GTCCATGCCT GGCGTTCCAT TTGCGCGGGA AATCGTGGGA   750
ACTCCTGCGG CGCACTTTCT ATCGGGCTTA TTCTGCCGCC CGAGAAGTGC   800
CTGGGCGCAA GGAGGGGTCG CCCTCTTTCG GCACCGTAGA CGGGCGTTCT   850
GGGGGGTGAG CTGTGAGCAC GGGCAGGCAG AGGCTCGAGG GGATGTATAG   900
GCTGCGCGGG ACGGTGACGA CGGCTGATGC CAGCGCGTGG CTGCCCCGCT   950
TGTGCTACTG GGCGGTTGAG GTTCTGTGCG GTAGCGACCA GATACGCTGA  1000
TTGAATATAT GGTGCCGTTT GAGGACCACT GGTGCCCTTG CACGGCCTCT  1050
GAGGGCTGCT ACCGCCTTCC AACCACGACC TCCCCTTCCC GCACAGAGCT  1100
TCACA 							1105                                                     

<210> SEQ ID NO: 104
<211> 1016
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
ATGCCGCCGC CGCCGCCCTC CCGCGTCCAT GCGTCTTGCC TGAAGGGGGT    50
GCCATCTGCG GCCGAGATGG CCTCGCAGAT GGAGTCCTGG GCGGCTCTGA   100
TGGCCTTTTC GAAGCGCCCT CGGATGCTGC GGGGGTGTCA AGTGGGGGAA   150
GGTAGACAGT GGAATGCTGG CGACCATGCT GAGCAGCCTT TGGTGAGGAC   200
ACATCATGGG ACCCCCTCTG TCTTTCACAG TGGACAGCGG GATTCCCACC   250
AGCGACACCA CCCAGCATAT CGAGGCTCCA ACACATGTGG TTTTGAATAA   300
GAGGGCGACT CATGACCACA CCTGTCCGCC GAATCAACTC CCGGCCGAAG   350
CAGGGAGCCC GGAGTTGTGT CGATGTAGTT CTCCTTGGGG ATGGACTGCA   400
GGATTTTGGT TGGCATTTAT TCGGTTGATA GACAGTCATG GCCTGCTGCG   450
CGGGCATCAA GACGTCCCCC TGGCTTCCCA GAGGAGGGGA ACTCCACCAT   500
CGTACCGAAA CAGCCATGGG CGCCGTCCGT CTCTGTCGTC GTGGAGCCGG   550
GGTGCAGACC TGGAAGGAAT GAAGATTCAA CTTCTGCCTG GGCACTGCAG   600
GAATGTGCTT CGAGCCGTGC TGTACGCGGC CCAGAGTGCA GCCAAGAGGC   650
TGCACCATTG CGGTAGCACC CATGTGTCCC ACGATCTACA GCACTTGCGC   700
ACACACTAAA GCACTGTGGG GTTGAGCACC GAAAAATCTC GAGGACAGAG   750
CAAAACGTAC TCAGTAAGCT CTAAATATTT GCATGGATCC GCTCATCCGG   800
TAGGATCGTC TACTGTGCTG TAGGGGGACA CCGCACCAGC TACGTGCCAT   850
CGGCATTTCT TCCTCCCTTA TCTCCCGAGA TTGGACCGCA GCATGCACGA   900
TACCTCGCTG AGGCGCCGTG AAATTGCGAT TTCGGGAGCA GCATTCATTC   950
GCGTTGAACC ACTCATTGTT ACATGCTGTT CTCAGAGCGC TTGATGAATT  1000
GAATCAAAGA CTCAGG                                       1016

<210> SEQ ID NO: 105
<211> 1016
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
GAATTCCGCC GCAGCACACC ATGCCGACAA CACTCACACC AGCCTCTTCA    50
CCCTTGATGT TTTTCAACCA CCCTCGTCGC GCAGTGTCAC ACAGACGGGT   100
CAGTTTGTGA GTCTCAAGGG AGCCCTCCGA GAGCCCGACA CCGGTGCCGT   150
GGAGGTCGTG GGATACCCCG GGCAGCCGAG GAAGGAGGTG GCGGTGGCGG   200
AGGAGGCGCA TAAAGAGGAG AACGGCAAGA GTACCGCGAA GCGCAAGCGG   250
CGGGAGAGCG CTGGAGATGC GGCTGGTGAG GAAGCCCGGG AAAATGGGAC   300
GAAGAAGAAA AAGAAAAAGG AGAAGACGAA GACGAAGAAA CGCAGCAAGG   350
ACGAGTGAAA AGTGCAGCCG GGCCGACCAT TGAAAGCTCT GCTGCTGAGA   400
AGTTTCCTGT TTGGCTGGCA GATGCTCACC AAGTCATTCG TCCTCCACTC   450
TCCTGTGGTC CCCCCATCTT GGGCCCCTTC TTCACTCCCT GTACTGACTT   500
CGCCCTGGTG CACCCCTTGA CTCGCAACAC CCCCTCAAGA CTTTTCCTCG   550
ACACAAAGTG TACTGAGCGT ATAGATAGAG CTGCAATTCC TGTTCAGTGG   600
GAGTCAAAGC TGTGGAAGGC TGGAAGAGGG GCCACCAGGT AGACTCCCCA   650
CTTGCTGCTT TTCCAGGAGA CAAAAGAGGC CCCGGCATTT GAATAGTAGT   700
TGAGTGGTCA CTTAATGTCT AGAAACTCCT TAGGTACTTG ACTGACGCGA   750
GATAGATGAG GGAACAACCT CTTGCCGAAG ATGGTGGCCT CAGGCATCCC   800
TGTGCCCCCC AAAAGCATCT GCGAGGTGCC ATTCCTGGGT CGATCGCGAG   850
CGCAGGGGCT GCCCCAGACC CGCCGGCTAC AACCCCGATC CTGGCTCCTG   900
ACCCTGTCCT TTTCGGCTGT GCGAAAACCA CAGCTACCTT CCCACCATCC   950
TTTACCAGGG CCCTCTTTCT TCGCCAGTGG CCGTATTCCC CGGACACTAC  1000
CGCCAAGGGT ACCATG                                       1016


<210> SEQ ID NO: 106
<211> 1074
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
GAATTCGCAA AATGCCTGGC CGACAGGCAG GCCCTGTCCA GTGCAACATC    50
CACGGTCCCT CTCATCAGGC TCACCTTGCT CATTGACATA ACGGAATGCG   100
TACCGCTCTT CCAGATCTGT CCATCCAGAG AGGGGAGCAG GCTCCCTACC   150
GACGCTGTCA AACTTGCTTC CTGCCCAACC GAATGCATTA TTTTTTGAAG   200
GGGGGAGGGG GGGCAGATTG CATGGCAGGA GGTCTCGTGA GGAACATCAC   250
TGGGATACTG TGGAACACAG TGAGCGCAGT ATGCAGAGCA TGTATGCCAG   300
GGGTCGTGGC GCAGGAAGGG GGCCTTTCCC AGTCTCCCAT GCCACTGCAC   350
CGTATCCACG ACTCACCAGG ACCAGCTTCT TGATCGGCTT CCGCTCCCGT   400
GGACACCAGT GTGTAGCCTC TGGACTCCAG GTATGCGTGC ACCGCAAAGG   450
CCAGCCGATC GTGCCGATTC CTGGGTGGGG GATATGAGTC AGCCAACATG   500
GGGCTCAGAC TGCACACTGG GGCACGATAC GAAACAACGT CTACACCGTG   550
TCCTCCATGC TGACACACCA CAGCTTCGCC CCACCTGAAT GTGGGCGCAT   600
GGGCCCGAAT CACAGCCAAT GTCGCTGCTG CCATAATGTG ATCCAGACCC   650
TCTCCGCCCA GATGCCGAGC GGATCGTGGG CGCTGAATAG ATTCCTGTTT   700
CGATCGCTGT TTGGGTCCTT TCCTTTTCGT CTCGAATGCC CGTCTCGACA   750
CAGGCTGCGT TGGGCTTTCG GATCCCTTTT GCTCCCTCCG TCACCATCCT   800
GCGCGCGGGC AAGTTGCTTG ACCCTGGGCT GGTACCAGGG TTGGAGGGTA   850
TTACCGCGTC AGGCCATTCC CAGCCCGGAT TCAATTCAAA GTCTGGGCCA   900
CCACCCTCCG CCGCTCTGTC TGATCACTCC ACATTCGTGC ATACACGACG   950
TTCAAGTCCT GATCCAGGCG TGTCTCGGGA CAAGGTGTGC TTGAGTTTGA  1000
ATCTCAAGGA CCCACTCCAG CACAGCTGCT GGTTGACCCC GCCCTCGCAA  1050
CTCCCTACCA CTAGTGGTAC CATG                              1074

<210> SEQ ID NO: 107
<211> 1527
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
GCCCAGGCTG AAGGCCAGCA GGGAGACCGC GGCCGCCTGC ATGGGGCTGG    50
TCAGGTTGTC GGCATCCACG CCGTCGCGGT CCCTGGCGTG GGCGGCGGCC   100
GGGTCTTCTA CCTGATTGGT GAGGGTGCTG GCCACCTGCC CGGCCAGCTT   150
GGGAGGCAGG CCGCGGTCAA TGTGCACCTG CGCCAGGTCC CTCTCTGCCT   200
CCGCCTGCGT GGCGCTCATG TGGTTAATGG GAGACATGCG GTGCAAGTCT   250
GGGGCCGCCA GAGGCGTTTG GAGTCGCAAC ATCCTCAACG CTCTCTGAGC   300
AGGGGCCGGT ATGGCTGACA TCCATGCTGG AGATTGGAGG CAGAGCTTAG   350
ATTTCCCCCA TCCCTCTTCC AACTAATCCT TCGGACGAGG GTCAGTTCAA   400
CACCCGCTGC AATCCCGTCA GCCTGGGGTC CGACCTTGCT TGCCGCCCCC   450
GCCTGCAGCT GCCGCTCCAG GGCCACCAGG TCTGCCCTCT CCGCATCGCG   500
CTGGGAGCTG ACCGACACGT ACTCCCCCAC GGCCATGGAG GCGGCCCCCG   550
CCACGACCCC GGCGACCCCG GTCAGCATGA GCGTCCGGTG GTCCACGGCC   600
GCGGCGCCCA CCCCGGCGAG CAGGGAGGAA GTGGTGACCA GGCCATCGCT   650
GGCACCCAGG ACCGCAGCCC GCAGCCAAGG AGCCCTCGAG CTGAAGCTCC   700
TGTGAATGTG ATTAGGCGGC TTGGTGGGAG GGTGTTACGG GAAGAAGGGG   750
CGTGGGGAGG CCAGGGCACA CTCTGCCCCA TCAGCAGGTG GTGCATCCCT   800
TTCAACCATA CGGGATGCAG CACCCCAGGG ACACGATACA ACGCCTGTGC   850
CTTACTCTCG CTCTTTGTCT GCTTTGGATT CTTGCTCGGC GGCTTGGCTG   900
GTGGTCAGTG ATCGGGCCTC CAGCCCCAGG CTCGCATCCA GCCAGTCTGG   950
GCGAAAGCTT GAGGCGGCGG GAACGAGGGA TCGGAAACGC CCTGCCACCG  1000
ACGTCAAGCG TAACATGATC GCCTTCTGGC CGGTTGAATT TATTTTGAAT  1050
AATCTGATGG CATGTTCGAG GCCCTTTTCA TGGCTTTGAT CTACAGGCTG  1100
TTTGCTGGCC CACACAGCCG ATTCCTCGTC GATTCCGTCG TTTGCCTGAA  1150
TGATTTATGG CGAGTTCGTG GCGTGGTGAT CGTTTCGGCC CCTTACGCTA  1200
CTCACTCGAC ACAACGCATG TTGCATTCAT GCAATCATTA CTTTCGTGAT  1250
TGATGGTCGC CAGCTGCACT CAGAATGTTG AGTTTGCAAC GATGCAGTGC  1300
TATCATTGGC GCACCCACAA TACAAAGTCG GCATGAAGTG CAGCCAAGAT  1350
GGCGACGTGC TACTCGGCAG TCGGGAGTTT GTGCCGCCTC CAACTTCACA  1400
GACCGAGATG CCGCCCTCAA CAAAAGGATT GAAACCCCCG ACCTCCTACC  1450
GACGAGCATC ACCTGTCCAG CCGTCGCTCT CCTATGTGCG ACACCCAGGC  1500
TCAACCTCTT GATGGAGGCC AACCATG                           1527

<210> SEQ ID NO: 108
<211> 93
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
CCCGCTAAAA AAAAAAAGTA TTTTTAAATT TACAAAATTT AAAAAATTAC  50
AAACACTACA AAAATCAACT AAATCTTTAA AATTATGGCT GTT       	93

<210> SEQ ID NO: 109
<211> 118
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
ATAAAATTAA TTAATTTTTA AAATTGTTTT TTTTATAAAA AAAATGTTAA   50
AATAATTCTA AGTAAATCAT AAATGTTTTA CACAAGTAAA AAACTTTATT  100
GGAGGAAATA ATCTTATG   		                        118

<210> SEQ ID NO: 110
<211> 149
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
AAATTAATTC TAGCATTTTT TTAGATTTTA TATTTTTTAA TTATTAGTAT   50
CACATTTAAA ACTATTTTGT GATTCTAGAA AAATAATTAA AAAATAAAGT  100
CTATTTTTTT AAGGATAATT GGCAGGAGGA TGTTCATCTT ATTATGGTA   149

<210> SEQ ID NO: 111
<211> 202
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TGTGAAAAAA AAAGATGAAA TCCATAGAAT GGCAGAAGCC AATAAAGCGT   50
TTGCAAAACA ACGTTTTTAG ACTTGAAAAA ATTTGAAGGT TTTATTATAA  100
TTATTTGTTA ATCAGGATTA ATAATCTATT TCTTTAAATT GTTAAATTTT  150
TAAAATTTGT ATACAACATA ACTAAAAAAG GACTTTTTTA TGGCACGCAC  200
AA							202                                                                                                                            

<210> SEQ ID NO: 112
<211> 204
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TCAGAAAAAG TAGAAGCTTC GTTTGAAGAA GCCAAAAATC AATTAGAAAA   50
TGCAGAAAGC GAAAAACAAA GAGTAGATGC TCTTTTTCAA TTTAAGCGCG  100
CAAGAGCTCG ATATCAAGTA ATTAAACAGT TAGTTAATTA ATTTTTTTAA  150
TTAAAAAGCA TTTTGTTAAA CAAATTTAAA TAATAAAAAT TGAATATGTC  200
GCGT                                                    204

<210> SEQ ID NO: 113
<211> 236
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
CGGGATCTTA ATATATTTTA AAAATAGTAA TTTCTAGGTA TTAGAATTTA   50
AAAAAGGTTA ATAAAATATT TTAAAAATTA AAAAAAAACA AATTATTATA  100
TGCAAAACTT CAAAACTTAT TTATCAACTG CACCAGTTCT TGCTTTAGCA  150
AGTCTAACAG TTGTTGCGGG ATTATTAATT GAAATTAATC GATTTTTCCC  200
TGATTCTTTA ATTTTTACTT TTTAGGAATT TATGCC                 236

<210> SEQ ID NO: 114
<211> 202
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAAATCTTAA CGGAATAAAT GATATTTATT GTAATAAATA TCATTTATTT   50
TAACCTGTTT ATTTCACTAT ATGGATTGTG GATTACACTC TTATAGTACT  100
TGAGTAAATA AATATGGTCC TTTTTATGTT TTTTGCGTTG TTTATTAAAC  150
AGTACAAAAA ACATAAAAAT ATAAATTATA TAAAAGCATT TATAAAAATA  200
TG                                                      202

<210> SEQ ID NO: 115
<211> 250
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
GTACATCTAA AGAAAATATA GAAACAAAAA AAAATATTTT GATATATTTA   50
TATTTTTGGT ATATTCTGAA GGATTTACTA TTCTTTGTTT CTGCCTTTGT  100
TATAGCATCA TTCTTTTTTC TGAAATAGAA TAATAAATTT TGTTTATTTT  150
AGTATGATGA GATAGCAGTT TATTTTATAT ATTTACAATA TATCTTTTAC  200
TCGTAGGTTC AATTCGAGTT ACTATATAAA ATAAGAATTT TAGATTTATG  250

<210> SEQ ID NO: 116
<211> 296
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TTATTTAATA TAAATGATAC TTATATAAAT CAATTTGATT TATTAGCGTA   50
ATATTTTTTT ACGTAAAAAT ATTATTTAAT TGAAATACCA AACATACATA  100
TTTGGTGTCT CAATTTAATC TATAAGAGTG TATTTTAAAA ATCACAAAGT  150
AAAATAACGT ATATTCTTTA TGAATATCAG TTATTGTACG AAGTAAAATA  200
ACGGAATAAA CATTATTTAT TCAACAAGTA GTTTTTATAA AACAACAAAA  250
CCCTATGCCT AATTCAATAT TTAAATCTTC TTTTTATAAT AGGATG      296

<210> SEQ ID NO: 117
<211> 300
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
ATAAATTAGA ACAACAAAAA GTTCAAAATC AAATAGCTGA AAAATTAATT   50
GAATTATCAT TAAATCAAGT TAAAAAAAAA ATAAAATTAC GGTTAAATTC  100
GTCTAATCAT AGTATTTTAA ATAACTTTCA AATTGTACTT TTTACTAATT  150
ATAAAAAAAA TTAATTCTAG CATTTTTTTA GATTTTATAT TTTTTAATTA  200
TTAGTATCAC ATTTAAAACT ATTTTGTGAT TCTAGAAAAA TAATTAAAAA  250
ATAAAGTCTA TTTTTTTAAG GATAATTGGC AGGAGGATGT TCATCTTATT  300

<210> SEQ ID NO: 118
<211> 766
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TGAGCGCTGC GCAGCGGCAT TCTGTCGCGT CGCGGCAGAG AGGGCAGGAT   50
TCTTCCCTGA GCCTGAAGCC GGCGCGCTAC TGCCAGCACG GGCTGGGACG  100
GTTTTATGAT TTTCCAAGCG CGACCGGTGA GGATCCTTCC CCTTCTTCCC  150
TGCGACACGC CCCACACGCG CACACACCCC TTCACCTCTC TACTACTAGT  200
ATCTAAGAGA GCATGCACGC CCACCGCCGC GTCATTTTGG ACGAGTGCAG  250
TCGTGGCCGG CGCGGCTGTG GCCTCTGCGT TTTGCCACCA GTCACTTTAA  300
AAACTCAACT GCCTGGCCGG ACGATGGCTT GGGTCTGGCT GGAGTGGCAT  350
GTCATCGCCC CACCCCCATC CCCTTCCGCC CCTCCCGCCA GCGCATGAGA  400
TTTGAAAAAT GGACTCGACC CATGCTCCCA GTTGAACCGC CTGTGTGTGC  450
CCTTGTCTCT TGTTCTGAGC GGTACGAGTC GAGGCGCACT GACCCCCCCC  500
CTACTAGCAA GCCCCTCCGT GCAGGCAGGG CGCCACCCCC GCCCCTGCCT  550
TGCCATGCCA AACTCGACAC AGTACGCCTG CCCTGCAAAA ACTCATCCCC  600
TGCATGCGCC CGTCTTCCCG CTGTACAGTC GAGCTGGATG ATGGGTGACT  650
GAGAGAAGTG GGAACAGTGA CTGGGAAGTG TTCAGAGCGG TTTCAGAGTA  700
CGTGTGCACC GAGACGCTCG CTTTGTGGGG TGGCGTGACT CGAGTCTCCA  750
ACCAAGGCCA TGCACG                                       766

<210> SEQ ID NO: 119
<211> 525
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
GGAGGCTCGG GCCCCACGCC CTCTGGGGTC CGCCGCTCAA TGAGCCGCCG   50
CCCTGGGGCA ACCGCTGCCG GTCTGGACCC TGAATTCTTT GGGCATGCAG  100
CCCCATGGAC TTGTAAGCAC CATCCCACCT GGCCTTCAGA CTGTTATTTT  150
GCCGAGTTGA TGACGTTAAT AATCGGGTGT GAATGTCCGG GTCAGTTACA  200
GCAGGGCAGN CAGTCCAGGG TAAGGGCAGA CCTCNGTGTT TGCCTCGGTG  250
TCTTCGGACT CCTCGGGCGT GGGCCTCACG TCCGGGAACT CCTCCAGACC  300
GGGGGAGGAT CCAAGGAAGC AGCGGGGAGT GTTGTGCACC CGAATTGCGT  350
CCCTNCTATG GTCTCCCACT CCATGACCTG AGCAGGGAGC TATGCATCCC  400
CTTCACCTAC GCTGCCGCTC CCTTTTGCGC GCGTCGTCCA AACCTGCTCA  450
GTATCCCAGG TCACACCCCT CGGCCGGGAC CCCCAACCGC ACCTGAGCCG  500
CTGGACCTTG GAGACGGAGG CAGAG                             525

<210> SEQ ID NO: 120
<211> 675
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
ATCTGAGCCT CATGCTTTGT GATTGGGTGT AATGTGGTGC CAGAACGAGG   50
CAGGGACGAT GCCTGGAGTT TGGGGGTAAC AGCTGCCGGG GCGTGTGCTG  100
CAGGGAGGGA TGCTCTCTGC CCTGGGAGGC TCGTGCGAAA GTGGCTTGGA  150
CCCATGTATG TTGGATGTTA AGGGTAGAGG TCCCTAAAGG CCCCTCCGAC  200
ACTGCTGCGT CTCGGTACAC CCGGGATGCA TGCTGCCTCT CATTTCGCGT  250
ATTTTTAAAG ATTGCCACAG AGCCAGCAAG CATGGCATTT TTGTGCTGGC  300
AGCAGGAGGA ATCCGAGCAA TGGAGTTGGA GCGACGCCAT TCACAAAGTC  350
ATAGTACAAA CTTGAAAGGG GGAGGGGGGG TGGTCCAGAA GACAGATACA  400
GAGTCACAAT GCAGCGAGAT CGTCATCATG CATCTGCCTT CTTCTTCTTC  450
TTCTTCTTCT CCTTCTGCGA GGGGTGCGTG ATGGGGTGAG TGCGTTGGAC  500
AGACTTCAGA TCATCGATGT CGAAGATCTT GTGTGAAAAC GCATCGCTCA  550
GGTTTGCTGT TTCCTGACAA GGATACGGAG CCTTGCATCC CACAAGCCAC  600
GCCCATGGCC GAAATACTTG GCGCTACAAA GGCCAGCTGC TTCACCACGA  650
CAGGAAAGAG CCACCGCTTA CCTTA                             675

<210> SEQ ID NO: 121
<211> 500
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TCCTGTATGT CTAGGTGCAT GGTGAAATTG TTCTGTGGAA TGTAATGTGC   50
CTTAGCCAGT GCACAGGAGT TGCCACCCTG GAAAGCCGGA AAGCGATCGA  100
GGATCATACC ATGTGCATGG CGGTGGCGAT TCGTGTGCCG AGACGCTTCA  150
GATGCCGCAT GACTTCTGTG ACGACTGCCT AATTGTCAGC TCNNNNNNTG  200
CGACGGGGCA CCCAACGAGC CCGGGGCGGC CAGGACATCG TCACATACAC  250
TCCCAGGCAC GGTGTCCGCA TGGCTGCCTG CATCGGTCAA TCTTCCCACG  300
CTCCCTCCCT CACATTCCTA TGCACTCGAC GAATCAATGC ACTTTGTTGC  350
CAACAGGGAG GATGGCGGGT ATGAGCTGAG CATCTCGGAT GCATGCGGTC  400
GCCAGATGTG GGGTCCACGC TCCCTCAGGG CCGTGCCATT CGCAGCTGCC  450
CTCCCTGCAG GCGGCGTGGC GGGCACTGTG CTCAGCACGG GCGTCAGTCA  500

<210> SEQ ID NO: 122
<211> 487
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAATCTAGAG GCGGAGGAGC GGGGCTGGCC CGCTGCTGTG CATGTAAACA   50
CTGTGATGGG TGACGGAGGA GGGAGAGGGG GGGGGGGAGG AGGGGAGGAA  100
GAGGGGATCG AGAGGCACCA GGAGAGCCAA GGCGTTGTGA GGGAAGGAAA  150
TTGGCAGTCT GACCTCTGCA GGTGTCGAAA ACACCGCCAG GAGGATGGGA  200
GGGGATCCTC GTTCTATTCC AGTTCGGCAA GGGGTGGACC TTCCCCCTTG  250
TGAGCCACAT GAGCGAGAAT GTGCTGCATG GATCATCTCG GTGGAGCCTC  300
GAGCCCATAG GACGCCTGTA CTGCCTCGGG GGCCTGGTAC ATCCCTGCTG  350
TACGGCCAAG GCGGATGTAC GGTACAGCAG GTGGAGCCTC GAGCCCATAG  400
GACGCCTGCA CTGCCTCGGG GGCCTGGTAC ATCCCACCGT AACCGAGGGG  450
GGGTACGATG TATGTTTTCC TGCACCATAT GAAGCTT                487

<210> SEQ ID NO: 123
<211> 621
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAATCTAGAG CGTACGCGTC TGATCGCGGC CCTGGCAATC CATGACCTCC   50
CCCGGTCTTT GGGTCTCCAG CTCGACCCTC CACACTCCAC CCCACCCCCC  100
TGACCCCCTC CCTTGCAACA ACAGAGCGCG AGATTGTGCG CGACATCAAG  150
GAGAAGCTGG GCTACGTGGC GCTGGACTAC GAGCAGGAGC TGGCCACGGC  200
GGCAGGCTCC TCCACGCTGG AGAAGACGTA CGAGCTGCCC GACGGCCAGG  250
TCATCACCCT GGGTTCCGAG CGCTTCCGCT GCCCGGAGGT GGTGTTCCAG  300
CCGGGCATGA TTGGCGTGGA GGGCCCCGGA ATCCACGAGA CCACCTTCAA  350
CTCGGTGATG AAGTGCGACG TGGACATCCG CAAGGACCTG TACGGCAACA  400
TTGTCATGAG CGGCGGCACC ACCATGTTCC CAGGCATCGC GGATCGCATG  450
TCGCGCGAAA TCACCGCGTT GGCCCCCTCC AGCATGAAGA TCAAGGTGGT  500
GGCGCCCCCG GAGCGCAAGT ACAGCGTCTG GATTGGCGGC TCCATCCTGT  550
CCTCCCTGTC CACCTTCCAG CAGATGTGGA TCGCCAAGAG CGGTGAGTGG  600
GCAGGAAGAC ATATGAAGCT T                                 621

<210> SEQ ID NO: 124
<211> 822
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TGAGATGGTG TCATGCAGCA AGTGCCAGAG CAATCCCAGA GGCAAGCCAC   50
TCCTGGCTGA AAGACGAGGG GACACCTAGG GTGCAACGTT GTGAGAGGCT  100
GCTCTGTCAT CAATGCCCAT GGTGGTCCCT AAAACTGATC TTGTTGGACA  150
CTCAGTGGCT CAGCAGCCAC ATGATGCAAG TTTGGTGTAT AGAAAATGAC  200
GGACCGACAC AAGACTTGCA GATTCATCCA TCAGACTCCA GATCTCCCGC  250
ATGGAAACGC GGGCGCGATG GGCAATGCAT GATCCAAACA TCGACGCACA  300
AGGTCCCATG CCTCGACACA GGTGTACAGC CATGAGCTCG TGCAGAGCTC  350
TCTACATTGA TTTCAATTAT TGTTGATTTA TCACATCGGC GATGCAGAGG  400
GTGTGCCGGG GGATCCTGCG ACTGCATGGC CCCGTTATCG GAACCCTGCA  450
CCCTCGCGGG ACATCCCTAG GGAACTCGGC GAGCCTGTAC AGCAATGGCG  500
CGCGACGTTG CAGCAGCAAT GCATCGCTGC GATGCTGGTC CAGCTCCAGA  550
TCCCTGGAGC TGGCGCCGAG GCCTTCGGGC ACGGTGGCCG AGGGGGCGGC  600
AGCACCCCCC ACAGAGGACA GCGAGGCCGC CTCCATCACC TTCCAGGACG  650
CCGTGACGCG GCTGCAGTCC TTCTGGGCCG ATGAGGGGTG CGCCATGTGG  700
CTCCCCCACA ACACAGAGGT GCGATATGGG GTTGGTGTGC TCTGATGTCC  750
TGCCAGGGGG ACTTTCTCCC CACTCACTGG TGGTACCACA GCCGACCGCG  800
ACAATGCGAG CCCTTCATCC CC                                822

<210> SEQ ID NO: 125
<211> 93
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAAGCAAATA ATAAAAATGC ATTGATATGT CTTGTATATT AACACATATC  50
AATGCATTTT TATATAATTA ATTATATGAG AGGCCTACTT AGC         93

<210> SEQ ID NO: 126
<211> 145
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAATTCTAGC ATTTTTTTAG ATTTTATATT TTTTAATTAT TAGTATCACA   50
TTTAAAACTA TTTTGTGATT CTAGAAAAAT AATTAAAAAA TAAAGTCTAT  100
TTTTTTAAGG ATAATTGGCA GGAGGATGTT CATCTTATTA TGGTA       145

<210> SEQ ID NO: 127
<211> 198
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAATTTTTTG GTTACTATAG ATTTTTTATA GTAACCTAAA AAATCTATAG   50
TTTATAGTTA TATAAATTTA TACAAAATAT TTAAGTAAAT ACTTGTTTTT  100
TAATTATAAA AATGAGTATT ATTAAAATTA GGGATAAACT AAAAAATTAT  150
GGTACTTCAA GTTTCAGTTA TGACACCGGA TGGTATTTTT TGGGATAA    198

<210> SEQ ID NO: 128
<211> 199
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAAAAAATTT AAAACTAAAT CTATTTTAAT ATAATTTTCA ATAGAGTTTT   50
TTTGATTCTA TTGAAAATTA TAATTTAATC TTTACTAAAT ATATTCTTTA  100
ACTATTTATG TCACGTCGAC GTACAGCAAA AAAACGTACT GTAATGCCTG  150
ATCCTCTTTA TAAAAGTAGT CTTTTAGAAC TTTTAGTACG ACAGGTTAT   199

<210> SEQ ID NO: 129
<211> 217
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TTATAATCAT TATATCTTTA TGCTTAAGGT GCTAAGCATA AAGATATAAG   50
ATCATTAAAT GCGGAAAGAT GTCTGAGTGG TTGAAGGTAT AGATCTGGAA  100
CATCTATGTG ATGCTTTTAT ATCACCGAGG GTTCGAATCC CTCTCTTTCC  150
GTCTTTTTGC GATTATGATG AAATTGGTAG ACATGCAAGC TTGAGGGGTT  200
TGTGGGTAAA CACCCAT                                      217

<210> SEQ ID NO: 130
<211> 71
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
TAAAAATGCA TTGATATGTC TTGTATATTA ACACATATCA ATGCATTTTT  50
ATATAATTAA TTATATGAGA G                                 71

<210> SEQ ID NO: 131
<211> 200
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
ATAATAAACT TTATTTATTT TAGGTGTATA TCTATAATAT AACAATGAAA   50
TGTTATAGAA TACCGCGGAA ATAGCTCAGT GGTAGAGTGT TGCCTTGCCA  100 
AGGCAAATGC CGAGGGTTCG AATCCCTTTT TCCGCTTTTT TTTAATAAAG  150
TGAATATAAC CAATTATAGT CTTGATCTCC GAGCTTTTTT TTATCAAGAT  200

<210> SEQ ID NO: 132
<211> 200
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
ATACAATTCT ATGTATTTAG ACAAACAAAT AAATCGAATT TATTTTAATA   50
AACTTCATTT ATTTTAACTT GTACCAAGTA TTTTATTGGC CTCGACAAAT  100
AAAAGCACTT GCTCAAAAAA ATAAAAGCAT TATATTATAA TGGGTACTTT  150
TTCGGTACAT ACAGTATATA ATAAGTACTT TTATTTTTTT GAACAAGTGC  200

<210> SEQ ID NO: 133
<211> 200
<212> Nucleotide sequence
<213> Auxenochlorella protothecoides

<400> 
AAATTTAGTG TGTTTTTATA AAACAACAAA AATCTTTAGT TACATATTGA   50
TGAAATTATG ATTTTTATTT ACTTTAGCTA TCTTGAGAGG TAGCTAAAGT  100
AAATAAAAAA ATAATTTTTT CTTATAAATA CAATGACACT TATATAGGAA  150
TAATCTACAT AGACATATAT AGGAATAATC TATATGGACA GATATAGCAA  200

<210> SEQ ID NO: 134
<211> 9741
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
GATACGGGAG CCAAGATATC CGTTATTTTT AATAAAATAA ATGAAAGATA    50
TTCCAAGAAT TAATCATAAA CTTCGTTTAT TTTACTTCGT ACACCTAAGA   100
ATACGTTATA AAACGAGTTT TAACAGGTAA TTTGGAATCT GCTAGTCTTG   150
CAGCTTGTTT GGCTAATTGA GCAGAAACAC CATCAAATTC AAATAAAATA   200
GCGCCTCGTT TCACTCGACA TACCCAGTAT TGAGGGGCCC CTTTACCTTT   250
ACCCATACGA ACTTCTGCGG GTTTTTCAGA TACTGCAATA TCAGGGAATA   300
CTCTAATCCA AACAATACCT AATCTCTTAA ATTTTCTAGT GATCACTCGA   350
CGAACCGCTT CAATCGTTCT AGCAGGTATT CTAGCAGCTG AAACAGTCTT   400
GATACCGAAT TTACCATATC TTAATTGATC TGTATTAGAT TGTACGCCTT   450
TAACTCTTGA TTTTTGAAAT TTACGAAATT TTGTGTTTTT CGGTTGTAAC   500
ATATAAATAA TTTTTTACAA GTAGAATCTT TTGATTCAAA GATCCGTAAT   550
TCTCTATTTA GATTAGAAAA AATAATAATG AAATATATAA CGGTTTTTTT   600
ATAGATAAAG TATAAGTATT GTATTTGCTG TGCCTAACTG TACTTACTTC   650
GTAATAAATA CAAATTTGAT AATGAAAATA GAGTATATAT ANATNTATTC   700
CTATATATGT CTATATAGAT TATTCCTATA TAAGATACAC AACAAATTTG   750
ATTCATTTTA AATTTTACCA GCTAGCTTTT ACACATCCTT GTATAAGGCC   800
TTGATTTGCT AAATCACGAA ATTTTATACG AGAAATCCTA CAAAATTTAT   850
ATACTGAATG ACCTCGGCCT GTGACAATAC ATCTATTTTT TATACGAACA   900
CGACTACTAT TTCTTGGTAA CTGATTAAGT TTTTGTGCAC ACAAAAACCG   950
TAAATCTTGG TCTAAATTAC AATCATTAGC TAAAGCTTTA TATAAAATTC  1000
TTTTAGACTC ATACTTCATA TATAATTTTC GTCTTTTTAA ATCGCGTTTG  1050
ATTGAGTTAA ACATAATATA AAATATTTTA ATTTTTTGAA ATGAATTGTT  1100
ATCAAATAAA CTTGGTTTAT TTGATTTTTG TTCTTTATTT ACTAACAACA  1150
AGCTAAAATA AACAAAAATT TATAAAATAA ACAAAGTTTA TTATCCAGAG  1200
ATAATGGGTT TCAATCTATC GAGGCAAATA GGCATAATTT TATAATTTTT  1250
TAAAAAAATT AAATTTACTT ATTTGTCTCG ATAGTAAAAT AAACAACGTG  1300
TATTAAGGTT TATAACCCCA AATGAGAACA TTTTTTTGTT TTGATTGGAT  1350
CTACCGTGTG AAATAAATGA AATTTATTAC CAAGTAAAAT AAATCTAATT  1400
TGTTTTAAAT TAGATTTATT CAGTTATTTC ATACTAGATT ATCGATCAAC  1450
TTCTCCAAAT ACAATGTCTT GTGTTCCAAT AATTGTAACA ACATCGGCTA  1500
ACATATGATT ACGAGCCATA AAATCAAGTC CTTGTAAATG TGCAAAGCCG  1550
GGCGCTCTAA TTTTACATCT ATAAGGTCTT GAAGTACCAT TACTTACAAG  1600
ATATACACCA AATTCACCTT TAGGCGCTTC AACAGCAGTA TATGTTTCAC  1650
CTGCTGGAAC AACAAATCCT GATGTATAGA ACTTAAAGTG ATGAATTAAA  1700
GATTCCATAG ATTGTTTCAT TTGGTCACGT GTAGGTGGAG TAATTTTTTT  1750
ATCATCTAAA CGTATTACAC CTTTAGGCAT TTCATTAATA GTTTGCATAA  1800
TGATTCTTAA ACTTTCACGC ATTTCTTGAA CACGAATCAA GTATCTATCA  1850
TAACAATCTC CACGAGTACC CACAGGTATA TTAAATTTCA TGCGATCATA  1900
AACTTCATAA GGTTGTGTTT TTCTTAGATC CCAAGCAATA CCAGAACCAC  1950
GTAATAATAC TCCAGAAAAA CTCCAATCCA TAGCTTGTTC AGCAGAAACA  2000
ACCCCAACAT CAACAAGTCT TTGTTTCCAA ATTCTGTTAG AAGTCAGCAT  2050
TTCTTCGATT TCATCAATAC GTGATGCAAA TTGTTTAGAA AATTTATAAA  2100
TATCTTCACA TAAACCTAAA GGTAAATCAA GAGATACACC ACCCGGTCTA  2150
ATATAAGCAG CATGCATACG TGCTCCAGAA ACTCTCTCAT AAAATTCCAT  2200
TAGTTTTTCG CGTTCTTCAA ATCCCCAAAG AAATGGAGTT AATGCACCTA  2250
CATCCATAGC ATGGCAAGTA ACAGCTAATA AATGATTTAA AATACGTGTA  2300
ATTTCACTAA ACAATACTCG TATATATTGA GCACGTAAAG GTACAGTCAT  2350
TTTATTTTCA TCATTATTAG ACGAACTAGG CCCAGTGTTG AAGTTTAAAA  2400
GTTTTTCCAC TGCTAATGAA TAAGCATGTT CTTGACACAT CATAGAAACA  2450
TAATCTAGAC GATCAAAGTA AGGTAATGCT TGTAAATAAT TTTTATATTC  2500
TATTAATTTC TCAGTCCCTC TATGTAATAA CCCAATATGT GGATCAGAAC  2550
GTTGAACTAC TTCCCCATTC ATTTCCAGTA CGAGTCTTAA TACACCATGA  2600
GCCGCTGGAT GTTGAGGTCC AAAATTAATA GTAAAATTTT TATATTTTGG  2650
GGCTGTAATT ACTTTTTCAA TAGCCATAAT TTTATTTTTT TATTATAAAT  2700
AGTTTTCATA CTTATTTATG GCTTGTTGTA CGTNGTGTNC TACGTACAAC  2750
AAGCCATAAT AGTATTTTAA TACTATTTTA CTTGAAATAA ATACAATTTA  2800
TTTTAATAAA TTATATTTAG AATTGTATAG TATCGTTTTA AAATTAATAT  2850
ATGGATGTAA CAAAGTTAAA ACCAAGTCAT AACCAAGTCG AAACAAAATT  2900
GAAACAAAGT CGAAAAGAAT CGAATTTATT TTAATTATGC ACCACCCCAG  2950
TAATATACAC ATACAAATAA GAATAACCAT ACAACGTCAA CCATGTGCCA  3000
ATACCAAGCA GCAGCTTCAA ATCCAAAATG ATGAGATTTT GAGAAATGAT  3050
GTAATATTAA ACGGAATAAA CATACAGCTA AAAACAGAGT ACCTATTATA  3100
ACATGAAATC CATGAAAACC AGTAGCTAAA TAGAAAGTAG ATCCATAAAT  3150
TCCATCAGAA ATTGTGAAAG GAGCTTCAAG ATATTCTAAT GCTTGGAATC  3200
CTGTAAAGAT AACAGCTAAA ACTACAGTAA CTGCTAAGGC TATAATACCT  3250
TGTTTACGAT TTCCAGCTAA AATAGCATGG TGTGCCCAAG TAACTGAAGC  3300
ACCTGAAGTT AAAAGTATAA TAGTATTTAA AAAAGGAATT TCCCAAGGAT  3350
TTAAAACTTG GATTCCTTTA GGAGGCCAAA TCGCACCAAT TTCTACAGTT  3400
GGGGCTAAGC TTGAATGGAA GAAAGCCCAA AAAAATGCAA AGAAAAACAT  3450
TATTTCAGAA AGAATAAATA ATAACATTCC GTAACGTAAT CCTACTTGCA  3500
CAGGAGTAGT ATGATGGCCT TGATATGTAG CTTCTCTAAT AACGTCTCTC  3550
CACCATACAT ACATAACATA AAATAGTGAT AATTGACCAA AAGCCAATAA  3600
AATACCTCCA CCAGAATATG CATGCATGTA CATTACTCCA CCTGTTGTTG  3650
AAAAAAATAC TGCAAAACTA GCTAATAAAG GCCATGGACT TGGATCCACT  3700
AAATGAAAAG GATGTTTTCT CATAATTAAA ATTTTTTTAT TTAGAAGAAA  3750
TTTAGTTTAT TATATTTCTG TACTGGACAT TGATACAATA AATTTTAAAT  3800
CGGTTCTTTT TTTTATTTTA ATCTATAGGA GCGTGTTTTA TTATGTATTT  3850
TGCTCTTAAA TAAACAAAGT ATGGAATAAA TATCATTTAN NCCGTTATTT  3900
TACTTCGTAC AATAAATGAA ATTTATTATA TCGTTTTGTT ATTTATTATA  3950
ACCACTTTGG TGTTTAATAG AGATATGGAT TTGTACACCA ACTACATGTA  4000
GATTTTTGCA AATTTCAATA AAAGCATGGA GACTATTTAT ATTTGCAAAA  4050
GAAAGTACCA GTGATCTCTT ATATGTTTTA ATTTGAAACT GATCTCGAGA  4100
TTTTTTATGT ATATGTGGAG AACGTATTAC AGTTATTTTC TTTAATTTAG  4150
ATGGGATAAA TATTTCTTTA GAATCCGGTG TTTCCAAAGT ATATAAAATT  4200
TCATCAAATA AAGAAACCAG TTGATTAATA TACAACGGAT CAAATGATTT  4250
TAATTTTAAT TGTACTTGTT GCATAAAAAT GATTTTAAAC TTTGTTTTTT  4300
TTGTTTGTCT AAATTGTATA TAATAAATTA CAATAAATTT GATTTATTTT  4350
ACTTTGTACA CAAAGCACTT GTTCAAAAAA ATAAAAGTAC TTATTATATA  4400
CTGTATGTAC CGAAAAAGTA CCCATTATAA TATAATGCTT TTATTTTTTT  4450
GAGCAAGTGC TTTTATTTGT CGAGGCCAAT AAAATACTTG GTACAAGTTA  4500
AAATAAATGA AGTTTATTAA AATAAATTCG ATTTATTTGT TTGTCTAAAT  4550
ACATAGAATT GTATTTAGAC ACATAGAGCA ATTGCTATTT TATGTGTGAT  4600
CAAGTGTAAC GGTGCAGGGT ACGCCATACA AAAAATCAGA AAAAATAAAG  4650
AAATTGCTAA TGCTAAAGCA TTTGTTCGTG GTACGCGAGC AGTACTACAC  4700
CAAGTTTTAG GAGTACTAAA ATACATTATT CTTATTAATC TAAGATAATA  4750
AAAACAACTA ACTACACTTG TACACACACC TACGACAGCT AATACACCTA  4800
GATTACTTCC AAGAGCGGCG AAAAAAATAT AGGCTTTACT ATAGAATCCA  4850
GCTAATGGTG GTATACCAGC CATTGAAAAC ATAGCAATAC TAAAAGTAAA  4900
TGCTAAGAGA GGGTTAGTTT TAGCTAAAAA AGCTAAATCA GTAATATATT  4950
TAATTCTAGG TAGCCCATTA TTATGTGTAT GACGCATGTT ACATAGTAAA  5000
ATACTAAATA CCGCAATATT CATAAAAATA TATACAATTA AATATATAGA  5050
TAATGCTTGT ATTCCTTCAA TACTAGCGCA AGCAAATCCT GCAAGTAAAT  5100
ATCCCACATG GTTAATACTA CTAAAAGCCA TTAATCGTTT GATTTGACTT  5150
TGTGAAAGCG CAGCTAATGC ACCTACAAGC ATAGAAGCTA AACTAGCAAA  5200
AATAACAAGT GTTTGCCAAA TTGGAAACAA GTCATAAAAA CTTTGTAAAA  5250
ATACACGAAG AAACACTGCA AATAATGCGA TTTTAGGTGT AATTACAAAA  5300
AATGCTGTAA TAGGCGTAGG GGCTCCTTGA TATACATCTG GAGACCACAT  5350
ATGGAAAGGT GCTGCAGCTA TTTTAAATAA AAATCCTGCA AGTATAAATA  5400
CCATTCCTAA TGAAACAAGT GCCGATTCTG TTGTTACAGC TTGAATACCA  5450
TCAAATGGCA CAACACCTGC TGCAAAAATT TTAGCACATT CAGAGAATGA  5500
AAGTACACCA GTATATCCAT ATATTAAAGA ACAACCAAAA ATTAATAAAC  5550
CTGAAGAAAA AGCCCCAAGT AAAAAATATT TTAGTCCTGC TTCAGTTGCA  5600
AATTCTGAAT TTCTTTTTAA CGCAGCTAAC ACATAAAAAC TTAAACTTTG  5650
AAGTTCTATT GCTAAATATA GTGATATAAA ATCTGCTGAT GATGTTAAAA  5700
ATAACATACT TGATACAGCA AATAGTATTA ATATTAAAGT TTCAAATGCA  5750
TTGTCTCCTT GTGTTTGGAT ATATTGTAAT GACATTGATA TACAGCATGC  5800
ACTTGATAAT AAACATGTAA TTTTTACAAA TTGAGTGAAA TCATCGATAA  5850
CGAGTGTGTT ATAGAGTAAC ACAGCATTAT TTATTGGACT ATTATAAATC  5900
AATAAAGAAG TATATAACAA GGCCATTATC GCATACCAGC TGATATTATA  5950
CGATAATAAT GGATAATTTA ATGATTTTGA TGTACTAGTA ATTACTCCAT  6000
AAACTAATAA AAAAATACCA GTAGATACTA AAAATATCTC CGGAAACAGA  6050
GCTAGAAAGT CATTTTCAAA TAAAAAACTC GAAAAGGATA TCTGTTTATG  6100
AAAAGATTGT TCTAAATACA TATTTTTATA AATGCTTTTA TATAATTTAT  6150
ATTTTTATGT TTTTTGTACT GTTTAATAAA CAACGCAAAA AACATAAAAA  6200
GGACCATATT TATTTACTCA AGTACTATAA GAGTGTAATC CACAATCCAT  6250
ATAGTGAAAT AAACAGGTTA AAATAAATGA TATTTATTAC AATAAATATC  6300
ATTTATTCCG TTAAGATTTA TTACCACCAA CGATAACGAA TATTTGCACG  6350
ATTGGTGATA GCTGCAGTAT GCAACTCATC GCGTTTTTGT CTAGCTTTTC  6400
CTTGTTTGGA ATAGGCTTGA TATATTTCAT CTGCTAGGCA TTCTGAAAAA  6450
TTTTGCGATG AACCTTGCTG TTTTTTTCGT GCTGATTCAA TTATCCATCG  6500
CATAGCTAGA GTGTCTGCTT TGTGTTGTGA TAATATAGCT GGAATTAGAA  6550
ATGTATTTCC AGCCCTTCGT ACTTTTTTTA ATTCTACACT AGGTGTTACA  6600
TTTTCTATCG CTATCGATAC TACCTCAAGT AATGAAGACA TTTCAATAGA  6650
CATAGAATCA TTATTTAAGG GTATCGTAAT TTCTTCAAGA GCTTTTTTAT  6700
TAAATTCATA AATGTAATTT ATTTTACCCT TTTCTAGAAC TTCGTTTATT  6750
TTCCCTTTAC CTATTCTTTG ATTATGATTC ATGTGGATTT TATCCATTTT  6800
AACTGTATCC TTTGGAGTCT GGTCTGAAAA TACATTTTTT TTTAATTTTT  6850
TATGCTTTAA GATTAACAGA CTATCATAAA ATAATTGTAC AGCTACAGAT  6900
TTTTTACCAT CTATCATTAA TAAATTTATA AATTTATTAC TATTGATTAC  6950
AGTATTTCTT GTAGTCCCGT ATTGTTTTTG ACCGTTCATA TAATTTATTT  7000
ATTATATCGT TTATTTTTTA AATAAGACAC TAATAATGCT TTGTTTTGTC  7050
TAAAATAAAT TGGATCTATT ATATCCATTT TTTTTTAGAG ACGAAACAAA  7100
ACATCATTAT GCTTTACTTT TTGGTGTTCC ATAGAGTGAC CTAGATTGTT  7150
TTCTAGAAAT AACCCCTTGT AAATCTAATC GACCACGAAC TACTTTGTAT  7200
TTCACTCCAG GTAAGTCTTT TACACGACCA CCTCGAATAC ATACTACGCT  7250
ATGTTCTTGA AGATTATGGC CTTCACCTGG AATAGAAGCA ATAACTACAA  7300
TTCCATTACA TAGACGAATT TTTGCTACTT TTCTTAGTGC TGAATTAGGC  7350
TTTTTAGGTG TTCGTGTGTA TACACGAAAA CATACACCTC TTTTTTGAGG  7400
ACATCCAGCT AATGCTGGTT TTTTACTTTT AAGTTTTGGA GCTTGTCTAG  7450
AGCTTTTTTT TCGTAATAAT TGATTGATTG TAGGCATAAA ATATTGCTCT  7500
ATTTTTCGTT TGTTGGGTAA AATAAATGAA ATTTATTNNA AAACAGCAAA  7550
TGCATAAAAT TAAATCTTAC AACTGAAAGG TAGAGGGTTA AATGAGCGAA  7600
ATTCATTGCC GGGTAAGTTC CGGCCTGCAT GAATGGTGTA ACGATTACTC  7650
CGCTGTCTCT AATATGGTCT CAGTGAAATT GAATTCTCCG TGAAGATGCG  7700
GAGTTCCCCA AAGTAGACGG AAAGACCCTA AGCATCTTTA CTGCAACTTT  7750
ACGTTGAAAA AGGATATTGA TTGTAGAGGA TAGGTGGGAG CCTTGATAAA  7800
AAAAGGCAAC CTTGAAATAC CACCCTTTCT ATATTGTTTT TCTAAGCACA  7850
GCAACAGCGT ATAGTGGGTA GTTTGACTGG GGCGGTCGTT TTCTAAAAAG  7900
TAACGAAAAC GTGCGAAGGT AAGCTCAAGC TGGAAGGAAA TCAGCTGTAG  7950
AGCGTAATGG TATAAGCTTG CCTGACTGTG AGACTTACAA GTCAAACAGA  8000
GACGAAAGTC GGCCATAGTG ACCCGGGAGT ACTTCGTGGA AAGGCTCTCG  8050
CTCAACGGAT CAAAGATACG CTAGGGATTC AAAGTTTGTA GTCCCGCTTT  8100
GTAGCGATAC AAGGGCAATA ACTGGGTGAA TTGCAAGAAG GCTAAGGTAA  8150
AATAAAACCG ATTGGTTTAA TATTTACTAT GCTAATTTGC AGCGAAGTTC  8200
TTCGAATAGG CTCTAAAAAA AATGTAGAAG AAAACGTTCA ACGACTAGAG  8250
AATGAGGATT TCTTACCAAT AATTTCTCCA CGAGCGCCCA GCAACTTAAA  8300
AAAACAAACA TGTTATATAA ATCATTACGT GGTAAAGTAT TAAAAAATTA  8350
TAAATTATGG ATGATGGAAG TAAATTATGT TATAATAAGG ATTATCCTCG  8400
TAAAGGGTTT GTCCTTAATA CGCAAGGATT TAAAACAGCT GATGTCATTT  8450
TAGCATCTCA AGAAATTTTT GTTAAATTTA ATATAATTAC AAAAATAAGA  8500
TAAAATAAAA ATTTCGCAGT AATTGTTATA CCTGCCATTC ATCACTATAA  8550
ATTTGTAAAC TACATAAAAC CTTATATTGT TCCATCTATG ATTTATAAAA  8600
TAACACTTTA AAAGTTGATG ACATAGTCTG AACTATATAG AAATATATAG  8650
AAGTAAAATA TAAACAATTT TACNNNANCN ANANNTNNAC NSNNTNNNNA  8700
NNNCNNNNNN CNNNTNTCGA CGGAATCGTT TGGCACCTCA AATCGGGGTC  8750
TTATTTTGGC AACAAGATAA TGCTATTTGG CTGTATGCTG GAACATCCGG  8800
GGTATCTAAA ATCATATGCT CATAATAAAT TTTAAAGAGC ATATAAAAGT  8850
ATTTTAGTGC GGACAATCAG CAGGGAAGTT GATTCTATCG AGTTTAACCC  8900
CTCAACGACT ACACGCCGAA CATCTTATTA TAATAATGAG ATGATGATAT  8950
AGTCTGATCT TTTAGGCGAC TAAAAGGCTA TTTTATTTCT AAAAATAAAG  9000
AATAAAATAG CATTGAATAT TTTCTTGTAT AATAAAATAC AAGTAGAAGA  9050
GTAAAATTAT GAAAAGGTTA ACTAATATAA ATTTCATTTA TTTTAATAAA  9100
TGAAGTTTAT TTTACCCAAC GTTAAGTTTT GATTGGTATA CTTTTAGGAA  9150
ATGCTAGTTT ACAAACATTT AATAATGGTA TTACTTACCG ATTGCGTGTT  9200
TTACAAAAAA ACCAAGAATA TTGTAAACAT TTATATAATA TATTTCAAAA  9250
TTTTGTAACA ACCCCTCCAA AATCTATAAC AACAAAAAGG GAGAAACACG  9300
ATGGTATTTT AATACTTCAA CAAATCCGTG TTTTAGGTTT TATGGACACC  9350
AATTTTACTG TCAAGATAGT AAAAAAAAAG TCATTAAAAA AAACCAAAGT  9400
CGATTCATAA ATGGTTAACA CCATGCAGTA TAGCTTATTG GTTTATGGAT  9450
GATGGTTGTG CAAAATGGAA AAAACACTCA AAAGCTATTC AGTTTTGTAC  9500
TGATCACTTT TCAAAAAAAG AAGTTCAACT TCTTTGTGAT GTTTTTATTG  9550
AAAAATATAA AATAAAAGCT AAACCACAAA AGTATCTTAA AAAAATCTGG  9600
ATTAGATTTA AAAAACATTA TTTTTCCTTT ACTTCGAGAT GAAATGTTAT  9650
ATAAATGTAA AATAAACGAA GTTTATTCTA GAAAAATGGT ATAATTAATA  9700
TTCAATTAAC ATATATGCGA TGTCGACTCG TCACATCCTC G           9741

<210> SEQ ID NO: 135
<211> 4819
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TTTGCTTTTA TTTAAATAAA AGCAAATATC ACATAAAAAT AATAGAGTTT    50
ATATTGCGTT TTACAATAAA CCTAAATTTA TTGAGCATAT AGCAAATAAA   100
TGCAAGAACA GTGCATTGCA TCTAAAAATG GTTCGGGATA GACACCTATC   150
CATAAAATAC CAGCAATTAG AGGGGCAAAC ATGAAAAATT CTCGACGATT   200
TAAATCACTC CAAGTATTAA TAAAATCAGG TTTTGATACA CCATACACTA   250
ATCGATTACA TAACCATAAA GCATAAGCAG CACCTAAAAC CATACCAGTA   300
GCTGCACAGA AAGCAACGAA ACTATTGTTT TGGTAAGCCC CAATAAATAC   350
TAAAAATTCT CCAGGGAAAG AACTAGTACC TGGTAAACTT ATATTTGCCA   400
TTGTAAAGAA TACAAATAGG AGAGCAAATA ATGGCATAGT TTGACCACAT   450
CCAGAATAAT AACGTAATAA TCGAGTTTTA TGTCTATCAT AAAGTACTCC   500
AACACATAGA AACAGTGCAG GAGAGACTAA ACCATGACTA ATCATAAGTA   550
AAATACTTCC TTCAATACCT TGAGTATTTT GACTAAATAA TCCTATAGTT   600
ACAAAATTCA TATGAGCTAC AGATGAATAT GCAATAATTT TTTTAAGATC   650
AATTTGTCGA ATAGTAGTAC AACTTGTATA TACGATAGCT ATGATACTCA   700
TTGTGTAAAT TAGAGGTGTA AAATATATAC ATGCATATGG AAATAATGGA   750
ATAGAAAATC TCAAAAATCC ATATGTACCT AATTTTAGAA GTATACCAGC   800
TAATATTACA GAACCTGCTG TAGGAGCTTC TACGTGAGCT TCAGGTAACC   850
AAATATGAAC AGGTACCATA GGAACTTTAA CAGCAAAACT TGCGAAAAAA   900
GCTAACCAGA GTATACATTG TCTAGTTTCA CTAAAATCAG ATAGATATAA   950
CATTTCAATG TCTAGAGTAC CTGTTTGAAA ATATATTAAT AATATAGCTA  1000
ATAACATTAA TACAGAACCA AATAATGTAT ATAAAAAAAA TTGATATGCT  1050
GCTCTAATTT TTCTTTCTCT AGAACCCCAA ACACCTATGA TAATAAACAT  1100
AGGAATTAAT ACACTTTCAA AAAATATATA AAATAATAAC AAATCCAATA  1150
CTGAAAATAC AGTTAGCATT AAAGTTTCTA ACACTAAAAA TGCAATACAA  1200
TATTCTTTTA CGTATATTTC TATGTTATTC CAACTTACTA AAATGCAAAT  1250
GGGAACTAAT AATGTAGTAA GAATAATAAA AAATAATGAA ATACCATCAA  1300
CCCCAAGAGC GAAATTTAAT GCAGAAAAAC TAGATGAACT CGCAGCTCGA  1350
GCTAAAGTAA CATCTGAAAA ACCACTTGGA GAACAAATAC CATTGGTATA  1400
TTGAAATACT GCTGAACTAC TATCGAATTC GATCCAAAGT AGGAGAGAGA  1450
TTAAAAAAGT CAATAACGAT GAGTTTAATG CAATATTTCG GATAGTTTGT  1500
GTTTTCCAAC TTGGAACAAA TAATAATGCT GTTACGCCTA ATAGGGGTAC  1550
AGCTAAAACC ATTTCAATAT ATGACATAAG GCTTAGAAAA AATATTGTTT  1600
TTTAAATTAT AAAATAATAA ATTTTAAATA ATATACGTTA TTTTACTTCG  1650
TACAANNNNA TATTATTTTA TTGTTGTACA TATATTTATT ATTTTTTTAA  1700
TATTCTTTAA TTATACATAT CGATACTACC AAGAAAAATG TGATGGAAAA  1750
TATTTTGTTG CTATTTATAT TTATTTCCAT GAATATAAAT AGCAACAAAA  1800
ACGTTCTGTT TTTAATATAT ACTTATTATT TTTCTAAATT TTCAGTTGAG  1850
CCTAGGGCTC GAGATGCTGA ATTATAATAG TTATAAAATA AAAAACAAAT  1900
AGCATATATA AATAATATAC GACTATCTAA AACATTTTCT AGTACACTCC  1950
AAGCACTACT TACAAGAACG AGCATAGTAA GTCCTAATAG CATAACAACT  2000
GCGTAATGAT ATATCAGACC ACTTTGTAAT CTGCTCCAAT ACTGAGCAAG  2050
TGCTGGAAAA CTATAAGCGA TACCAGTAGG TCCACAAATT TCAAAAAATC  2100
CTTTATCAAG AGCTTTAAAA CTTACATTAT AACCAAAAGA CATAGTGTTT  2150
TTTGATATGT AATCGTTGTA TACTTTATCA AAGTACCAAC GTTTGTTTAA  2200
AAAACTATAT AGGCTTCTGA AGAAAGGTAT TTGTTTAAGA GCAAAGCTTG  2250
TTTTAGTACT ATAGACAGAA AAAAAATAAG CAATTACAGC ACCTGCAACA  2300
GTAGCTATTA AAGGTAGAAA TTTTTGAGAT TGTGGTATAA ATTCAGATTC  2350
CAATAAAACA TTATTTTGTG GTAAGACAAA AATAGCATTA GCCCAAAAAG  2400
GGGTAGCTAG TCCAATCATC ATATCTTTAG CACAATACCC TACAAAAATA  2450
CTTCCTATAG ATAATAGAAA CAATGGCAAG GCTAATAAAA ATGGCGCATC  2500
ATGAGCAGCT TTTACAGTAG ATTTATAAGC ATTCGTTGGA GCTATAAAAG  2550
TAAGAAATAA CAAACGAAAT GAATAATAAG ATGTAAGAGC TACACAAATA  2600
CTTCCAAATA AATAGGCAAA ATTAGCATCC CAAGTATATC GAGCGAAAGC  2650
TACTTCTAAA ATAACATCTT TACTATAGAA TCCAGTAAGG AATGGAAATC  2700
CCACAAGTGA AAAGCTTCCA ATAGACATCA TAGCATAAGT AAAAGGTAAA  2750
AGTTTTGCCA TTCCTCCCAT TTTTCTCATA TCTTGTTGAT CTGAAAGAGA  2800
ATGAATAACA CTCCCAGCAC TTAAGAATAA TAATGCTTTA AAAAAAGCAT  2850
GATTCATTAG ATGAAATACA CCTACAGTAT ATTGAGAAAT ACCACACGCA  2900
AAAATCATAT ATCCTAATTG ACTAGCAGTT GAATAAGCAA TAACTCTTTT  2950
AAGGTCATTT TGTACTACTC CAGTTGTTGC TGCAAAAAAA CATGTCATAG  3000
CTCCTACAAT AGCTACTACA ACAAGAGCTT TAGGAGCATA TTCAAATAAT  3050
GGTGAACAAC GTGCTATCAT AAAAACACCT GCTGTTACCA TTGTAGCTGC  3100
ATGAATTAAT GCAGATACCG GTGTAGGACC TTCCATAGCA TCTGGTAACC  3150
AAGTATGTAA ACCTAATTGT GCTGATTTAC CTACAGCCCC TACAAACAAT  3200
AATATACATA TCACTGTTAA TGCATGAAAT TCTATATTAC AAAATATCAA  3250
CACCGCTTCT GATTGATGAG CAGCAAGTGC AAACACCGTA TAAAAATCTA  3300
CCGTCTTAAA TACTGAAAAA ATAGCCATTA CACCTAATGC AAGACCAAAA  3350
TCACCTACAC GGTTCACTAA CATCGCTTTA ATAGATGCCT TTGATGCTTG  3400
AAGTCTGGTA AACCAGAAAT TAATTAATAA ATATGATGCA AGACCTACAC  3450
CTTCCCAACC AAAAAACATT TGTAAAAAAT TATCAGCTGT TACCAACATT  3500
AACATGAAAA AAGTAAAAAT AGATAGATAT GCCATGAAAC GTGGTAAATG  3550
AGGATCTGCT TCCATATAAG AAATAGAATA TAAATGAACT AGAGTACTTA  3600
TACTTGTAAT TACACAAAGC ATTATTACTG TAAGACTATC AAAATAGAAA  3650
CCCCAACCGG CATCAAACAT CTCTGAAAAA AACCAAGGCA TCCATTCAAT  3700
ACTACAGGTT ACTCCTGATA ATGCTACCTC ATAAAAAGCT ATGCAACTAA  3750
TAACAAAAGA TAAAAATACA CTACTAGTTG TTAATAGAGC TGCTCCTCGA  3800
AATCCTAAAA ATCGACCAAA CAACCCTGCT GAAATACTTG CAAGTAAGGG  3850
TAATAAAAGT ACCAGTAAAT ACATAAGTTT AGTTTTATAA TAATATTCAC  3900
ATTTTTTAAT AATGTGAAAA CATAGAATTA TATTACTTAC GACATAAATA  3950
ATATAAGTTA TGTTGTATCA AGTAAAATAA ATCCAATTTA TTGTCGTACC  4000
AAGTAGTACG AAGTAAAAAG TATTTTTATT TCGATAATAA AAGTAGAAAT  4050
GATACGCTTT TACTTATTAA ACGAATAAAT GATATTTATT CCGTTATAAA  4100
AACTACAAGC ATTATCGATA TACAAAAAAA CAATACAAAA AAATTATAGT  4150
TTAAAAATTT TAAATTGATA CGCACTCTGT AGGAATTGAA CCTACGACCG  4200
TCAACTTAGA AGGTTGTTGC TCTATCCAAC TGAGCTAAGA ATGCCCTTAT  4250
CCGAAAATAA GAATGCCCTT ATTTTTGAAT AAAAAAATCC TTATCTTTGA  4300
ATTTAAAAAT TGCTCACTAT CGGACTTGAA CCGATACCCT AAAAGGAACA  4350
GATTTTAAGT CTGTCGTGTA TACCAATTTC ACCAAGCGAG CAGAATATAA  4400
TAAAAAAATA GAGTGGAATA AAATAACATA TCATATTTTC AATAGTGACC  4450
CAAAATATGA TATGTTATTT TTTTTTTTTG TGTACAGGTT TTTTACGAGT  4500
AACCGCAAAT TCGCCAAATT TATGACCAAT CATTTCTGGG GAAATTTTAC  4550
ATGGAATAAA AATTCTACCA TTATGAATTG ATACAGTTTT ACCAACAAAT  4600
TGAGGTAAAA TAGTAGAACG ACGTGACCAT ATTTTAATCA TATTGTCTTT  4650
TATTAGTCGT ATTTCAGAAA AAGGAGGTTT TGATAAAGAT CTTGACATAT  4700
ATAGTGGTTT TATATTTTAT AGAAATTGTT GTCGCGAATA AATCTCATTT  4750
TTTTTAAAGG AAAAAGGAAT AACATTATAT TCATAAAGAA TATACAATAA  4800
CATTATATTC ATAAAGAAT                                    4819

<210> SEQ ID NO: 136
<211> 6262
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TATATAGACA GATATAGGGA GAGTGGCTGA GTGGTTAAAA GCGGCAGATT    50
GTAAATCTGT TGAATGTTTT CTACGTAGGT TCGAATCCTG CCTTTCCCAT   100
TCTTTTGAAG TCTTTGGTTA TTTTTATATA AAAATAATAT AGACTATATA   150
AAAAGATTTA AAAATTAATA TTATTATTTT TTGTCTTNTN TANNATTNNN   200
AANATNNAGA AGGCAACTTG GCGGTTTAAA ATAAATTTCA TTTATTCGTA   250
ATAAATTTTA TTTATTCACA ATAAATTTCA TTTATTCACC CTTCTATTTT   300
AAAAATTTTA TATCATTATG CCACAATTAG ATTTAGTTTC GTTTTTATCA   350
CAATATTTTT GGTTATTAGT AGCTTTTGTA GGTTTTTATT TTTATTTATA   400
TAAAAGTTTT CTTCCAAAAA TGTATCGTAT ATACAGCGTT CGTGAAAGAT   450
TAAGTAACAA ATCACACGAA TCAAAATCAA ATTTTCAACC GTTATACATA   500
GAAGCATCTA AAAAAAAAGA AACTTTATTT GCTAATGTTT TAGGTTATGT   550
AAATCAAACA GTTGCAACAT CTCAAACAGC TGTGGAAAGT TGGAAAGATA   600
GTAATTCAAA ACAATTACTT GAAAATTCTC TTTCAAAATT CACAAATACT   650
TTCAAAAAAA CTCTAACTTT ACAGTCTCTA TCACAATTTC TTGTATTAAG   700
ATATGCAGCG CCATTAAAAG TGTCTTCTAC TTTTTCTGCT AATGTATTAC   750
CAAAAGCAAA ATCTGCAGTT AAAACACTTG AAACTGTAAA TATACATAAA   800
CTAGGGGACA AAAAAAAATG GAAATCAAAT TTATTAAAAT CTCCAAATTT   850
ATATGTACCT TATTTTGGAT TCCAAAGTTT AGCAACAGAG ATTTTAAGTA   900
ATAAATCTAC TAAACTAAAT GGAGTGTCTA ATTTAAGTTC TGAAAACAAC   950
GTTCAAGATT TAGCAACAAC AACGCAAACA AAAACTGTAT CAAAAAATAC  1000
ATCTAAGTCT AAATCTTCAA AAAGTTCTTC TAAGAAAAAA GCATAAAATA  1050
TAAATCGGTT TTTTTACATA CATATTTTAC CATAATAAAC CCTGTTTATT  1100
CTACCGTAAT AATTATAGAC TATATATGAA ATCATTATCT AAACCGTACA  1150
TATTATTATA TATTGTTTTA GCGTTTTGCG TAGCAAGCTC AAAACATATA  1200
TTAATTTATA ATGAAGAGAC ATTAGTATTA ATTTGTTTTT TTGGGTTTAT  1250
ATTTGCAATT AAACATTACT TTGGAGATAC TTTAACAGCA TCGATCCAAG  1300
AACGAGGTGA TACTATTCAC GATGAATTAA ACCAATCATA TATCACTCAA  1350
ATTCATCAAG ATTTATCTAT TCGTGATCAA TATAAACAGA TTATACCAGT  1400
TTCTCAAGTG TTATCTGTTT TTTCAGCTAG TTTAGTAGAA CAAAGTGAAT  1450
CTACTCAGAA ATTAGCAGAG TATAGTTTAA AACACCAAAT AGATTCTCAA  1500
ATTTCTAAAA TGTATACACA TTTATATAAA TTACAACAAT CGTTTCCTCA  1550
AAAACTTCAA AAAAAAATAC ACGATAACGT ATTACCTAAT ATACAACAAA  1600
GTTTAGCATT AAATAAAAAA CCAATAGATT TTATTAAAAA TTCTAAAAAT  1650
AAATTAAAAG CAAAAGATTT TGCATAATTT ATTAAGTCGC TAAAATTTAT  1700
ATCGTTTTAT ATATCTAAGA TTTTTATATT TATAAATTTG AAATAGTAAT  1750
TTATAAATAT AAAAATCTTT TAAAATNNNN NAAANAATNA AACCTAAAAA  1800
TCTTCTTAGT ATAGCTTTAC TAAGCATCTA AAATTTTTAG AGAAAAAAAG  1850
AGACTATATT TTTTTCAATT GTATTTTTTG TTAAAAAAAA TAAATTCGAT  1900
TTGAAAAAAA TTATAAAAAC CATCTATTTA TGAAATTTTT ATTATTTGCA  1950
TATACAGTAA TACCATTTGT TTCATTTTCT GATGCGCCAG AAGCATGGCA  2000
AATTGGTTTT CAAGATCCAG CAACTCCTAT TATGCAAGGT CTTATTGATT  2050
TACATCATGA CATTCAATTT TTCTTAATTA TTATATTAGT ATTTGTATTA  2100
TGGATGATAT CTAGAGCGTT ATATTTATTT CATTATACAC GTAATCCCGT  2150
TCCAGAAAAA ATCATTCATG GTACAGTGAT TGAAATTGCA TGGACAATAA  2200
CACCAAGTTT AATATTAATA TTAATTGCAG TACCTTCATT CGCATTATTA  2250
TATAGCCTTG ATGAAGTAGT AGACCCTGCA GTAACTATAA AAGCTATTGG  2300
TCATCAATGG TATTGGAGTT ATGAGTATTC AGACTATAGT GTAGCTGATG  2350
ATCAAAGCAT TGCTTTTGAT AGTTACATGA TTCCTGATGA TGATTTAGAA  2400
TTAGGGCAAT ATAGATTACT AGAAGTAGAT AATCGTGTAG TTGTTCCAGT  2450
TGAAACTCAT ATTCGAGTTA TTATTACAGC AGCTGATGTT TTACATAGTT  2500
GGGCAGTGCC TTCACTAGGG GTAAAATGTG ATGCAGTACC TGGTAGATTA  2550
AACCAAATAC CTATTTTCAT TAAACGTGAG GGCGTATTTT ATGGACAATG  2600
TAGCGAACTT TGCGGTGCAA ACCACGCATT TATGCCTATC GTTGTTGAAG  2650
CTGTGTCTCT AGAAAATTAC ATTTCTTGGG TATCTAATAA ATTAGAAGAA  2700
TTGTAATTAT TTAATATAAA TGATACTTAT ATAAATCAAT TTGATTTATT  2750
AGCGTAATAT TTTTTTACGT AAAAATATTA TTTAATTGAA ATACCAAACA  2800
TACATATTTG GTGTCTCAAT TTAATCTATA AGAGTGTATT TTAAAAATCA  2850
CAAAGTAAAA TAACGTATAT TCTTTATGAA TATCAGTTAT TGTACGAAGT  2900
AAAATAACGG AATAAACATT ATTTATTCAA CAAGTAGTTT TTATAAAACA  2950
ACAAAACCCT ATGCCTAATT CAATATTTAA ATCTTCTTTT TATAATAGGA  3000
TGGTTTGGTC CACAAATTAC ACTCCTCTTA ATAAATCCGG TTCATTTTAT  3050
TTTAGATTAA CTAAAAGAAC CATAAAAACT AGTCCCAAAA CAAGGAGTAT  3100
AGGTATCGAT AAGGAAACTA CTAATTATTA CAGAAAAGCA ACTCATTTTG  3150
GTCATAAATC TATCTATTTA TCGGAAACTA ATTCATGGCA TCCTTCTATG  3200
TCTCAATATC ATTTAGGGAT ACGTAATGAT ATTACTATCT GTAATCTTCA  3250
ACAAACGCAA AAATGTTTAT CAAGAGCATT CTATGTATTG AACAAGATTT  3300
TAGAAAATGG GGGTAATGTA CTAATAGTAA ATACTAATCC AGAATTTTAT  3350
AAATTATCAA TCAATAGTAT GAAATTTATA GAAAAAAATT TACCTTTTAA  3400
AACTTTTAGA CTCCTTTCTT ATTGTTTTTA TAAATGGATT GGTGGTACGT  3450
TAACAAATTA TAAACAAATA TCTCGATCTA TTTATTGTTA TATTAAATTT  3500
TCACAACGTT GTAGTGACTT CTGCGAAAGA AATAACATAG ATTTTACAAG  3550
ATATCAAAAA ATAAAAAAAT GTTTTCAAGG ATATGGTACT GTTTCTGACA  3600
ATTCTATAGT TGTGTCTTTA CAACACAAAC CTGATATTAT TTTTTTATTT  3650
AATCCTTCTG ATAATCAAAA TCTTATATCA GAAGCAAATC GATTACAAAT  3700
TCCAATTATT GGTTGTACTG ATACTAGTAG TGATGCAACT GGAATTACTT  3750
ATCCTATTCC TTGTAATAAT ACTAGTGTAG AATTTACTCT ATATTTATAT  3800
AAAAAATTAT ATAGAGTCTT AAAAAATTTA AAATAAATAA TAAACTTTAT  3850
TTATTTTAGG TGTATATCTA TAATATAACA ATGAAATGTT ATAGAATACC  3900
GCGGAAATAG CTCAGTGGTA GAGTGTTGCC TTGCCAAGGC AAATGCCGAG  3950
GGTTCGAATC CCTTTTTCCG CTTTTTTTNA ATAAAGTGAA TATAACCAAT  4000
TATAGTCTTG ATCTCCGAGC TTTTTTTTAT CAAGATATCA AATAAATCGA  4050
ATTTATTTTA GGTATTAAGA GTAGAGATGG GGGCGAAATA GTATATACTT  4100
TTTTAATGAA AATGAGTCGA ATTTATTTAA ACTATTCTAT ACATATATGA  4150
ATAAAATAGC CATAAGAATA TGTATGTGGA TGTAAAATGT GTTTTTTTTG  4200
TAGTACATCT AAAGAAAATA TAGAAACAAA AAAAAATATT TTGATATATT  4250
TATATTTTTG GTATATTCTG AAGGATTTAC TATTCTTTGT TTCTGCCTTT  4300
GTTATAGCAT CATTCTTTTT TCTGAAATAG AATAATAAAT TTTGTTTATT  4350
TTAGTATGAT GAGATAGCAG TTTATTTTAT ATATTTACAA TATATCTTTT  4400
ACTCGTAGGT TCAATTCGAG TTACTATATA AAATAAGAAT TTTAGATTTA  4450
TGGTTACACG TTGGTTATAT TCTACAAATC ATAAAGACAT AGGTACTCTA  4500
TATCTTATAT TTGGTGCATT TTCTGGTGTA TTAGGAACAG TATTTTCATT  4550
AATAATACGT ATGGAATTAG CACAACCTGG TAATCAAATA TTAAACGGAA  4600
ATCATCAATT ATATAATGTA ATTATTACAG CACACGCATT TTTAATGATT  4650
TTCTTTATGT TGATGCCAGC ATTACTTGGA GGATTTGGTA ATTGGTTTGT  4700
TCCTATTTTA ATAGGGGCAC CAGATATGGC ATTTCCTCGT TTAAATAATA  4750
TTTCATTTTG GTTATTACCA CCATCATTAT TATTATTAGT AAGTTCTGCT  4800
CTCGTAGAAG TGGGAGCAGG AACAGGTTGG ACGGTATATC CTCCATTAGC  4850
AAGTATAGCA AGCCACTCTG GTGGTAGTGT TGACTTAGCT ATTTTCAGTT  4900
TACATTTAGC TGGTGTTAGT AGTATTTTAG GTGCTATTAA CTTTATTTGT  4950
ACTGTATTTA ATATGCGTGC ACCTGGTTTA TCTATGCATA GACTTCCATT  5000
ATTTGTATGG GCCGTATTTA TTACAGCTTG GTTACTATTA TTATCTTTAC  5050
CCGTTCTTGC AGGGGGTATT ACAATGCTGT TAACAGACAG AAATTTTAAT  5100
ACTAGTTTCT TTGATCCAGC AGGAGGTGGA GATCCTATCT TATACCAACA  5150
CTTATTCTGG TTTTTCGGTC ACCCTGAAGT ATATATTTTA ATTATACCTG  5200
GTTTTGGTAT CATTAGTCAC GTAATATCTA CTTTTTCTAA AAAACCTATT  5250
TTCGGTTATT TAGGAATGGT TTATGCTATG TGTAGTATTG GTATTTTAGG  5300
TTTTATTGTT TGGGCTCATC ATATGTATGT TGTAGGTTTA GATATTGACA  5350
CTCGTGCCTA CTTTACTGCA GCTACAATGA TTATTGCTGT ACCTACTGGT  5400
ATTAAAATTT TCAGTTGGGT TGCAACAATG TGGGGTGGTA GTATTGAATT  5450
ACGTACACCT ATGTTATTCG CTATTGGTTT CTTATTCTTA TTCACTGTTG  5500
GGGGAGTAAC GGGAGTTGTT TTAGCAAACT CTGGCTTAGA TGTGGCGTTT  5550
CATGACACTT ATTACGTAGT AGCGCACTTT CATTATGTGT TGTCGATGGG  5600
TGCTGTATTT GCTCTATTTT CAGGTTTTTA TTACTGGATT GGTAAAATTT  5650
CTGGACTACA ATATCCTGAA ACTCTAGGAC AAATCCATTT TTGGATGATG  5700
TTCTTAGGAG TTAATATTAC ATTTTTTCCA ATGCATTTCT TAGGATTAGC  5750
TGGAATGCCT CGTCGTATTC CTGATTATCC TGATGCTTAT GCTGGATGGA  5800
ACGCAGTAGC GAGTTACGGA AGCTATTTAT CAATTGCTGC TGTTTTATTC  5850
TTCTTTTATG TTGTATATAA AACACTTACA AGTGATGAAG TATGCCCACG  5900
TAACCCTTGG GAAACTACAC CTGGTGTATC TCCAACATTA GAATGGATGT  5950
TACCTTCACC GCCAGCATTC CATACTTTTG AAGAAATTCC AAGTATAAAA  6000
TCATCTTCTT CAAACTAAAA ATTTAGTGTG TTTTTATAAA ACAACAAAAA  6050
TCTTTAGTTA CATATTGATG AAATTATGAT TTTTATTTAC TTTAGCTATC  6100
TTGAGAGGTA GCTAAAGTAA ATAAAAAAAT AATTTTTTCT TATAAATACA  6150
ATGACACTTA TATAGGAATA ATCTACATAG ACATATATAG GAATAATCTA  6200
TATGGACAGA TATAGCAATA ATCTATCAAT CAAAGAATAA GGAATTATAT  6250
TGAGTTTGAT CC                                           6262

<210> SEQ ID NO: 137
<211> 2961
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
GAACAAAGTC CTTTATAAGA CAAGACTCTA TATTAAGCTT GATTGTTAAT    50
AAATTTCATT TATTAATACC AGGATTATAA ACATCGTTGT ACTTTGTTTA   100
TAAAAAAATA AAAAAATTTC ATGAAAAGAC TATCAATTAT AAAACAACCA   150
ATATTATCAA TATTAAATGA TCATATTGTA GATTATCCAT CACCAAGTAA   200
CCTTAATTAT TTTTGGAGTT TTGGTAGTAT TGCAGGAATT TGTTTAGTAG   250
TTCAAATAGC TACTGGTATT TTTTTAGCAA TGCACTATAC ACCACACGTT   300
GACTTGGCAT TTATGAGCGT AGAACATATT ATGAGAGACG TTGAAGGTGG   350
ATGGTTACTT AGATATATGC ATGCAAATGG AGCAAGTATG TTTTTCATTG   400
TAGTATATAT CCATATGTTT CGTGGTTTAT ATTATGGAAG CTATACAAGT   450
CCTCGTGAAT TATTATGGAT TGTAGGTGTT GCCATTTTAT TATTAATGAT   500
TATTACAGCT TTTATCGGTT ATGTATTGCC ATGGGGTCAA ATGAGCTTTT   550
GGGGTGCTAC TGTAATTACA AGTTTAGCTA GTGCTATTCC CGTAGTTGGA   600
AATTCTATAG TAACTTGGCT TTGGGGAGGA TTCTCTGTAG ATAATGCAAC   650
TCTAAATAGA TTCTTTAGTT TACATTATTT ATTACCATTT GTTATTGCAG   700
GATTAACAAT TGTACATCTT GCAGCATTAC ACCAATATGG ATCAAATAAT   750
CCTTTAGGTA CAAATTCAGC TGTAGACAAG CTTGGATTAT ATCCTTATTT   800
TTACGTAAAA GACTTAGTTG CTTGGGTAGC TTTTGCTTTA TTTTTTTCTG   850
TGTTTGTTTA TTTTTTCCCT AATTTATTAG GACATCCAGA TAATTATATC   900
CCTGCAAATC CTATGAGTAC TCCTGCACAC ATTGTGCCTG AATGGTATTT   950
CTTATGGGTA TATGCTATTC TTCGTAGTAT TCCAAACAAA TTAGCTGGTG  1000
TTGCAGCCAT TGCTCTAGTA TTTGTTTCTT TATTTTCATT ACCTTTCTTA  1050
AATACGTCAC CTATACGTAG TAACAATTTC AGACCAATAC ATAGAAAATT  1100
ATTCTGGTGT ATTTTAGCTG ATTGTTTCTT ATTAAGTTGG ATAGGTCAAA  1150
AACCTGTTGA AGATCCTTAT ATTATTATTG GACAACTTGC TTCAGTATTT  1200
TTCTTTTTTT ATTTTTTAGT AGCTGTACCG TTAACAGGAA AAATTGAACA  1250
CTATTTAATT AAATATAAGT CTTAATTTAT ATTTTATTGT TGGTTGTACT  1300
TGGTAACATC GTTGAACCAA GTAACTTGGT TCAACCAAGT AAAATAAATC  1350
TGATTTATTT AATAAATAAG GTATTTTAAA TTGCTGCTTT TCGCTAATAA  1400
AAATATAGTA AAAAGCAGCA ATTTAGCTTA TTATTTCGAT TCGATATTGT  1450
TGTATAAATT TCATTTATTT TAGGCATAGA CATAAATAAC TTTCATTTAT  1500
TTTCATTCGT TTTAATTTAG TTTAAATGAG GTAAATCTCA TGAATGAGGT  1550
AAATCTCATT AATGAGGACG TAGCTCAGTT GGTTAGAGTA TTGGCTTGTC  1600
ACGCCAAGGG TCGCGGGTTC GACTCCCGTC GTCCTCGTAT ATCATTAATT  1650
GCTTTCTATT TTCAATTCGA TTTGTTTTTA CTTCGTTTCG ACTTCGTACA  1700
ACGAAGTAAG GTAATGATAT ATTTTTGTAA ATGTTTAGGT AGCTCAGGGG  1750
TAGAGCGGAG GACTGAAAAT CCTTGTGTCG GTGGTTCAAA TCCACCTCTA  1800
GACAATTGAT AGAGTTGGAC AAAAGATTTA ATAAATTTAA TTTATTTTAC  1850
TTCGTTCAAT GAAGTAAACG AATTTAAATA ACGTATATTC TTTATGAATA  1900
TCAGTTGTTT TAATTTCATA CAACCAATCG GATTTATAAA TATCATATTA  1950
ATAAAACAAA ATTTATGAAT ATGATATGAA ATTGGACGAC ATATATAAAA  2000
AGTATATAAA CTTCTATTAT AATCTCTTTT GCATGGTAAA TAATAAATAA  2050
AAGCAAGTGG TTGGATTTAA TGAATAACAC CGAATCAAAT GAAATTTATA  2100
TTTATTTTGT TTTATAAATC CACATAATTA TAGAAATAGA TATATAATTG  2150
ATTATAAAAT AGAAGCTTAC GTACTATTTT TTTTGGTTTT GTATTAAAGC  2200
AGACCAAACA TTTTCGATTT TTCTCTCTAG AACATCAGAT AATTCGTTTA  2250
GTTGCGAATC AGGACTTAAA GTTTCTAATA ATCGACTATA AACTTCTTGA  2300
CTTGATGTAT TATATACACA CTCAATTCCT TTTTTTAAGT CCAAATGATT  2350
ATACGCTGTT TTATCTATAA TGGCTCCACA AAAAAACAGA TGTGGTATTG  2400
AATTTAGAGC TTTCCAAATA GCGGGTAATT GATTTAAATT CATACCAGCT  2450
AAAAGCATAT TAGGTCCTTG AAAAAGATTT AAAAAATTAA TATTTTTTCT  2500
ATCCAAACCT AAGTTAGGAA CTCCAGCTCT AGATTTATCT GTAGGAACCC  2550
CAGCTCTAGA TTTATCTAAA GATTTAGAAA CGATACAATC ATTATTATAT  2600
AAAAACGGAC TATCTGCCAA AAATTTTTGT ATAATAGTGT TTTTATATTG  2650
AAAAATACGT ATATTATTTT TTGTATCCAG ACCTAAATCA ATTTTTAGAT  2700
TAGATATATT TTTTTTTTGA GTTTGTTGTA TAAATAAAAC GTATCGAGAA  2750
TTTTGTAAAA TGCTGGCAGT TTTAGATAAC TGTATAAGTT TTTTTTTACT  2800
TGCCATAATA TATTGAAAAT TTTTTATTAA TTTATTAGAA TTATAATTTT  2850
TATGCGATCT ATTTATTTGT TTTTTTATTA AATTAAAAAA CACGCATAAA  2900
TTACCGAAAA AAATGCATTC GTTATAATAA CAAGTTGAAA TAACTGATAT  2950
TCATAAAGAA T                                            2961

<210> SEQ ID NO: 138
<211> 2650
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TTCGGTTTAC AAAACCGATA CTCTACCAAT TGAGTTAAGT AGGCTTGGTT    50
GTACGAAGTA ATTTCATACA ACCAAGTTTA TTAAAACCTT ATAAATAAAA   100
GCAGTTTAAA AACTGGAATA CGTGCTTGGA AAGGGTCGAA CTCTCAACCT   150
TCAAATCCGT AGTTTGACGC TCTATCCAAT TGAGCTACAA GCACTAGTAA   200
ATCGAATTTA TTTAAATGAT ATTTATTCCG TTATTTTACT TTGTACACCA   250
AGTAATACTT TGTTTTAATA AATATCATTT ATTTTATATT TTAACCTATG   300
CTTTTTTAGC CTATTCTATC AAATAGGCTA AAAAAAAGCA TAATTTAAAA   350
GTTTAGTAAT TTTTAATAAA AAATAGAGTC CAAATAATAG AGTAGAGAGA   400
TGAAACCAAC CGGGTCTATT GGACTCTAAC CTTTCATTAA ATTAATAAAT   450
TCTACTGCAA TGGTTCCACG AATACGATAA TATACTACAA GTAATGCTAA   500
TCCTATCGCG GATTCAGCTG CAGCAACTGT TAATATCAAT AGCGCAAAAA   550
GTTGACCTAT ACAATCATCA ATATAGACAG AGAATAATAA AAAATTTAAA   600
TTCACTGCTA ATAACATTAG CTCAATTGAC ATCAACATTA CAATAATATT   650
TTTTCTATTT AAAAAAATGC CCCAAATTCC TAATAGAAAT AATATCATGG   700
AAACTGTAAG ATATTTAGAT AAATCCATAG TATTTATTTA TGTGTTTTTC   750
TTTTTTTTAT AGTCTAGAAA CAAATATAAT TATATTTGTT TTATGTTTAT   800
ACTTTTATAT CTATATTTAT ATACTGAAAT AACGTTTTTC TAAAGGTTAT   850
TTTATGCTAC TTTTCTGATA GTTTTAGCAA AATCTCTTGT ATTTTGTAAA   900
AAAACTTGTT GTCTTTTAAT ACGAATACCT TTTTGCATTG TTAAAACAAT   950
AGCTCCAATC ATAGCAACTA GTAATATAAG ACTAGCTACT AAAAAAAAGT  1000
AAAAGTAGTA AGTATATAAT AGACTACCTA GAGCATCTAT TGTATGAGAT  1050
GTAGATAAAA ACATTCGCCA ATCTATAAAA GATAATTGAT TGTAGTTGGC  1100
TAGTAATTCA GTATTTTCAA TATTATATGA TAGTAATGGA ATCAAATCAT  1150
TATCGATAAG TATACAAATT TCAAATAAAA ATAGTACCGC CAATACACCA  1200
CCAACTGGAA GATAACGTAA TCTTTTTTCG CTGATCTCAG AAATTCTTAT  1250
ATTTAACATC ATAACTACAA ATAAGAATAA TACTGCTATA GCCCCTACAT  1300
ACACAACTAA AAATAATAGT GCAAAAAAGT CTAATCCCAA TAATACTAAT  1350
AAACCTGCAG CATTAAAAAA TACTAGAATT AGAAATAATA CAGAGTGTAC  1400
AGGATTTCGG GCTTGAATAA CTAGTGCACC TGATACTAAT GTTAAACTGG  1450
AAAATATATA AAATAAAAAA TCCATAGATT TATATTTTGA TTTTTGCTTT  1500
GTAAAATAAA TGAAAGTTAT TTAAATAAAA AAAAGAAAAT AACTTGTTTA  1550
TTATGTTTTG ATTATGTATT ATATATGTAC AAAAATGTTA ATTCATAGAT  1600
ATATACGTAC ATAGATTCTT GTTTAAAATA AATGATATAT TGATTCAATA  1650
TCTCAAGGTC TATAGATTCG ACAATACCTA ACATCACTTT ATTATATTTT  1700
AAAATTCTTT GTTGTACGAA GTTACTTCGT ACAACCAAGT TATTTGGTAC  1750
AACCAAGTAA ATTTTTTATT TAAAAAAACG CAATACAAAG CCAATCCATA  1800
AAGAAAATTC AAAAAGAAAA CTGGTTACTA ATAAGCAACA AAATTGACTC  1850
AATATTTCTG GGGGAGATAG TAATGATGAA ATACAAATGA TACAAAAAAA  1900
TACAATTTTT CTATATCTAC AAAGACTAAA ACAGTCAAAA TAAGATAAAT  1950
AAAATAAAGC TATAAATATT ACTGGTATTT GAAAAAATAG GACAAGTAAA  2000
AAATAAATTT GTATACTTGA ACTTACTGTT GATTCCACAC GAGGAGACAT  2050
TTCTATTATT TTAAATGAAT CCGTTTTATT AAAATAAAAC AATAAATCCG  2100
TTTTTTCCGA ATTTTGGAGA TAAATATCAT TTATTTTACC TTGATTGTTA  2150
TCTGTATAGT TGATATAGAT TTGAAAACTT GAAAATAACT CACAAATTTG  2200
TGGAAATATA TAAAAATATA TCAAGTATGT TTCGATGCAA ACAATAGTCG  2250
AAAACAAGAG CAAAAAAGTT GTTATTTCTT TTCTTTCAAA ATTATAACGG  2300
CTTGGTACCA AAAAACTCCA ATATTGATAA ATATAATTAA ATATACAGAC  2350
GCCTAGAGAA ACATAAATAC ATAATTTGTA AGTAGTCGAT AATGCTTCTG  2400
TAATATCAGT TGATTGTAAT ATTTGATTTA ATTGTAGAAA AGGTCGAGAT  2450
ATAAGATACA TCAGCTCAAG TTGATAATAG TAGCAAGTAA TTAATGTACA  2500
AATTAAAGAA TAAAATATAT AAAATATACG ATAACGTATT TCATTGTAAT  2550
GAAATAGCAT TCTAGAAGAA TTCATTTTTA AATTTTGTTG TGTGAAGTTT  2600
ATTAAGAAAT AAACTTATAT AATTATTTTA CTTGGTTGTA TGAAATTAAA  2650

<210> SEQ ID NO: 139
<211> 2112
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TGGTCAATTT GTTTTTTTGT TTTTAGATAA ATTTCAAATT AAATCTACCC    50
CAAAAAAACC AAAAAATATA TTTCATTATG AAAAAAATTG GACATTTTTT   100
TCACCCTCAA AAACATTAAA TACATCTTTT TCTAGTTTTA ATAAATCAAC   150
AAAAAAATTT ATTCTAACAA AAAATGCTTT TTTAACAAAT TTCTCATTTG   200
TAAACCAACC TGTATCTTTA GAATATCTGT TATTAACGCA GATTAATGTT   250
TTACAAAAAA AAAATTTAAA CCATTGTCAA TATTCACAGC CTTATTGGTA   300
TTCTAAAAAA AATTTATTAA ATATTAAGTA TTCTTGGAAA TCGACTTGTC   350
ACTTAAGAAA AAAATCTTTT TTTAATAATA TAAGCTTTAA GCAAATAGAT   400
AAAACGTTAA TTTTCTATAC ATTACAAATA CCGTCTAAAT TAAATAATAA   450
ATTAAATTTT AGTTCTATTA AAAAAATAAG TTNNNNNANT AATATTGAAA   500
AAAATTTATT GTGTATTTCT ATTCAAAAAT TAAATCAAAA TAATTTTCCT   550
TGCAATTTAA ATTTACAAAA ACAATGGGGA TCTTTAATTT TTAAATCTGA   600
TAAATCGAAA TTTTTTGTAA ATAATACTTC GCCAGTTTTT TCAAAACAAA   650
CAACTTATAA AACTTCTTAT AAAGAAGTTT CAAATAAACT ATTTATAATT   700
CAACCTGATT TTAACGTTGG GTGGCATTCT CCAACAAATT TGATGAAAAT   750
TAAATTAGAA CTTAAAACAC AAAAAAACTT TAAAGGATTT TATAGTGGCT   800
TTGTTTCTGA TTTTTATAAA ACAAATTCAA AATCAAATTT AAAAAATATT   850
AGTACTAAGG AAAGAAAAAA AGCTTTCAAA CGGGAGAGTT TTTATTTAGG   900
TTCTTATATA CACTTACCAA AAGAAAATTT TGTTTTTGAT CATAAAAATT   950
TAAATTTTCA AACGTTTTCT AATGAGTGGG TACTACCAAA TATAAAGTTA  1000
GCTGTTGGGT TTAAAACTTC TACAATTAGT GGAGAATTTA TTCGATTAAA  1050
AAAATCGTTA AATCAAACTT ATTGGAGTAG TTTAAGTAAA GATGATATTG  1100
TTACATTAAA TTTACCGAGT AAAGCGACTG AATTAACTTT AAGGGTTGGT  1150
CAAATTTTAC GATGGGGTCA ACCAATTTAT CACAATTTTG TCTCGCCATT  1200
TAATGGTCAA ATTATAAAAA TAAATCAAGG CCAGTTATCA ATCCGTCGAG  1250
GTTTACCCAT TTTAGCTTCA AAACGTGGAA TTATTCATAT ATCCGATAAT  1300
GATTTGATTT TAAAGAACAA ACTTCTTGTA ACGTTAAAAT CACGCCGATT  1350
AGAGACACAA GATATTGTAC AAGGTATTCC AAAAATTGAA CAACTTTTTG  1400
AAGCAAGAGA GACACTCTCA GGGGAAAAAT TACCAAGTAT TCATAAAAAA  1450
TTACAACTTT ATTTTTTAGA TGCTTTGGAT ATGTTTTATA AAAACAGCAA  1500
TCAGGTGTTT TCTGCATTAA CTCCACAACA ACAGAAAGTA CATTATTTTG  1550
AAATTGCTAA TAAAATGGCA GTAATTAAAA TACAAAACTT TATAGTTGAA  1600
AATATAATTG ATGCATATTT AAACCAAGGG GTTTCAGTGT CTGAAAAACA  1650
TGTTGAAATT GTTGTTCGAG AAATGACAAC TCGTGTAAGA ATATTGTCAA  1700
GTGGAGCTAC TGGTTTTTTA CCAGGAGAGA TTATTCAATT TAAAACAGTT  1750
CAAGAAATAA ATAATAAATT ATATAATAGC CAACAAAGTC CAGCATTTTA  1800
TGATCCTATT ATTTTAGGTA TTACTAAAAG TGTATTACAT TCGGAAAGTT  1850
TTTTATTAGC AGCAAGTTTT CAAGAAGTGA GCCGTATTTT AGTACGAAGC  1900
GCGTGTATTA AAAAAACTGA TTTTTTATCA GGATTGCATG AAAATGTAAT  1950
TGTTGGTCAA TTGATTCCTG CTGGGGCAGT TTTATTTTCA AAATTACCAA  2000
AAAAATATAC AATATAATTT AATATTATTA TAAAACATCG TTTTTGTTGC  2050
AATAAACGAT GTTTTGTAAT AATATTACAA GTATAAAATA GGCTTTTAGC  2100
TCAGTTGGTA GA                                           2112

<210> SEQ ID NO: 140
<211> 1550
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TGTAAATTAG TTGCAACTAA TTTACATGAT AATCTTTTTT TTAAAACTAT    50
TCCTAAAATT AACAAACCAC ACGTATTTCA TGAAATAAGT AAAAAACTTC   100
ATAATAACAA TAATTTACAC GAATCTAATC TTGTTAATGG GTCTATTACT   150
CTTGCAAAAA AACCAGAACA TCAAGTGGGT GCACCCAATT TTTATTTTTT   200
AGAAAATATT AGTGCTAAAT TATTATTAAA TTCTTTAGTA TCAGACTCAT   250
TTTGGTCTAA ACAAACACCA AATTTAGAAA CAAATACATC TAAACGTAAT   300
TTAAAAAATT TATTATTACA ATATTATCTA GTAGGTGCAA GAAAAGAATC   350
TGGATATAAT AACGATATTG CATATCAATC CAACGATCTT ACAAATACGG   400
GACTTGAACA TATATCTAAT CATTTATTCG TTGATGGGTT TCAAAATAAA   450
GAGTATAATT CGTTAAATAG TGAAAATATT TTTAAATATT CTCTACAAAA   500
GAAAGGATCT AATAAAATTA AAGGTTTGAC TAAAACCAAT GAAATGAAAA   550
CTATTGGTGT GACGCATGTG ACGCAAAATA TGTCTTTATC TTTTATGCCA   600
GCATTACAAA AAGATCATTT TGAGACTGAT CAAAAATTAT ATAATAAACT   650
TGACTCATCA ACTCAAATTC AAAGCTTAAG TAAAATAAAC GAAGTTTATA   700
AAACCAATGA AATTAATTCG TTACTTGATA CAAACAATTT ATTAAAATTT   750
GGTCTGGAAC AAACTAATAT TTATAATGCA TTAGATAGCA CTCGATCAAT   800
GATTTCTCAA AAAAAGTTTA AATATAAAAC TCATATTGAG AATTTTTTAT   850
CATCACACTA TAATATTGAT TTACAATTAT TTCCTTTTTA TAGTAAACAA   900
AATTGGCAAA GTGCTGGTTT CGTAGCTGAT GAAATAGTCT ATTTTATTGA   950
ACGACGAGTT TCTTTTTCAA GAATCAAAAA TAGAATATTA AGACAAGCAA  1000
GTATGCAACC ATTTATTAGA GGTATACGTA TTACTTGTAA TGGGCGTGTT  1050
GGGGGTAAAT CTAAAAAAGC ACAGCGTGCT ACTCAAGAAT GTGTTAAATA  1100
TGGTGAAACA TCTTTACATG TGTTTGACTG TAAAATAGAT TTTGCTTCTA  1150
GAAGCGCAAA TACTTCTTTT GGTTTAGTAG GTATTAAAGT CTGGATTTGT  1200
TTTAAATAAA TTTAGAAGAT ATAAACATCA CCTCATTTTA TGAGGTAAGG  1250
GTTGGGTAAC ATAATGGTAA TGTATGGGAT TGCAAATCCT ATAATGGCAG  1300
TTCGATTCTG CCCTCAACCT TGATTAATGT TGAAAACGTT ATTTTAAACG  1350
GGTGTGTAGC TCAGTTGGTA GAGCACCGGG CTTTTAACCT GATGGTCGTA  1400
GGTTCGAGAC CTATCATACC CANNNNNNNN NNNANNAGCG GGGAAGAGCA  1450
AATGGTTAGC TCGTCAGGCT CATAATCTGA AGGTTGTAGG TTCAAGTCCT  1500
ACCCCCGCTC TAAAATGAAT AAACAATATT TATTCATTTA TTGTACGAAG  1550

<210> SEQ ID NO: 141
<211> 4083
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
CCTCGAGCTG CAGAAGGTTC GAAGGGTTCG GCTGTTCGCC GATTAAAGTG    50
GTACGTGAGT TGGGTTTAAT ACGTCGTAAA GAGTCTTGCG ACCTTAAGGA   100
GCAATCCTTA TGTAAAAATT GCCTCATAAC GGTGAAGTCC TAATAGTAGC   150
CTCATACACT CATTAGGGGA ATACCGTGGG AAGTCTATAA TATAAGAAAT   200
AAAATGTGTA TCTCTTTAGT CTATTTTGCC TAGAAGGTCA TGATATACTG   250
TGCACAATTC TTTATATTTC AGTGACCCCG TAACGACTTT AATTCTATAT   300
CTAATAAAAA CAACTGGACA TAATAAATCT CATTTATTTT AAATTATAAT   350
CTATATCTAT GTAAACAAAT ATAAGTTATA ATTTATATTT AGTTTTAGAT   400
TAGAATTAAG ATATCTTTTA AATTAAAGAT ATAACACGGC AACTCTTATT   450
ATAATAATAA GATGAAGGTA TAGTCTGAAC CTTAAGTAAT TAGGGATTTT   500
TCTAAGTAAT TAGAACTTGT TTATTAAAAC TGTAATCTTA TTTAATACGT   550
TTGTAAACGC TTAGAGACAG TATGTATCCT ATCTTTTTGG GGTGTAAAAA   600
AACTGACAGG ACGATTTCTT TGTACGAGAG GACCGGAAAT CACGAACCTC   650
TTGTGTATCA GTTGTTATGC CAATAGCATA GCTGAGTAGC TACGTTCGGA   700
CTAGAGAACC GCTGAAAGCA TCTCAAGCGG GAAACTAACC TGAAAACAAG   750
TTTTTAGTAT CGTAGAAGAC TACTACGTTA ATAGGTGGAA TGTGTAAAAA   800
TAGCAATATT CCAATTTTAT TTCTAGTTTT TAACAAAAAA TAAAAGAGCA   850
AATCCATACT AAACTCTATT TTAAATATTA TTTATATTTT ATAAATTAGG   900
TGTAGGTAGA AATAAATAAA ATTTATTTAG ATCAATATCT AATTTATAAA   950
ATATATAATT TTTATCATTC TGTTGCGGGC ATATATTAAT GGTAAATTTG  1000
TTATTTTCCA AGTAATCGAT ATGGGTTCGA TTCCCATTGT CCGCTTGTGT  1050
ATCGAGACAA GTAGATAAGA TTTAGGATAT AACTCTATTT ATAAATTTAG  1100
GTAGATAAAA AAATTATATC AATTCTGCTA TTATGTACAA ATATGTTTTT  1150
TACAGAGTAT AGCGCAGCCT GGTAGCGTGT TCGCTTTGGG AGCGAGAGGC  1200
CGCAGGTTCG AATCCTGCTA CTCTGATTTC GTGTTTTATA GCCCTGCATA  1250
ATAAACTTTG TTTTATTAAA TAGCTTTATA CCCAAAATGT ATCTGATATA  1300
ACTATATAAG GTTGTTTTTA GTTACAGTGG GTTTCTCATC ATCTTGATCA  1350
TCGTTTTATT TTTAGTCCTA TGCATCTCTG TACCAATAAT AACAAAAACT  1400
AAAGGTAAAT CAAATTCTTC AATTGCATGT AAAAGGCCAG TAGGTATTTA  1450
TAATCTATTT TTTCCCTGCT TTTTTATTAA CTAAGACTCT TAGAATATAA  1500
ATTTAAGAAA GTATATAAAA TAAACTATAA CATATATCTT TTAGTAAATA  1550
TGTATGATAG TGAATATGCT TGATAGTTAA CCAGCTTATT TAGTAACATC  1600
TCCCTAATGG ACCCCCCGGA AGACAAGCTG TAGAGCAAGT TAGAGAGCAA  1650
AGAAAAGAAT AAAATATAGA TTGATTAGTC ATATAACTTT TAACTAGTAA  1700
AGCCTATATT TCCTATTATT GTATAAACTT TGTAATAAGT AATTCGAATA  1750
ATTAATAATG TTATATACTT TTTACTATAT TATTATAGTA ACTATTAATT  1800
GTATAGATTT ATTAATTGTT AATTAGAATA AGTAATTAGA TTGATTACGC  1850
AAGTAAAAAC CTAAATCAAC TTTGTAAAGA CTAATTGTAA AACTAAATAG  1900
ATTGGTATAA ATAAAATGCT GTTGTTCCAA ATGAGGCATT GTAATAATTG  1950
CAACATATAA TTTATAATAC CAAGGATTAA AAGTATGGTT ATTGAGCTTG  2000
TGAGCATATA AATTGATAGG TAATTAGTTT CAATTATACA GTAAAACGTA  2050
AATATATCAA ATATAGCATC TTTTATTTAA AATATAATAT AAAAGCATTT  2100
CTAGGTATAC ATTTTTTATC ACACCACCTT AAAAAGTATA CTTATTTTCC  2150
TTATATTCTC CGTACCATGT AAAATAAATG AAAATTTATT AAANNNNNNN  2200
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2250
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2350
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN  2400
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNT AAAATAAATG AAAATTTATT  2450
AAAATAAATG AAAATTTATT TATTGGTAAG TGGTTCAACA ATATGATCAT  2500
TANNTAATGA TCATATTGTA GATTATTGTC TAGTCTAAAA CATTGAAAAT  2550
AATTTTATTG AAGTGATTAG TAGGCTTAAA TTCGTCGATG GGTTTTATTT  2600
TTATCTTTCG TAAATCAAAT AAATAAATAT CATTGGTTTT AAACTTGACT  2650
AGAAATGCGA AATGTAATAA AAAACCTCCT TATAAACAAA ATTTCTATGG  2700
TTAATTTTGT TTATTTATTT TACTTATATT AGAAAAACAA AAGCCCCATA  2750
GTGTAATAAA AAATAGAAAT GTCAATCGAA TTGCAGCCTG AAATTAGAGA  2800
CAATAGAATC TATTGTATTG TCTCTTTTAA TAGAATCTTT AAATTATACT  2850
AGTGAAATAA ACAAAGTTTA TTAAAATAAA CCAAGTTTAT TAAAATAACT  2900
GAATTTTATT TGATTTATTT CCGAGTAGTA AATGCTTTTT CAAGCAAATT  2950
TAATGTCTCG TCTTTATTTG GATAATATCT ATCGTATTTT ACAGTAGTGC  3000
TGATATTAGT ATGTCCTATT AAATTTGATA CAATTTGTAA AGGTGCATGT  3050
TTTAAGAGTG ATGTTACAAA ATTAACTCGA AAAGAGTGTG ATTTAATATT  3100
GAGTCCTAGG TGTCGAGTCT TCTCTAACAA TCTACTATTA ATAAAATAAA  3150
TCCAACTATT TTCGGCGATG CCTCCAGATA ATGTAGAATA CTTGTTAAAA  3200
ACAATAGTAC ACTCATTTTC AAGTGCTCTA AACTGATCAC AACTTTGTTT  3250
GGTAAAAAGT ATTAATCTAT ATTTATTAGT CTTAGGTTGA TAAACTTGTA  3300
AAGATTTGTG ATGTATAATT GAATCAATAT CATCCTGTGT AATGTTTCGA  3350
ATTTCATTTA CCCTCAATCC AGCAGCCCAC AATAGAGTAA CTGCAACTCT  3400
GAATCTAGAA TATTGTAAAT TTTTTTCTTT AAATTCTCTT TGTAGACTCA  3450
TAAGATAAAA GTAGACATCA TTGTTAGCTG GTTGCCTTAG AGGTAAAGTA  3500
TTTTGTTTAT CTCTCCTTTT TGTTTTTATC GTATGTAATA TTTCACGAGT  3550
GACTTGTTGT AACTGGTCGA CACTCTGTTC GATTCTCCTT ACAGAATCTT  3600
GTGTTTCCGT AATAACGACA TATGGGTTTT TATCGTAGTG TATTTCAACC  3650
ACAGGGTCAT TATGAGGTGT CTCGATAATC TGTAAATCTT TATCTTTTTT  3700
TAAAACAGAC TCTAACTTAT TTTCTAGTTT ATTTATTTGA GGCATAAATA  3750
ATTAATTTCG TGATTATTAT ATTCATATCA AGCAATATAG ATGTTTGNTT  3800
CTANANNANN NTNNTNNNAT NTNGCTTGAT AATAATCACG TCTCCCGCAT  3850
AAAGTAGTAC ATATGATGTA GAATGCTTTC ACAAATCGAG TTCGGAAAGG  3900
GTTCGGGTGG TTCACATTCC CAGATATCAA GCTTTTNTNC TTTNNNNNNA  3950
ANNNNTAGGC GAATAACGGG ATTTGAACCC ATGTTTCCAG AGTCACAGTC  4000
TAGCACTTTA ACCGACTAAG TTATACTCGC CAAGAAAAAC AATAAATTTA  4050
TACTGAACTA ATAACTGATA TTCATAAAGA ATA                    4083

<210> SEQ ID NO: 142
<211> 3352
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
ACTCCTTTNN NNANNTANNT AANNNAACAA ACCCATAAAA ATAGTATTAT    50
TAAAAAAATG AAAATTATAA TTAATATCTT TAAGCGTGGT TTTTTATTAG   100
GTATTTTTTT AGAAGCTTTT AAATTAGGAA GTTAAATATT GTATTTTTAA   150
AATTAATTAC CTAGCATTAA GCTATTTTCC CAAAAGGCCC CCCTTTTAGT   200
ATCTTCGCCC CTTGCGCGTT TCACTTCTTA GTTCGGGATG GAATAAGGTG   250
GTTCCACACA AGCATGAACA CTAGGTATTT TAAAATAAAT TTTTGTATTT   300
TCTGGTCAAA ATCGATCGGT CTATTAGTAT ATTTCAGCTA CACCCATTAC   350
TGGTATTCCA CTTAATACCT ATCAAACGGC TAGTCTTGCC GTGACCGATA   400
TTTCTAAATT AGAAAAGAGA GCACTCATCT TGAGGTGAGC TTCCCACTTA   450
GATGCTTTCA GCGGTTATCT TCTCCATACA TAGCTACCCA GCGTTTCCTG   500
TTGGCACAAG AACTGGTATA CTAGAGGTAT GTTCTTCCAG GTCCTCTCGT   550
ACTATGGAAG ACTCCTCTCA ATGCTCTAAC GCCTACACCG GATATGGACC   600
GAACTGTCTC ACGACGTTCT GAACCCAGCT CACGTACCGC TTTCATGGGC   650
GAACAGCCCA ACCCTTGGGA CGTACTACCG CCCCAGGTTG CGATGAGCCG   700
ACATCGAGGT GCCAAACCTT CCCGTCGATG TGAACTCTTG GGGAAGATCA   750
GCCTGTTATC CCTAGAGTAA CTTTTATCCG TTTAGCGACG GCCCTTCCAC   800
TCGGTACCGT CGGATCACTA AGGCCGGCTT TCGCCCCTGT TTGACTTGTA   850
GGTCTTACAG TCAAGCTCCC TTCTGCCTTT ATACTCTACG GCTGATTTCC   900
GTCCAGCCTG AAGGAACCTT TGCACATCTC TGTTACCTTT TAAGAGATGA   950
CCGCCCCAGT CAAACTGCCC ACCTGAAACT GTCAAGCGTC CTGCTTCAAG  1000
GAACGCCATT AGAATTCTAG CTTTTCCAGA GTGGTCTCTC ACTGTTGGCT  1050
CCAATTTTCC CAAAAGAAAA TTATCAACGC CTCCCACCTA GTCTGCGCAA  1100
AAAAAACCCA AATTCAATTT CAGGCTACAG TCAAGCTTCA TAGGGTCTTT  1150
CTGTCCAGGT GCAGGTAGTC CGCATCTTCA CAGACATGTC TATTTCACCG  1200
AGTCTCTCTC CGAGACAGCG CCCAGATCGT TACGCCTTTC GTGCGGGTCG  1250
GGACTTATCC GACAAGGAAT TTCGCTACCT TAGGACCGTT ATTGTTACGG  1300
CCGCCGTTCA CCGGGGCTTC AGTCGCAAGC TTTTTCTTAA AAAAGAATAA  1350
CCTACTTCTT TAACCTTCCG GCACTGGGCA GGCGTCAGCC CCCATATGTT  1400
GTCTTACGAC TTTGCGGAGA CCTGTGTTTT TGATAAACAG TCGCCTGGGC  1450
CTGGTCACTG CGGCCAAAAG ATGAAAAATC ACCCCTTGGC ACCCCTTCTC  1500
CCGAAGTTAC GGGGTCATTT TGCCGAGTTC CTTAGAGAGA GTTATCTCGC  1550
GCCCCTTGGT ATACTCTACC TACCTACCTG TGTCGGTTTA TGGTACAGGT  1600
GTTTTGCATT TATAGATAAT ACAGGCTTTT CTTGAGAATA TGATAAGAAA  1650
TTAATTCATA TTTTATAAAT AAAATACAAA CGTATCACTC GTTAGCTTTA  1700
AAAAGGAAGA AGTTTTTTTT CTTTTCCCAG CTTTGCAAGT TTCCACCGAA  1750
ATCCAATAAC GGCTAATTGT TACCTCTTCT GTCCCCTGGT ATCAAATACA  1800
AAACAGTACA GGAATATTAA CCTGTTTGCC ATCGACTACG CCGTTCGGCC  1850
TCGCCTTAGG TCCTGACTCA CCCTTCATGG ACGAACCTAG TGAAGGAATC  1900
CTTAGGTTTT CGGGGCATTG GATTCTCACC AATGTTTGCG TTACTCAAGC  1950
CGACATTCTC ACTTCTGCTT GGTCCATCTA AACTTACATT TAAACTTCAC  2000
CCTAAAGCAG AACGCTCCCC TACCGATATA AATTGTATTT TTATATATAC  2050
ATTATTTTCA TATATCCCAC AGCTTCGGCA GATAATTAAG TCCCGTACAT  2100
TTTCGGCGCA AAAACGCTAG TACCAGTGAG CTATTACGCA CTCTTTAAAG  2150
GATGGCTGCT TCTAAGCCAA CCTCCTGGTT GTGTTTGCAT TTTCACCTCC  2200
TTTGTCACTA CATTACCATT TTGGGGCCTT AGCTGGTGGT CTGGGCTGTT  2250
TCCCTCTTGA CAATGGAGCT TATCCCCCAC AGTCTCACTA CTACACCGTC  2300
ATTTCTAGTA TTCAGAGTTT GCCACGATTT GGTACCGTTT TCACAGCCCG  2350
CACCGAAACA GTGCTTTACC CCTAGATAGA CTAGATGTAC TGCTGCGCCT  2400
CAACGCATTT CGGGGAGAAC CAGCTAGCTC CGGGTTCGAC TGGTATTTCA  2450
CCGCTAACCA CAACTCATCC GCCGATTTTT CAACATCGGT CGGTTCGGAC  2500
CTTCACTTGG TATCACCCAA GTTTCATCCT GGTCATGGTT AGATCACCCG  2550
GGTTCGGGTC TAGAAAAAAT GACTAAATTG TAAATATTAA AATATTTGAT  2600
CGCCCTATTC AGACTCGCTT TCGCTAAGGC TTCGGAATTT TTCCTTAACC  2650
ACGCCACTTT TTCTAAGTCG CCGGCTCATT CTTCAACAGG CACGCGGTCA  2700
GTTTAAAAAA AACCTCCCAC TGTTTGTCAG CATAGGATTT CATGTTCTAT  2750
TTCACTCCCC TTCAGGGGTT CTTTTCACCT TTCCCTCACG GTACTAGTTC  2800
ACTATCGATC AAGAGTGCGT ACTTAGGCTT AGAGGGTGGT CCCCCTATTT  2850
TCAAACAAGT TTGTTCTCGT TCTACTCAAA CTATATAGTA TCTCTAAAAG  2900
AATACTATAT TATCAATATA TATATTGCTT CACAGTTCAT GTAAAATAAA  2950
TGAAATTTAT TAAATGAAAT TAAAACTTTC TCTGTAAATT TTAATAACAT  3000
AAATACATGA AATACACAGG GCTTTAACCT TCTATGGCGT GTTATTTCAA  3050
ATTACTTGGA TTTGCTACAA CAATATAGAT TACATGTTTC ACAGAGTTGT  3100
GTTTAAAATA TTGATTTGGC CTATTCCGCT TTCGCTCACC ACTACTAACG  3150
GAGTCTCGGT TGATTTTTTT TCCTCTAGTT ACTAAGATGT TTCAATTCAC  3200
TAGGTAAAAT TTATAATTGA GTTTCCTCTA TGGAGATCTA TATCATCTTG  3250
TATTACGACA ATTGTATATA GCTTTTCGTT TCGCTATACG TCCATTTCAC  3300
TCTTGTCTAG GCATCCACTA AATGTACATA TAGCGAAACG AAAAGCTATA  3350
TA                                                      3352

<210> SEQ ID NO: 143
211> 2440
212> Nucleotide Sequences
213> Chlorella protothecoides

<400>
GTCAATCTGA GAAATAGTCA ATCTGAGAAA TAGTCAATCT AGTTACTATA    50
GTAATCAATA AAGAAAGAAG ACAAAGAATA AATATCATCC ACTTTACAAA   100
AAACTAACCT ACTTATTAGA ATCAATCTCG ATATTCTCAT AACTAACAAA   150
GAAATAAAGT ATTCATGCTT AAACTAAGGA GAGCTCTCCA TATAATCGAC   200
AAGAACTTTA TGAAAAAAAT TATTTAAGTC GCTTTTAATA AATTTTATTT   250
ATTTTAATTT CATACAACCA AGTAAAGACG ATAATTAATT ACAGTTATTC   300
GTTATCGTGC TGTAATAAAT CTATCAGTGA ATCCATCAAA GAAAGTTTTT   350
AAAGTATTTA ATAATTCGTC AGATAAAGCT TTTTTATCTT TGATAGTACT   400
TAAAATAGCA GGATCAATTT CTTTTAATAA AGTTTGTTCA AATTTTTCAA   450
TATCTGAAAT ATTTAATTTA TCTAAATACC CTTTAGTTGC AGCGTATAAA   500
ACAACAACTT GTTTTTCAAT AGGCATTGGT GAATATTGAC CTTGTTTTAA   550
AACTTCAGTT AATCGTGCTC CACGGTTTAA TAAATATTGA GTAGAAGCAT   600
CTAAATCACT ACCAAATTGT GCGAAAGCAG CAACTTCACG ATATTGAGCA   650
AGTTCTAATT TCATAGTACC AGCTACTTGT TTCATAGCTT TTACTTGAGC   700
TGCAGAACCT ACACGACTAA CAGATAAACC TACGTTGATA GCAGGACGTA   750
TACCTTTATA AAATAACTCA GTTTCTAAGA AGATTTGACC ATCAGTAATA   800
GAAATAACGT TAGTAGGAAT ATAAGCAGAA ACGTCTCCAG CTTGTGTTTC   850
AATAACTGGA AGTGCTGTTA ATGAGCCACC ACCTACAGAA CTAGACATTT   900
TAGCTGCTCT TTCAAGTAAT CTTGAATGTA AATAGAAAAC ATCTCCAGGG   950
AATGCTTCAC GGCCAGGAGG TCTACGTAAT AATAATGACA TTTGACGGTA  1000
AGCTACAGAT TGTTTACTTA AGTCATCATA AATTATTAAT GCGTGCATTC  1050
CATTATCACG GAAGTATTCA CCCATAGCAC AACCTGAATA TGGTGCTAAA  1100
TATTGTAATG GTGCAGGGTC ACTTGCAGTA GCCGCTACAA TTACAGTATA  1150
TTCTAATGCT CCTGCATCTT CTAATGTTTT TACAATTTGT GCTACAGTTG  1200
ATCTTTTTTG CCCAATTGCA ACATATACAC AGTATAATTT AGCTGATTCG  1250
TCTTTTGATA CGTTTACGAA TTTTTGATTA ATGATAGCAT CTAATGCAAT  1300
AGCTGTTTTA CCAGTTTGTC TATCACCAAT AATTAATTCA CGTTGTCCAC  1350
GACCAATTGG TACAAGACTA TCAACTGCTT TTAAACCTGT TTGCATAGGT  1400
TGGTTTACTG ACTCACGAAC AATAATACCA GGAGCTTTTA ATTCTGCACG  1450
AACACGAGTT ACATCTTTTA ACGGACCTTT TCCATCAATA GGGTTACCTA  1500
AACCATCTAA TACACGGCCT AATACACCTT TACCTGATGG TACGTCAACA  1550
ATGGTACCTG TACGTTTTAC TAAATCTCCT TCGCTAATAG TACTGTCACT  1600
ACCGAATAAT ACAACACCTA CGTTGTCATT TTCTAAGTTT AGAGCCATAC  1650
CTTTAACACC ACTTGCAAAT TCTACCATTT CACCTGCTTG GATTTTTTTT  1700
AAACCATAAA TACGAGCAAT ACCATCACCA ACTGATAATA CTCGACCTAC  1750
CTCGTCTACA TTAATTTTTG AATATGTATT TGAAATTTTT TGTTCTAGTA  1800
ATACTGATAT TTCACTTAAA GATAATGCCA TAAATTTGGA AATCTTATGT  1850
ACTCTTTTTT CTTATAACAT TTACATATGT AACGCTTTAT CTATTATTGA  1900
TATGACGTTT TTTACTATAA TACTCTCGTT AATACTATTA ACGAGACAAA  1950
AAACAATAAC AACACCATAA TAAATTTTAT TTATTTTAAT AAATAAAATT  2000
TATTTTACCA ATAATATCAT ACATACGATA GGTCAAAACA AACTACATTT  2050
ATTTTGACCA GATACAAACT TCATTTATGT TGCAGGAAAA TAGCTAAATA  2100
AATATTATTT ATTAGTTTAC CTGGAATCAG GAGAAAAGGG ACTCGAACCC  2150
TTAACCTTTG GTTTTGGAGA CCATTATTCT ACCATTGCAA CTATTCTCCT  2200
AAGATGCACC AAGTAAAACT TGACAAATAA AATTTATATC TAATGTTACA  2250
ATCAATTTAT TAAGAATTAT AGATATTGAA GTTTTCTTCA TTTAGCCAAC  2300
TATCGGACTC GAACCGATAA CCTTCGGTTT ACAAAACCGA TACTCTGCCA  2350
ATTGAGTTAA GTAGGCTTTA TTACTTTAAG TTTTTTTTTT TTTTTTTTTT  2400
TTTATTTTTT TTTTTTTTTT TTTTGTTAGT GACAGGATGT             2440

<210> SEQ ID NO: 144
<211> 2423
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
AGTGTACGAA GGAAGTAACT TCGTAAGTGT ACGAAGGAAG TAACTTCGTT    50
CAACCAAGTA AAATAAAAGA AATTTAGTTA TTTTAAGATA GATAAAAAAA   100
ACAAGCAATA AATAAAATTA TAGAGCCTAT AATTTATTTA TCCATTATAT   150
ATTTATTTAG GATAATAAGT CTCTATCTTA TATAAATATA TAAATACAAC   200
CATTAATAAA ATAGAAAAAT TATGAATATT TTAATATTTA GTATAGTATT   250
GAAAATTCTA GCCTTAACTG TACCTTTAAT CTTAAGTATT GCTTTTTTAG   300
TATTAGCTGA AAGAAAAGTA TTAGCTTCAA TGCAACGACG AAAAGGTCCT   350
AATGTTGTTG GTATTTATGG TATATTTCAA CCTATTGCAG ATGGCTTAAA   400
GTTAATAGTA AAAGAACCTG TACTTCCAAG TAGTGCAAAT TTAGTAATTT   450
TTCTTTTTGC TCCCGTATTA ACTTTTTTAT TAAGTCAAGT AGCATGGGCT   500
TCGATTCCAT TTGGTGAAGG TATAGTATTA GCTGACATAC ATGTAGGATT   550
ACTTTATGTT TTTGCTATTT CATCATTAGG AGTCTATGGT ATTATTACAG   600
CTGGATGGTC TAGTAACTCA AAATATGCCT TTTTAGGGAG TTGTAGATCT   650
GCTGCACAAA TGGTATCATA CGAAGTTTCA ATTGGATTAA TTTTAATCAC   700
TGTATGTATT TGTGCTGGGT CTCTAAATTT AACAGAAGTT GTATTAGCTC   750
AAACACAAAT ATGGTATTGT ATACCCTTAT TTCCACTATT AGTGATGTTC   800
TTTATCTCTT GTTTAGCTGA AACCAACAGG GCTCCCTTTG ATTTGCCCGA   850
GGCAGAAGCG GAGCTTGTTG CTGGGTATAA CGTTGAATAT TCTGCTATGG   900
GGTTTGCCTT ATTTTTCTTA GGAGAATATG CTAATATGAT TGTAATGAGT   950
AGTTTATGTT GTATATTTTT TTTAGGTGGT TGGTTACCTC CTTTTGATTT  1000
TGCAATATTT TACTGGATTC CTGGTATATT TTGGTTTAGT TTAAAAGTAG  1050
TGTTCTTCCT ATTTGCTTTT ATTTGGGTAC GTGCCGCTTT CCCAAGATAT  1100
CGTTATGACC AATTAATGCG TTTAGGTTGG AAAGTACTCT TACCCCTATC  1150
GCTAGCTTGG GTAATTTATG TTGCAGGTGT CTTATTATGT TTTGACTGGT  1200
TAGCTTAATA TAAATTGATT CTAATCTTTA AATCATAATA AATATGCTCT  1250
ATTTATTTAT TTCCAAACCA AACTCAATCA AATTTGCTAT TAAAATAAAC  1300
ATAATTTATT ATAATATGCT GTACTTGTTT TGTACAGGTT TGGAAATAAT  1350
TATAACATAA TAATAAATAA AATTTATTTT ATATAAGTGA AATCTATAAT  1400
TTTAGTAGTA TGTTATCAGA AAAACAATAT TTTAAACAAT GGTTAGTGGG  1450
CTTTACTGAC GGTGATGGTA CTTTTACAGT TCATAAAGCT GATCCAAAGT  1500
ATCCTCTAAA CAGAAAATTT ACATTTAAAT TATCTATTTC CATGCCAAAT  1550
AGTCAAGTGT TGCATTATAT ACAAAAGCAA TTACGTGTTG GTACAATTTC  1600
AAAACCACAT GATAATATAA TCAGTTTTTA TGTATCAAAT AGAGAAGATT  1650
TAATAAATAT CATTTTACCT AAAATAAATG AAATTTATAT TTTTGATGAA  1700
TTTTCTTTAT TGACTGTAAA ACAACACAGA TATGATAGAT TTAGAAGGTG  1750
TCTATTAATT TATGACAATC CATCACTATC GAGTGCAGAA AAGGATAGAC  1800
GAATAACAGC TATTTGGGGG GAGAAATTAC CTGATAATTA TAAATCTCAT  1850
GTTTGGATGT ACTATTTAAA TCAAGAAGAT TTTAAAGCTA AGGTTTTAAA  1900
AGGCGAAACC CGATATCATA GTGATCCTCA CTTGATAATT AGCAAGCCCT  1950
GGTTAAAATA AATTTCATTT ATTATCTGGA TTTATTTTAC TTCGTACAGC  2000
TCATGGAAGT TTTGTTTTTA AAGCCATAGG CACTAGTACC AAAAAAAGTA  2050
TAATACACTT TTTTAACCTT ACACGAAAGG AAGATCTTCA TTTGTTAGAA  2100
GCATTTCGTA AAATGTTTGG TATTAAGGCT AAAATAGCGA TTAATAAAAA  2150
AATTTATTAT AACTTAACTA CTGGAGCTTC TTCATCTATT GAACAAATTA  2200
TTAATTATTT CACTGCTCCC GACAATAGTT ATAGATATGC AATAAAAAGT  2250
TTAAAAAGCG TAGAATTTCA AATATGGGTC CGCTCTTATC AAAAATATAA  2300
AGGTGATTTC GAAAGGTTGC AAAAAATACA AAAATTTGTT CGCAAATTAC  2350
GCACTAAAAA CTATTAAATA AATTCAGAAT AGATTACATT AGTTGTACCA  2400
AGTGTACGAA GGAAGTAACT TCG                               2423

<210> SEQ ID NO: 145
<211> 2226
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
AAATAGACTT TGTACTTTTA TTATTTTTAC CAATACTCGT TAGTTATTGA    50
GCTTATAAAT TAAGAAAACC CCATATTGGT AAAATAATTA TTTTTACCAA   100
TATTAGTGTA AATTAATAGC ATCGTTTAAG TAGATACAAA TAAGGATTGT   150
AAATACATAT GCTTGTAAAC AAGCAACTCC AATTTCAAGT CCTGTAAGAG   200
CAAAAACTAC AGCAAAAGGT AAAACTGATG CTACTGATAG AGCACCACCT   250
ACAGAAAGCA TAGTCCATGC AAACCCACTT AAAATTTTTA CTAAAGTATG   300
TCCAGCCATC ATATTAGCAA AAAGACGAAC ACCCAAACTA ATAGCTCGGA   350
AGCAATAAGA TACTAACTCT AAAACTACTA ATAATGGTGC TAATATTAAA   400
GGTGCGCCTT TTGGTAAAAG AAAACTAAAA AAGTGTAGGC CATGTATTTG   450
GAAACCAATA ATAGTGATTG CAATAAAGAG TGAAAGTGAT AATCCAAAAG   500
TTACAGCTAA ATGAGATGTA GCTGTAAAAC TATACGGAAT CATACCTATT   550
AAATTCGTAA ATAATAGAAA TACAAATAGG GTAAATACTA TTGGAAAGTA   600
TTTTCTTCCT TTTGCACCGA TTTGTTCTTG TATTAGACTT CTAACAAACT   650
CATAAATCAT TTCTACTAGT GATTGCCAAC GGCTTGGTAC TAAAAACCCT   700
CCATTTTGAG TTACAAAATG AAATAATAAA CCAGCTGTAC CAATAGCTAA   750
AAATAAATAT AGAGATGAAT TTGTAAAAGA AAAGTAATAT TTACCAATAT   800
GTAAAGGTAT TATACTTGAT ATTGTAAATT GTTCTAAAGG TGTAGAAATC   850
ATATGTGTTA TTTTATTGAA TATACAAAAA TTAAAGATAG ACTCTTTTTT   900
TGTATAAATA TATTTATATA TATATTTTTT ATAATTAAAA TATAAAATTA   950
CAATCTTAAT TTTATACTAT TTTAATAAAC AAAGTTTATT TGACCTTTCT  1000
TTTTTTATTT TTTTGTCGGT AATAGCCTTT TAAAACCTTA GGCAAAAAAA  1050
TAATATAAGT TATTAGCTCC AATCAAGAGC TCCTTTGCGC CATTCGTAAA  1100
CAAAACCTAT AGTTAATATT AATAAGAATA GCATCATAGA CCAAAACCCA  1150
AAAAAACCTA TTTTATTTAG AGTTAAAGCC CAAGGAAATA GAAATGCTAC  1200
TTCAAGATCA AAAATAATAA ATAGAATTGC TACCAGATAA AATTGAATAT  1250
CGAATCGGCC TCGAGCATCG TCAAAAGGAT CAAAACCACA TTCATATGCT  1300
GAGATTTTTT CAGGATCCGC TTTTCTTGTA GAAACTACAA AAGGTAATCC  1350
TAATAATAAT AGTGATAATG CTAGTGCAAT AAAAAAATAA ATAAGTATTC  1400
CTAAATATTC GTACATTTTT GTTTTTTTGT CTTTATTTTT TTGATTATTG  1450
ACATTTTTCG ACCTAAAATA CTTCGTTGTA TGAAGTTACT TCGTTGAACG  1500
AAGTTAAAAC CAAGTAAAAT AAATCGAATT TATTAAAGTC AATAATAGAG  1550
ATTGCAGTTT TTATTTACTA GATAAACAAA TAAATGAAAT TTATTACCAA  1600
TAAATGAAAT TTATTTTAAA CTGGGGAGAA AGGATTCGAA CCTTTGCATG  1650
TTGAAGTCAA AGTCCAATGC CTTACCACTT GGCTACACCC CAAAAAATAA  1700
AGTGTTCCGT TTTTTTACTC TATGTATATA AAACTCTTGT CTATATGATC  1750
GACAAGAACT TTATATATTA ATCATACACC TTTTCTTTTA AAAGAGAACA  1800
AATAATACAT ATTATATGTA TCTCCGAATA AAATAAACCA AATTTATTTC  1850
GGTTATTTTA ATGCGCAAAT TGATCTATTA CATGGACGAT AATGTTGGTG  1900
GCAGGGAAAA TGATAATATT ATTGAGCCTT TAACTAATGA GACTGTTTTT  1950
TAGTACAAAA CATATACCAA AATACGCGCT CACGAGCGAA ATAAAGCTAC  2000
TTTTTTACAT CACCTGATAA AATGTATATG CAATGGACTT TTATACTAGA  2050
ATTTGAGAGA TAGATGGGTA AAATAAATAT ATCGACTAGA AATCAATCAA  2100
TTACAGTATA TCTAATTGCA TTCTAATAAG CCAAATAAAC ATATCTTTTA  2150
AGGGTGATAT TATAAATATA TATACTTTTA TTTACATGTT TTATTTTAAC  2200
AAGCAGCTAC TATATTTACA CTATCT                            2226

<210> SEQ ID NO: 146
<211> 4434
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
AGTACCGTGA GGGAAAGGTG AAAAAGAAGT GAAATAAATA CTGAAACTGG    50
ATGTATACAA GCACTCAAAG CTTTGTAACT TATTTATATT TAAATAAGCT   100
GTTTAACAAC AACACAGAGT AATGGGGTAC CTTTTGTATA ATGGGTCAAC   150
GAGTAAATAA ATGAAGCAAG CTTAAGCGAC TAAGTGGAGG CGTAGTAAAA   200
ACGCTAATAA ATTAGATTTA TTTTATAAAT TTCATTTATT TTAACAAGTG   250
TCATTTATTT GACCCGAAAC CGAGTGATCT AGTCATGAAC AGGTTGAAAA   300
AATGCTAACA CGTTTTGGAG GACCGAATTC ACGTCTGTGG CAATAGACGG   350
AAATGATTTG TGACTAGGGG TGAAAAGCCA ATCGAACTCG GAGATAGCTG   400
GTTTTCTGCG AAATTTATTT AGGTAAAGTC TAGAATATAT ATAACAAGGG   450
GGTAGAGCAA CTCGCTGGGT AAGGGTGGCC CAAAGCCTTA CTAAATCCAA   500
GGAAACTCCT AATACCTTGT TATACTCTCC TAGAATCAGA CTTTGGGCGC   550
TAAGGTCCAA AGTCGAAAGG GAAACAGCCC AGATCGAACG CTAAGGTTCC   600
AAAGCGTATA TTTAGTGGTA AAGGAGGTTA TTGAGCCCAG ACAACCAGCA   650
AGTAGGCTTA GAAGCAGCCA TCTTATAATG AAAGCGTAGT AGCTCACTGG   700
TCTATATAGT TATACATTTT TATTAATGTA TAGGCTCAAT AGCGCCGAAA   750
ATGTAACGGA GCTCAAGTAT ACCACCGAAG CGGCGGCTAT ATATAGTAAA   800
TCAAAAACTA CTATTTATAG GGTAGCAGAA CGTTCTGTAA GTAGGTTTAT   850
AATAAATACG TAAGCATTTA TTTATATATC AGAAGTGAGA ATGTTGACAT   900
GAGTAACGTA AAATCATGTG TGATTCATGA TCGCCGAAAA TCCAGTGTTT   950
TCTACAAACT AGAATTTAAT GTAGAGTTAA TCGGCCCCTA AGTGTTTCTA  1000
GCGATTGCTA TAACATAATG GGTACAAGAA TTTAAAATTT CTTGATGTAT  1050
ACTATATTAC GAATAGTACC TATTTTAAAA GATAAACGGA TTTCTTTTGA  1100
AATATACCTA TTCCAGGAAA AAAGTAGTAG CTTGTTTCTT TAATTAGAAA  1150
TAAGATTTAG ACACCGTACT TTCCGACACT GGAGGATGAG TAGAATATAC  1200
TAAGGCGTAG AGAGAACCAT ATTAAAGGAA CTCGGCAAAT TGCCTCCGTA  1250
ACTTCGGGAT AAGGAGGACC ATGTAGAAAT ACCTGTTATA GGTATTTAAG  1300
TGGTGGCACA AAATAGGGAG TAACGACTGT TTAATAAAAA CACAGGACTC  1350
TGCAAAGCGG TAACGCGACG TATAGGGTCT GACACCTGCC CGGTGCTGGA  1400
AACTTAAGTG GAGAGGTTTG ATGCTGTTTT ATTTGCAAAA ATAAAACAAA  1450
TTCCAGCTTT TAAATTAATG TCCAGTAAAC GGCGGCTGTA ACTCTAACAG  1500
TCCTAAGGTT AATGTTATAG CTCATTGTTT TATATACAAA AAAACAAAAA  1550
TTTCGCAGCC TTAGTAAAAC TATACCGTAT GCTGGAAAAC CCTCGTAGTT  1600
TATTAACTAG ATAAAAAAAT GCTCTCCATT TATAATAAAT TAAATAAATA  1650
GAAAGCAATA TTTTTATTAC AGCCTATTAG TACATATGAT TAACCATAAA  1700
AAGTATTCCA TATTGTGAAA ATCTAATAGG ACATGGGGAC AATCAGCAGG  1750
AAACGTAAAT TTAAAACCCA ACGCGAGTTT AAAACCCAAA TAAATTTTAT  1800
TTTTTAACTC AATACGGTTA TAACTTACGG ATCCTCAGAG ACTTTACGTA  1850
TAGGCATTTG AATAATAAAT ATATACATTT GATATCATGA ATTTACATGC  1900
ACAATGGATT ATAGGGTTTA TTGATGGAAA AGGATTTTTC AATATACATA  1950
CTAGAAAAAA CACACAGGAA GTTTGTGGTA TTGAAATAAT ATCAGAATTT  2000
GTTGTAGTTT TACCTCAAAA AGATATACAA GTTCTATATG CTCTAAAATC  2050
TTATTTTGGG TGTGGAGTTG TAAAAAACAA TTCTTTCGTT ATAAAAAATC  2100
GTAAACATCT ATATGATAAA ATTATACCAT TTTTTGAAAA ACATAAATTA  2150
TTAACAAAAA AACGCATTGA TTTTGAAAAA TTTCGTTTTG TTGTTAAATT  2200
TATGATTGAA AAGAAACATT TTACAGGTAA TATTGATCAA GATCTTGAGA  2250
ATTTAAAAGA TTTTAAAGAT TTGGGTGAAT TAAAAAAATG TTTACTCTAT  2300
TTTGAAAATG CAAATAAATA GTATTTATTA ACTACATTCA AATGTAAGAG  2350
CAAGTCCATC CTCTTGATTT AATAATACGA GATAGATATC AATTCGATTC  2400
GCTTTGATTT GTCTATTCAT TTTATGCACT TAAAATAAAT GAAATTTATT  2450
TTAACTGAGA TTTATTCATT TATTTCAAAT TAGTTTTACG AAAATTAGAG  2500
TTATTGCAGT AATGATTAGA ATTTAATATT ATTTTTCAAT TTTCGTACTG  2550
TGCGTGCATT TCCATGTGTT CTTTGGCCTC TAACAGGTAA ACCTTGAATA  2600
TGTCTATATC CTCTAAAACT ATGTATATTT ACTAATCGTT GTATGTTTTG  2650
TCTTGTAAAT CGTCTAACAT CACTACCTGT ATCATAGTTT TGAGTTATAA  2700
TTTGTGCTAA TAAAGAATGC TGACCTGCTG ATAACATGCC TAATCGAGTT  2750
TCTGGACTAA CACCTAATAC ATCACAAATT TGTAAACAAT GATGATGCCC  2800
TAATCCATAA ATTTGTGTGC ATGCTTGATA AATTGGTTTT TTATCATTTA  2850
AATGCGTATT TTGTATGTAT ACCATAAATA TTTTTCTATA TTTTATATTT  2900
AAATAGAAAA ACTTAAATAC TGACGTATAT TTTATATCAA TTACTATCAT  2950
GAAAATATAC GTTCTATTAT TTACGTTTTC CTGTTCGTGT CGTTATATTT  3000
TCATTAGATA AGCGAATACC CTTACCTTTA TATATACTAG GTGGTCGAGT  3050
GTTACGTATA GAATGAGCTA CTTGTGAAAC TTGATTTAGG TCCACACCAA  3100
ATAAACATAT TTCTGAAGGA CTTTGTAAAA ACACACGTAC CCCACTAGGA  3150
ACTTTATAAT GTATATCATG GCTATGTCCT AATTTCAACA CAATTGTATC  3200
TTTAATTGGA GAGCCTTGAT CTGTTTCTCC TTTCGAAGGT TGTACGGAGT  3250
AAAATAAATC AAGTTTATCA AGATAAATTT CATTTATTTT ATTTTGTACA  3300
CCAACTACAC TTTCAGAAGA AATTAAAGAT GCACGATAAC CAACTCCAAC  3350
AATATCAAGA TAGATACAAA AACCTCTAGA AACACCAATA AATTTATTTT  3400
CAATGAGTTG TTTATATAAT CCATTATATG ATGGATGGAA AGAACGTACA  3450
ATTACTTGTT TATTTCTAGT ATCTATATTT ATACAAGCTG ATCCAAATCG  3500
ATCTAATTTT TTCAGATCGA GTGTTGACGA ACCTAATGGA CCTGTTATAG  3550
TATAGACTGA ACTACTGATA TTCTTTTCCA AACTTATATT TTCTGGTATT  3600
TTTATTACGG TATCTATAAG AGATTTGTTT TTCATGATTT TTTTGTATAT  3650
TTTGTACAGT TTTACGTAGG TATTTGGAAA GCACTATATA AAAGTGTACC  3700
ATCTTTTTTA AAAAAAGAAG TTGTAGATAA TGATATTTGT ATGCCAGTAA  3750
CATTTTGAAA TAATTCATCA TATTGCTGTA ATTCAGGAAA ATAGTTAGAA  3800
TTTAGTACTA CAAAGTCTCG ATTATATGTT CTTTTATAAG CAGCTCCGTG  3850
ACCCTTTTTC GGCTCTAAAT TGACAGATTT TTTATTTTTT ACTTCTAAAC  3900
TTGACCAGTC TTTCACACGT GTAGAAACAA TAGATATATA TTTTTCTAAA  3950
AATCTATACA TTGTTTCACC TCGCAAACTA GCTTTACAAC CTAATATTTG  4000
ATTTTGTCTA AGTTTAAAAC TAGATATTGA TTTTTTTGCA CAAGTGTATT  4050
TTGGTTTTTG ACCTGTAATC ATTTCAAGAG CAGAACAGGC AGGTAAGATA  4100
GACTCTTTTT CTCCTACATA GTGTTTAGAA CTAGTATTAC ATACAATTTG  4150
ATCTAAAGAT GAACATTCCA TTATATTTTT ATAATTTTGT TTTAGACAAA  4200
GATCGTGGCA TATAATGTCT TGATAATATC GTTGTAAATT TGTATCTGTA  4250
TATAACGAGT AAAAAAGTCC ACTATTATCA TTTAGTTTGA AAGCTGCTGT  4300
TGTTTCAGTA TGATTTAATA ATAAATTATT ATTATCTTTT TTTTTGTTAT  4350
AATTTTGCAT AATAAACTTG GTTTATTTAA TTATTCTATA TAATGTTGGG  4400
TAAAATAAAT AAAATTTATT AAATGAAATT AAAA                   4434

<210> SEQ ID NO: 147
<211> 2709
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
AAAAAAATTA GAGATTTTAT GAAAAAATTA AAAAAAACGT AACTGTTTTT    50
ATAATTTTAT AAATTCAATT GATTTTAACT TCGTTGTACG AAGTCGAAAC   100
CAAGTAACTA TTTTTTAATA AAAAATATTA TATCAAAATA ATTTGTTATC   150
ATTAAATATC ATTTAGTACG ATTCGATATT ATTTAATTTA CCATTTAATC   200
TTTTTACTAC AAAATAAACT TGATTTTTCA AGTTGAAGGA ATAGTCCGAA   250
CTTAATTGAA AAATTAAGAG TACAATGAGA GCAATATTAT TATATACTAT   300
TGTGTTGATT GTCAACAAAA ACATTTCTAG TACAATAAAA CTTTTATTGT   350
ACATGATTTA TTGATATCCT CCATTAGCAA GTATAGCAAG CCACTCTGGT   400
GGTAGTGTTG ACTTAGCTAT TTTCAGTTTA CATTTAGCTG GTGTTAGTAG   450
TATTTTAGGT GCTATTAACT TTATTTGTAC TGTATTTAAT ATGCGTGCAC   500
CTGGTTTATC TATGCATAGA CTTCCATTAT TTGTATGGGC CGTATTTATT   550
ACAGCTTGGT TACTATTATT ATCTTTACCC GTTCTTGCAG GGGGTATTAC   600
AATGCTGTTA ACAGACAGAA ATTTTAATAC TAGTTTCTTT GATCCAGCAG   650
GAGGTGGAGA TCCTATCTTA TACCAACACT TATTCTGGTT TTTCGGTTAA   700
CAATCAGGGA TGGCCATTTT AAAGAGTGAT CTTTAAATTA GTTTTTTCCA   750
GAATACTATT AAATTTACAA AAACACACGT TACTATTTGC TGGAACAATA   800
ATAAACTTTT TAGTACTAGG TTTATAAGCG CAACTGATAC TATACTATAG   850
ACTATCGGTT TGCCCAAACC AGTGAAAATT TAAAAAGCTT GATTCAATCA   900
GCAGGAAACG TAAGTTTAAA ACCCAACGCG AGTTTATAAC CTAACGCGAG   950
TTTATAACTT ACGGATCCTC AGAGACTATA CGTAACGAAT CTTTTAATCT  1000
CTATCTTTTA TGATTACTAA TCAATTTTTC GAATGGCTTG GTGGGTTAAT  1050
TGATGCTGAC GGGGGTTTTT ATATTTCAAA AAAAGGTTAT GGTTGTATTG  1100
AAATCACTAT GCATATAAAA GAAATACAAA CATTATATTT TATTAAAAAA  1150
AAATGTCATG GGAGTGTTTC ATTGCGTGCA GGTGCGCAAG CGGCTCGATG  1200
GCGCTTACAT AGAAAACAAC ACCTTCTTTA TATTTGTAGT GGCCTTTCAG  1250
GGCATATACG AACATTAAAT AGACAATCGC AATTTATAAA AATATGTAAA  1300
ACTTATAATT TACATTACAA AGTTCCTGAG ACTTTAAGAT ATGAAAATGC  1350
TTGGTTTTCA GGTTTTTTTT CTGGGGAGGG ATGTCTCTAT ATTAATAAAA  1400
ACACTTTTCA ATGTACAATT TCAGTATCAC AAAAAGAAAA AAATATTTTA  1450
TACAATATAC AACGAATATA TAAAGGTAAT ATTTTTTTTG ATATTTCATG  1500
GCAAGGGTGG GTTTGGCAAT TATCAAACAA GCATCATTGT AATCAAGTAC  1550
TAGATTATTT TTCTAAATAT TCAACTCAAA ATCCATATAA ACAAGCTAAA  1600
ATTAAAAGTT TTAAACGCTT TTTAATTTAT AAAGAAAAAG GATATCATTT  1650
AGACCCCCTT AAAAAAGAAA AATTAGTACA TTTTATAAAA GTATTTGAAA  1700
TAAAAGATTA AGATAGAGTC CATTGTGCTA TTATAAACAA AGCACCCTGA  1750
AGTATATATT TTAATTATAC CTGGTTTTGG TATCATTAGT CACGTAATAT  1800
CTACTTTTTC TAAAAAACCT ATTTTCGGTT ATTTAGGAAT GGTTTATGCT  1850
ATGTGTAGTA TTGGTATTTT AGGTTTTATT GTTTGGGCTC ATCATATGTA  1900
TGTTGTAGGT TTAGATATTG ACACTCGTGC CTACTTTACT GCAGCTACAA  1950
TGATTATTGC TGTACCTACT GGTATTAAAA TTTTCAGTTG GGTTGCAACA  2000
ATGTGGGGTG GTAGTATTGA ATTACGTACA CCTATGTTAT TCGCTATTGG  2050
TTTCTTATTC TTATTCACTG TTGGGGGAGT AACGGGAGTT GTTTTAGCAA  2100
ACTCTGGCTT AGATGTGGCG TTTCATGACA CTTATTACGT AGTAGCGCAC  2150
TTTCATTATG TGTTGTCGAT GGGTGCTGTA TTTGCTCTAT TTTCAGGTTT  2200
TTATTACTGG ATTGGTAAAA TTTCTGGACT ACAATATCCT GAAACTCTAG  2250
GACAAATCCA TTTTTGGATG ATGTTCTTAG GAGTTAATAT TACATTTTTT  2300
CCAATGCATT TCTTAGGATT AGCTGGAATG CCTCGTCGTA TTCCTGATTA  2350
TCCTGATGCT TATGCTGGAT GGAACGCAGT AGCGAGTTAC GGAAGCTATT  2400
TATCAATTGC TGCTGTTTTA TTCTTCTTTT ATGTTGTATA TAAAACACTT  2450
ACAAGTGATG AAGTATGCCC ACGTAACCCT TGGGAAACTA CACCTGGTGT  2500
ATCTCCAACA TTAGAATGGA TGTTACCTTC ACCGCCAGCA TTCCATACTT  2550
TTGAAGAAAT TCCAAGTATA AAATCATCTT CTTCAAACTA AAAATTTAGT  2600
GTGTTTTTAT AAAACAACAA AAATCTTTAG TTACATATTG ATGAAATTAT  2650
GATTTTTATT TACTTTAGCT ATCTTGAGAG GTAGCTAAAG TAAATAAAAA  2700
AATAATTTT                                               2709

<210> SEQ ID NO: 148
<211> 2107
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
ATAAAATAAA ATTGATTATA CTAATTCTTT TGGATTTTTA AATATTTAAA    50
AATTCTCAAT AAATAGTATA ATCAATTTTA TTTTATTATA CAATTTTTAC   100
ATAACTTTAT TATGTAAAGC TATCTAAAAC ATAGGCAATA TATACAAAAA   150
TTATATGTTA GACGGAGCAA AATTAATCGG AGCAGGTTGT GCGACAATCG   200
CACTTGCTGG TGCCGGCGCT GGTATTGGTA TTGTATTTGG ATCATTAATT   250
AATTCAGTAG CTCGTAACCC AAGCTTAACT AAACAACTAT TTGGTTATGC   300
TATTTTAGGT TTCGCGTTAA CTGAAGCGAT TGCATTATTC GCTTTAATGA   350
TGGCATTCTT AATCCTTTTC GTATTTTAAT TATTTTCTTG TTTGGAGAAA   400
CTATAATCTT ATATGATTTT CTCCAAACAA AATCTATATA TATAACAACG   450
TAGCATTGTT ATGTTGTTAT ATATATAGAT TGTTTTCAGT AGTGAGACTA   500
TACAATAAAA TACTCTTTAT TCTATACTCG TGTAAAACAA ATTCAATTGA   550
AACTTCTTAC AACCAATCGA ATTTATTCTC GATATAAAAC GAAATGTAGG   600
TATAATGAGG GCGGAATAGT TCAGTGGCTA GAACGATGGA ATCATATCCC   650
ATAAGCCGAG GGTTCGAATC CCCCTTCCGC TAAAGCATGA AGTAATGCCT   700
TCCGCTAAAG CATGAAGTAN TGCCTTCCAA ACCTTTCGAT GGGGCATGAA   750
AGTATGAGGT AAATTATAAA TTATCGAGTT AAAATACTTG GTACAACTTA   800
AAATAACGGA ATAAATTCCA TTTATTCTGT TATTTTAGAT GCATATAGTA   850
CATAAAACAA TAAAATTATG AAAATTTCTC TAAAATCTCA AAATAATTCT   900
AATCTATTAG GCACAAATTT CACACCTGTT AGTATATATA ATGAAAATGT   950
TCATGCTGCA GAAATGTATA AAAAATGTTT ATCATTTGCA TACAATGATA  1000
ATCCGTTGTA TAATAATTTT CAAGGTGTAC CAAGTAAAAT AAATGAAAAT  1050
TATTCGGAAA TAAAAATGAT ACAGAAAAAA AATTCAAAAC GATTTTCTAG  1100
GTTTGGAAAT TTGAATGTAG AAAAAAAAAC TCAACAAGTA GTTTTTTCTA  1150
CATTAACAAG AATCTATTTA AAAAAGAATA TAAATTTATT ATTTCCAATA  1200
CAAGATTTTC TTTTTGAGTG TAATAAACAT GTAAATAAAT CAACAGGTTC  1250
AAGGTCAAGA GAATATTTGG AAATTCTTTT ATGTGTTAAG AAACTAAAAA  1300
TGTTTTATGG ATATATTCCT TTAAAACAAT TACATAGAAT TTTATCTCAA  1350
GCATCAATCA TGCCTGGGTA TTTTTCAAAA AATTTCTTTT CATTAATAGA  1400
AAAAAGATTA GATGTCGCGT TATACCGTAG TGGTTTTGCA AAAACTATAG  1450
TAGCAGCAAG ACAAGCTTGT AGACATTCTA AAATTTATGT AAATTCTAAG  1500
ATTTGTCGTT TTCCTAGTAC TCTACTAAAT CCGGGAGACG TGATTTCATA  1550
TACACCTAAA AAAAGTCCAT ATACTCTATA CAACAAAACT CAATTTTCTA  1600
CTGAAAATGA TAAAATCCAA AAACCAAATT TTTTTCTTTT TGATGAAAAG  1650
ACTCAAATTC ACTTTGACTG TGCACAACAT AGATTAAACG ATTCATTAGC  1700
ACTCCTAGTA TCATTCTGTG GAAAATGTGT GTCAAATTCT ATTATACTGG  1750
ATAATAAAAA TAATAAACAA AGTTTACAAA TTAAACAGTC TAATAAAAAG  1800
TGTATAGTGC TCGATAGCAA TTTTAATCCT AACATATCTA ACGTGAAAAC  1850
TTTTGGCATA CCTCGATTTA CCGCTATTGA CAACAGTCAT TACATTGATA  1900
ATACTGGTTT TTTTTCATCA AATCAGTCCA AAATAAATAA AAAAGACAAT  1950
AGCTTAGATT ATTTTTCTTT TATGGAAACA ATGAAATCCA CTAGTAAATC  2000
TAATCTAAAA TTTGGTACAA GTTTGGAAAC AAAAAATAAT ACATATTTTT  2050
CAAAACAAAC AAAAAAAAAT TTTTTATTTC CGAATAATTT TCAAGGTTAA  2100
ATAAATG                                                 2107

<210> SEQ ID NO: 149
<211> 13,452
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TTATATTTAC TTTTTTTAAG TTTCTTGTAA ATTCTCCAAT ATCAATAAAA     50
AAATATTTTA AAGTTTTTGG TTTATATAAT TGAATTGAAT TTGAAACTAC    100
TTGTGTAATA AACCCACTCT CTCTTGTTTG TATAATATGA TTAATATCTG    150
ATACAACTCT CGGCTCTAAA CTAGTTTTAA CAAGGGCAAC TTCGGGTTTT    200
AATAAAGGAA CTGCCTGTCG TTGCATATTT GATCCCATAA GAGCACGGTT    250
TGCATCATCG TGTTCTAAAA ACGGAATAAG ACTCGTTGCT AACGAAATCA    300
TTTGTAAAAT TGAAATAGAT TGGGTTGTAA TTTGATCTAA TTTGGCATAA    350
TCAAAATCCC AGTTCTTTCG AGCGGGAAGG CTCATTTTTG GCAAACGTCC    400
CCATTTTGAA ATTAGGACAT CTGAAGGAAC TAAAACATGT GAAGATTCTT    450
TTTCTGGGTT TAAAAAAATA ACATTAAGTT TATTTTGGGT TTGTCCTTTA    500
TATACTTTAT AATAAGGAGT TAAAATAGTA CCATTAGAAG CTTTATGCCC    550
AAAAACTGTA AGTGAATGCA CCAATCCTGC ATTTTGACCT TCAGGTGTTT    600
CTATTGGGCA AAGTCTCCCA TAGTAAGTCG GGTGAATTCC TCTAATTTTT    650
ATTGTAGTTT GTTTACTATT TACCCCCCCA GGCCCTAATA ATGTTATACG    700
TCTTTTATGT GTGATTTCTG CTAAAGGATT TGTTTCATCT AAATATTGTG    750
ATAAAGTTCC TCCCAAAAAG AAACTTCGCC AAGCTCTTGT AACATATTTT    800
GATAAAATAG TTCTATTACT TTGCCAAAAA CTAGCAACAA GGGTTATTTT    850
ATTTTTTTTA TTTAATTTCA GTTTTTTCTT ATTTTTTAAT AAAAGTGAGG    900
ATTTAGTGTG AGAACTAGTT GTTTTTATTA CTTTTTTTTG TTTTTTTAAA    950
ACGCCAGATT TTACTTTTTT CTTTTTACGC GGTAATGAAA AATTTTTAAA   1000
TTTTTTATCA AGTAAAGTCT CAAATTCATT AAGTCCACGA GCTAATTCTT   1050
GATATAAAAA GTCTCCACAA CCTCTAATTT TTTTATTATT TAAATTATCT   1100
ATATCATCTA TTGTTTCTTC ACCGTATAAA ATATTAATTA TTGTTTGCGT   1150
AATTAAAATA AAATCTTGTT TTGTTAATTG TATTTGATTA AAAGCAGAAC   1200
CTGTAGTTCC TAATTTTTTT AGAAATTGAT CACGACCAGA AATACCTAGA   1250
TACTTATTCT CTGAATTCCA AACTACTTGC TGAAAAAAAA GAAGTGCTTT   1300
TTTTTCAGTT ATAGGTTGCC GCTGTTTGTT TGCAGGTGAT CGATATTCCA   1350
TAAAATGAGA ATATAAATAT TTGCAAGCTT CTACTTGAGT TACAGGATGA   1400
GATATTAAAC CAGCATCAAT TAAAATGGCA TCATGCCGTG TAAGTCTTTC   1450
ACTATTTAAT GATTTTACAG GAGGAACAAT ACTTGATAGT AAAATTTCCG   1500
AATGACCAAT AGTATTAAAA ATAGTATGAA TATCAATACC TAAAGCTTGT   1550
AAAAATACTA ACATAGATAT TTTTTTTCTA AAAATTTTTG TTGTAATCCA   1600
TAATCGATTT TTAGAATCAA AACTAATGTT TAGCCAACTA CCTTTATCAG   1650
GAACTATGCG TATGTAAGGG ATTGCTTTTA TTCCTTTAGT TGTACGTTTA   1700
TTTTCAATTA ACTTATAAAT ACCAGGAGTA CGTACTAATT GATTTAACGC   1750
AACTCGTGGA ATTCCATTAA TTAAAAAATG CCCAGGTTTT GTCATAAACG   1800
GTAAAAACCC TAAATAGATC CATTCAAAAA TAGCTTTAGA TTGACTATTA   1850
GATTTGAGTA ATATTGGTAA ATAAATAGGG CATCCGTAAG TTTTATTTAA   1900
ATTTATAGTT TCTGTTATAT TTTTTTCTGG TTTTTTAAAT TGTAAGTATT   1950
TACTTAAATA AAAAACCTCA AATGAATAAA GTGTATTTTT TATTTTTAAA   2000
GGGTTTTGAC TTTCAATTGC GGATATTAAA CCAATCTCTA AAAATTGCTT   2050
AAAACTTAAT TGTTGTATAT TTATTAAATT AGGCAATGTT GACAAAACAT   2100
TAATTTTTTG TTTTTTTTGT TTTGTTTTTA TATAAAAATC GAGTAATAAG   2150
TTTACAGTTT CATCAATAAT TATAGAATGA TTTTTTAATA CATCATTAAC   2200
AGAAGTATCA CTTATTCGTA AAGGTAACTC TTGATTTAAA AAAATATCAT   2250
TTTGATTAAA CATTTTTAAA TTTGTTTTTA AATTCATCTT TATTTTATAT   2300
TAGAAAAAAT TACTACTAAA AAGATTTTTT TTAATAATTT CTATCACGTA   2350
GTCTAACTAT AATATTTTAG AAATTAGTTC AATATATATA TTAAAATTCA	 2400               
TATATTTAAA AATTTTGAAA ACTAATAAAA AAAATTTTTA TAATACCGCG   2450
TATATTTAAA TTTTTTGTTA ACTTAAAATA TAAAATTGGC TCAAATATTA   2500
ATTTCTTTAT CTAGAAAACA AGAAAATGTT AAATTAATTA GTTGTTAGTA   2550
ACTGTTAATT AATATAAACC ATATTTTCAA TATTTTGTTG TCAATTAGTA   2600
TATATATAAG GCGTAACTTG TACAAAAAAA GTAAGAGCGA TAACTAATGT   2650
CATTTTTTAG TTGGTAGTTA AATATAATAT AGGCGGCACT CAGACTTGAA   2700
CTGAGGAAAT GAAGGATTTG CAATCCTTTG CCTTACCACT TGGCTATGCC   2750
GCCCTTCAAA TGATTCAAAA TTATTTTGAA TCATTTGAAA GAGTGTTCAG   2800
ATTTAGAATA GGTGAGTAAA CCCACCTATA GAACCCTCGT GGTTTTATCA   2850
GCTTAGAATG TTGCTCCTTT TCAGTCCTTA CATAATTTCA AGCTAGTTAC   2900
CGCAGAGTAC TAAATCATAT TTACAATACT ACTTTTAGTA ATAATAATTT   2950
GTCAAGTAAC AAAAAACATA CATTACAATA AATATTAAAA TAACCCTAGA   3000
GTTAATGAGA TATCAATTGG TAAAGCAGCA CCAATACCTA ACCAAATTGC   3050
TGCAACAGTA CCAACTAAAA ATACTGTCGT AGCAACAGGA CGGCGAAATG   3100
GATTTTGGAA TTTATTAATA TTTTCAATAA AAGGCACTGT AATTAAACCT   3150
GCCGGAACAG CAGCCATTAA AAGAACACCT AACAGTTTAT TAGGAACAGT   3200
ACGTAAAAGT TGAAAAACAG GGTAAAAATA CCATTCTGGA AGAATTTCTA   3250
ATGGAGTAGC AAATGGATTT GCTGGCTCGC CAATTGCTGC TGGGTCTAAT   3300
ACAGCTAGAC CAATAACGCC AGCAAAAGTC CCAAAAATTA CAACAGGGAA   3350
CATGTAAAAT ATATCGTTCG GCCATGCCGG TTCGCCATAG TAGTTGTGGC   3400
CCATACCTTT TGCTAATTTA GCACGTAATT GAGGGTCTGA TAAATCTGGT   3450
TTTTTTGTAA CAGCCATAAT GATTTTGGAT CTTTTTTATA ATTGTTTCTT   3500
TTGCGAGTTA TTATTTTAAA GTGGTCCAGA TATACCTTGT TTGCGAATCA   3550
TTAAAAAATG CATTAGCATA AACACGGCTG TAAAAAGAGG TAATACAAAA   3600
GTATGTAAAC TGTAAAAACG TGTTAATGTG GATTGACCAA CAGCAACACC   3650
CCCACGTAAA AGTTCAACAA CACCTGGCCC TATTACTGGA ATAGCATCTG   3700
GAACACCAGT TACAATTTTA ACAGCCCAAT ACCCAATCTG ATCCCACGGT   3750
AAAGAATAAC CCGTTACCCC AAATGAAACA GTACATACGG CCATTAATAC   3800
ACCAGTTACC CAAGTTAATT CGCGTGGTTT TTTAAAGCCA CCAGTTAGAT   3850
AAACACGAAA AATATGAAGA ATCATCATCA AAACCATCAT ACTTGCTGAC   3900
CATCTATGAA TAGAACGAAT TAACCAACCA AAATTAACTT GCGTCATTAA   3950
ATATTGAACA GAAGTAAAAG CTTCAGCTAC TGTCGGTCGA TAATAAAAAG   4000
TCATCGCAAA TCCTGTAGCT ACTTGAACTA AAAAGCAAGT AAATGTTATT   4050
CCACCAAAGC AATAAAAAAT ATTAACGTGT GGTGGTACAT ATTTACTGGA   4100
AATATCATCA GCAATTGATT GAATTTCAAG GCGTTCTTCA AACCAATCGT   4150
AAATTTTACT CATAATTTTA ATTTTTAAAA TTAAAAAATC TCTTATTTAA   4200
TTAATTTAAT AATATAATTT AGATTTTTTA AAAAATATAT TACCGCAATT   4250
AGGCTGATTT TATCTTATTA TTTATTATAC TACTTTTTTT AAAGCATAAA   4300
AAAATTCTTA TTTTTAAATG AAAAAAATCT ATTAAATATA AAATTAAATT   4350
TAAATTACTA AATATTATTT TATTTAAAAT TTTTAATTAC GTTCAGGATA   4400 
CATATTTGCC ATATTAACCA CAACTTTATC TACAATGCCA TAATTTTTTG   4450
CTTCTTGCGC AGATAAAAAA ACGTCTCGGT CTAAATCAGC TGATACTTTG   4500
TTTAAGGGTT GCCCAGTACG CTCAGCGTAA ATACGACCTA TTTGGTGTCT   4550
AATCCGTACT ACTTCTAAAG ATTCATAAAT TACTTCTGCC GCTTGTCCTT   4600
TACTTCCACC TTCTGGTTGA TGCAGCATCA CACGGGCATG AGGTAATGCA   4650
ATTCGATTTC CTCTTGCCCC ACCAGCTAAT ACAAAAGAAG CCATTGAAGA   4700
AGCAGTACCA GCACAAATTG TAGTAACTTC AGCATTTATA TAATTAATAG   4750
CATCAAAAAC AGCTATACCA CAAATTGCAG ACCCACCAGG AGAGTTAATA   4800
TAAAGAAATA CATCATTGGT TTCTTCTTCT GCATTTAAAT ACAACATAAT   4850
ACCAATCAAC TGATTAGCTA ATTCATCTTC CAAATCTTGA CACAGAAAAA   4900
GAACACGTTC TCTATATAAT CGATTATATA AATCAACCCA TTGAGCATCA   4950
AGCTCGCCTG GAAGGCGAAA TGGGACTTTT GGTACACCGA TAGGCATTTT   5000
TTTTTATTAA TTTTATTAAG TTATTTTATT CTGCTTTAAT TTTATTAATT   5050
TTTTAGAAAA ACAAAAAATT AGTTACTAAC TGTCAAAGTA ATAGAAAAAT   5100
TATCTGCAAA TTTATTACTT TTTAATTAAA AAAAAAGCAA TAAATTTGCA   5150
GATAATTTTA ATTAATATAA ATATTAACAA ATTAAAAATT AATGCAAATA   5200
TTTATATATA ATTGTATAAT TTTAATTTTG AGAGAATCCC TTTTATAAAA   5250
ATATAAGGAA TTAATTATGA CTGCAATTTT AGAAAGACGT GAAAGAGCTA   5300
GCTTTTGGGC TCGTTTTTGT GAGTGGGTAA CAAGCACTGA AAACCGCTTA   5350
TATGTTGGTT GGTTTGGTGT TCTTATGATC CCTACATTAT TAACAGCAAC   5400
TACTGTTTTT ATTATCGCTT TTATCGCTGC TCCGCCAGTA GATATTGATG   5450
GTATTCGCGA ACCAGTATCA GGTTCATTAC TATATGGAAA TAACATAATT   5500
TCTGGCGCTG TAGTACCAAC TTCAAACGCA ATTGGTTTAC ATTTTTACCC   5550
AATCTGGGAA GCTGCTTCTC TTGACGAATG GTTATATAAT GGTGGACCTT   5600
ACCAACTTAT CGTATGTCAC TTTTTCTTAG GAATTTGTTC ATACATGGGT   5650
CGTGAATGGG AATTATCTTT CCGTTTAGGT ATGCGTCCGT GGATCGCAGT   5700
TGCTTATTCA GCACCTGTTG CTGCAGCTAC TGCTGTATTC ATTATTTATC   5750
CAATTGGACA AGGTTCATTC TCTGATGGTA TGCCATTAGG TATTTCTGGT   5800
ACTTTCAATT TTATGATCGT ATTCCAAGCA GAACATAACA TTTTAATGCA   5850
CCCATTCCAT ATGCTTGGTG TTGCTGGTGT ATTTGGTGGT TCATTATTTT   5900
CTGCAATGCA TGGTTCATTA GTAACTTCTT CTTTAATTCG TGAAACAACT   5950
GAAAACGAAT CTGCTAACGT TGGTTACAAA TTCGGACAAC AAGAAGAAAC   6000
TTACAACATC GTAGCTGCTC ATGGATATTT CGGTCGTTTA ATTTTCCAAT   6050
ACGCATCTTT TAATAACTCT CGTTCTTTAC ACTTCTTCTT AGCTGCTTGG   6100
CCAGTTGTTG GTATTTGGTT TACTGCTTTA GGTATTTCAA CTATGGCATT   6150
TAACTTAAAT GGTTTCAACT TTAACCAATC AGTAGTTGAT TCACAAGGTC   6200
GTGTTATTAA CACTTGGGCT GATATTATTA ACCGTGCTAA TCTTGGTATG   6250
GAAGTTATGC ATGAACGTAA TGCACACAAT TTCCCACTAG ATTTAGCAGT   6300
AGTTGATGCT CCGGCAATCA ATGGCTAGTA TTTAAACTCT AATCTTTTTT   6350
TTTAGCGGGG CAACCCGCTA AAAAAATAAA AAATTAAAAG AATTAAATTT   6400
TTTTTTAATT AAAAAAAATT ACCAACTTGA CTTTTTAACT CCGGGTAATA   6450
ATCCTTTTAG TGCATATTCT CGTAATGTAT TACGAGATAA ATTAAAATTT   6500
CTATAAAATC CTTTAGGTCT TCCAGTAATA TTACAGCGAT TATGTAATCG   6550
ACTTGGGGCA CTATTTCTAG GTAATTGTTG CAATTTTATT TGTAACTGAA   6600
ATTTTTCTTC CATTGAGTTT GCTTGTTTTA GCGCGCTTTT TATATCTTGA   6650
CGTTTTTTTA AATATTTTTG CACTAACGCA AAACGTGTTT TTTCACGCTC   6700
AATCATACTT TTTTTTGACA TTTTGGTAAT CCTTATAATA TTTGATATAA   6750
TATATAATAT ATAAAATTTA ATTTTTAAAA ATATTAGAAT ATATATATAT   6800
ATTAAGAAGA ATGCGGTATT AATTTCAATA TTCAAATATT GAAATTATAA   6850
CAATATTTTG TATAAATAAT ATTTATATGC TATTATAAAA ATAAATAAAA   6900
AATTTTTATT TATTACAATA AAATTTAACT AATATTTCGG GTAGATTTCT   6950
AATCGTAAAA TAAAATAATA ATTTATTAAA AATGGCTCCA CAAACAGAAA   7000
CAAAAACCAG TTCAGGTTTT AAAGCAGGTG TTAAAGATTA TCGTTTAACT   7050
TATTATACTC CTGATTACCA AACAAAAGAT ACGGATATTT TAGCAGCTTT   7100
CCGTATGACT CCTCAACCAG GCGTTCCACC AGAAGAATGT GGTGCAGCTG   7150
TTGCAGCAGA ATCTTCAACA GGTACTTGGA CGACTGTTTG GACTGATGGT   7200
TTAACTAGTT TAGACCGTTA TAAAGGACGT TGCTATGATC TTGAACCAGT   7250
AGCAGGCGAA GAAAATCAAT ATATTGCATA TGTTGCATAT CCAATTGATC   7300
TTTTTGAAGA AGGCTCTGTA ACAAACTTAT TTACATCAAT TGTAGGTAAC   7350
GTTTTTGGAT TTAAAGCACT TCGTGCATTA CGTCTAGAAG ATCTTCGTAT   7400
TCCACCAGCA TATGTTAAAA CGTTCCAAGG ACCTCCACAT GGTATCCAAG   7450
TTGAACGTGA TAAACTTAAC AAATATGGTC GTTCATTATT AGGTTGTACA   7500
ATTAAACCAA AACTAGGTCT TTCTGCTAAA AATTACGGCC GTGCTGTATA   7550
CGAATGTTTA CGTGGTGGTC TTGATTTTAC AAAAGATGAT GAAAACGTAA   7600
ACTCACAACC TTTTATGCGT TGGAGAGACC GTTTCTTATT TGTTGCTGAA   7650
GCTATTTATA AATCTCAAGC TGAGACTGGT GAAATTAAAG GTCACTATTT   7700
AAATGCAACA GCAGCTACAA ATGAAGAAGT ACTAAAACGT GCAGAATTAG   7750
CTAAAGATCT TGGTGTACCA ATCATTATGC ACGATTACCT TACTGGTGGT   7800
TTTACTACAA ATACTACTTT AGCGCATTAT TGTCGTGATG AGGGTTTATT   7850
ACTACACATT CACCGTGCAA TGCACGCTGT AATTGACCGT CAAAAAAATC   7900
ATGGTATGCA CTTCCGTGTT TTAGCAAAAG CTCTTCGTTT ATCTGGGGGT   7950
GACCATTTAC ACTCAGGTAC TGTTGTAGGT AAACTAGAAG GTGAACGTGA   8000
AGTTACTTTA GGTTTCGTTG ATTTAATGCG TGATGATTAC ATCGAAAAAG   8050
ATCGTAGTCG TGGTGTTTAC TTTACTCAAG ACTGGGTATC ATTACCAGGT   8100
GTAATGCCAG TAGCTTCTGG TGGTATTCAC GTATGGCACA TGCCAGCTCT   8150
TGTAGAAATT TTTGGTGATG ATGCTTGTTT ACAATTTGGT GGTGGTACTT   8200
TAGGACACCC TTGGGGTAAT GCACCTGGTG CTGCTGCAAA CCGTGTAGCA   8250
CTTGAAGCAT GTGTACAAGC TCGTAACGAA GGTCGTGACC TTGCTCGTGA   8300
AGGTGGCGAT GTTATTCGCG CTGCTTGTAA ATGGAGCCCT GAATTAGCTG   8350
CAGCTTGTGA AGTATGGAAA GAAATCAAAT TTGAATTTGA TACTGTAGAT   8400
ACTCTATAAG TTTGTAACTT TCACTTTAAA TTACTCTATT TAAAAAATAG   8450
AGTAATTTAA AGTAAAAGCC TTTGTGACGA AATGGTATAC GTGTCAGTTT   8500
TAGGAACTGA TGTCTTTGAC GTGTCGGTTC GAGTCCGACC AAAGGCAATT   8550
AAATATTTGC ATCCAGTTGG GTTTGAACCA ACGGCGCCTT AAAAAGAACC   8600
GCATTATGAG TGCGGTGCGA TCGACCACTC TGCCATGGAT GCATTTAAAA   8650
AAAAAACTAT TACTTTTTAT TTAATATTCA AAGTATTATT CAAAATATAA   8700
GAACTAAAAA TTTTTTTTAT TACAGTATAA TTACTAATTT TGTTATAATT   8750
TATATAAACG CACCGTAATT AATTATTATA AATTATAACA AAATTAGAGT   8800
AAATTATACT GTAATTTTTT TAAATTTACT GTATGTTTTT TATACGGGAC   8850
TGACGGGCTT CGAACCCGCA ACCTCCGCCG TGACAGGGCG GCGCTCTGAG   8900
CCAATTGAAC TACAGCCCCC GTAAAATTTA TTTATAAATA ATATATATAT   8950
ATATTATTTC TTTGTCAAGT ACTTGATTTC ATAAATAATA CATATTTTTT   9000
CGTAAACTAA CATCATTTAT AATTAACTTT TATAAGTGTG CCTACTACTG   9050
GATTTGAACC AGTGACATTC CGCGTATGAG ACGGATGCTC TAACCAGCTG   9100
AGCTAAGTAG GCCTCTCATA TAATTAATTA TATAAAAATG CATTGATATG   9150
TGTTAATATA CAAGACATAT CAATGCATTT TTATTATTTG CTTGATGCTA   9200
AAAACTCTTC AGTGAAATCT TTTAAACTAT TTTTCAAAAT TTCTTCTGCT   9250
TCTGAGGTAA AAGTATTACT AGATTTTATA ATTTCACCAT ATTTTGGTTG   9300
ATTATTAACT AAATATTGTC TTAAACCTAT TAGAAAGTCT GATACACGAT   9350
CAACTGGGAT ATCATCTAAA TAACCATTAG TACCAGCAAA AATACTTGAT   9400
ACTTGGTCTT CTAGATTTAA TGGAGAAGAT TGTGTTTGTT TTAATAATTC   9450
CCGTAAACGT TGTCCTCGTG CTAACTGATT TTGAGTCGCT TGGTCTAAAT   9500
CAGAAGCAAA TTGAGAAAAA GCTTCTAATT CTGCGAATTG AGCTAATTCT   9550
AATTTTAATT TCCCAGCAAC TTGTTTCATT GCTTTTGGTT GTGCTGCAGA   9600
ACCAACACGG GAAACAGAAA TTCCAACATT AATCGCAGGT CTTACATTAG   9650
CATTAAAGAT ATCTGCTGAT AAGAAAATTT GACCATCAGT GATCGATATA   9700
ACATTAGTTG GTATGTAAGC TGAAACGTCA CCTTCCTGTG TTTCTACTAC   9750
TGGAAGTGCT GTCATACTTC CACCACCTAA CTTATCACTA AGTTTAGCAG   9800
CTCGTTCTAA AAGACGTGAA TGTAGATAAA AAACATCTCC CGGATAAGCT   9850
TCACGACCCG GAGGTCGTCT TAATAATAAA GACATTTCGC GGTATGCTTG   9900
TGCTTGTTTT GTTAAATCAT CATAAATAAC AAGAGTATGC TGTCCATTAT   9950
ACATAAAGAA CTCAGCTAAA GCTGCGCCAG TATAAGGAGC TAAATATTGT  10000
AATGTTGCTG GTGAATTAGC AGTAGCTGCA ACAATTATTG TATATTCAAG  10050
AGCACCTTTT TCTTGTAATG TATTTACTAC TTGAGCAATA GAAGAAGCTT  10100
TTTGACCAAT TGCAACATAA ACACAAACAA CGTTTTTTCC TTTTTGATTA  10150
AGAATAGTGT CTACGCCGAT AGCTGTTTTA CCAGTTTGTC GGTCCCCAAT  10200
AATTAATTCA CGTTGTCCTC GACCAATAGG AATCATTGAA TCAATAGATA  10250
AAATTCCAGT TTGTAATGGT TCATGTACTG ATCGTCGAGA AATAATACCA  10300
GGAGCAGAGG ATTCAATTAA TCTGTTTGTA TTACTTTGAA TTTCACCTTT  10350
CCCATCAATA GGGCGAGCTA AAGGGTTCAC TACGCGGCCT AAAAAAGCTT  10400
CCCCTACAGG AATCTGAGCG ATTTTTCCTG TGGCACGTAC AGAACTACCT  10450
TCTTGTACAT TTGTACCGTC TCCCATAAGA ACAGCCCCAA CATTTTTTGC  10500
CTCTAAGTTT AATGCAATAC CCACAGTACC GTCTTGAAAC TCAAGTAATT  10550
CACCAGCCAT TACATCATCA AGACCATAGA TACGAGCGAT TCCATCCCCA  10600
ACTTGAAAAA CAGTTCCAAC ATTAACTGTT TTAATTTCTT GTTGGTATTG  10650
TTGGATTTGT TGTTTAATAA TACTACTAAT TTCATCTGGT TTGATTTTTA  10700
CCATAATAAG ATGAACATCC TCCTGCCAAT TATCCTTAAA AAAATAGACT  10750
TTATTTTTTA ATTATTTTTC TAGAATCACA AAATAGTTTT AAATGTGATA  10800
CTAATAATTA AAAAATATAA AATCTAAAAA AATGCTAGAA TTAATTTTTT  10850
TTATAATTAG TAAAAAGTAC AATTTGAAAG TTATTTAAAA TACTATGATT  10900
AGACGAATTT AACCGTAATT TTATTTTTTT TTTAACTTGA TTTAATGATA  10950
ATTCAATTAA TTTTTCAGCT ATTTGATTTT GAACTTTTTG TTGTTCTAAT  11000
TTATAGCTTT CTTGTTGAAA AACCTTCAAT CTATTTAAAT CTTCTGTAAT  11050
TTCGCGATTA AATTTTTCTT TTTCGTTTTC TATAGTAATT GAAGCTTGAT  11100
TTCTAATTTT GGAAGCTTTT TGTTTTGCTT CTTCAAATTG TATTTGAGCT  11150
TGGTTCACAC GTTCTTTAGC TTGTAAAGCA CGCCGATCTG CTTCTTGAAA  11200
ATTGGTAACA ATTGTTTCTT TTCTATTTGC AAGAAGGCCT TTTAGAGCGT  11250
CTCCGACGTA AATAACCACA ACAGCTAATA CTACAGCTAA ATTTAATACA  11300
TTGGTTTCAA TTATATTTGT ATTAAAACTA AAATGATCCG CTAAAATAAA  11350
AAGCGAGTTA GCATTCATAA TAAATTTTTT TATTTAATTT TGTTATGATA  11400
CAAAAGGGTT TGCAAATAAA AGAGCTAAAG CAACAACCAG ACCATAAATT  11450
GTTAACGATT CCATAAAAGC AAAACTTAAA AGTAAAGCAC CTCTAATTTT  11500
ACCTTCTGCT TCTGGTTGGC GAGCAATACC TTCTACTGCA TAACCAGCAG  11550
CAGTTCCTTG TCCAATACCA GGACCAACAG CAGCAAGACC AACAGCAAGA  11600
CCAGCAGCAA CAACGCTAGC AGCAGCAACA AGTGGATCCA TAAGATTATT  11650
TCCTCCAATA AAGTTTTTTA CTTGTGTAAA ACATTTATGA TTTACTTAGA  11700
ATTATTTTAA CATTTTTTTT ATAAAAAAAA CAATTTTAAA AATTAATTAA  11750
TTTTATTAAA TACTAAAAAA AATTAATTAA TAAAAAAAAA AATTAAAAAT  11800
TATTTTTAAT GGTGATCTTC TAACGATTCA CCAATATATG CCCCGGCTAA  11850
TGTAGCAAAA ACCAAAGCTT GGATGGCACT TGTGAATAAA CCAAGCAACA  11900
TTATTGGAAT TGGAACTACC AAAGGTACTA AAGTAATTAA TACACCTACA  11950
ACTAATTCAT CAGCTAAAAT ATTACCAAAA AGTCGAAAAC TCAAAGAAAG  12000
AGGTTTTGTA AAATCTTCTA AAATATTAAT TGGTAATAAA AATGCCGCTG  12050
GAGATATATA ACGTTTAAAA TACGAGAATC CTTTTTTACT AATACCTGCA  12100
TAAAAATACG CAAGTGATGT TAAAAGAGCT AAAGCAACTG TAGTATTAAT  12150
ATCATTTGTA GGAGCCGCTA ATTCCCCATT TGGGAGTTCT AAAATACGCC  12200
AGGGTAATAA AGCACCTGAC CAGTTAGATA CAAAAACAAA TAAAAATATT  12250
GTTCCTAAAA AAGGAACCCA TTTTATATAA TCTTTTTCTC CAATTTGGGT  12300
TTTTGCTAAA TCTCTAATAA ATTCAGTAAC ATATTCCGTT AGATTTTGTA  12350 
AATTTTCTGG GATAGTTTTT AAATTTTGAT TAGCTATTAA TGCTATTCCA  12400
ATAATAAGAC TTAATACAAC CCATGATGTT ATTAGTACTT GACCATGAAC  12450
AAAATAGGGT CCTAAGTTCC AATACCAATG TTGACCAACA GAAACTTCCG  12500
CTAAATCAAA CAAAACGTTT GTCATAAATA CTATTTCCTC TATAAAAATA  12550
TAAATTCAAG TGGCTTTTAT TAAAAAGAGT TTTAATTATA GCTTTCTTAT  12600
ATTTACTATA CATTTACTTT ATAAAACATA AAAATTTTTA TGTTTTTTTT  12650
GATTTAATTC GACTTGATTT TTTTATATTT TGTTTAACAT TTAATAATTG  12700
TTGGTTCTCA TTATTAATTT TTTCCCCAGA AAGTATTGCG TCTTGAAAGT  12750
TAGAAAGTAA GAATTTAACC GTTGCCGCAG AATCATCATT AGCTGGAATA  12800
TACCAATCTG CTAAATAAGG GTCACAGTTA CTATCAAGAA GGGTCACTGT  12850
AGTAATACCT AATTGTCGAC ATTCTTTAAC AGCATTTAAT TCTTCAGATT  12900
GCCCTATGAT TATTACAACA TCAGGCAATT TTTTCATATT TTCTAGCCCA  12950
GCTAAATTTT TTTCTAAACG TAATTTTTGG CGTTTCTTTT TTGCAATTTC  13000
TTTTTTCTTT AATAAATCCC ATTCACCTGA TTCTTCAGCT TTTCTAAAAG  13050
ATTGTAATTG TTGTAGAGAC TGTTGAATCG TTTTCCAATT TGTTAGCATT  13100
CCACCTAGCC AACGTTGATT AACGTAATAG CTATTACACG CAATAGCTGT  13150
TTTTTCAATT AAAGGGGCTG CTTGTTTTTT CGTGCCAACG AATAAGAATT  13200
TTTTTCCTTG AGCAGCACTT TTTTCTAAAA ACCCTAATAA GTCATTTAAA  13250
AGATAATATG TTTGCACTAA ATCTAAAATA TGCCGCCCAT ATCGTTCGCC  13300
ATAAATAAAC GGTTTCATTC GCGGATTCCA TTCAGAAGTA AAATGCCCAA  13350
AATGCATTCC ACTTTGTACC ATATCCTCTA ATTTTATTAT TTTATTATTA  13400
TTTTTAAACA TAATATTTAA TTTTTTTTTA TTACTAATTA TTTATGTTGT  13450 
TT                                                      13452

<210> SEQ ID NO: 150
<211> 9794
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
AAAATTATCT AATTTTAACA TTATTAATCA TTTTATTCTA GCATATTATG    50
GTAAGTAGCA ACAGTAGAAG GAGATATTCT ATTTAAAGAT CTAAAGACCC   100
AATATTTAAA GATTGTATCT AAAAAAACTG GAAATGTTGA TACAAATAGC   150
GATAAAAATA GACTACTCTC AGAAAAGCCA AAATGTATAA AAATCCACTT   200
TAAAAAAATA TTCCAACCTC TTGGTGAATG AAAACCAACA AGTAAATCTG   250
TCCCTAAAAT TAATAAAAAG GATTTTGTAG TATCACTAAA ACTATAGAGA   300
GATTCCGCCA AAAATGATTG CAAAATAATC ATTTGAGGTT TCATCGAAGA   350
GAATATAATA AATATACTGA AAATACTGCA AAAATCTGCA ATTAAATTAG   400
TAATTGCTCT AATACTTTGC TCATTATAAG TTTGTGTAAT TTCAAAAATT   450
TTTTTTTGTA ATGATTGACC TATGTTTTTG TTTGTGTCAA AAAAATCATC   500
ATCAACAAGT GAATCAAAAT AAATAATTTC ATGAGCAAAA TTAAATTCTT   550
TTAATGCACG CTTTTTTTGA TCAGAATTCA AAAAAATCTC ATTACCGGGA   600
CGATTCCAGT AATGTTCTAC AATAGGTCTA ATTAGCAAAA ATTTCAAACT   650
TATGTTTACT AAAGGTGGAA CAAGAAATAA AATTAATAAA GATTGAACAG   700
ATACAATAAG TTGATAGCGC GATATTCTAT ATTCTTCAAT AGCTAGTACT   750
TCTACACCAG GCAATAATTG TTTTTTTAAT TTATCTAAAG TACGAATAAT   800
GGATCTTGGT ATTAATCCAA TTCGTTCTAC AGCTTCATAT CTCATCTTTT   850
TTTAATAGAT TTTATAATAT TTTAACAAAA AATTGTTGTA TTTTTTTATT   900
ATTAAATCAA AATTTTATAT TAAGAAGCTA ATTCAGAAAT TTCTAAAGAC   950
ACTTTTAATA ATTTAGCTAT ATCAGTAGCT TGTGTTTCAA TTTCTTGTAT  1000
ACTCATTGGT TGCCCAGCCC CAATTAAAGG AATTTCACGA TTACCTCGTA  1050
ATTTTAAAAA AATAGCCCGT TTTGTATTAA TCCCTTCTTT TATTTCAACT  1100
CTAATACTTT GAACATCTTT AATCGGGTAT CTTAAGTCAA TACGTCTATT  1150
ATTTCCCGGA AAGCCCCACC TAAAAACACG AACCTCATCT GTTTTTAAAT  1200
TAAATTCGTT ATACCCGTGA CCTAAATTCC AAATAATAGT TAACCAAATA  1250
TAAATTCCTA ATAATAAACC TATGAGTCCA TAAAAACACA TAACTAACCC  1300
TTGCGGAATA AATTGAATAT TAGGTGCTTT AAATATAAAT AATTGATCAT  1350
TTATGTTTAA AGCACTTAAG CTTCCTACTG CTAAAAATCC TATTCCACCG  1400
ATAAACATTA AAGATCCCCA AAAATAATTA CTAAATCGCC GAGCGCCAGA  1450
AACAGTATAA TAGAGACTTT CTTTTTGTGT TGTCATTTCT TTTTAATTTT  1500
GTTAATGTAA CTTAGCACGC GTTTGTTTAT TAAGAAGATT TCGCAACCCA  1550
AATTGGTGGT AATTTATCAA ATTTTTCTTT TAATCGAGTT GCTTTACCTA  1600
TTCGATTACG AAGATAATAA AGTTTTGCAC GTGATACTTG AGATCGACGT  1650
AAAATATGAA TACTTGTAAC GCAAGAAGCA TGTATCGGAA ATACGCGCTC  1700
AACACCAATA CCTTGTAAAA TTTTTCTTAC AGTAACTGTG GTATTTAAAC  1750
CTGCTTTATG TAGAGCAATT ACTGTTCCTT CAAAAGGTTG TACACGTTGT  1800
TTTCCGGATT CTTGAATAGA AATACCAAGA CGGATAAAAT CGCCGACTTT  1850
TATCTCAGGT AATTGGTTTT TTGAAAATTT ATATTCAATC TTTTTTATAA  1900
TATTTTGAAA ATTGTTCATA TATATTTTTT TTTATCAATA AAACTCAATA  1950
TATATTTATT TTTAAATATA TATTGAGTTT TATTTTATTG TAAAATTTTT  2000
ACTACAACAC CAGCACCTAC TGTTTTACCA CCTTCACGAA TAGCAAATCT  2050
CATATTGCGT TCAATAGCAA TAGGTTGAAT TAATTCTACT GTCATTTGAA  2100
TACGGTCGCC AGGTAATACC ATTTTAAGAG GCTCACCAGC GTCAGTAGAA  2150
AAAGATGCGA TTTTACCAGT TACGTCTGTT GTTCTTACGT AAAATTGTGG  2200
TCGATATCCT TTAAAAAAAG GTGTATCACG TCCACCTTCG TCTTTGGTTA  2250
AAACGTAAAC TTGTGCTTCA AAACTAGTAT GTGGTTTAAT ACTTTTTGGT  2300
TTTGCTAAAA CCATTCCACG TTGGATATCA GCTTTTTGAA TACCACGAAG  2350
TAAAATACCA ACATTATCCC CCGCCATACT TTCATCAAGT GTTTTTTGAA  2400
ACATTTCTAA ACCAGTTACA GTAGTTACTT TAGTACTACC AAAACCAATA  2450
ATTTCTACAG TATCCCCGAT TTTAACAACA CCACGCTCTA CGCGACCTGT  2500
AGCAACAGTA CCACGACCAG TAATAGAAAA AACGTCTTCA ATTGCCATTA  2550
AAAATGGTTT TTCTACGTTA CGTTTTGGAG TTGGTATATA AGAATCAACC  2600
GTATCCATTA ATTTATAAAT TTTATCAACC CATTTGTTCT CACCAGGTTT  2650
AATATTTGGA TTTTCTGTTA ACGCAGATAA TGCCATAAGT GCAGATCCAG  2700
GAAGCATCGG TACTTCATCA CCAGGAAAAT CGTATCTTTG TAATGTTTCA  2750
CGAACCTCTA ATTCTACTAA TTCAATTAAT TCAATATCGT CTACTTGGTC  2800
TTCTTTATTG ATAAATACTA CAATATTAGG AACACCTACT TGTTTTGCTA  2850
AAAGAATGTG TTCTTTAGTT TGTGGCATAG GGCCATCAGC ACCAGATACA  2900
ACAAGAATAG CACCATCCAT TTGAGCAGCA CCAGTAATCA TGTTTTTTAC  2950
ATAATCCGCA TGTCCAGGGC AGTCTACATG AGCGTAATGG CGATTTTCTG  3000
TTTCATACTC TACATGAGCT GTATTGATAG TAATACCACG AGCTTTTTCT  3050
TCAGGTGCAG AATCAATATC GTCATATTTT TTACCTTTTC CACCATCACG  3100
TGCAGCTAAT GCCATAGTAA TTGCTGCAGT TAATGTTGTT TTACCATGAT  3150
CCACATGACC AATAGTACCA ATATTAACGT GTGGTTTTTT ACGTTCAAAT  3200
TTTGTGCGTG CCATAAAAAA GTCCTTTTTT AGTTATGTTG TATACAAATT  3250
TTAAAAATTT AACAATTTAA AGAAATAGAT TATTAATCCT GATTAACAAA  3300
TAATTATAAT AAAACCTTCA AATTTTTTCA AGTCTAAAAA CGTTGTTTTG  3350
CAAACGCTTT ATTGGCTTCT GCCATTCTAT GGATTTCATC TTTTTTTTTC  3400
ACAGCATTAC CTGCTTTATT AGAAGCATCA ATTAACTCAT TGGTTAATAT  3450
AACGGACATT GGTTTCCCGT GATTATTACG ACAAGCTGTT AATATCCAAC  3500
GAATAGCTAA AGTAGTTCCC CGTTCAGGAA GAACTTCAAT AGGAACTTGA  3550
TATGTCGAGC CCCCTACCCG TCGAGCTTTT ACTTCAATAA GTGGTGTTGT  3600
GTTTCGAATT GCTTGTTCAA GTATTACAAG TGGATCTTTT TTTGTAATTT  3650
CAGCAATTTT ATTCATGCTT TCATACATAA TTCTATATGC TAATAATTTT  3700
TTACCGTTTC GCATAACCTG TCGTACTAAA AGTTCTAAAA GACTACTTTT  3750
ATAAAGAGGA TCAGGCATTA CAGTACGTTT TTTTGCTGTA CGTCGACGTG  3800
ACATAAATAG TTAAAGAATA TATTTAGTAA AGATTAAATT ATAATTTTCA  3850
ATAGAATCAA AAAAACTCTA TTGAAAATTA TATTAAAATA GATTTAGTTT  3900
TAAATTTTTT ATTTTGGACG TTTTACACCA TATTTTGAAC GACCTTGGTT  3950
ACGATTTTTT ACTCCAAGTG AGTCCAAAGT ACCACGAACA ATATGATAGC  4000
GGACACCGGG TAAATCTTTT ACACGGCCTC CTCTAACTAA AACGACTGAA  4050
TGTTCTTGCA AATTGTGGCC AATACCTGGG ATATAAGCAG TAACTTCAAA  4100
TTTTGAAGAT AACCGAACAC GTGCTACTTT TCGTAAAGCG GAATTTGGTT  4150
TTTTTGGAGT ATTTGTATAA ACTCGAGTGC ATACACCTCT TCTTTGTGGA  4200
CATGCTTGTA ACGCCGGCGA CTTTGTTTTT TTAAACATAC GCTTACGTGC  4250
TGATCTTACT AATTGTTGAA TAGTTGGCAT AAATTCCTAA AAAGTAAAAA  4300
TTAAAGAATC AGGGAAAAAT CGATTAATTT CAATTAATAA TCCCGCAACA  4350
ACTGTTAGAC TTGCTAAAGC AAGAACTGGT GCAGTTGATA AATAAGTTTT  4400
GAAGTTTTGC ATATAATAAT TTGTTTTTTT TTAATTTTTA AAATATTTTA  4450
TTAACCTTTT TTAAATTCTA ATACCTAGAA ATTACTATTT TTAAAATATA  4500
TTAAGATCNN NAAAAAAAGT AAAAAATAAA TTTTATATTA TAAAAAAATA  4550
GATAAAATTT ATTTATTTTT AGTATATAAG TTTTAGTTTT TTACTAAAAT  4600
CCATTAAAAT TGAAAAATGG AAGATTACTT TATATCATAA AAATAAAATA  4650
TAAGTTTCGG GATGTAGCGC AGTTTGGTAG CGTGCCTGTT TTGGGAACAG  4700
GGGGTCGCAG GTTCAAATCC TGTCATCCCG AGAACGCTTT TAGTTCAGTT  4750
GGTAGAACGC AGGTTTCCAA AACCTGATGC CGTGGGTTCA AGTCCTACAA  4800
AGCGTGATTT AATAATTTTA AATAAATTTT TTTTTCTACA AAATTATTAC  4850
TTTTTACATT TATAGTATGT CATTTTTTTA ATAAAAATCA AATTTTTTTT  4900
GTTTTTAATT TTCTACTAAA TAACGATTAA CAAATGGCAG TAACCCACTC  4950
ATACGTCCTC TTTTTATAGC TGTTGCTACA GTACGCTGTT GTTTTGAGGT  5000
TAAGCCAGTA ATTTTTTTAG ACAAAATTTT GCCTTGTGGG CTAATGAAAT  5050
TCATTAAAAG TTTTGTATTT TTATAATCAA TAGTATAACG AGAATTAGAC  5100
TCATTTAAAT TTAATTTTTT TTTTATTTGT AAATAAAAAT TTTTGCTATT  5150
AGTATTTTTA AATGATTTCA TTTTTTATAT TTAGAATAAA TAGTTATAAT  5200
TTAAATTTTG ATAAATTAST NNATNNNNAN TAAAATCTAG TAATTTATCA  5250
AAAGATTCTT TATCTTGAAG TGCTATTTGA CTGCAAATTT TTCGATTAAG  5300
AAGAATTTTA GAGGTTTTAA GCTGATTTTG AAAGTTACTA TAATTTAATC  5350
CTGATAAACG AACTGCAGCA TTTATACGTG AAACCCAAAT TTTAACAAAT  5400
TCACGTTTTT TAATTTTTCT ATCTTTATAA GAATTTGTTA ATGCTTTTAT  5450
TGTTCTTTGT TGGGCGGTTC TATATAATTT AGACGATGAA CCACGAAAAC  5500
CTTTTGTTAA ATCTAATACT TCTTTTCGTC TTTTATGGGC AATATTACCA  5550
CGTTTTACTC GAGTCATAAA TAAAATTAAA GTTTTTTTAT ATACTAATTT  5600
GATAATTATT CAAATTAAAA CGAAATCTTA ATTAAATAAT TTTTTTAATT  5650
ATAGAGTAAT TTTTTTNANT NNANTNNNAA NAGTTACTCT TAAATAACAG  5700
AAAATTTAAT TCATATTATT TAAATTTTTT TCCAATTATT TGAAAAAAGA  5750
CAACATTAAA TGGTAACTAT TAGTATTAAA TATTTTAATT TTTTTTTTAG  5800
TTGTCTACCC CCAGGGGAAT TCGAATCCCC GTTTCCTCCG TGAAAGGGAA  5850
ATGTCCTAGG CCTCTAGACG ATGGGGGCTG GTTTGGAATT GTTTAATGAA  5900
AAATTTAGTT TAAGGAGTGG GAATAGGATT TGAACCTATG ACCTTAAGGT  5950
TATGAGCCTT ACGAGCTACC AGACTGCTCT ACCCCACTTA CATTATTAAT  6000
CATTGATTTT ATAATATTTA ATACTACTGA AAAGTGTCAA GAAAATAAAA  6050
TAATTGAATT TTATTTATAA AATTTTTATA ATAAAATTAT AATATAAAAA  6100
ACAAAATAAT TATATATTGG TTATAATCTT AGGGTTCAAT TCTATCAATT  6150
TAATGCGGAT ATAGCTCAGT GGTAGAGTGG CACCTTGCCA AGGTGCTTGT  6200
CGCGCGTTCG AATCGCGTTA TCCGCTATAA TTTGATACTT ATTAGGAAAG  6250
GTGGCAGAGC GGTTCAATGC ATCGGTTTTG AAAACCGACG TAGTGTTAAA  6300
GCTACCGGGG GTTCGAATCC CTCCCTTTCC GTTACTTCGA GTTTTAATAA  6350
TATAATTATA TTAAAATTTT TATCATTTTT TTATTTTAGA GTNGNNNNNN  6400
NNNAGGATAA ATTAAAAAAT TATTTTTTAT TTAAATTTTA TTGAATCGGA  6450
TTTAAATATA TAATACTTAA CAAATAAAAA GGATATATTT TAATGGAAGT  6500
AAATATTCTT GGTTTAATTG CAACTGCTCT ATTTATTATT ATTCCTACTG  6550
CATTTCTTTT AATTCTTTAC GTAAAAACAG AAAGTAGTCA GTCAGCAAAT  6600
TAATTTTTAT TTTTGGGCCG AGCTGGATTT GAACCAGCGT AGGCAAAGCC  6650
AGCGGATTTA CAATCCGCCC CCATTAACCA CTCGGGCATC GACCCAGTTT  6700
GAACCTTTAT ATTTTATTAC GGAAATTTAA CAAAATTTTT GTATATTTAT  6750
ATTTTTATAA TATTGTATTC GTCTCTAGAA AAAACTGCAA TATTTAATTT  6800
TAAAATTTTA CTAAAATTAA AAAAATTATT ACTATATTGT ATATATATAT  6850
GCAATTAAAT AATAATTTTT TTATTGGGGA TATGGCGGAA TCGGTAGACG  6900
CAACGGACTT AAAGGTACTA TTTATATTTA TATATAAATT GAGCCTATTA  6950
GAGGAAACTT TAATAGTGAA TGCTCTCAAA TTCAGGGAAA TCTTTAAATA  7000
AAAAAGACAA TCCTGAGCCA ATTTGCTTTA AAAATTATTT TTCTAGCAAA  7050
AGGTGCAGAG ACTCGACGGG AGCTATCCGC AAACAAATTT TAATTTGATT  7100
TAAGGATAAA GAGAGAGTCC AGCTTCTAAA AACTGAAAAT CCGTTGATTC  7150
ATTGAATCGT GAGAGTTCAA GTCTCTCTAT CCCCAAAATC AAATCTTTTT  7200
TTTATTTTTT ATTTCCAACT CACAGTAATG TCTTCTAAAA GAACAGAACT  7250
ATTGTAGATT TCAAGAATAA TTACAAGAAA AACTGCAAAT AATCCCATAA  7300
AAATCCCCAT TATTGCAGTA GTACCCCATC CTGGGGCTAC TTTACCATAT  7350
TCAGAATTTA AAGGTTTTAG TAGTGTTCCC AAAGAAGTTA TTTTCCCAGG  7400
TTGAATATTT TGTGTATTTG AAGCTTTTCT GCCAGATGTA GTTCCAGTTG  7450
CCATAATTTT AAATAAAAAT TGTATTTAAT TTACTTTCTA TATTTTTATG  7500
TGAATTTACA AATTAAATAT TGAGTTTAAA TTTTATTGAA ATTTTTCGAT  7550
TTTGTGGGTA TTTATTATTA ATATATAATG TATTAGGGAA AAAAATTATG  7600
GAAAATTCAG CTATTTTTAT TACTATTTTT TTGTGGTGTT TACTTATAAC  7650
CATTACAGGT TATTCAATTT ATATAGGCTT TGGCCCACCT TCGCAAGAAT  7700
TGAGAGATCC TTTTGAAGAG CATGAAGATT AATATTAAAA GAATTCTTTT  7750
AATATTAATC TTCATAAAAT AGATTTTTTA TTTAAATAAC ACCAACATTA  7800
TTTACCAGGA ACGCGTGGAG GTTCTCTAAA GAAAATCGAA AAAAATATAA  7850
TACCTAGAGT ACCGACTAAT AAAAACGTAT AAACTAATGC TTCCATATTA  7900
ACTGCTTATT TATATATTTA ATTTTTTAAT ATAAATAAAC GTTTTTTTTT  7950
ACTTAAATTT TTTAGCTTAA GTAAAAAAAA CGTTTATTTA ATTTTATATA  8000
ATAAAAATTT ATGTTAAACA CACAAATTAA TATTTAATCA ACCTAAAATT  8050
ATTTTATATT GATTGTCGAG GCGTAGATTT ATCACCAAGT TTTTGAAATG  8100
CACCGAATTC GATTTGTTCA TCTAAGTCAG CATCAATACC TGCAAAAACA  8150
TCTCGGAAAA TAGTTCGAGC GCCGTGCCAA ATATGTCCAA TAAAGAAAAA  8200
TAAAGCAAAA ATTAAATGTG CAAAAGTAAA CCAACCACGT GGGCTTGTAC  8250
GGAAAACACC ATCAGATTGT AAAGTTGATC TATCAAATTC AAAAACTTCA  8300
CCTAATTGAG CACGACGTGC GTATTTTTTA ACAGTAGCGG GATCAGAAAA  8350
AGTAACACCA TCTAATTCAC CACCATAAAA AGTAACTGAA ACACCTACTT  8400
GTTCAATACT ATATTTTGAT TCTGCTCTAC GGAATGGTAC ATCAGCTCGA  8450
ACAACACCAT TATTATCAAC AAGTAAAACT GGGAAAGTTT CAAAGAAAGT  8500
CGGCATACGT CGAACAAATA ATTGATTACC CTGTTTATCT TTAAATACTG  8550
CGTGTCCTAA CCAACCTACA GCAATACCAT CACCACTATT CATAGCACCT  8600
GTACGGAAAA GTCCACCTTT CGCTGGATTA TTTCCAATAT AATCATAAAA  8650
TACTAGTTTT TCTGGTATTT CAGACCATGC TGCTGTTGCT GATTTTCCAC  8700
TACCAATACT GCTCTGTACG CGGCGTTCAA TTTCTTGTTG GAAAAACCCT  8750
AAATCCCATT GGTAACGCGT TGGTCCATAT AATTCTACTG GAGTAGAAGA  8800
AGAACCATAC CACATTGTTC CAGAAACAAC AAAAGCTGCC CAAAAAACTG  8850
CTGCGATACT ACTAGAAAGT ACTGTTTCTA TGTTTCCCAT TCGTAAGCCA  8900
TTATATAGGC GTTGTGGAGG TCGAACACAA AGATGAAACA AACCTGCTAA  8950
TATACCAAGA ATACCAGCTG CAATATGGTG CGCCGGAACT CCACCAGGAT  9000
TAAAAGGATC AAATCCTGTA GCATCCCAAG ATGGTGATAC GGGCTGAATA  9050
CTTCCACTAA TACCATAAGG ATCTGATACC CAAATACCAG GACCAAATAA  9100
ACCAGTTACG TGGAAAGCAC CAAAACCAAA ACAAGCTAAT CCTGATAAGA  9150
AAAGATGAAT ACCAAAAATT TTTGGTAAAT CTAAAGCAGG GTTTCCTGTA  9200
CGTGGGTCAC GGAAAAGTTC TAAATCCCAA TAAACCCAAT GCCAAATTGA  9250
AGCTGCAAAT AATAATCCAG ATAATATAAT ATGAGATGCT GCTACACCTT  9300
CATAACTCCA AATTCCAGGA TTATTCGCTG TTTCTCCACT AATAGTCCAA  9350
CCACCCCATG ATTGAGTAAC ACCTAATCGT GTCATAAAGG GTAATACAAA  9400
CATCCCTTGA CGCCACATAG GGTTTAATAC AGGATCTGAT GGATCAAAAA  9450
CTGCAAGTTC ATAAAAGGCC ATTGAACCTG CCCAACCAGA AACTAATGCT  9500
GTGTGCATTA AATGAACAGA AATTAAACGA CCTGGGTCAT TTAATACTAC  9550
TGTATGTACA CGAAACCAAG GAAGACCCAT AAAAATACTT ATATTTTAAA  9600
AATTTCATTT ATTTACTTTC TTTCTATATT AAAAAATAAA AAATACTAAT  9650
AACTTTTTTT TATTTTTATA AATTATTATT TTTATTATTA TACCATCAAT  9700
TATAATATTT TATTTATACA TTTTTGCTTA CTAGTTAGTG TTTTAAAATA  9750
AATTTTATTT AGGTTCAATA TATCTTTATA ATATAATTCT GTAT        9794

<210> SEQ ID NO: 151
<211> 8866
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
ATATTTCATA TTAAGAAATA TTTAACAAGA ATCTCATATT ATTGAAAATA    50
AAAGATTTAT TTATATAANG TGGCACCTAA TCGTACAGCA AAAAGCCCTG   100
TAACTAATGC TAAAAATAAT GCTATAAATA TTTGAGAATC TGCAAGCGAC   150
ATAAATTTTA ATAATATTTA ATAAATGCTA TTTACTAATT TTTTAAGCTA   200
AAATTATTGG ATTTTAATAT TAATTAACTT ATAAATTTCC TTTACGACTG   250
GCAAGTAATA TAATAACAAG TGGGCCGGCC GCAATAATTA AAAATAAACT   300
TGTTAATTGA AAAAATATTT CTAAATTCAT GATGTATTTT TTAATTTCGT   350
AATTTAATAA AATTTTAGGT GAAGTAAAAT ATAAAATTTA AATAATGTTA   400
AATTTGTATT AGAAAAAAGC TTATATTAGT TTAAATAATA ATATAATAAT   450
TTTAAATATT TACTTTAACG AAAACTTACT GACGCCTGCC AAACAAACGC   500
TAATAATAAA AAAAATACAG GTATAACTGG TAAAACGTTT ACAATAGGGT   550
CGAAGGGAGC ATAAGCTTCA GGTAATTTTG CTAATAATAA CACGGGAATC   600
CTCCAAAATA ATATATTTTG AGCAATTGAA TTGTATGTTC AAAACTACTA   650
TTCAATTTTT TACTTAATTT AGTATTATTA TGCTAGAAAA ATATATTTTT   700
TACCAATGTT TTTAATACTT TAAATATTTA ATTCAAAAAT AAAANNANNA   750
NAANAAAAAN ATAAAATATT AATAAAATTA AAATATTGTA TAAACATAAC   800
TATAATAATG TAAAATAAAA AGTAGTTGTT CTTATATACC TATAAGATAT   850
ATTTCTTTTT ATAGTACTTA TTATTTTAAG TATATACTAA TTTTTAGTGA   900
TTTGTAAAAA AAAATTATGA AATTAGCGTA TTGGATGTAT GCAGGGCCTG   950
CTCATATAGG TACATTAAGA GTGGCGAGTT CATTTAAAAA TGTTCATGCT  1000
ATTATGCACG CGCCTTTAGG CGATGATTAT TTTAATGTAA TGCGCTCTAT  1050
GTTAGAACGA GAACGCGATT TTACACCCGT AACCGCAAGT ATCGTTGATC  1100
GTCATGTTCT TGCTCGGGGT TCTCAAGAAA AAGTTGTCGA AAATATTACT  1150
CAAAAAGACC AGCAACAAAA ACCTGATTTA ATTGTATTAA CACCAACTTG  1200
TACTTCAAGT ATTTTACAAG AAGACTTACA AAATTTTGTA AATCGTGCAT  1250
CTTTAGAATC TAATTCAGAT GTAATTTTAG CTGACGTTAA TCATTATCGA  1300
GTGAATGAAT TACAAGCAGC AGACCGGACA CTTGAACAAA TTATTAGATT  1350
TTCAGTTGAA AAAGCACGTA AAAATAATGT ATTAAATACA GAAAAAACTA  1400
TTAAACCTTC TGCTAATATT CTTGGGATTT TTACATTAGG ATTTCATAAT  1450
CAACATGATT GTCGAGAATT AAAAAGACTT TTAAATGATT TAGGAATCTC  1500
AATTAATGAA ATTGTTCCAG AAGGTGGGTC GATTAAAAAT TTAAAAAATT  1550
TGACTAAAGC GTGGTTTAAT CTAGTTCCGT ATCGAGAAGT AGGTTTAATG  1600
GCGGCAAATT ATTTACAAAA CGAATATTCC ATGCCTTATG TTGGAATAAC  1650
CCCAATGGGT ATCGTAGATA CAGCTAATTG TATTCGTGAA ATTTCTACTA  1700
TTATTAAAAA CCAAAACCCT GATATAGAAT TTAATACAGA AAAATATATT  1750
GATCAACAAA CAAGATTTGT ATCCCAAGCT GCGTGGTTTT CGCGTTCTAT  1800
TGATTGTCAA AATTTAACAG GGAAAAAAGC TGTGGTATTT GGTGATGCAA  1850
CACATGCAGC TAGTATGACA AGAATATTAG CGCGTGAGAT GGGGATTAAT  1900
GTTTCATGTG CTGGTACATA TTGTAAACAC GACGCAGATT GGTTTCGTGA  1950
ACAAGTTCAA GGATTTTGTG ATCAAGTTTT AATTACTGAT GATCATACTC  2000
AAGTTGCAGA TATGATTTCT CGCATTGAAC CTTCAGCTAT TTTTGGTACT  2050
CAAATGGAAC GTCATATTGG AAAACGGTTA GATATACCAT GTGGTGTTAT  2100
TTCAGCACCT GTACATATCC AAAATTTCCC ATTAGGTTAT AGACCGTTTT  2150
TAGGTTATGA GGGTACGAAT CAAATTGCTG ATTTAGTTTA TAATTCGTTC  2200
ACATTAGGTA TGGAAGATCA TCTATTAGAA ATTTTTGGTG GTCATGATAC  2250
AAAAGAAGCA ATTACAAAAT CACTTTCTAC AGAATCTGGC TGTAGTTGGT  2300
CACCCGATGG GTTAGCGGAG CTTAAAAAAA TACCAGGTTT TGTTCGTGGA  2350
AAAGTTAAAC GTAATACGGA AAAATTTGCT CGTGAACGAA ATATTACTGA  2400
AATTACAGTA GAAATAATGT ATGCGGCGAA AGAAGTTATT GGTGCATAGT  2450
AATTAAACCC GTATGCACTA TTTTTTTATG GAATAATAAA TATAAAAATT  2500
TAGCTTTTAT TATTAAAACA TGGTTTTATA TTTATCTATT TTTTTATAGA  2550
GTTTAATTTT TAAAATTTTA AATTAGATCA ATTTTAAAAA TTATTTAGAA  2600
AGCAACAATA ATAAATATTA TAATTTTTTT ATAATTAAAT ATATTATAAA  2650
ATAGAAATTT CAATCAATAT TATTTAGTTA GTTGAATTTA CCCTCGAGAG  2700
GGAAAGGAGA ATTTTTCGCC ATGACAATTA GCCCCCCAGA ACGCGAATCT  2750
AAAAAAGTTA AAATTCTTGT AGATCAAAAT CCTGTAGAAA CGAACTTTGA  2800
AAGATGGGCA AAACCTGGTC ATTTTTCACG CACGTTAGCA AAAGGTCCAA  2850
ATACAACTAC ATGGATTTGG AACTTACATG CAGATGCACA TGATTTTGAT  2900
ACACAAACAA GTGATCTAGA AGATATTTCG CGAAAAGTGT TTAGTGCACA  2950
TTTTGGTCAA CTTGGTATTA TTTTTATCTG GTTAAGTGGT ATGTATTTTC  3000
ACGGCGCACG TTTTTCAAAT TATGAAGCAT GGTTAACAGA CCCTATTCAT  3050
ATTAAACCAA GTGCGCAAGT TGTATGGCCA ATTGTAGGCC AAGAAATGCT  3100
GAATGCTGAT GTTGGTGGCG GATTTCAAGG GATTCAAATT ACATCTGGAT  3150
TTTTCCAACT TTGGCGTGCT TCAGGTATTA CTAGTGAGCT TCAATTATAT  3200
ACTACAGCGA TTGGTGGTCT TGTATTCGCT GCAGCTATGT TTTTTGCAGG  3250
TTGGTTTCAT TACCATAAAG CAGCGCCTAA ATTAGAATGG TTTCAAAACG  3300
TAGAGTCTAT GCTAAATCAC CATTTAGCAG GTCTTTTAGG TCTTGGAAGT  3350
TTAGCATGGG CTGGGCACCA AATTCACGTG TCTCTTCCAA TAAATAAGCT  3400
TCTTGATGCT GGTGTTGATC CCAAAGAAAT ACCACTCCCA CATGAATATA  3450
TTTTTCATCC AGAATATCTT GCACAATTAT ACCCAAGTTT TGGCAAAGGA  3500
TTAGTACCGT TTTTTACACT TGATTGGAAT CAATATAGCG ATTTCTTAAC  3550
TTTTAAAGGC GGTTTAAACC CAATTACTGG AAGTTTATGG CTAACAGATA  3600
CTGCCCATCA TCATCTTGCT ATTGCTGTTG TTTTCATATT AGCCGGACAC  3650
CAATATAGAA CAATTTTTGG TATTGGGCAT AGTATCAAAG AAATTCTAGA  3700
AAGTCACACT CCTCCGTCAG GAAATTTAGG TGCTGGTCAT AAAGGTATTT  3750
ATGATACTCT TAATAATTCT TTACACTTTC AATTAGGGTT AGCTTTAGCG  3800
AGTCTTGGCA CAATTACATC TTTAGTAGCA CAACATATGT ATTCGTTACC  3850
ACCGTATGCT TACTTAGCGC AAGATTTTAC TACACAAGCA TCTTTATATA  3900
CACATCATCA GTACATTGCT GGGTTTTTAT TAACAGGTGC ATTTGCACAT  3950
GGTGCTATAT TTTTTGTTCG TGATTATGAT CCAGAAGCAA ATCGTGGCAA  4000
CGTGTTGGCA CGTATTTTAG ATCATAAAGA AGCTATTATT TCACATTTAA  4050
GTTGGGTTAG TTTATTTTTA GGTTTTCATA CATTAGGATT ATATGTACAT  4100
AATGATGTAA TGCAAGCTTT TGGGACTCCA GAAAAACAAA TTCTAATCGA  4150
ACCTGTTTTT GCTCAATGGA TTCAAGCTTC TCATGGTAAA ACTGTTTATA  4200
ATTTTGATTT TTTATTATCA TCATCAACAA GTGCTCCAAG TATTGCTGGT  4250
CAAAATTTAT GGTTACCTGG ATGGTTAGAA TCAATTAATA ATGAAAGTAA  4300
TTCATTATTT TTAAACATAG GCCCTGGTGA TTTCTTAGTG CATCATGCAA  4350
TTGCTTTAGG ACTACATACA ACTACATTAA TTGGGATCAA AGGTGCTCTA  4400
GACGCGCGTG GTTCAAAATT AATGCCAGAT AAAAAAGATT TTGGTTATGC  4450
TTTCCCTTGT GATGGTCCTG GACGTGGTGG TACTTGTGAT ATTTCAGCTT  4500
GGGATGCTTT CTATCTTGCT ATTTTCTGGA TGTTAAATAC TATAGGTTGG  4550
GTTACTTTTT ATTGGCATTG GAAACATCTT GGCATTTGGC AAGGTAATGT  4600
TAGTCAATTT AATGAATCGT CTACTTATTT AATGGGTTGG CTTCGAGATT  4650
ATTTATGGCT TAATTCTTCA CAACTTATTA ATGGTTATAA TCCTTTTGGG  4700
ATGAATAGTC TTTCTGTTTG GGCGTGGATG TTTTTATTTG GTCATTTAGT  4750
GTATGCAACA GGATTTATGT TCTTAATTTC TTGGCGTGGT TATTGGCAAG  4800
AATTAATTGA GACTTTAGCT TGGGCTCACG AAAGAACACC TTTAGCTAAT  4850
CTAATTCGTT GGCGCGACAA ACCAGTAGCA CTTTCTATTG TTCAAGCACG  4900
TTTAGTTGGT TTAGCACATT TTTCTGTTGG CTATATTCTT ACTTATGCTG  4950
CTTTCTTAAT CGCTTCGACT TCTGGTAAAT TCGGTTAAAT TTTTTTAATT  5000
ACTCTTTTAT TAATAAAATA AAAGAGTAAT TATTATTACT GTTAAAATTT  5050
TTTGCGGTGT AAAAAGTTAC GATTTTTACA CTTTTTTATT TTTATGATAT  5100
AATATAAAAA ACTGGTTGTA TGTAATTACT ATTTATTACT TGTATAAGTA  5150
CTAATATGAT TAATTTAAGA TTATTTTTTA TTATTAATAA AATTTAATAA  5200
TTTCTTATTA GTAGAATTTA TTAATAATTT ATATATNNNS NTANNNNNAA  5250
AAAGTATTTT TAAATTTACA AAATTTAAAA AATTACAAAC ACTACAAAAA  5300
TCAACTAAAT CTTTAAAATT ATGGCTGTTT CTACAGAAAC TAAAAATGTT  5350
GGTCGTATTA CTCAAATAAT AGGCCCTGTT TTAGATATTA CTTTTACGGC  5400
AAATAAAATG CCTAAAATTT ATAATGCTTT ATCTATTACT AGTAAAAATA  5450
ATGAAGGTCA AGATATTTTT GTCGTTTGCG AAGTTCAGCA GTTATTAGGT  5500
GATCATTGTG TTCGCGCTAT TTCTATGAAC GCTACAGATG GCTTAAAACG  5550
AGGTATGGAG GTTTTAGATA CTGGTGATGC TTTAAATGTA CCGGTTGGTA  5600
AGGGAACTTT AGGTCGGATT TTTAATGTAC TTGGGGAAAC TGTTGATAAT  5650
TTAGGTCCGG CTAATACTTC AGATCAACTA CCAATTCACC GTTCTGCTCC  5700
TGAATTTGTA GATTTAGATA CGAAACTTTC TATTTTTGAA ACAGGTATAA  5750
AAGTAGTTGA TTTATTAGCG CCATATAGAA GAGGTGGTAA AATTGGATTA  5800
TTTGGAGGAG CTGGTGTTGG AAAAACAGTA TTAATTATGG AATTAATTAA  5850
TAACATAGCA AAAGCTCATG GTGGTGTTTC TGTTTTTGGT GGTGTAGGTG  5900
AGCGTACACG TGAAGGTAAT GATTTATATA TGGAAATGAA AGAGTCTGGT  5950
GTAATTAATG AGTCTAACAT AGCTGATTCA AAAGTAGCAC TAGTATATGG  6000
CCAAATGAAT GAACCGCCGG GTGCGCGTAT GCGAGTAGGT TTAACAGCTC  6050
TTACAATGGC TGAGTATTTT AGAGATGTAA GTGGACAAGA TGTTTTATTG  6100
TTTATTGATA ATATTTTCCG TTTTGTACAA GCGGGTTCTG AGGTGTCTGC  6150
TTTATTAGGT CGTATGCCCT CTGCTGTTGG ATATCAGCCT ACATTAGCGT  6200
CAGAAATGGG TACTTTACAA GAGCGGATCA CATCAACAAA AGATGGAAGT  6250
ATTACTTCAA TTCAAGCTGT ATATGTTCCG GCAGATGATT TAACGGATCC  6300
AGCTCCAGCA ACTACATTTG CTCATTTAGA CGCTACTACT GTACTTTCAC  6350
GAGGGTTAGC TTCTAAAGGA ATATATCCAG CGGTGGATCC TTTAGATTCA  6400
ACATCTACAA TGTTACAACC TTGGATTGTG GGTGATGAGC ATTACAAATG  6450
TGCACAAAAC GTAAAACAAA CATTACAACG TTATAAAGAA TTACAAGATA  6500
TTATTGCAAT TTTAGGTTTA GATGAATTAT CACCTGATGA TCGTTTAGTT  6550
GTTGCGCGTG CTAGAAAAAT TGAACGGTTT TTATCTCAGC CTTTTTTTGT  6600
AGCTGAAGTT TTTACTGGTT CTCCTGGAAA ATATGTTAGT TTGGCTGATA  6650
CTATTAAAGG GTTTAATTTG ATATTAGCTG GTGAGTTAGA TGATTTATCA  6700
GAGCAAGCTT TCTATTTAGT GGGTAATATT GAAGAAGCAG TAGAAAAAGG  6750
ATCTACAAAA AAATAATTTT TTGGTTACTA TAGATTTTTT ATAGTAACCT  6800
AAAAAATCTA TAGTTTATAG TTATATAAAT TTATACAAAA TATTTAAGTA  6850
AATACTTGTT TTTTAATTAT AAAAATGAGT ATTATTAAAA TTAGGGATAA  6900
ACTAAAAAAT TATGGTACTT CAAGTTTCAG TTATGACACC GGATGGTATT  6950
TTTTGGGATA ATAAAGCTGA AGAGGTAATA CTTCCCACAA ATACTGGCCA  7000
AATGGGAGTA TTAGCTAATC ACGCACCACT AATTACAGCA TTAGATGTCG  7050
GTGTAACATT AATTCGTACT GATAAAAAAT GGACTCCATT AGCTGTTATG  7100
GGTGGTTTTG CATTAGTAAA ACAAAATCAA ATAACAATTT TAGTAAATGG  7150
AGCAGAATCT GCAGATACTT TACAATCAGA AAAAGTAGAA GCTTCGTTTG  7200
AAGAAGCCAA AAATCAATTA GAAAATGCAG AAAGCGAAAA ACAAAGAGTA  7250
GATGCTCTTT TTCAATTTAA GCGCGCAAGA GCTCGATATC AAGTAATTAA  7300
ACAGTTAGTT AATTAATTTT TTTAATTAAA AAGCATTTTG TTAAACAAAT  7350
TTAAATAATA AAAATTGAAT ATGTCGCGTT ATAGAGGTCC ACGATTGCGT  7400
GTTATACGCC GTTTAGGTGA ATTACCTGGT TTTTCAAAAA AAGTAGATAA  7450
AAACCAGGTT CCACCAGGTC AACATGGATG GAAAAAAAAG ACAGGTGATC  7500
AAAAAAAATT AAAAGAATCA CAATACGGTA TTCGTCTTAA AGAAAAACAA  7550
AAATTAAGAT ATAATTATGG AATTAATGAG CGCCAACTTA TAAATTATGT  7600
ACGGGAAGCT CGTCGACGGA AAGGATCTAC TGGAGAAGTA TTATTACAAC  7650
TTTTAGAAAT GCGATTAGAT AATATCATTT ATCGTTTAGG ATTTGCGCCC  7700
ACTATTCCTG CAGCACGTCA ACTTATAAGC CACGGTCATA TTAGTATAAA  7750
CAAGAAAAAC ATTAATATTC CAAGTTATAC TTGCAAAATT AATGATAATA  7800
TTTCGGTTTT AAAAAATTCT CAACAATTAG TAAAAAATTA TTTAAATAAT  7850
GCAGGTATTG CTACTGTTTC TTCTTATTTA AATCTAAATA AAGAAACATT  7900
AGAAGCTAGT GTTAATAACA TCGTACCACG CGATTTAATT AAACTCCAAG  7950
TTAATGAATT ATTAGTTATT GAGTATTATT CTAGAAAATT ATAATCATTA  8000
TATCTTTATG CTTAAGGTGC TAAGCATAAA GATATAAGAT CATTAAATGC  8050
GGAAAGATGT CTGAGTGGTT GAAGGTATAG ATCTGGAACA TCTATGTGAT  8100
GCTTTTATAT CACCGAGGGT TCGAATCCCT CTCTTTCCGT CTTTTTGCGA  8150
TTATGATGAA ATTGGTAGAC ATGCAAGCTT GAGGGGTTTG TGGGTAAACA  8200
CCCATCCCGG TTCAAGTCCG GGTAATCGCA AAAAATATTT AATTTAGCAA  8250
TATTATTGAC AAATAAAAAA ATTTATATTT ATACTAAAAT TATTATTAAA  8300
TCCTCAGTAG CTCAGCGGTA GAGCGATCGG CTGTTAACCG ATTGGTCGTA  8350
GGTTCAATCC CTACCTGGGG AGAAAAAATT AAAATAAAAA TTTGGTTTTA  8400
TAACTTATTT AATTATTATA TAAATAACAG CATAAAAATT AGTTATTGTT  8450
TAAAGAAGTT ATAATACTAT AATTATTAAC TTCTTTTCAA TTTATAATAA  8500
TTATAAATAC GTTATATAAT AATTTTGAAA TTTTATATCT AAACATTTTT  8550
TAAAAGGAGC CTTTTTATGT CTCATACTGT AAAGATTTAC GATACATGCA  8600
TTGGTTGTAC TCAATGTGTT CGTGCGTGCC CAACTGATGT TTTGGAGATG  8650
GTACCTTGGG ATGGGTGTAA AGCTAGTCAA ATCGCGTCAG CACCGCGAAC  8700
TGAAGATTGC GTAGGTTGTA AACGTTGTGA ATCTGCATGC CCGACTGACT  8750
TTTTAAGTGT TCGTGTATAT TTAGGTCCTG AAACTACTCG TAGTATGGGA  8800
TTAGCTTATT AATAATAATA TTTATGTTGG AGTTGAAAAA ACCCAACATA  8850 
AATATTATTA TTAATA                                       8866

<210> SEQ ID NO: 152
<211> 8161
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
ATCATACCAG TAATAAAATC TATCAAGTAT TACTTGATGG GGTTCTTTTG    50
GCTCCTCTGA ATTAACTTTT CTTTTTTTCT TAGTAACCTG TTTTTCTTCA   100
ACAATAGGTT GAATTTCCGC TACCTCTCGA GCAGTTTCTA TATACTTTTT   150
AGCTTTAATA GGATCATACT CAATACCTAA CTCTTCGTAT TGTTTTTTTA   200
AGTAGTCCGT AAATGTTATT GGTTCATTAT GTGCGGGTGC AGGATAAGCC   250
ACAGAAGATG ATGTAGTGGA TGATTCAGAA GTTGGTGCAA ATATATTATT   300
TATAATCTCC TGAATTTTAT CATTTTTTTT CATCTTATTG TTTAATTCAA   350
GACGATAAAA TATCAATGGA GTTTCTTGTG GTAAAACCCG CTCTGATTGA   400
GATTGAGTTA CTCTATTAAA TCCTAAATTT TCTGCTAATC GCTGTCCTTT   450
AATATGAGTA GAAACCACAT CTAATGATAA TTTTTCCACT CTTCTCACCC   500
ACGCATAATC AGATCGATAG GATAATTCCT CCATACTCAC TGTAATATCC   550
TCTCCCGGAT AAACTAAATA TTGACCCCTA TTAAAAGGAA ATAGTTTTAA   600
TTCCATCAGT TTTCCTTCAG TAATATAACT GAATTCAGGA GAAATATCTG   650
GTACCGAATT ATGACTAAAT ATTGTCGATT CAAAAACGTC GTCTTCTGAA   700
ACAAAACCAA GTGGTCCTGT TAATATATAA TTATAAGCAT AATAAGGAAG   750
AGTTGCTAAA CTAATAGCTA TTGAGCAAAT AAAAGAAACT CTATTAATAT   800
CAGATATAAT TTTACTAGTA TAAAACATCG TACGGTCAAG TACAAAGTTT   850
TTTACTTTTA AAAAAAACCA CATCCAGAAG CTTGTAAAAA CAACAGATCC   900
AAGTAAAATG CCAGTTATAT ATAAAAAATT ATTGGCAATA CTTAACCCTA   950
AATGATTCGA AGTAAACCCT TGTAAAATTG TTGGGTTTGC TGAAAAAGTC  1000
ACATTACTTA AAAATTGAAA AATAGCGCCT TGCTCACACC ATGCCATTAA  1050
AAAACTATAT ATAAAAAATT TTTTATATTT TGGGTGATGC CAAGTTTTTA  1100
ATGCAAGTAA ACTTTCAAAC CGTAATGAAA AAATAATGCG CGCAACAAGA  1150
ATTAAGCCAG CAATATAGTT TAACGGTTCT AATGTTAACC ATGGAATAAT  1200
TATACCTTGT ATACCAAAAG TAACACAACT AATAAACACA ATTTGACCAG  1250
CAATATAACC ACCAAAAGAA AAAACGGCAG CTGGGATTCC TTGAATTAAT  1300
AATCTTCTAA TTGAAATAAT ATGTATTATA GAAAATGGAA GCGTGAAAAA  1350
AGCACTATTT AAAAACCCAA CTAATAAAGG ATTTTGCTCT ACATGTAAAA  1400
TCTCTAAAAA ATCAAATAAA GATCCTGGGG GCGTTTGCCC AACGAGTGAT  1450
TCATAAAAAG CTGGTATTTT TATAGGTAAT AGACTAAAAT CACGAATCCA  1500
TTGAAAACTT AAAAACTGTA AAATACAAGT TTGACATATT TTTAAAACGT  1550
AAAGTAAACT TTCCGAAATA AATTTTTCAA ACTCGATAGG ATTACCTGGC  1600
GTTTCAGTAA CTCCAGCTAA TACTTCAACA TAATCGCGAA TATTTTCACT  1650
TAATGACATA TACTAAATAT ATAATAAGTT CTTTTTAATA AAAAAAGTAA  1700
AAAGAGCCAT ATACCCTTTT TCTTTTAGTT AATGAATAAT TTAAAATAAA  1750
TTATATAACA AAATATAAAA ACACAATTTT CTAAATTCAA TATAAAATTA  1800
ACGACGAGAA TAGAAAGATT GTGCTGTATT TACTAAAAAA ATAATAATTA  1850
AAGCAAGAAG TAAAATTACA GCACCTATAA CAGACGCTCC AAAATAATCA  1900
TATTGCTCTA ATGATTGATA AATTAAAACT GAGGTTACTA AATCATCTAA  1950
TGGAAGATTA GATGAAATCA TAACAACAGA ACCAAATTCA CCCAATGCTC  2000
GCGAAAAACT TAATGTAAAT CCGGTAACTA ATGCTGGCAC CAATGTAGGA  2050
AAAATAACCC GCAAAAAAGT TTGAAATGAA CTTGCACCCA TACCCCATGC  2100
AGCTTCTTCT AAACCATGAT CAAATTCTTG TAATACCGGT TGTAATGATC  2150
TTATTACAAA AGGGAAAGAT ACAAAAATCA TAGCTAAAAG TACACCAAAT  2200
CTTGTATAAA TAATTTGAAT GTTTGCCATT TTTAGATAAT TACCAACCCA  2250
ACCTTGGTGT CCATAAACTG TAGCTAAAGT AAGGCCTGCA ACTGATGTTG  2300
GTAAAGCAAA AGGTAAATCA ACCGCAGCAT CAACAAATCT TTTACCCCTA  2350
AATTCATAAC GAGTGATCAT CCATGTAATT AAAAACCCAA AAACACTATT  2400
TAAAAGAGCT GCTGTTAAAG CCATTTTAAT AGTAAAAGCA TAAGCCGAAA  2450
TTGCAATAGG ATCTAATGCT TTTTCTAAAA TTTCTCTCCA ACTATTTTTA  2500
AAAATTAATG AAAATAGAGC AAAAAGTGGA AGAACTAATA TAAAAAATAC  2550
ATAATTTCCA AGCATTACTC CTATAAGTAA TTTTGGATTA GTTTTTTTTA  2600
AAAACGGTAA ATTGTGGAAA CTAGTTGTTT CTTGTTTATT CATAAAAATT  2650
CGCTCATTTT TATTTTTTTC TAAATAAACT GTATTAAAAT CATTATTAGT  2700
TTTCTTTATT TAATAATGAT TTTAATACAG TTTTTGCTTT AGCTAAAGCT  2750
TTTTTTGCTT GGACCGACGC TTGCGCCTTC CACTGTGATT TTCTTAAATT  2800
TTTTCTTGTT TTAGATTGTC GTTTTTTAGG AACTGCCATT ATATTCTCCT  2850
AATATAATTT TTTTAAGGTA TTCTGCAAAA TTTTATATTT ATTATATATC  2900
TTTTTTTACA GAAATTTCAA ATAATTATTA TTAATAATTT TGAATTCATT  2950
TTTTAATTAC TTCTTTTTAT AGTATAAAAT TGTATTAAAC TTTATTTTAT  3000
AACCAGTTGA TTAAAATTTA ATATAGTTAG ATTATTATAC TAGAAAAATT  3050
ACTAAAAACA TTAAAATGTT TTATATATTT TATATTTATT CAAGAGTTTT  3100
AAAATTTATC ATCTCAGAAA GAAATATAAT AATAATGTTT TAAATATGCA  3150
AATAAATTAA AAAAACTACC TTTTTATTAA TTATATATCT TTTAATAAAA  3200
AAAAACCAGT TTTCTTACAT ATTTAAAAAC TTAAATAAAT AGAAAATATA  3250
TTAGGTTATA AAGAATCCAT AACTATGTAG CCCTTTACCA AAAAAATTCA  3300
CACCTAAATA ACAAACCCAA ATAATAATAA AACCAAAACT AGCGACCCAA  3350
GCTGATTTTA TACCTTTAAA TTTACCAACA AACCGCATAT GTAAATATAT  3400
AGCAAAAACT AACCAAGTTA TAAAAGCCCA TGTTTCTTTT GGATCCCAAC  3450
TCCAATAATT TCCCCATGTT TCATTAGCCC AAACAGCCCC AGAAAGAATA  3500
CCTAATGTTA AAAAACAAAA ACCAATTCCG ATTAATCTAT AACTAAAATT  3550
ATCCAAAAAA AGTAATTGAG TTTCTTTATT TTTTTTTTGT AAAGGGTAAT  3600
TACTATTATA CATTGCATAT ATAGGTTTTG GCTCATCTTG GTTGATTAGA  3650
TTTGCAGGTG GAGTTTTATA TTCTTGAGGT TTAAAAGAAA AATTTATAAA  3700
TATATAGCTT ATTGAAAAAA GAGAACCTAA TAATAATGCC CCATAACTTA  3750
CTATCATTGC ACTAACATGC ATAAATAACC AATTAGATTG TAAAGCTGGT  3800
ACTAACGGGT GTAATTGATA GAGTGATGAA GGCAGACTAA AATAACTAAA  3850
TGCTACGATA AGTAAAACTA ACGGCGAAAC AAATAACGCA GCAAACTCTC  3900
TTTTTGTTGA AATTTCTAAA AAAGCATATA AACAAAGTAA TACCCAAGCT  3950
AAAAACAACA ATGATTCATA TAGACCACTT AAAGGGAAAT GTCCTGAAAA  4000
ATGCCAACGT AAACAGAGTT GACTAATTAT AAAAAAATTT GAAAGTAAAA  4050
TCCCAAATAC GCCGAAAACC TTATATCCTA TTTTTGGGTA GCGTAAAAAT  4100
TTTGTCCAAT AATAGACAGT AGTTAAAAAT AGGGTTATTA GATTACCATG  4150
CGATAATTCT GTTTCGATTG CTTCAAAATT CATATTTTTT TAGTTATAAT  4200
TTAATTTTTN TTTNTTNTNN ATTATTTAAA ATAAAAAAGC AAGACCTAAT  4250
AATATATTAA ATATTAATTA ATAATATTTT AAATAATTAT ATACTAAAAC  4300
ATGTAATCAA CTATTAATAT TGTTTGTTAA AATAAAATTG ATCTGTATTA  4350
TTATTTACAA ATTTACTTTT AAGGATATTA TAATTATGAA ATTAGCTGTT  4400
TATGGTAAAG GTGGTATTGG AAAATCAACA ACTAGCTGTA ATATTTCAAT  4450
AGCTTTAGCG AGACGTGGAA AAAAAGTATT ACAAATTGGT TGTGACCCAA  4500
AACATGATAG TACTTTCACA CTTACAGGTT TTTTAATTCC AACAATAATT  4550
GATACACTAC AAGCAAAAGA CTATCATTAT GAAGATGTTT GGCCTGAGGA  4600
TGTTATTTAT CAAGGATATG GTGGTGTAGA TTCTGTTGAA GCAGGGGGTC  4650
CTCCGGCGGG TGCAGGTTGT GGTGGATATG TTGTTGGTGA AACTGTTAAA  4700
TTATTAAAAG AACTTAACGC ATTTTACGAA TATGATATTA TTCTTTTTGA  4750
TGTTTTAGGT GATGTGGTGT GTGGAGGATT TGCTGCTCCA TTAAATTACG  4800
CAGATTATTG TTTAATTGTT ACTGATAATG GATTTGATGC TTTATTTGCA  4850
GCAAATAGAA TTGTAGCCTC TGTTAAAGAA AAAGCACGCA CACATCCATT  4900
ACGATTAGCT GGTTTAATTG GTAATAGAAC AGCCAAACGC GATTTAATTG  4950
ATAAATATGT AGAAGCTTGC CCAATGCCTG TATTAGAAGT TTTACCGCTG  5000
ATTGAAGATA TTAGAATTTC GCGTGTTCAA GGAAAAACTC TTTTTGAAAT  5050
GGCTGAATTA GAGCCTAATT TTAATTATAT TTGTGATTAT TATCTTAATA  5100
TAGCAGATCA ATTATTATCA CAACCCGAAG GTGTTGTTGC TAATCAATTG  5150
GCTGATCGTG AACTTTTTAC GCTATTATCA GATTTTTATT TAAATATTAA  5200
AAAAACGCCA AGTAATCCTG TTTCTCCTGA ATTAGACTTT TTAATGATTT  5250
AAATAAATCC TAAAATTGAT AAAATTATCA AAAAATAATA CTTTAATTTA  5300
CGTTTATAAA TATCTAAATA TTAATATTTA AAGAGATTAC AATTCAATAA  5350
CTATAAATTA TGACAAATAA CCTTTCGACT GATAACCTAA CTTTTGAATG  5400
TGAAACTGGC AATTATCATA CTTTTTGTCC AATTAGTTGT GTAGCTTGGC  5450
TTTATCAAAA AATAGAAGAT AGCTTTTTTT TGGTAATTGG AACAAAAACT  5500
TGTGGTTATT TCCTTCAAAA CGCATTAGGA GTTATGATTT TCGCGGAACC  5550
TAGGTACGCA ATGGCTGAAT TAGAAGAAGG CGATATTTCA GCTCAATTAA  5600
ATGACTATAA AGAATTAAAA CGTCTTTGTT TAGAAATAAA ACAAGATAGA  5650
AATCCTAGTG TAATCGTTTG GATTGGGACT TGTACAACTG AAATTATTAA  5700
AATGGATTTA GAAGGGATGG CCCCCAGATT AGAAACAGAA ATAGGAATTC  5750
CTATTGTTGT AGCACGGGCG AACGGTTTAG ATTATGCATT TACACAAGGA  5800
GAAGATACTG TTCTTGCAGC TATGGTTCAT CGATGTCCCA CTGAAACTTT  5850
AACAGAAAAT ACAAAGAAAC ACCCTTCATT AGTATTATTC GGATCTTTAC  5900
CTAATACAGT AGCAAATCAG TTAAATATGG AATTAAAACG ACATAATATT  5950
GAAGTTTCTG GTTGGCTACC ATCACATCGT TATTTAGATT TGCCTCATTT  6000
AGGCGAAAAT GTTTATGTTT GTGGTATTAA TCCTTTTTTA AGTAGAACGG  6050
CAACGACACT AATGCGACGA AAAAAATGCA AACTTATTTG TGCTCCATTT  6100
CCTATAGGCC CTGACGGTAC ACGAGCTTGG ATTGAAAAAA TTTCTTCTGT  6150
TTTTGGTATA ATTCCACAAG GGCTAGAAGA AAGAGAACAA GCTATTTGGG  6200
AAAGTTTAGA AGATTACTTA CAATTAGTAA GAGGGAAATC TGTTTTTTTT  6250
ATGGGAGATA ATCTTTTAGA AATATCATTA GCACGTTTCT TAATAAGTTG  6300
TGGTATGATT GTTTATGAAA TTGGAATTCC ATATATGGAT AAACGGTTCC  6350
AAGCTGCTGA GTTAGCTTTA TTAGAAACAA CGTGCAAGAA AATGAACGTT  6400
CCTTTGCCGC GAATCGTAGA AAAACCAGAT AATTATCACC AAATCCAACG  6450
TATTAGAGAA CTCCAACCAG ATTTAGCAAT TACTGGAATG GCGCATGCTA  6500
ATCCGTTAGA AGCGCGAGGA ATTAATACTA AATGGTCTGT TGAGTTTACA  6550
TTTGCCCAAA TCCATGGATT TACAAATGCT CGGGATATTT TAGAATTGAT  6600
CACACGACCA TTACGTCGGA ATCAAGCTTT AGGAACATTA GGTTGGTCTC  6650
AATTGATTAA AACTGATAAT TAATTTTTTT TTTGAATTTG TTAGTACACG  6700
TACATATATT AAAGATGTAC GTGTACGTGC TTGTAGCTCA GTGGACTAGA  6750
GCACATGGCT ACGAACCATG GAGTCGGGGG TTCAACTCCC TCCTAGCACA  6800
AAAAAAGCAA AATTAAATCA AAAAAACGTA AAATATTGCA CCTCTTTTTA  6850
TTATCATTAA TAAGAACTTA AATGATTTGT AATCTAAAAA CAATGAAAAT  6900
AGTATGTTTT TAATAACAAT TATTTTATTT TNNTNNTNTN TTANATNTNN  6950
NTTTNTTAAA ATTTTAAAAA TAAAATCAAT AAGTTATTTA TGGAGGTAAA  7000
TCACTATGAT AAAAATAAAA GAAATATTAA CTAAAAAAGA AAAACAATTA  7050
TTATTGCAAG TAGAAGAACC ACTACTTAAA GAAAAAGAGG AAGAAACCAA  7100
AAATAACGAA CAACCACAAA AAGGTTTAGA AAAAGATAAA CTCAAAGAAA  7150
TAAAAAAAAA AAAAATAAAA GAACCTATGA TTCCAGAATC AACCTCCGAA  7200
GAAAACTCAA CTAAAAAAGA TATTGATGAA CCAGAACCTC GGGTTATAGT  7250
AATAACTTCT GGGAAAGGAG GTGTAGGAAA AACAACAACT ACTGCTAATT  7300
TAGGAATGTC TATAGCTAGA TTTGGTTATC GTGTTGCTTT AATTGATGCG  7350
GACATAGGAT TACGAAATTT AGATTTATTA TTAGGTTTAG AAAATCGTAT  7400
TACTTTTACA GCTATGGATA TTATTGAAGG GCGTTGTCGA TTAGACCAAG  7450
CACTTGTACG AGAAAAGCGA TGGAAGAATT TAGCATTATT AGCAGTTTCT  7500
CGAAATTACC AAAAATATAA TGTAACACAA CAACATATGC GACAATTGGT  7550
TTCTTCAATA AAAAAATTAG GTTATCAATT TATATTAATC GATTGCCCAG  7600
CAGGTATTGA TGTTGGTTTT ATAAATGCTA TAGCGCCAGC ACAAGAAGCT  7650
ATTATTGTAA CAACTCCTGA AATAACTGCT ATTCGTGATG CTGATAGGGT  7700
TGCTGGTCTA TTAGAAGCAA ACACAATTGT TGATACTAAA TTATTATTAA  7750
ATCGTGTTCG TATGGATATG ATTCAAAACA GCACTATGCT ATCAGTAATG  7800
GATGTCCAAG AAACATTAGG TATTCCTTTA TTAGGTGCAA TACCTGAAGA  7850
TACTAATGTA ATTATTTCAA CTAATAAGGG AGAACCACTA GTATTAGATA  7900
AAAAATTATC TCTGTCGGGT ATTGCATTTG AAAATGCTGC TCGAAGGTTA  7950
ATAGGAAAAG AAGAATATTT CATTGATTTA GATGTACCTC CAAAAGGTGT  8000
TATTCAAAAA ATACAAGATT TTTTTTTAGG AGAAGACTAA TTCAAATTAG  8050
TTTTTTATCA AATATTTTAA TATACTAGGA GGAGTTTTAT CTTTTAGTAT  8100
ATTATACTAA CATAAAATAT TTTCAAAGAA AAATTTACAA AAAAAAAATC  8150
TTGTATTTTT T                                            8161

<210> SEQ ID NO: 153
211> 6689
212> Nucleotide Sequences
213> Chlorella protothecoides
<400>
ATGGGGTTT TATAACCTTT AAAAAATTTA AATAATTCTA TATTTGTACT    50
TGCTAGAAAA AGTTTGTTTA AGTTAATATC ATTTTCTAAG GATAAAAAAA   100
TATAAGAAAG ATAAAAAAGA TTACCAAAAT AACTCAATTT GTTATCTGAC   150
AATTTATGAT TTAAAGAAAT ATTATATGAT TTAGATTTGT TTAATTTTTC   200
TTGTAAAACC TTAAAGTAAA ATAAATTAAT AACATTTTTT TTGAAAATTT   250
TATTTGGTTT TACTTTTTTT TGAAGTCCAT TAAATTTTAT ATATTTTTTT   300
GAGAGAAAAA TAAAACCAGA GCTTTTGATT GACCAATTTT TAGGATACCA   350
AACAAATTTA AATTTTCTGT TTTTATGAGA GATAGTATTA CTCCAAAAGA   400
AGACAGATTT GTTTTTATTA TAAGAAGTAG AAGAATATGC TGTTTTGTGA   450
AACCGTATTA TATTTGTTGG TATATTTAAA ATAGGACGTT CTATAACAAT   500
TTTTGCATGT ATTCGTTTTA ATTTAGCTTG AAACAAAACA TGAAAATTTA   550
CTTGAAAAAT AGGACTTTGT GAGGATATTA AATCACCTAT TTGTAAAAAG   600
GTATTTGTCG AATATGGTTC CCTTTGGTTA TGAGCATGAA AAACCCAAAA   650
ACTACCAACT TCTAATAAAG AATGTGTCAT CGGAACAACA CCTTTAATTG   700
GAATTAGTGT TTTACTTGAA TTTAGTAAGT AGTTATTAAT TAATCCTTTA   750
GATTGAGATT CTGTTTGAGA TAAAAAACTA TTTTGTTGTA TATTATCAGT   800
AAAAGATACG TTAAATTTAG AAATAGGTTG CGATACAGAA ATTTTCATGT   850
ATTCAAACCT CACCTCACCC TCTATTTTTG ATTGGATTGG GTGTAACGAT   900
TCTGGTAAGT TTTGATTACT TGTTTTTATC TGGGGAGTTT GAGCAATAAT   950
ATCATTTATT TGAACATTTT GTCCTTGTCT AACAAGTAAA ATACTTCCCG  1000
GAGGCAATTG ACTTTCGTAT AAAGTAAAAA ACGATTTATT TGATTTTTTC  1050
GTTTTAATTT CTAAAAGAAT TCGTGTTGGA TCTATACGAT TATATTTAAT  1100
CATATATACA ATTTTTCCAT GAGGAGTTCT AATAAAGCTA CCAGTTATAT  1150
TTTCTGGAAA ATTTATTATT CCCGACGAGG GCGCTTTGAA TGTTTTTAAT  1200
GATTCATCGG AAAATACCCC AACGCCTCCT GTATGAAACG TTCTCATCGT  1250
TAGTTGTGTA CCCGGCTCAC CAATAGATTG AGCTGCAATT ACACCAACAG  1300
CTTCCCCAAT ATTAACTAAA TCACCTTTGG CTAAATCCCA GCCATAACAT  1350
AATTGACAGA TTGATTTATA TGATTGACAA GTTAATGGTG ATCTTATATA  1400
AATTTTATCT TGATTTGCGT AAATTTGTTT TGCTATTGTT GAAGATATAA  1450
TTTGATTTTT TTTAATAAAA AATGTTTCAT TTTTATTAAA ATCAAATTTT  1500
TTATCATCTA AAAAAAAAGG GTAATAGTTA TTAATCTTTT TTCTATCTAA  1550
AAAATTTTTA TTAATTTTTT TTAATAAAAC CCGCCCAATT AACCGTGTTT  1600
CGTTATCTGG ATTATTAATT AAAAGTCCTG TGCCAGTTCC ACAATCTGCG  1650
CTAGAAACTA CAATATGTTG CACCGTATCA ACTAGCCTGC GTGTAAGATA  1700
ACCTGAAGTT GCTGTACGTA AGGCTGTATC AACTAATCCT TTTCGAGCAC  1750
CGTAGCAAGA TATTAAATAC TCTGTAAGTG TAATACCTTC ACGAAAATTA  1800
CTTTGAATTG GAAATTCAAC AATGGCCCCT TGTGGATCAG CCATTAATCC  1850
GCGCATCCCT ACTAATTGTC GAACTTGCGA AACATTACCC CGAGCCCCTG  1900
AAAAAGCCAT CATAGATAAG GAATTTAAGG GGTTTTTATA CTTAAAATTT  1950
TCAATTACAT TTTCTCTTAG TAATTCACTT GTGGTATTCC AAATATCAAT  2000
TGTTCGTTGA GATTTTTCAA TACTAGTGAT GTTTCCAGCT AAGTTTTCAT  2050
ATTCTGTTGT TTGGAGTGCA ATATTTGTTT GTAATAAAAA CTCACCTTTT  2100
CTATTTGGAA TTTTTAAATC ATCTAATCCT AATGAAATAC CAGCTTGGGT  2150
TGCTTGAGAA AATCCTAGAT GTTTTAGTTT TTCCAAAAAC TCAAGAGTAT  2200
TCTTTTCTCC ATATTCATCT AAAAAACATG TTATTAGTTT TTTAAGTTTT  2250
CCTTTATCAA AAGGGCCATT AAAAAAAAAT GAATTATTAA TTTTAGATTG  2300
TAAATTACTA AATTTAGAAT TATTCATTTT GAAATATTTT TTTTTAATTT  2350
TACTCTTTTT TAAACTTAAT ATTTTAAAAA AAAATTGTTT GTTATTGTTT  2400
TGATTCATAA ACACATTAAA AATTTAGATT TAAAGCTAAT TTATAAAAAA  2450
AAAGACGACC CGGTGTTGTT CGAATATAGA GATAAGTTGA AGTTGTAGTG  2500
GTTGTTGATT TATACCTATT CTCTCGAACT AAAGTTGTAT TTCCAAAAAA  2550
ATCAATCTGG TGTTCTAAAA CTAATTCAGA AGCTTGAGCT TCATTTTGTA  2600
ATATTAAGTT TTGAAAAGGT TGAGACCAAT TTAACCATAT AGGTGTATGC  2650
ATACTTAAAC TGCCTTTTTC AAATGCTTGA TAAACATCTT GAAAATTATA  2700
AAAATGATAT TTAGTTATTT GTGCAAGTTG TTTTTTAAAT ACTTCAAAAT  2750
TATTCAATAA ATATAAGTTT TTAGAATTTT TATTTAAATT TAAATTTGAA  2800
TTTGTTAGTG AAGCCGAAGA TGTTAAATAG TAACATCCCA ATACCATATC  2850
TTGAGCAGGT AAAAGCAAAG GTTGACCAGA TGATGGTGCT AAAAGATTGT  2900
TTCTTGCCCA GATCAAATTA AGTGCTTCTG TACGTGCTAA TGTCGACAAT  2950
GGTATATGAA CTGCCATTTG ATCTCCATCA AAATCCGCAT TAAATGCAGG  3000
ACAAACAAGT GGATGTAATA AAATAGCTTT CCCATTAATT AATTTCGGGA  3050
AAAAAGCTTG AATCCCTAAA CGATGTAATG TAGGAGCTCT ATTTAAAAGT  3100
ATAGGGTGTC CTTTCATTAT TTTTTTTAAT ATATTCCAAA TTTTTGGGCG  3150
ATGTTCGTCA ATAAAAGTTT TAGCTGAAGT TGTTGTAATT GTAATATTGT  3200
TGCTTTTTAA ATTTTGGATA ATATAAGGTT GAAATAATTC AATTGCCATT  3250
TCTAATGGTA AACCGCACTC ATAAATTTTT AATTCCGGGC CAACAACAAT  3300
AACAGAACGA CCAGAATAAT CGACACGTTT TCCTAATAAA TGCTGACGAA  3350
AACGTCCTTT TTTTCCTTTT AGGGAGTCTA TAAGGGCTTT TGTGTTTTTT  3400
CCTTGTTTTG AAAAACTTGA AAATTCATTT TGGTTTGAAT CAACATTTCC  3450
GGTTTGTAAT AAACTATCTA CAGCTTCTTG GAGTAAACGT AAATTAAAAC  3500
ACCATAAATT CCAACTACTA CTTAAAGCAG GATCAAATAC AGTAAATTGC  3550
ATTTTATGCG GAATTCGTTT GTTCCGTGTT AAAACTTTGC GATAAAGGCT  3600
ATTAATATCA GAAATAAAAA TTTGTCCACC TAATTTTGTA ACTGGGCGTA  3650
AGTCAGGTGG TAGTACTGGT AAGCAACTAA GTACCATCCA GGCTGGTTGC  3700
ATATTATATA AATAAATTTT TCGAAAATAA CCTAATCTTC TTAAATATTT  3750
TTTTCGGAAT TGTTTTAAAC GATTCCATTT TTTTCGAAGA TCACTGTGTT  3800
CTTTTTCATA ATTGATACTT AAATAATAAT TCATTTGAAT TTTTATTAAA  3850
TTAAGCACAT TGTTTATTTT TTCATATTCA AACTCAAGTT GTCTTTTTAA  3900
GTTTATTGGA TTGTAATGCG CTAATATTTT TTCAATAACA CTACCACCTG  3950
TTTGAATTGG AGATGATTGA AATTGGTATA ATTCTTGTTT ACTATTTTTG  4000
TTAGATTTTT TACGAATTTG GTGTGGTAAA TTTTGGTTTT TAATTTTTTT  4050
TGTAAAAAAA TAATATGGAA TAATTGATTC ATTAGGAGAT GGCTGTATCC  4100
AACTATAAAA AATAAATACT TGAAAATGCT CTACTTGATC CCACGTCATA  4150
TCATAATTTA TACCATATAA AAAAAGCTCT TCCGGTAATG TTCTTAATTT  4200
GTAAATTTTT TTTCTTTTTA TCGCATTACT TCTTTTCAAC AAGGAATTTT  4250
TTAGAGTATT GTTAAATTTA GTAGAACGTA ATTCACGAAA TAATTTCGGT  4300
GAAGAAGACG CTAAATTTGA TTTATTATCA GACGTTACAT TTAATTTTAA  4350
ATGATGGTCG CTTTTCTGAG GATTTTTTTT TAAGGTTGTT AATATATTAA  4400
ACCAAGATGA AAAATTTAAA AAATGTGGTG ATAAATGACA AAATTCTGTA  4450
GCAAATATAA TACTTTGCAA TCGTTTATTT GACCAATTCA TACAAGCCCC  4500
CAATGGCGAG GGCTTGTATG AAGCATATAA AGGATGTACT ACCGATTGTT  4550
TTAATTTTAT ATGTCCAAAT CTATAACGGC GTACTCTAGA TAAAGTACGC  4600
TCCACGCCGC AAATATTACA AATTTTAACT TGTGGAATTT TAACTTTTTT  4650
ATCACAAGCA CAAGTGTAGT CTTTTATAGG GCCAAATATT TTCTGACAAA  4700
ACAACCCTCC CATTTCGGGT TTTAATGTTT TGTAATTAAC TGTTTCCCAA  4750
CTACTAACCT CTCCAACAAT AGTACCATCA AGTAATGAAC GTTCACCCCA  4800
TTCTTGAATT TCTCGTGGCG AGGTTAATCC AATTTTTACT TTTTTAATTC  4850
CATTAATTTT TTTTTTTACC ATAACTTTAA AAAGAACATA CAAATATTTT  4900
TTATTTATAT TTGCTTTGTT TTTAATACAT AAAAATTAAA ACATAAAGCA  4950
CAAAATTTTA AGAATATTAT AAACTTAAAT TTTTCTAAAA TATTGATGAA  5000
AAACTATTAT ACTAGTTTAC CATCTAAATC GTTTAAATCG ACTAAACAAG  5050
TTTTTTTCGT ATTATAAGAA GGTGCGAAAA TTTCTAAACT AATACAAAGT  5100
ACTTGAATTT CCCGTAATAA TAATTTAATA CTTTCGGGTT GCTTCATTTC  5150
AATAGGTTTA TTATAAAAAA TTCGCAAAAC AGCTTCATTT CTACCATCTA  5200
TATCATCCGA TTTTATAGTT ATTAGTTCTT GCAAAGTATA AGCAGCACCA  5250
TACGCTTGTA ACGCCCAAAC TTCCATCTCA CCTAAACGTT GTCCCCCATT  5300
TCTTGCTCGT CCTCGAACCG GTTGTTGAGT TAAAGCGGAA TAAGGTCCTG  5350
TTGCCCGTGC ATGAATTTTA TCATCAACCA TGTGAACTAG TTTTAACATA  5400
TAGGTATATC CTACTGTTAC TGGCTGTTGA AAAATAATAC CTGTACGTCC  5450
ATCAAAGAGT TTTATTTTTC CGGGATGTTC AGGCTCAAAT AACCAAGAAT  5500
TTTTTGATTT TAATGAAGCT TGATAAAGTT TTGAATATAC AAAACTTCTT  5550
GAAGCTTCAG TTCCAAATTT TTCATCAAAT AAATTTACTG TAAAAGACTC  5600
ATTAAGGTAT TTTCCAGCCA AACCTAAAAG GCACTCTAAA ATTTGTCCTA  5650
CATTCATTCG AGATGGTATT CCTAAAGGGT TTAGAACAAC ATCAATAGGA  5700
GTTCCATCTG GTAAATAAGG CATATCTTGA ACAGGTAATA TTTTTGAAAT  5750
AACACCTTTA TTACCATGAC GACCAGCAAT TTTATCCCCA ACTTGAATTC  5800
TTCGTCGCTG TAAAAGAGAT AATTTTACAC ATAAAATAGA ATTTTTTGGC  5850
GCTAAAAATA AAATGTCATC TTGTTTTGGA GGTAATATTT CAACATCAAC  5900
TATAAGGCCT TCTACTCCTT TTGGTACACG AAAACTAGTG TTTCTAACTG  5950
GTAATTTTTC ACGTTGTATA ATAGCATTAT AGAGTTTCTC ATATTGTGCA  6000
GAAATACCCT TTAACTGTAC ATTACTCGTT TTACCAACTA AATAATCCCC  6050
TTCTTTAACC CATGTTCCTT TTTTTATAAT ACCTCTGGCA TCTAAGTTCA  6100
ATATTTGAGA AATTTCTTTT GATGAATTTA AAGGTATATC GTTTGTTATT  6150
GTTTCTAACC CTGCTGCTGT TTTCTGAACC TCCACTTGAT AATAATCAAG  6200
ATGAAGAGAT GTAAATATAT CTTGAGATAC TAAATTTTCG CTTATTAAAA  6250
CAGCATCTTC AAAATTATAA CCTTCCCATG GTAAGTAAGC TATAAAAACA  6300
TTTTTCCCTA TCGCTAATTT TCCTTTATCA CTCGCTCCAC CGTCTGCCAA  6350
AACAGTTGAT TTTTCTACCC AATCAGTTTC TGCAACAATT GGTCGTTGAA  6400
GAGTACATGT GCTTTGGTTG CCGCGTTTAT ATAAATTTAA GTGGAAAAGT  6450
TTATAGTCTA TTTTTTTCCT TTTATTATTT AAAAAAAGTG GTTTGGGCTT  6500
ATTTTGATTT ACCTTTTCTT TAATTAATAA AAAATTCCCT GCACTTAGAA  6550
AAAAATCTTT TGTGTTTAAA TTATTTTTTA AATTATTTAA ACTGATTTTA  6600
TTTTTTTTTA TTGTTTTACT ATAAGAAGTA AATTGATTAG AATTAATGAA  6650
ATTTGAAAGA AAATTATTTT TATTCCTTTT TTGAAACTT              6689

<210> SEQ ID NO: 154
<211> 6645
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
ATAAATAATA AATATTTTTA AACAAATATT AAAGCNNNNA NNTCAGTNNN    50
NNAAGNNNTN NNTNTTNNNG ANANNNCGNN NNCNCGNGNT CNNTNCNCNC   100
NNNNNCTNNA NNANNNTNTN NCNATNNNNN NCGGAGAAAG AGGGATTCGA   150
ACCCTCGGTA TGTTTTATCA TACGACGCGT TAGCAATGCG TTACCTTCGA   200
CCACTCGGCC ATTTCTCCTG GTATGTAGTA TTTATCATAT TATTATCAGC   250
TAATAAAAAA ATAAAGATAT GTAAAAAAAT TTAGTTTTTT ATAAAATAAT   300
GCCCCCCCAT TGTGGGGAGG AGCATTAAGA AAGTTAATCT AATGGCCGCA   350
TTGATAATAC AGGCTCATTA TCACGATCAA GTCCTTTTTC AAATCCTGCA   400
GCTGCTGCAC GAGCACGTCC AGCATGCCAT AAATGCCCAA TAAAGAAAAA   450
GAAACCTAAA CAAAAATGAG AAGTAGCTAG CCAACTTCGT GGTGACACGA   500
AATTTACAGC ATTGATTTCT GTAGCTACAC CACCTACTGA ATTTAATGAT   550
CCTAATGGAG CATGAGTCAT ATATTCAGCA GCACGTCGTT CTTGCCAAGG   600
TTGGATATCA TTTTTTAATT TATTTAAATC CAACCCGTTT GGACCACGTA   650
AAGGCTCTAA CCATGAACCA CGGAAATCCC AAAAACGCAT AGTTTCACCA   700
CCAAAAATAA TTTCTCCAGT TGGTGATCTC ATTAAATATT TACCAAGACC   750
AGTAGGTCCT TGTGCAGACG CAACATTAGC GCCTAAACGT TGATCTCGTA   800
CTAAGAAAGT AAAAGCTTGT GCTTGTGAAG CTTCAGGTCC AGTAGGACCG   850
TAAAATTCAC TAGGGTATGC TGTATTATTA AACCAAGACA TACAACAAGC   900
AACAATACCC ATTATTGAAA TAGCACCTAA GCTGTAAGAA AGGTATGCTT   950
CACCAGACCA TACAAAAGCT CTACGTGCCC AAGCCCAAGG TTTCGTTAAA  1000
ATGTGCCAAA TACCACCAAA AATACATAGA GTACCAATCC AAATGTGGCC  1050
ACCTATAATA TCTTCAAGGT TATCAACACT AACAATCCAA CCATCACCAC  1100
CAAAAGGTGA TTTTAAAATA TAACCAAAAA TAACACCCGG GTTTACAGTT  1150
GGGTTAGTTA CTAAACGAAC ATCACCACCA CCTGGAGCCC ATGTATCGTA  1200
AATACCACCA AAATAAAGAG CTTTTAATAC TAAAAGCCAA GCTCCGATAC  1250
CTAGAATAAC TAAATGAATA CCTAAAATAG TAGTCATTTT GTTTTTATCT  1300
TTCCAAACGT ATCCAAAAAA AGGAAAAGAT TCTTCTAATG TTTCAGGTCC  1350
AATTAATGCA TGATATATTC CCCCAAATCC TAATACGGCA GAAGAAATTA  1400
AATGTAAAAC TCCAGATACA AAGTACGGAT AAGTATCTAA AATTTCACCA  1450
CCTGGACCAA CACCATAACC TAATGTAGCA AGATGGGGTA AAAGTATTAA  1500
ACCTTGCTCG TACATTGGTT TTTCTGGTAC AAAGTGAGCA ACTTCATATA  1550
AATTCATTGC CCCTGCCCAA AATACAATTA ATCCTGCGTG TGCAACGTGT  1600
GCACCTAATA ATTTACCAGA AAGGTTAATT AAGCGGGCAT TACCTGCCCA  1650
CCATGCAAAA CCAGTAGATT CTTGATCACG TCCACCGATT ACCAAACTAC  1700
TATTAAAGAG CGTTTCCACG TGGTAAAACC TCCTCTGGGA ATACTAATTT  1750
TTCATGAGGC TGATCTTGAG CAGCCATCCA AGCACGAATA CCTTCATTCA  1800
AAAGAATATT TTTTGTATAA AAAGTTTCAA ATTCTGGGTC TTCAGCAGCA  1850
CGAATTTCTT GTGAAACGAA ATCGTATGCA CGTAAATTTA ATGCTAAGCC  1900
TACTACACCA AAAGCACTAG TCCATAAACC AGTTACTGGT ACAAATAACA  1950
TAAAGAAATG TAACCAACGT TTATTAGAAA ACGCTACACC GAATATTTGT  2000
GACCAAAAGC GATTTGCTGT TACAAAAGAA TAAGTTTCTT CTGCTTGAGT  2050
TGGATTAAAC GCACGAAATG TGTTTGCACC GTCGCCATCT TCAAATAGAG  2100
TATTTTCTAC TGTTGCGCCA TGAATTGCAC AAAGAAGCGC AGCTCCTAGA  2150
ACACCAGCAA CACCCATCAT ATGAAAAGGA TTCAAAGTCC AGTTGTGGAA  2200
ACCTTGGAAG AATAAAATAA AACGGAAAAT AGCTGCAACA CCAAAACTCG  2250
GTGCAAAAAA CCAACTAGAT TGACCTAATG GGTAAATTAA AAATACAGCA  2300
ACAAAAACTG CAATAGGTGC AGAAAATGCA ATTGCGTTAT ATGGGCGAAT  2350
TTTAACTGAA CGTGCGATTT CGAATTGACG AAGCATAAAT CCAATTAAAG  2400
AAAAAGCACC GTGTAGTGCA ACAAATGTCC AAAGACCTCC AAGTTGGCAC  2450
CAACGAGTAA AATCACCTTG AGCTTCGGGA CCCCATAAAA ATAATAAAGA  2500
ATGCCCCATA CTATTAGCGG GTGTAGAAAC AGCAACAGTT AAAAAGTTGC  2550
AACCCTCTAA ATAAGAAGTA GCTAAGCCAT GAGTATACCA AGAACTTACA  2600
AATGTTGTTC CAGTTAACCA ACCACCTAAT GCTAAATATG CAGTAGGGAA  2650
TAATAATAGA CCTGACCAGC CAATAAATAC AAAACGATCA CGACGAAGCC  2700
AGTCATCAAC GATATCAAAC CAACCACGTT TTTCTTGATT TTTTCCAATT  2750
GCTATAGTCA TAACGAGACT ATCTCCTTGT AATAATTTAT GCAAATTATT  2800
AAATAATTTG CACAAATTTT GTTAAAATTA AATATATATT TATTAAAAAT  2850
ATTTTTAATA AATATTTGTT TTTTTGTTTA CGCTATAAAT TAGTATAGTA  2900
TGTTTTTAAA TTTTTTCAAG TTACTAGATT TTAGTAAATT TAATTAAAAA  2950
CTAATTACAA TTTTAACTGA TCACCACGAC GATATTGTAA GTACGCGGTA  3000
ACAAACAATC CTATAATTGT TACTGGAACT AAGCCAAGAA CAATCCCTGA  3050
TAATAATGGT TCTACCATAT TTAAAATTTC TCCAAACCCG ATAAATTAAA  3100
TAATTATTAA TAAAAAAATA ATAATTTTTT TATATTNNNN TAATTTTTAC  3150
TAAACTATAA TAACATATTA CTGTTATTAA NGAAGCTCCA ATTAATAAAG  3200
CGAGATAACT AAAAATTGTA ATCATTTTTT TAATAAAATT GATTTATGAA  3250
AGAATAAATT TTTTAAATTA ATTAAAAATT CATTTCAGAT AATTGAACTT  3300
TTTCGTATTG TTTCTTTTTT AATACAAAAA ATACTTGGGC AAATAAAACT  3350
AAAATAAAAA AAACTAATAA ACCGATAATT CGAGTTGGGT TTTGTAAAAC  3400
AATTTCTGTT TCACCTTGCC CAAAACCACC GACATTCGGG TTACTTGTTA  3450
ATGCTTGATC TAGCTTAACT TCATTACCAG TTTTTACAAT TAACTCGGGA  3500
CCTGGCGGTA CTGTATCTGT AATAATATCA CCATTTGCCG TTTGAATACT  3550
AATTAAAAAT CCACCTTTTT TCTCAATTGG TTCTATACTA GTGATTTTTC  3600
CGGCGCTAGA TGCAGTATAA ATAGTATTAT TACTTTTTGA GCCGTCTGGA  3650
TAAATTTGAC CACGACCACG ATTTCCACCA AAATAAATAG GGTATTTTAA  3700
ATACGATACT GATTTATTTT TAGTCGGATC GGGTGATAAA ATAGGAAAAA  3750
CAAGTTCTTT ATATTTTTTA CCTGGAACTG GCCCAACAAC TAAAATATTA  3800
GGTTTTTCCG CATTATATGA TGAAAAATAT AATTTCCCCA TTTTCGCTTT  3850
AATTTCTTCT GGTATTCTTT CCGGAGGTGC TAACTGAAAA TTTTCTGGTA  3900
AAATAAGCAC AGCGCCTACA TTTAAATCAC CTTTTTTACC GTTCCCTAAA  3950
ACTTGTTTAA TTTGTTTATC GTAAGGTATT TTTGCAACTG CCTCAAATAC  4000
TGTATTTGGT AAAACAGCTT GTGGTACTTC AATTTCGATT GGCTTTTGGG  4050
CTAAATGACA ATTCGCACAC ACAATTCGCC CATTTGCTTC CCGAGGGTTG  4100
TTGTAATTTT GTTGAGCAAA AATAGGATAG GCATTACTTG GAGCACTAAC  4150
TAGAAAAGTA ATACCCAAAA GAAATGTTAA TTTAATACAA AAATTAATTG  4200
GTAGTTTAAA TAGATAAATG GAATACTTCA TAAAATTGAT TCTATCTGTT  4250
TTTTAAATTA AAAAACATAA AAAATAAAAT TTTCTTCTCT GCATTTTTAT  4300
AATTTATAGA TATTTTAATT TCTNNNNNNN NTNNNNNNNN NCATNNNNNN  4350
NNNAAAAAAT ACTTTTAATT GAATTAATTT TGTAAAANAN NTANNNTANN  4400
ANNAATNAGA AATTAAGAGT TTTTTTATAT ATTAAGAACT ATAAAAAATC  4450
AGAATATTAA CAATTAAAAT AAAATTATTC TCTGATCATA TTATGATTTC  4500
AACAGATACA GATACAACTA ATTTAGAACA AATAAGACCC GTTTTTCCTT  4550
TTACCGCTAT TGTTGGTCAA GATGAAATGA AACTAGCTCT TATTTTGAAT  4600
GTAATTGACC CAAAAATTGG GGGCGTTATG ATTATGGGTG ACAGAGGAAC  4650
AGGAAAATCT ACTACTGTAC GCGCGCTAGT CGATTTATTA CCAGAAATAG  4700
AGAGTATAGA AGATGATCCA TTTAATTCAG ATCCGCATGA CCTTGAATTG  4750
ATGAGTAAAG AAGTTCGAGA AAAAATACAA AAAAAAGAAT CATTATCTAT  4800
CATTAAAAAA AAAATTTCGA TGGTAGATTT GCCGTTAGGG GCCACAGAAG  4850
ATCGTGTTTG TGGTACAATC GATATTGAAA AAGCTTTGAC ACAAGGTGTT  4900
AAAGCTTTTG AACCGGGTTT ATTAGCAAAA GCAAATCGTG GTATTTTATA  4950
CGTCGATGAA GTTAATCTTT TAGATGATCA TTTAGTAGAT GTTTTATTAG  5000
ATTCTGCAGC TTCTGGTTGG AATACCGTGG AACGAGAAGG TATTTCAATT  5050
AGTCACCCAG CTCGTTTTAT TCTCGTTGGC TCTGGAAATC CAGAAGAAGG  5100
TGAATGTAGA CCACAACTTT TAGATAGATT TGGGATGCAT GCTCAAATAG  5150
GAACTGTTAA AGAACCCCAT TTACGTGTGC AAATTGTAGA ACAACGAGCT  5200
GATTTTGATG CATCACCTTT AAAATTTAGA AATATTTATA AGCAATCACA  5250
AGAACAACTA GCAAATCAAA TAATTAAAGC TAGAGAGTTG CTACCGGAAG  5300
TTCAAATTGA TTATGATTTC CGTATCAAAA TTTCGCAAAT TTGTGGAGAA  5350
TTGGATGTAG ATGGTTTACG CGGAGATTTA GTAACAAATC GAGCAAGTAA  5400
AGCTTTAGCT GCTTTTTATG GTCATACTAA CGTGAATGAA AAGGATATTT  5450
TTAAAGTAAT TCCACTATCT TTACGTCATC GATTAAGAAA AGACCCGTTA  5500
GAATCTATAG ATTCTGGGGA TAAAGTACGT GAAGCCTTTA AACGTGTTTT  5550
TGGGTATAAT TAAATAACTT GACAAATCTT TTTATTGATN AAGTANNTNA  5600
ANNNNTNAAA AGATTTGTCA CGGGCTTATC GTCTAATGGA TAGGACAGGA  5650
ACCTTCTAAG TTTCTAATGT AGGTTCAATT CCTACTAAGC TCAAATTCAA  5700
TTTTAAAAAA ATTTAATAAT TATTAATAAA AATATGTTAT AANNNNNNNN  5750
TCTGTAGCCT ATAAATTAAA AACTAATTTT TAAATACATG TTAACTTTAA  5800
AAATTTTTGT ATATACTGTT GTAACATTTT TTGTTTCACT TTTCATTTTT  5850
GGATTTTTAT CTAATGACCC GGGCAGAAGT CCAGGGCAAA AAGATATTGA  5900
TTAATTTTTC TATTTTTTCA AATTCAACAT TTATTAATAT ATAATTTTTT  5950
GTATTAACTA GCTAAAAACT AGTTAATACA AAAATACTAT GATTTAAAAT  6000
ATGCCACGGT CACAACGAAA CGATAATTTT ATTGATAAAT CTTTTACTGT  6050
AATTGCTGAT ATATTATTAA AAATATTACC TACTTCTAAT CGCGAAAAAC  6100
AAGCATTTAG CTACTATCGA GATGGAATGT CAGCTCAGTC TGAAGGAGAA  6150
TACGCAGAAG CTTTGCAAAA TTATTACGAA GCTTTGCGTT TAGAAACAGA  6200
TGCGTATGAC CGAAGTTATA TCTTATATAA TATTGGATTA ATCCATACTA  6250
GTAATGGTGA TCACGCGCGC GCTTTAGACT ATTATTATCA AGCATTAGAA  6300
CGAAATCCCG CACTACCACA AGCTTTAAAT AATGTGGCTG TAATTTTTCA  6350
TTATCGGGGT GAACAAGCTA TAGAGATAGG GCAAATAGAA ATATCTAAAT  6400
TGCTATTTGA TAAAGCTGCG GACTACTGGC GTGAAGCTAT TAGAATAGCG  6450
CCGACTAACT ACATTGAAGC ACAAAATTGG CTTCAAATGA CTGGTAGAGG  6500
TTAAATTATG ATGCATTACA CATAATATAG AATAAATATT CTAAGTTTAT  6550
GGAAAACTTA AAATATTTAT TCTATATTNT NNNTAAACTT AAAGTAATAA  6600
AGCCTACTTA ACTCAATTGG CAGAGTATCG GTTTTGTAAA CCGAA       6645

<210> SEQ ID NO: 155
<211> 5843
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
AATTTCTTTA AGATTTTCAA TAATGTCAAA TAGAGTCTCG TGAACTCCTG    50
GAATAATATC AAATTCGTGG CGAGTTCCTA AAAAAGTTAT ATCTGTTAAA   100
ACTAAACCTG GAACACCTGC TATTAAATTC CGTCGAATGG CATTTGAAAA   150
ACTTAATGCA TGGTCTTTTA AAAAAAGACC TAAATGAAAC CGAGCGTACA   200
TTTGCCCAGA ATCTAATGTT TTTGATTCTA CAAACGAAAT AAATTTTTTT   250
TCTAATTTTT TTTTATTCAT ATAAATTATA GTTATTATGG TTTTTAGCGT   300
GTAAAAATTT ATATGTATAT TTTATATTTA AATTCTACGC TTTTTAGGCG   350
GTCGACAACC ATTATGTGGA ATTGGTGTAA TATCACGAAT AACAGTAATA   400
GTTAATCCAG CATCACGTAA TCCCCGTAAC GCTGCTTCTC GACCAACCCC   450
AGCCCCTTTA ATCCTAACAA TAAGTTCTTT CATACCTTGA CTAACGCAAA   500
TTTGTGCCAC ATTTTCCGCA GCTTGCTTTG CAGCTAAAGG AGTTTTTTTT   550
CGAGCACCTT TAAATCCAGA TGTACCTGAA GACGCCCAAG CTAAAACATT   600
ACCTTTAGGA TTTGTAATTG TAACTATAGT GTTATTAAAA GTCGATTGAA   650
TATGAGCAAC TCCCCGTATT ACTTTTTCTT TAAATTTTTT TTTTGAAATT   700
TTTTCAACTT TTTTTGGCAT ATTTATATAT TTTTTTTAAT AATTTAAAAT   750
TATTATCAAA TAGAATTATT AATAATAATT TTAATGTAAT ACAAAATTTT   800
ATACAATAGT ATATCATTTG AGTTTCTCAG AACTAAATGA TATAACGCTT   850
TAACCTTGCT TTTGCTTGTG TTTTGGGTTT TGGCAAATAA TACGTAAAAC   900
TCCCTTACGA CGAATTAAAC GACATTTTTC GCACATTTTT CGAACTCCAG   950
GACGAACTTT CATAAATTTA AAAATTGTAA TTGGTTTTAT TTTTTTTTTC  1000
CCGACCGAAA GCGATGAGTA ATACGCCCAC GTTTTAAATC ATATGGGCTT  1050
AATTCCACAA TAACCCGATC TCCTAATAAT ATTCGAATAT ATCTTTGTCG  1100
AATTTTGCCC GAAATGTGTG CTAATATTAA AACCCCATTA TCTTTTAATT  1150
TAACCCTAAA AAGACCATTA GATAAACATT GGGTTATCAC ACCTTCCATC  1200
TCTAAAGCAT CAAGACGTGA ACTTTTATTT AAATACACTT TAACCTCCGA  1250
TTATTTTTTT TATAAGGCAA CAAGTATTAC TGATTTTTTA TATTAAAAAA  1300
AAGCTAAAAA ATTAGGATAA TTTAAATAAA ACTAAATTTA TTTAATTTAC  1350
CATATAGAAC AAACTAATTC ACCACCAAGG TGAGAGTTTC GAGCTTCTTT  1400
ATCTGTCATT AATCCTTTTG AAGTTGAAAT AATAATAATT CCTAACCCAC  1450
CTAAAATTTT AGGAATTTCT TTATAATTTG AATAAATTCG TAAACCGGGT  1500
TTACTAATTC TTTTTAAATT TGTAATACAA GGTTTACCAG CAGAGCCATT  1550
TGATATTAAA CCCGTTTTCT TATATTTAAG ACTAACGATT AATTCGTTTT  1600
TGTTTATATC TAAAACGGAA AAACCAGCTA TAAAGCCTTC TTGTTGTAAT  1650
ATTTCGCAAA TTTTTTGATT TATTTTGGTA TTAGCAATTA ATACTGTAGG  1700
TTTAGATACT AAATTTGCGT TTCGAATTCG GGTCAACATA TCACTAATTG  1750
TATCAGTTAT CATATTTTTG AATTTTTTTA ATAAATATTT AAAAAGGGTT  1800
TAAAGATAAT TATTTAGCGA TCAACTTAGA TGATTTTTAT TTTTTAGATT  1850
TATCTTTAAA AGGCATGCCA AATGCTTGTA ACAAACTCAT AGCCTGCCCA  1900
TCTGTTGTGG ATGTAGTTAC TATTGATATA TCCATACCTC GAATTTGATT  1950
AATTTTATCA TAATCAATTT CAGGAAACAT AAGTTGTTCT TCTAAACCTA  2000
AATTATAGTT TCCATGACCA TCAAAACTTT TAGGATTAAC TCCTTGAAAA  2050
TCGCGTATAC GTGGTAATGC TAAATTAATT AACCGGTCTA AAAAAGCATA  2100
CATACGGTCA CCTCGAAGGG TAACAACAAT ACCTACTGGC ATTTTTTGTC  2150
GAAGTTTAAA AGCTGCAATT GGTTTTTTTG AACGTGTAAT TACACCTTGT  2200
TGTCCAGTTA TACTGCAAAT TTCTTTATTA CAAGATTCTA AAAGTTTTGC  2250
GTTTTGAGAA GCATCACCTA GACCACGATT AATTATAATT TTTTTAATAT  2300
GTGGTACTTG GTGTTTGTTT TTATATTGAT ATTCTTTAAA AAGTTGGGGA  2350
ACTATTTCCT TTTGATAATA TTGTTGAAGT CGTTGTACCA TATTAATTTT  2400
TTTATAAATT TTAATTAAAA GAGAATATAA GAATACTAAA ATAGCGAATT  2450
AAATAAATAA TAAAAAAAAA ATAAAATTTA TAAAACTTCT GGTGCTAATG  2500
ATATAATTTT TGTAAAATTA CCATCACGTA ATTCTCGGGC TACTGGCCCA  2550
AAAACTCGAG TACCGCGAGG ATTTGATTCT TTATTTATAA TAACGGCCGC  2600
ATTTTCATCA AACCGGATAC TTGTTCCGTT TTTTCGACGT ACATTTTTTC  2650
TTGTCCGTAC AATAACAGCA CGAACAACAT CTGATTTTTT AATTAACATA  2700
TTTGGGGCCG CATCTTTTAC TACAGCTATA ATAATATCCC CAATACCAGC  2750
AGCTTGTTGT TTAGCACCAC TTAATACTCG AATGCACATT ATTTCACGAG  2800
CCCCTGTATT ATCTGCTACT CGAAGATAAG ATTGTGGTTG AATCATAATA  2850
ATTTATTTAT TGGTTCTTTT AAAATATTTT TTAAGAAAAC AGTAATTTTT  2900
TTTTATAAAA AGTATATAGA TTTAAAAAAT CAACCTTTTT ATAAAAAAAA  2950
ATAAATTAAA ACTAAATTGA TTTTTTTAAA AGAATTTGTG TTTTAATTGG  3000
TAATTTAGAA GCTGCTAATT TTAAAGCCGT ACGGGCACTA GTCTCTGGAA  3050
CCCCTTTTAA CTCATAAAGA ACTTTACCAG GTTTTACTAC CGCAACCCAG  3100
TATTCAGGCG CTCCTTTCCC AGAACCCATT CGTGTCTCTG CAGGTCTCAT  3150
TGTTACGGGT TTATCTGGAA ATAATCTAAT CCATAATTTA CCACCACGAC  3200
GAACTTGGCG TGTTACAGCA CGTCGAGCTG CTTCAATTTG TCTAGATGTG  3250
ATCCATGAAC ACTCTAAAGC CTGTAATCCA TAGTCACCAA ACACTAATTT  3300
ATTACCGCGT AAGGCTTTTC CAGACATGCG ACCACGATGG TATTTACGAT  3350
ATTTTGTTTT TTTGGGACTA AGCATATGTT ATATAAAATT TAGAATTTAT  3400
TAATTTATAA TACTATAAAA TTTGTAACTT TATAGATTAC TTAATGGAGT  3450
TAAACGCTCA CCGCGGAAAA CCCAAATTTT AATACCTAAA AGGCCATAAA  3500
TAGTTTTTGC TGATTTTTCG GAATAATCTA TTTCAGCGCG TAATGTTTGT  3550
AATGGTACTG GTCCACTTCG TAAATCTTCA GAACGAGCAA TATCTGCCCC  3600
ATTTAAACGA CCAGAAATTT TTACTTTAAT ACCCTTAACA CCAGCTTTTA  3650
TTGCTTTCCG AACAACTTGT CTCATAGCTT TTCGATAAGG TTTTCGTTGT  3700
TGCAAATCAT CAACTAATTT TTCTGCAAGT ACCATTGCAT GTATATCTGG  3750
ATTATTAGTT GAATTTATAG AAATCGTACA AAAAATTTTT CCAGGGTCTG  3800
GCAAGTTTCT TACTTTATAA AATTTTTTAA GATGTTTTAA TAAATTATCT  3850
CTTAATTGTG ATAAACCTTC TGACCAATTT GATTTTGTAG TTTTTTTAGA  3900
AAAAGAGTTA CGATTTAAAA AAATATTTGG TCGTGCTGCA TGAATTTTAA  3950
TGCTTATAAA AGGGTATTCA ATTTCACGTC TTTGAATTTC AATATTAATA  4000
ATACCTGCTC CAAAAAAAGC TTTTTCTATA TAATTTCGAA TAAAACTTGC  4050
ATCTTGAACC CAAAGTGCTG ATGTTTGAGA TTTTGTACAC CAATAATTTT  4100
TATGTTTTTG AGTAATACCA AGACGAAATC CCAAAGGATG TACTTTTTGT  4150
CCCATATATT AATTATCTTT TTGTTTTTTT ATCTTTTTTT ACGTGCCCTT  4200
TAAAAGTGCG AGTAGGGGCA AATTCACCTA ATTTATGCCC AACCATTTGA  4250
TCTGTAATAA AAAGTGGAAT ATGTTGGCGG CCATTATGCA CAGCAATCGT  4300
ATGTCCTATC ATTAATGGAA CAATCGTTGA GGCTCGGGAC CATGTTTTTA  4350
CTACTTTTTT TTTACCTTGA CTATTTAATC GTTCAATTTT TTTAAACAAA  4400
TGATTTGCTA CAAAAGGAAT TTTTTTTAAT GATCGTGTCA TATAATTAGT  4450
GATAAATTTA AAATATTATT TATTTTTTTA CAAACAAATA TTTAAAGGTT  4500
TTTAAAAACC TTCTCTAAAA ACACAAAACG AATATAAAAT TAGAAATTAA  4550
CAATTAGTTT GTTACTAAAT TTTTTACGTT TACGGGTTTT TACACCTAAT  4600
GCTGGCTTAC CCCATAAACT TACTGGACGA GCCCGACCAA TAGGTGCTTT  4650
TCCTTCACCA CCACCATGTG GGTGATCTAC TGGATTCATA GCTGAACCTC  4700
GGACGTGGGG GCGTCGACTT AACCAACGAC TTCTGCCTGC TTTTCCTAAT  4750
GTTATATTAG AATTATCGAT ATTTCCAACA CGACCAATAG TAGCCCAGCA  4800
TTTTTCAGAA ATTAATCGAG ACTCGCCTGA TGGTAAACGA AGTGTAACCC  4850
AGCGGCCTTC TTTTGCTACG ATTTGTGCTA CAGAACCAGC TGATCTTACA  4900
AATTGACCAC CAGCTCCGGG TTGGAGTTCA ACGTTATGAA CAGAAGTCCC  4950
TAATGGAATA TTAGATAATG GTAAAGTATT ACCAATAGTT ATAGAGGCAT  5000
TAGGTGAAGA AAGTAATTTT TCTCCGATTT TTAATCCAAG AGGGGATATA  5050
ATGTAGCGTT TTTCGCCATC GTTATAAGAA ACTAATGCAA TGCGTGCAGT  5100
ACGGTTTGGA TCATATTCAA TATTTTTTAC TGTAGCAGAA ACCCCAATTT  5150
TATTTCTTGT AAAATCGATT TTTCTATATA ATCTTTTATG ACCACCACCA  5200
CGATGCCTAC TTGTAATAAT ACCTTTGTTA TTTCGACCTT GAGACCGAGA  5250
CCAACCCTTA GTTAATTTTT TTTCGGGTTT TTTTGATGAA ATCTCATCAA  5300
AATTTAATAC TGAACCGTGA CGACTTCCTG GAGTAGACGG TTTAAAAAAA  5350
CGAATAGCCA TACTTTGTTT TATATTAAAA ATTTATAATA AATACATATT  5400
AGCTTAATAA ATTAACATTT GAACTTACTG TAATAATTAC CCGTTTTGAG  5450
CGTGGTGAAG AATTTGAACG TTTTAATTTT CGTGGTGGTC GGTGTGTATT  5500
TACCGCCAAA ACTTTAATAT TAAAATAGTC TTCAACTAGC TTTTTTATTT  5550
GGGGTTTTGT AAGTTTTTTA TCTATATCAA AACTAAATTG ATTATTTTCA  5600
AGTAAACGGG TAGCTTTCGG AGTTTTTATT ACTGGATATT GCAATAAATC  5650
TAACATCATA ATTTTAAAAA TTCTCCTTAA ATAATNANAN TTNGNNTTTN  5700
TNTATNTNNA NACTAAATAA TATTAATCGT ATAAATGCTA AAATAGGCCG  5750
TTTATAATAA AATCTATTTG TATATCATCT AATACAATAA CAAGCTATCA  5800
CACTATTAAT AAATAATTAA AAATGATTAA TAATGTTAAA ATT         5843

<210> SEQ ID NO: 156
<211> 1883
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
TGCAAATTCA ATAATTTGCA ATAGTATAAA AACAAATTAA CTTCTAATTT    50
AATTTTACAA TGATGAACCT AATCCTACAT AAGATCCATA AAAAAATGTT   100
CCTAATAACC CCAATACAAC AACTCCCCCA ACAGTTCCAA TAAACCATAA   150
TGGAATACGA CCGGTTGTGC TAGTATTAGA CATAAAAAAA TCTCCTTTTT   200
TTTAAAGGAC CCCAATAAAG TACTAACTGG GGTTAATTTG TGAATTAATA   250
CGTCAATTTT ACTAAAAAAT ATAAGTAATA ATTAATAAAA TAAACTAATT   300
GAAAATATAA CTTGAAAACA AAACTGCTAA TACAAAAATT AATAATAATC   350
CCCAGTATAA ACTGGTTCTA TTTAATTCTA CAGTTTGTTT ATTAGGATTT   400
GGTTTTGCCA TAAAATAATA TAACCTTTTT CAATTTTACC TAAATAATAT   450
AAATATTATT TAACGTTGAA TAAATTGCAT AGCCGTAATA GCACCTAAAA   500
AGAAAACTGT AGGTACTGCT AATGCGTGTA TAGCAAGCCA ACGTACAGTA   550
AAAATAGGAT AAGTATAGGA TTTTCTTGCT GTCATATGAT AAAAATTCTA   600
ATTATTCTTT AAAATTAGAA ATTTTTAAAT TTCTAATTTA AAGGTATTTG   650
TTTAATTAAC TTGAGATAAT TGTTTTACTT GATCTAAAGC GTTAAAACGA   700
TCCGTAATGA GTGGGGTTTC TTGTCGATCT TCAGTAAAAT ACTCACTTGG   750
TCTTGGACTG CCAAAAATAT CATAAGCAAG ACCTGTACTC ACAAATAACC   800
ACCCGGCAAT AAATAAAGAC GGAATGGTAA TACTGTGAAT TACCCAGTAA   850
CGAATACTGG TTAAAACATC AGAAAAAGGG CGTTCTCCTG TAGCACCAGA   900
CATGCTATAA CTCCTTATAA TTTTTTTACT ATAACAAAAA GTTTTTTCTT   950
TCTTTTTTTA GTATACCAAT TTTTTCAAAG AGCTTCAAAG TTTTCAAATT  1000
GTTTTAATTC ATTTTTATAT TTGGCGAACG ACGGGGTTTG AACCCGCGAG  1050
TGATGGAGTC ACAGTCCATT GCCTTACCAC TTGGCTACGC CCGCCATGTA  1100
TTAATTGGAG CGGATAGCGG GAATTGAACC CGCACCTCTT GCTTGGAAGG  1150
CAAGGGCACT ACCTTTATGC AATATCCGCT TCTTTTTTTA ATATATATAT  1200
ATTTGTATTA AAAACTACAT TTTTAAAAAA TCCTAAAAAT GTTTAATACT  1250
CTCTACTCCT CAAGTATACT AATTAATAGT TGTTACGATA AAATTAATTA  1300
GTTAAAAAAT TTTAAAATTT TAGGAAATCA AAGCTATGCT TTCAATTTTT  1350
CAATTAGTAT TATTTAGTTT AATTTTACTA TCTTTTTTCT TAGTGGTTGG  1400
TGTACCTGTT GTTTTAGCTT TACCAGATGG CTGGGGTGAA AACAAACGTC  1450
TTGTTTTTTC AGGTCTTGGG GTTTGGTTAA TACTTGTATT TGCTGTAGGT  1500
ATTTTAAATT CATTTGTAAT TTAATGCAAT CTTAAACGTT ACTTATTTAG  1550
TAACGTTTAA GATTTTTATT AATTGATTTT TTAAATATAT AAAGGTTAAA  1600
ATATATACTA ATTAGGGTTA CTAACTCAAT GGTAGAGTAT TCGGCTTTTA  1650
ACCGACAAGC TCCGGGTTCA AATCCCGGGT AACCCATTAA AAATTTCTTA  1700
AAAATTTTGT TTTAAAATTA TATAATTTTT AGTTAATAAA GCCTAATGAA  1750
TAATGCTTTT TCTGTAATTA ATAAATATTT TAAACAAAAA TATTTATTAA  1800
AAAACAATTT AGTAACGCAA AAACCGAAGT ACCTTTGTGG TATTTCAGGG  1850
GGACAAGATT CAATATTTTT GTTTTTTTGG TTA                    1883

<210> SEQ ID NO: 157
<211> 2599
<212> Nucleotide sequence
<213> Chlorella protothecoides

<400>
AATTGCTATT TTTCATTGTA GTTGGAAAAT GCAATCAGAT GTTTGGGGTA    50
CTGTAACAGC AAACGGCGTT TCTCATATTA CTGGTGGTAA TTTTGCTCAA   100
AGTGCGAACA CTATTAATGG GTGGTTACGT GACTTTTTAT GGGCTCAGTC   150
ATCACAAGTT ATTCAATCAT ACGGGTCAGC ACTTTCTGCT TATGGACTTA   200
TTTTCTTAGG AGCACATTTT GTTTGGGCTT TTAGTTTAAT GTTTTTATTT   250
AGTGGACGTG GTTACTGGCA AGAATTGATC GAATCTATTC TTTGGGCACA   300
TAATAAATTA AAAGTGGCTC CAGCTATTCA ACCTCGTGCT TTAAGTATTA   350
CACAAGGTCG GGCTGTGGGT GTAGCTCATT ATTTATTAGG TGGTATTGCA   400
ACCACATGGT CATTCTTCTT AGCAAGAATT TTAGCTGTGG GTTAATAAAC   450
TGTAACAATT TAGAGGTTTT TGTAACCTCT AAATTGTTAA ATAAGTCGTT   500
TTTTTATTTG TTAAAATATA TGTATTAGTG CAATCTTTTT AAAGATTGCC   550
TTTAATAGGA GAATTATAAA ATGGCAACAA AATTTCCAAA ATTTAGTCAA   600
GCCTTAGCAC AAGATCCAAC TACACGTAGG CTTTGGTATG GTTTAGCCAC   650
AGCTCATGAT TTCGAAAGTC ATGATGGTAT GACTGAAGAA CGTCTTTATC   700
AAAAAATATT TGCATCACAT TTTGGCCAAT TAGCGATTAT ATTTCTTTGG   750
ACTTCTGGAA ACCTTTTTCA TGTAGCTTGG CAAGGAAATT TCGAGCAGTG   800
GGTTCAAGAT CCTCTTCATA TCCGACCAAT AGCTCATGCA ATCTGGGATC   850
CACATTTTGG GCAGCCAGCC GTGGAAGCTT TTACACGTGG CGGTGCTTCA   900
GGTCCAGTAA ATATATCTAC ATCTGGTGTT TATCAATGGT GGTTTACTAT   950
TGGGCTACGA ACTAACCAAG ATTTATATAA TGGATCTATT TTTTTAATTA  1000
TACTTTCAGC ATTATTTCTA TTTGCAGGCT GGTTACATCT TCAACCTTCT  1050
TTTCAGCCAA CGCTTTCATG GTTTAAAAAT GCAGAATCGC GATTAAATCA  1100
CCATTTAGCA GGTCTGTTTG GAATAAGTTC TCTAGCTTGG GCAGGTCATT  1150
TAATTCATGT GGCTATTCCT GAATCTCGTG GGCAACATGT ACGTTGGAAT  1200
AATTTTCTGA CTGTTTTACC ACATCCGGCT GGATTAACTC CTTTTTTTAC  1250
AGGTAATTGG GTTGTATATG CTCAAAACCC AGATGCAGCT ACTCATGTTT  1300
GGAATACTAA TGAAAACGCT GGGAGTGCTA TTTTAACATT CTTAGGAGGG  1350
TTTCATCCTG AAACTCAAAG TCTTTGGTTA ACAGATATGG CACATCATCA  1400
TCTTGCTATT GCTGTTCTTT TTATTATTGC AGGACACCAA TATCGTACTA  1450
ATTGGGGTAT TGGACATAGT ATCAAAGAAA TTCTAGAAGC ACATAGCGGT  1500
CCATTTACTG GGCAAGGTCA TAAAGGTTTA TATGAAATAT TAACAACTTC  1550
TTGGCATGCT CAATTAGCTA TTAACTTAGC TTTATTTGGT TCATTATCTA  1600
TCATTGTTGC TCAGCATATG TATTCAATGC CACCTTATCC ATATTTGGCA  1650
ACTGATTATG GTACTCAATT ATCTATTTTC ACACATCATA CATGGATTGG  1700
TGGATTTTGT ATTGTAGGTG CTGGGGCACA TGCTGCTATT TTTATGGTTC  1750
GTGATTATGA TCCTTATAAT AACTATAATA ATTTATTAGA TCGTGTTTTA  1800
CGCCATAGAG ATGCAATAAT TTCTCATTTA AATTGGGTAT GTATTTTCTT  1850
AGGTTTTCAT AGTTTTGGAT TATATATCCA TAATGATACT TTAAGTGCTT  1900
TAGGACGTCC TCAAGATATG TTTTCAGATA CTGCAATACA ACTACAACCC  1950
GTTTTTGCAC AATGGATTCA AAACATTCAT GCGACAGCTC CAGGTTTCAC  2000
TGCTCCAAAT GTATTAGATC CTACAAGTTT TACTTGGAGT AATAATATTG  2050
TAGCAATTGG TGGGAAAGTA GCAATGATGC CAATTCCATT AGGTACAGCT  2100
GATTTTTTAG TACATCATAT TCATGCTTTT ACTATTCACG TAACGGTTTT  2150
AATTTTGTTA AAAGGTGTAC TATATGCTCG TAGTTCTCGA TTAATCCCTG  2200
ATAAATCAAA TTTAGGGTTT AGATTTCCTT GTGACGGTCC GGGTCGTGGA  2250
GGAACATGTC AAGTTTCTGC TTGGGATCAC GTGTTTTTAG GTTTATTTTG  2300
GATGTATAAT TCTTTATCAA TTGCTATTTT TCATTTTAGT TGGAAAATGC  2350
AATCAGATGT TTGGGGTACT GTAACAGCAA ACGGCGTTTC TCATATTACT  2400
GGTGGTAATT TTGCTCAAAG TGCGAACACT ATTAATGGGT GGTTACGTGA  2450
CTTTTTATGG GCTCAGTCAT CACAAGTTAT TCAATCATAC GGGTCAGCAC  2500
TTTCTGCTTA TGGACTTATT TTCTTAGGAG CACATTTTGT TTGGGCTTTT  2550
AGTTTAATGT TTTTATTTAG TGGACGTGGT TACTGGCAAG AATTGATCG   2599

<210> SEQ ID NO: 158
<211> 1511
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>
CATAAAGTTC ATAGAAGCAT TTTAGACTTT GATGAATTTA ATAAACATAA    50
TAATATTAAT ACAATTAATA CAATAATAAT AATATATTAT TATGTTTATA   100
CAATACGAAT AGTTCATAGA TTTTTTTGTT GTTATGTAAA ATACTCCATA   150
ACAACAAAAA AATCAAGTAC TTAGAAAATA AAAAGCTCAA AAAAATAATA   200
ATACTTTTGG TATAAATTAA AATATGTATT ACACATCAAT TATATCAGAT   250
TTAGGAGTTT TCGGAATTAA AAATTTAACA CGTTTATACT TATTAACTTT   300
AGGAACTTTT TTAAAAGCTT TACTAAACGG CAAAGGTTTT AATTTTAATA   350
ATCCTGTTTT TTCTTCTTTT TTACCAGTAA CTAAATGAAA AAACGATTTC   400
ATTAAACCAC TAAAATTTAA GTTGGATGAA TTCTCTACTA AACGTCGTAA   450
TTGCTCTTCT TGAGAAATTG TTAAACGATG ACGCTCTTTA AAAGCAAGAA   500
GCCCTAATTC TTTTGTTGGA GCTTGTTGAT ATAAAATAAT TGTTTCAGCA   550
ATTGCTTGTT TTATAAAAAA CCTTGAAACT ATAAGATCTA TTAAACCGTG   600
ATGTAATAAA TATTCTGCTG TTTGAAAATC TTTTGGTAAT TGCTCTTGAA   650
GTGTTTGTTC AATAACTCTA CGGCCGGCAA ACCCAATAAG GGCTCTTGGT   700
TCTGCAAAAA TTAAATCACC TAACATCGCA AAACTCGCTG TAACACCACC   750
AGTGGTTGGT GAAGTTAATA TTGGAATATA AAGAAGTTTT GCAGTATTTT   800
GGTGAGCATT TAACGCAGCA GATATTTTCG CCATTTGCAT TAAACTTAAG   850
ATGCCTTCTT GCATACGGGC ACCACCAGAC GCGCAAAATA TTATTAAAGT   900
AAGTCCTTGT GCTGTTGCAT ATTCTATTAA GCGAGTAATT TTTTCTCCAA   950
CAACAGAACC CATACTTCCA CCCATAAAAG TAAAATCCAT AATTCCAACA  1000
GCAATGGGAA TACCTTCTAT AAAGCCAGTT CCTGTTTGTA TAGCATCCTG  1050
TAATCCTGTT TGTTCTTGCG ATTCTTTTAA TCTTTCACTA TATTTTTTTT  1100
GATCTTTAAA TTTTAAAGGG TCACAAGACG ATAGTGTTTC ATTAATTGGC  1150
CTCCAAGTAC CTGCATCAAT AAAACGCTCA ATACGCTCGA TGCTGTTCAT  1200
TTGTAAATGG TAATTACATC CAAAACAAAC TCGCTTATTT TGTTGTAAAT  1250
GTTTAATATA TAAAATGACT CCACAGTTTT CACACCGAAT CCATAAGCTT  1300
AAATCTCCCG GGCGGTGGCC GGATTTAGGT GCGTTAAGTA ACTTTAAATT  1350
TTTTTCGTCT TCAATCCATG AAAAAATTGA CATAATATTG AACCTCTTTG  1400
TATTTTTTAT TATTTATTGC TTTTTTTATT TGTATTTATT CCATATACTA  1450
AATAAATAAT AATCATTTTT TTTTGTTTTT TGTAGAGATT AATAAAAATT  1500
TTACGGGGGA A                                            1511

<210> SEQ ID NO: 159
<211> 1462
<212> Nucleotide Sequences
<213> Chlorella protothecoides

<400>	
GTTATAAATA ATAAATATTT TTAAACATTA AGATACTTTA ATTTTCCCGC    50
CCGCAGCTTC TAATTGTTGT TTTGCTGTTT CTGATTCTTC TTTTGACGCA   100
CCTTGTAAGA TTGTTTTTGG TAAATCTGTA ATAGCTTCTT TTGCTTGTTT   150
TAAATCTAAA GAAGTTAAAT TACGTATTAC TTTTAAAATA GGTACACGTT   200
TGTCACTTGG AACTTCTTCT AAAATAATAT CAAAAGTAGT TTTTTCTTCT   250
ATTACTTCTT CTTGTTTTGT AATTTCTGTT GAAGCTGCAC TTGTTCCACT   300
TGTTGTTTGA ATAGATGCAT CTACACCAAA AGCATCTTCA ATTTGTGATA   350
CTAATTGCGA AGCTTCTAAT AAAGTAATTG TTTTTAGTTT TTCTAAAATC   400
TCAATAGTTA TTGTAGACAT AAATAAAACT GATCTCCTGA GTTTTTGAAT   450
AATAGAGCTA ATTTTTAAAT TTTTTATTTT AGCTTTTAAT ATATTTATTT   500
ATTATATGTT TTTATAATGA TAAAATCAAT TAGTTAATTT GAAAGTTAAA   550
CTTCAAAATT AACTAAGTAA AATTAACGTT TTGAGTATTG TGGGGCTTTT   600
CGCGCTTTTT TTAATCCATA TTTACGTCGT TCTTTAATAC GAGCATCCCG   650
AGTTAAAAAA CCTTGCGCTT TAAAAGAAGC TCTTAATTCT GGAATAAGTT   700
CGATAAACGC TTTACATAAT GTTAATCTAA TAGCTTGAGC TTGTGCACTT   750
AAACCACCCC CATTGACAGA AATATAAGCA TTATATTTAG TCGAGGTTTC   800
TATTAGTTGA TATACTTGAA ATATATTATG TAATAATGCT TTATTATTTT   850
GAAAATAAAG TTCAGCTGGT TTCCCATTTA TAAAAAGATT TTCAGTTGCA   900
CTAGGACTTA ACTTTACACG AGCGGAAGCT GTTTTTCTCC CACCTGAATG   950
TATTAATTCA TGTTTAATTG ATGACATTAT TAAAAAATTC TCCTTAAATT  1000
TTTTGTAGCA TAAATACAAA AAATTATTTT TACTAATCTA GAATAATTTA  1050
TATTTTTTAC TAACTCTCTA AAAAATTAAA TTTTTTTAAA GAGATAAACT  1100
AACAAGCGTA AAATCAAAAA ATAATTTTGT AAGTTTATAT AAGCTATATT  1150
GCAAAGCTTG TTTTGGATGA ATACTACCAT CTGTTATAAT TTCTAAGATA  1200
AGATATTCAA ATTCTTGCTT TGTACTTACT TTTTGAATAA AAAAATTAGT  1250
TTGTTTTATA GGGCTTGGGA AACGTTGTAC ATTAAAAACT TTATAATCAT  1300
CTATAAATTC TTGCGGGTTT ATAAATGGTT GAAATTTATT AGGATCAACT  1350
AATGCAATTT TCAAACTAAG GGATAATTCC CCATCCCAAC TAACTGAAGC  1400
TATATGTTGA GATGGTACGA CACAACTAAT ATTTGGGGGA AATTTTATAT  1450
CACCTGCAGT CG                                           1462

<210> SEQ ID NO: 160
<211> 558
<212> Nucleotide Sequence
<213> Chlorella protothecoides

<400>
ATG GGC GTG AAG GTG CTG TTC GCC CTG ATC TGC ATC GCC GTG GCC GAG 48
GCC AAG CCC ACC GAG AAC AAC GAG GAC TTC AAC ATC GTG GCC GTG GCC 96
TCC AAC TTC GCC ACC ACC GAC CTG GAC GCC GAC CGC GGC AAG CTG CCC 144
GGC AAG AAG CTG CCC CTG GAG GTG CTG AAG GAG ATG GAG GCC AAC GCC 192
CGC AAG GCC GGC TGC ACC CGC GGC TGC CTG ATC TGC CTG TCC CAC ATC 240
AAG TGC ACC CCC AAG ATG AAG AAG TTC ATC CCC GGC CGC TGC CAC ACC 288
TAC GAG GGC GAC AAG GAG TCC GCC CAG GGC GGC ATC GGC GAG GCC ATC 336
GTG GAC ATC CCC GAG ATC CCC GGC TTC AAG GAC CTG GAG CCC ATG GAG 384
CAG TTC ATC GCC CAG GTG GAC CTG TGC GTG GAC TGC ACC ACC GGC TGC 432
CTG AAG GGC CTG GCC AAC GTG CAG TGC TCC GAC CTG CTG AAG AAG TGG 480
CTG CCC CAG CGC TGC GCC ACC TTC GCC TCC AAG ATC CAG GGC CAG GTG 528
GAC AAG ATC AAG GGC GCC GGC GGC GAC TGA                         558


<210> SEQ ID NO: 161
<211> 810
<212> Nucleotide Sequence
<213> Chlorella protothecoides

<400>
ATG GGT AAG GAA AAG ACT CAC GTT TCG AGG CCG CGA TTA AAT TCC AAC 48
ATG GAT GCT GAT TTA TAT GGG TAT AAA TGG GCT CGC GAT AAT GTC GGG 96
CAA TCA GGT GCG ACA ATC TAT CGA TTG TAT GGG AAG CCC GAT GCG CCA 144
GAG TTG TTT CTG AAA CAT GGC AAA GGT AGC GTT GCC AAT GAT GTT ACA 192
GAT GAG ATG GTC AGA CTA AAC TGG CTG ACG GAA TTT ATG CCT CTT CCG 240
ACC ATC AAG CAT TTT ATC CGT ACT CCT GAT GAT GCA TGG TTA CTC ACC 288
ACT GCG ATC CCC GGC AAA ACA GCA TTC CAG GTA TTA GAA GAA TAT CCT 336
GAT TCA GGT GAA AAT ATT GTT GAT GCG CTG GCA GTG TTC CTG CGC CGG 384
TTG CAT TCG ATT CCT GTT TGT AAT TGT CCT TTT AAC AGC GAT CGC GTA 432
TTT CGT CTC GCT CAG GCG CAA TCA CGA ATG AAT AAC GGT TTG GTT GAT 480
GCG AGT GAT TTT GAT GAC GAG CGT AAT GGC TGG CCT GTT GAA CAA GTC 528
TGG AAA GAA ATG CAT AAG CTT TTG CCA TTC TCA CCG GAT TCA GTC GTC 576
ACT CAT GGT GAT TTC TCA CTT GAT AAC CTT ATT TTT GAC GAG GGG AAA 624
TTA ATA GGT TGT ATT GAT GTT GGA CGA GTC GGA ATC GCA GAC CGA TAC 672
CAG GAT CTT GCC ATC CTA TGG AAC TGC CTC GGT GAG TTT TCT CCT TCA 720
TTA CAG AAA CGG CTT TTT CAA AAA TAT GGT ATT GAT AAT CCT GAT ATG 768
AAT AAA TTG CAG TTT CAT TTG ATG CTC GAT GAG TTT TTC TAA         810

<210> SEQ ID NO: 162
<211> 270
<212> ANino Acid Sequence
<213> Synthetic construct

Met	Gly	Lys	Glu	Lys	Thr	His	Val	Ser	Arg	Pro	Arg	Leu	Asn	Ser
1				5					10					15
Asn	Met	Asp	Ala	Asp	Leu	Tyr	Gly	Tyr	Lys	Trp	Ala	Arg	Asp	Asn
				20					25					30
Val	Gly	Gln	Ser	Gly	Ala	Thr	Ile	Tyr	Arg	Leu	Tyr	Gly	Lys	Pro
				35					40					45
Asp	Ala	Pro	Glu	Leu	Phe	Leu	Lys	His	Gly	Lys	Gly	Ser	Val	Ala
				50					55					60
Asn	Asp	Val	Thr	Asp	Glu	Met	Val	Arg	Leu	Asn	Trp	Leu	Thr	Glu
				65					70					75
Phe	Met	Pro	Leu	Pro	Thr	Ile	Lys	His	Phe	Ile	Arg	Thr	Pro	Asp
				80					85					90
Asp	Ala	Trp	Leu	Leu	Thr	Thr	Ala	Ile	Pro	Gly	Lys	Thr	Ala	Phe
				95					100					105
Gln	Val	Leu	Glu	Glu	Tyr	Pro	Asp	Ser	Gly	Glu	Asn	Ile	Val	Asp
				110					115					120
Ala	Leu	Ala	Val	Phe	Leu	Arg	Arg	Leu	His	Ser	Ile	Pro	Val	Cys
				125					130					135
Asn	Cys	Pro	Phe	Asn	Ser	Asp	Arg	Val	Phe	Arg	Leu	Ala	Gln	Ala
				140					145					150
Gln	Ser	Arg	Met	Asn	Asn	Gly	Leu	Val	Asp	Ala	Ser	Asp	Phe	Asp
				155					160					165
Asp	Glu	Arg	Asn	Gly	Trp	Pro	Val	Glu	Gln	Val	Trp	Lys	Glu	Met
				170					175					180
His	Lys	Leu	Leu	Pro	Phe	Ser	Pro	Asp	Ser	Val	Val	Thr	His	Gly
				185					190					195
Asp	Phe	Ser	Leu	Asp	Asn	Leu	Ile	Phe	Asp	Glu	Gly	Lys	Leu	Ile
				200					205					210
Gly	Cys	Ile	Asp	Val	Gly	Arg	Val	Gly	Ile	Ala	Asp	Arg	Tyr	Gln
				215					220					225
Asp	Leu	Ala	Ile	Leu	Trp	Asn	Cys	Leu	Gly	Glu	Phe	Ser	Pro	Ser
				230					235					240
Leu	Gln	Lys	Arg	Leu	Phe	Gln	Lys	Tyr	Gly	Ile	Asp	Asn	Pro	Asp
				245					250					255
Met	Asn	Lys	Leu	Gln	Phe	His	Leu	Met	Leu	Asp	Glu	Phe	Phe	Ala
				260					265					270

<210> SEQ ID NO: 163
<211> 810
<212> Nucleotide Sequence
<213> Chlorella protothecoides

<400>
ATG GGT AAA GAA AAA ACT CAC GTT TCT CGT CCG CGT TTA AAT TCT AAC 48
ATG GAT GCT GAT TTA TAT GGG TAT AAA TGG GCT CGT GAT AAT GTT GGG 96
CAA TCA GGT GCT ACA ATC TAT CGT TTA TAT GGG AAA CCC GAT GCT CCA 144
GAA TTA TTT CTA AAA CAT GGC AAA GGT AGT GTT GCT AAT GAT GTT ACA 192
GAT GAA ATG GTT CGT CTT AAC TGG CTT ACT GAA TTT ATG CCT CTT CCA 240
ACA ATC AAA CAT TTT ATC CGT ACT CCT GAT GAT GCT TGG TTA CTT ACT 288
ACT GCT ATC CCT GGC AAA ACA GCT TTC CAG GTA TTA GAA GAA TAT CCT 336
GAT TCA GGT GAA AAT ATT GTT GAT GCT CTT GCT GTT TTC CTG CGT CGT 384
TTA CAT TCT ATT CCT GTT TGT AAT TGT CCT TTT AAC AGT GAT CGT GTA 432
TTT CGT CTT GCT CAG GCT CAA TCA CGT ATG AAT AAC GGT TTA GTT GAT 480
GCT AGT GAT TTT GAT GAC GAA CGT AAT GGC TGG CCT GTT GAA CAA GTC 528
TGG AAA GAA ATG CAT AAA CTT TTA CCA TTC TCA CCT GAT TCA GTT GTT 576
ACT CAT GGT GAT TTC TCA CTT GAT AAC CTT ATT TTT GAC GAA GGG AAA 624
TTA ATT GGT TGT ATT GAT GTT GGA CGT GTT GGA ATC GCT GAC CGT TAC 672
CAG GAT CTT GCT ATC CTT TGG AAC TGT CTT GGT GAA TTT TCT CCT TCA 720
TTA CAG AAA CGT CTT TTT CAA AAA TAT GGT ATT GAT AAT CCT GAT ATG 768
AAT AAA TTA CAG TTT CAT TTG ATG CTT GAT GAA TTT TTC TAA         810

<210> SEQ ID NO: 164
<211> 270
<212> Amino Acid Sequence
<213> Chlorella protothecoides

<400>
Met	Gly	Lys	Glu	Lys	Thr	His	Val	Ser	Arg	Pro	Arg	Leu	Asn	Ser
1				5					10					15
Asn	Met	Asp	Ala	Asp	Leu	Tyr	Gly	Tyr	Lys	Trp	Ala	Arg	Asp	Asn
				20					25					30
Val	Gly	Gln	Ser	Gly	Ala	Thr	Ile	Tyr	Arg	Leu	Tyr	Gly	Lys	Pro
				35					40					45
Asp	Ala	Pro	Glu	Leu	Phe	Leu	Lys	His	Gly	Lys	Gly	Ser	Val	Ala
				50					55					60
Asn	Asp	Val	Thr	Asp	Glu	Met	Val	Arg	Leu	Asn	Trp	Leu	Thr	Glu
				65					70					75
Phe	Met	Pro	Leu	Pro	Thr	Ile	Lys	His	Phe	Ile	Arg	Thr	Pro	Asp
				80					85					90
Asp	Ala	Trp	Leu	Leu	Thr	Thr	Ala	Ile	Pro	Gly	Lys	Thr	Ala	Phe
				95					100					105
Gln	Val	Leu	Glu	Glu	Tyr	Pro	Asp	Ser	Gly	Glu	Asn	Ile	Val	Asp
				110					115					120
Ala	Leu	Ala	Val	Phe	Leu	Arg	Arg	Leu	His	Ser	Ile	Pro	Val	Cys
				125					130					135
Asn	Cys	Pro	Phe	Asn	Ser	Asp	Arg	Val	Phe	Arg	Leu	Ala	Gln	Ala
				140					145					150
Gln	Ser	Arg	Met	Asn	Asn	Gly	Leu	Val	Asp	Ala	Ser	Asp	Phe	Asp
				155					160					165
Asp	Glu	Arg	Asn	Gly	Trp	Pro	Val	Glu	Gln	Val	Trp	Lys	Glu	Met
				170					175					180
His	Lys	Leu	Leu	Pro	Phe	Ser	Pro	Asp	Ser	Val	Val	Thr	His	Gly
				185					190					195
Asp	Phe	Ser	Leu	Asp	Asn	Leu	Ile	Phe	Asp	Glu	Gly	Lys	Leu	Ile
				200					205					210
Gly	Cys	Ile	Asp	Val	Gly	Arg	Val	Gly	Ile	Ala	Asp	Arg	Tyr	Gln
				215					220					225
Asp	Leu	Ala	Ile	Leu	Trp	Asn	Cys	Leu	Gly	Glu	Phe	Ser	Pro	Ser
				230					235					240
Leu	Gln	Lys	Arg	Leu	Phe	Gln	Lys	Tyr	Gly	Ile	Asp	Asn	Pro	Asp
				245					250					255
Met	Asn	Lys	Leu	Gln	Phe	His	Leu	Met	Leu	Asp	Glu	Phe	Phe	Ala
				260					265					270


<210> SEQ ID NO: 165
<211> 1945
<212> Nucleotide Sequence
<213> Chlorella protothecoides

<400>
GGCTTTTCCA GACATGCGAC CACGATGGTA TTTACGATAT TTTGTTTTTT    50
TGGGACTAAG CATATGTTAT ATAAAATTTA GAATTTATTA ATTTATAATA   100
CTATAAAATT TGTAACTTTA TAGATTACTT AATGGAGTTA AACGCTCACC   150
GCGGAAAACC CAAATTTTAA TACCTAAAAG GCCATAAATA GTTTTTGCTG   200
ATTTTTCGGA ATAATCTATT TCAGCGCGTA ATGTTTGTAA TGGTACTGGT   250
CCACTTCGTA AATCTTCAGA ACGAGCAATA TCTGCCCCAT TTAAACGACC   300
AGAAATTTTT ACTTTAATAC CCTTAACACC AGCTTTTATT GCTTTCCGAA   350
CAACTTGTCT CATAGCTTTT CGATAAGGTT TTCGTTGTTG CAAATCATCA   400
ACTAATTTTT CTGCAAGTAC CATTGCATGT ATATCTGGAT TATTAGTTGA   450
ATTTATAGAA ATCGTACAAA AAATTTTTCC AGGGTCTGGC AAGTTTCTTA   500
CTTTATAAAA TTTTTTAAGA TGTTTTAATA AATTATCTCT TAATTGTGAT   550
AAACCTTCTG ACCAATTTGA TTTTGTAGTT TTTTTAGAAA AAGAGTTACG   600
ATTTAAAAAA ATATTTGGTC GTGCTGCATG AATTTTAATG CTTATAAAAG   650
GGTATTCAAT TTCACGTCTT TGAATTTCAA TATTAATAAT ACCTGCTCCA   700
AAAAAAGCTT TTTCTATATA ATTTCGAATA AAACTTGCAT CTTGAACCCA   750
AAGTGCTGAT GTTTGAGATT TTGTACACCA ATAATTTTTA TGTTTTTGAG   800
TAATACCAAG ACGAAATCCC AAAGGATGTA CTTTTTGTCC CATATATTAA   850
TTATCTTTTT GTTTTTTTAT CTTTTTTTAC GTGCCCTTTA AAAGTGCGAG   900
TAGGGGCAAA TTCACCTAAT TTATGCCCAA CCATTTGATC TGTAATAAAA   950
AGTGGAATAT GTTGGCGGCC ATTATGCACA GCAATCGTAT GTCCTATCAT  1000
TAATGGAACA ATCGTTGAGG CTCGGGACCA TGTTTTTACT ACTTTTTTTT  1050
TACCTTGACT ATTTAATCGT TCAATTTTTT TAAACAAATG ATTTGCTACA  1100
AAAGGAATTT TTTTTAATGA TCGTGTCATA TAATTAGTGA TAAATTTAAA  1150
ATATTATTTA TTTTTTTACA AACAAATATT TAAAGGTTTT TAAAAACCTT  1200
CTCTAAAAAC ACAAAACGAA TATAAAATTA GAAATTAACA ATTAGTTTGT  1250
TACTAAATTT TTTACGTTTA CGGGTTTTTA CACCTAATGC TGGCTTACCC  1300
CATAAACTTA CTGGACGAGC CCGACCAATA GGTGCTTTTC CTTCACCACC  1350
ACCATGTGGG TGATCTACTG GATTCATAGC TGAACCTCGG ACGTGGGGGC  1400
GTCGACTTAA CCAACGACTT CTGCCTGCTT TTCCTAATGT TATATTAGAA  1450
TTATCGATAT TTCCAACACG ACCAATAGTA GCCCAGCATT TTTCAGAAAT  1500
TAATCGAGAC TCGCCTGATG GTAAACGAAG TGTAACCCAG CGGCCTTCTT  1550
TTGCTACGAT TTGTGCTACA GAACCAGCTG ATCTTACAAA TTGACCACCA  1600
GCTCCGGGTT GGAGTTCAAC GTTATGAACA GAAGTCCCTA ATGGAATATT  1650
AGATAATGGT AAAGTATTAC CAATAGTTAT AGAGGCATTA GGTGAAGAAA  1700
GTAATTTTTC TCCGATTTTT AATCCAAGAG GGGATATAAT GTAGCGTTTT  1750
TCGCCATCGT TATAAGAAAC TAATGCAATG CGTGCAGTAC GGTTTGGATC  1800
ATATTCAATA TTTTTTACTG TAGCAGAAAC CCCAATTTTA TTTCTTGTAA  1850
AATCGATTTT TCTATATAAT CTTTTATGAC CACCACCACG ATGCCTACTT  1900
GTAATAATAC CTTTGTTATT TCGACCTTGA GACCGAGACC AACCC       1945

<210> SEQ ID NO: 166
<211> 1945
<212> Nucleotide Sequence
<213> Chlorella protothecoides

<400>
GGCTTTTCCA GACATGCGAC CACGATGGTA TTTACGATAT TTTGTTTTTT    50
TGGGACTAAG CATATGTTAT ATAAAATTTA GAATTTATTA ATTTATAATA   100
CTATAAAATT TGTAACTTTA TAGATTACTT AATGGAGTTA AACGCTCACC   150
GCGGAAAACC CAAATTTTAA TACCTAAAAG GCCATAAATA GTTTTTGCTG   200
ATTTTTCGGA ATAATCTATT TCAGCGCGTA ATGTTTGTAA TGGTACTGGT   250
CCACTTCGTA AATCTTCAGA ACGAGCAATA TCTGCCCCAT TTAAACGACC   300
AGAAATTTTT ACTTTAATAC CCTTAACACC AGCTTTTATT GCTTTCCGAA   350
CAACTTGTCT CATAGCTTTT CGATAAGGTT TTCGTTGTTG CAAATCATCA   400
ACTAATTTTT CTGCAAGTAC CATTGCATGT ATATCTGGAT TATTAGTTGA   450
ATTTATAGAA ATCGTACAAA AAATTTTTCC AGGGTCTGGC AAGTTTCTTA   500
CTTTATAAAA TTTTTTAAGA TGTTTTAATA AATTATCTCT TAATTGTGAT   550
AAACCTTCTG ACCAATTTGA TTTTGTAGTT TTTTTAGAAA AAGAGTTACG   600
ATTTAAAAAA ATATTTGGTC GTGCTGCATG AATTTTAATG CTTATAAAAG   650
GGTATTCAAT TTCACGTCTT TGAATTTCAA TATTAATAAT ACCTGCTCCA   700
AAAAAAGCTT TTTCTATATA ATTTCGAATA AAACTTGCAT CTTGAACCCA   750
AAGTGCTGAT GTTTGAGATT TTGTACACCA ATAATTTTTA TGTTTTTGAG   800
TAATACCAAG ACGAAATCCC AAAGGATGTA CTTTTTGTCC CATAAGAATT   850
CAATTTAGAA AAATTCATCA AGCATCAAAT GAAACTGTAA TTTATTCATA   900
TCAGGATTAT CAATACCATA TTTTTGAAAA AGACGTTTCT GTAATGAAGG   950
AGAAAATTCA CCAAGACAGT TCCAAAGGAT AGCAAGATCC TGGTAACGGT  1000
CAGCGATTCC AACACGTCCA ACATCAATAC AACCAATTAA TTTCCCTTCG  1050
TCAAAAATAA GGTTATCAAG TGAGAAATCA CCATGAGTAA CAACTGAATC  1100
AGGTGAGAAT GGTAAAAGTT TATGCATTTC TTTCCAGACT TGTTCAACAG  1150
GCCAGCCATT ACGTTCGTCA TCAAAATCAC TAGCATCAAC TAAACCGTTA  1200
TTCATACGTG ATTGAGCCTG AGCAAGACGA AATACACGAT CACTGTTAAA  1250
AGGACAATTA CAAACAGGAA TAGAATGTAA ACGACGCAGG AAAACAGCAA  1300
GAGCATCAAC AATATTTTCA CCTGAATCAG GATATTCTTC TAATACCTGG  1350
AAAGCTGTTT TGCCAGGGAT AGCAGTAGTA AGTAACCAAG CATCATCAGG  1400
AGTACGGATA AAATGTTTGA TTGTTGGAAG AGGCATAAAT TCAGTAAGCC  1450
AGTTAAGACG AACCATTTCA TCTGTAACAT CATTAGCAAC ACTACCTTTG  1500
CCATGTTTTA GAAATAATTC TGGAGCATCG GGTTTCCCAT ATAAACGATA  1550
GATTGTAGCA CCTGATTGCC CAACATTATC ACGAGCCCAT TTATACCCAT  1600
ATAAATCAGC ATCCATGTTA GAATTTAAAC GCGGACGAGA AACGTGAGTT  1650
TTTTCTTTAC CCATGATATC TAATTATCTT TTTGTTTTTT TATCTTTTTT  1700
TACGTGCCCT TTAAAAGTGC GAGTAGGGGC AAATTCACCT AATTTATGCC  1750
CAACCATTTG ATCTGTAATA AAAAGTGGAA TATGTTGGCG GCCATTATGC  1800
ACAGCAATCG TATGTCCTAT CATTAATGGA ACAATCGTTG AGGCTCGGGA  1850
CCATGTTTTT ACTACTTTTT TTTTACCTTG ACTATTTAAT CGTTCAATTT  1900
TTTTAAACAA ATGATTTGCT ACAAAAGGAA TTTTTTTTAA TGATCGTGTC  1950
ATATAATTAG TGATAAATTT AAAATATTAT TTATTTTTTT ACAAACAAAT  2000
ATTTAAAGGT TTTTAAAAAC CTTCTCTAAA AACACAAAAC GAATATAAAA  2050
TTAGAAATTA ACAATTAGTT TGTTACTAAA TTTTTTACGT TTACGGGTTT  2100
TTACACCTAA TGCTGGCTTA CCCCATAAAC TTACTGGACG AGCCCGACCA  2150
ATAGGTGCTT TTCCTTCACC ACCACCATGT GGGTGATCTA CTGGATTCAT  2200
AGCTGAACCT CGGACGTGGG GGCGTCGACT TAACCAACGA CTTCTGCCTG  2250
CTTTTCCTAA TGTTATATTA GAATTATCGA TATTTCCAAC ACGACCAATA  2300
GTAGCCCAGC ATTTTTCAGA AATTAATCGA GACTCGCCTG ATGGTAAACG  2350
AAGTGTAACC CAGCGGCCTT CTTTTGCTAC GATTTGTGCT ACAGAACCAG  2400
CTGATCTTAC AAATTGACCA CCAGCTCCGG GTTGGAGTTC AACGTTATGA  2450
ACAGAAGTCC CTAATGGAAT ATTAGATAAT GGTAAAGTAT TACCAATAGT  2500
TATAGAGGCA TTAGGTGAAG AAAGTAATTT TTCTCCGATT TTTAATCCAA  2550
GAGGGGATAT AATGTAGCGT TTTTCGCCAT CGTTATAAGA AACTAATGCA  2600
ATGCGTGCAG TACGGTTTGG ATCATATTCA ATATTTTTTA CTGTAGCAGA  2650
AACCCCAATT TTATTTCTTG TAAAATCGAT TTTTCTATAT AATCTTTTAT  2700
GACCACCACC ACGATGCCTA CTTGTAATAA TACCTTTGTT ATTTCGACCT  2750
TGAGACCGAG ACCAACCC                                     2768

