               SEQUENCE LISTING

<110> Max-Planck-Gesellschaft zur Foerderung der Wissenschaften e.V.

<120> Novel Fusion genes in Lung Cancer

<130> T2396 PCT s3

<150> EP12 15 3907.6
<151> 2012-02-03

<160> 165

<170> BiSSAP 1.0

<210> 1
<211> 4002
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4002
<223> /mol_type="DNA"
      /note="CCDS nucleotide sequence of SOS1 (Gene ID: 6654)"
      /organism="Homo sapiens"

<400> 1
atgcaggcgc agcagctgcc ctacgagttt ttcagcgaag agaacgcgcc caagtggcgg       60

ggactactgg tgcctgcgct gaaaaaggtc caggggcaag ttcatcctac tctcgagtct      120

aatgatgatg ctcttcagta tgttgaagaa ttaattttgc aattattaaa tatgctatgc      180

caagctcagc cccgaagtgc ttcagatgta gaggaacgtg ttcaaaaaag tttccctcat      240

ccaattgata aatgggcaat agctgatgcc caatcagcta ttgaaaagag gaagcgaaga      300

aaccctttat ctctcccagt agaaaaaatt catcctttat taaaggaggt cctaggttat      360

aaaattgacc accaggtttc tgtttacata gtagcagtct tagaatacat ttctgcagac      420

attttaaagc tggttgggaa ttatgtaaga aatatacggc attatgaaat tacaaaacaa      480

gatattaaag tggcaatgtg tgctgacaag gtattgatgg atatgtttca tcaagatgta      540

gaagatatta atatattatc tttaactgac gaagagcctt ccacctcagg agaacaaact      600

tactatgatt tggtaaaagc atttatggca gaaattcgac aatatataag ggaactaaat      660

ctaattataa aagtttttag agagcccttt gtctccaatt caaaattgtt ttcagctaat      720

gatgtagaaa atatatttag tcgcatagta gatatacatg aacttagtgt aaagttactg      780

ggccatatag aagatacagt agaaatgaca gatgaaggca gtccccatcc actagtagga      840

agctgctttg aagacttagc agaggaactg gcatttgatc catatgaatc gtatgctcga      900

gatattttgc gacctggttt tcatgatcgt ttccttagtc agttatcaaa gcctggggca      960

gcactttatt tgcagtcaat aggcgaaggt ttcaaagaag ctgttcaata tgttttaccc     1020

aggctgcttc tggcccctgt ttaccactgt ctccattact ttgaactttt gaagcagtta     1080

gaagaaaaaa gtgaagatca agaagacaag gaatgtttaa aacaagcaat aacagctttg     1140

cttaatgttc agagtggtat ggaaaaaata tgttctaaaa gtcttgcaaa acgaagactg     1200

agtgaatctg catgtcggtt ttatagtcag caaatgaagg ggaaacaact agcaatcaag     1260

aagatgaacg agattcagaa gaatattgat ggttgggagg gaaaagacat tggacagtgt     1320

tgtaatgaat ttataatgga aggaactctt acacgtgtag gagccaaaca tgagagacac     1380

atatttctct ttgatggctt aatgatttgc tgtaaatcaa atcatgggca gccaagactt     1440

cctggtgcta gcaatgcaga atatcgtctt aaagaaaagt tttttatgcg aaaggtacaa     1500

attaatgata aagatgacac caatgaatac aagcatgctt ttgaaataat tttaaaagat     1560

gaaaatagtg ttatattttc tgccaagtca gctgaagaga aaaacaattg gatggcagca     1620

ttgatatctt tacagtaccg gagtacactg gaaaggatgc ttgatgtaac aatgctacag     1680

gaagagaaag aggagcagat gaggctgcct agtgctgatg tttatagatt tgcagagcct     1740

gactctgaag agaatattat atttgaagag aacatgcagc ccaaggctgg aattccaatt     1800

atcaaagcag gaactgttat taaacttata gagaggctta cgtaccatat gtacgcagat     1860

cccaattttg ttcggacatt tcttacaaca tacagatcct tttgcaaacc tcaagaacta     1920

ctgagtctta taatagaaag gtttgaaatt ccagagcctg agccaacaga agctgatcgc     1980

atagctatag agaatggaga tcaacccttg agtgcagaac tgaaaagatt tagaaaagaa     2040

tatatacagc ctgtgcaact gcgagtatta aatgtatgtc ggcactgggt agagcaccac     2100

ttctatgatt ttgaaagaga tgcatatctt ttgcaacgaa tggaagaatt tattggaaca     2160

gtaagaggta aagcaatgaa aaaatgggtt gaatccatca ctaaaataat ccaaaggaaa     2220

aaaattgcaa gagacaatgg accaggtcat aatattacat ttcagagttc acctcccaca     2280

gttgagtggc atataagcag acctgggcac atagagactt ttgacctgct caccttacac     2340

ccaatagaaa ttgctcgaca actcacttta cttgaatcag atctataccg agctgtacag     2400

ccatcagaat tagttggaag tgtgtggaca aaagaagaca aagaaattaa ctctcctaat     2460

cttctgaaaa tgattcgaca taccaccaac ctcactctgt ggtttgagaa atgtattgta     2520

gaaactgaaa atttagaaga aagagtagct gtggtgagtc gaattattga gattctacaa     2580

gtctttcaag agttgaacaa ctttaatggt gtccttgagg ttgtcagtgc tatgaattca     2640

tcacctgttt acagactaga ccacacattt gagcaaatac caagtcgcca gaagaaaatt     2700

ttagaagaag ctcatgaatt gagtgaagat cactataaga aatatttggc aaaactcagg     2760

tctattaatc caccatgtgt gcctttcttt ggaatttatc tcactaatat cttgaaaaca     2820

gaagaaggca accctgaggt cctaaaaaga catggaaaag agcttataaa ctttagcaaa     2880

aggaggaaag tagcagaaat aacaggagag atccagcagt accaaaatca gccttactgt     2940

ttacgagtag aatcagatat caaaaggttc tttgaaaact tgaatccgat gggaaatagc     3000

atggagaagg aatttacaga ttatcttttc aacaaatccc tagaaataga accacgaaac     3060

cctaagcctc tcccaagatt tccaaaaaaa tatagctatc ccctaaaatc tcctggtgtt     3120

cgtccatcaa acccaagacc aggtaccatg aggcatccca cacctctgca gcaggagcca     3180

aggaaaatta gttatagtag gatccctgaa agtgaaacag aaagtacagc atctgcacca     3240

aattctccaa gaacaccgtt aacacctccg cctgcttctg gtgcttccag taccacagat     3300

gtttgcagtg tatttgattc cgatcattcg agcccttttc actcaagcaa tgataccgtc     3360

tttatccaag ttactctgcc ccatggccca agatctgctt ctgtatcatc tataagttta     3420

accaaaggca ctgatgaagt gcctgtccct cctcctgttc ctccacgaag acgaccagaa     3480

tctgccccag cagaatcttc accatctaag attatgtcta agcatttgga cagtccccca     3540

gccattcctc ctaggcaacc cacatcaaaa gcctattcac cacgatattc aatatcagac     3600

cggacctcta tctcagaccc tcctgaaagc cctcccttat taccaccacg agaacctgtg     3660

aggacacctg atgttttctc aagctcacca ctacatctcc aacctccccc tttgggcaaa     3720

aaaagtgacc atggcaatgc cttcttccca aacagccctt ccccctttac accacctcct     3780

cctcaaacac cttctcctca cggcacaaga aggcatctgc catcaccacc attgacacaa     3840

gaagtggacc ttcattccat tgctgggccg cctgttcctc cacgacaaag cacttctcaa     3900

catatcccta aactccctcc aaaaacttac aaaagggagc acacacaccc atccatgcac     3960

agagatggac caccactgtt ggagaatgcc cattcttcct ga                        4002


<210> 2
<211> 1333
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1333
<223> /mol_type="protein"
      /note="SOS1 (full-length protein)"
      /organism="Homo sapiens"

<400> 2
Met Gln Ala Gln Gln Leu Pro Tyr Glu Phe Phe Ser Glu Glu Asn Ala 
1               5                   10                   15    
Pro Lys Trp Arg Gly Leu Leu Val Pro Ala Leu Lys Lys Val Gln Gly 
            20                   25                  30        
Gln Val His Pro Thr Leu Glu Ser Asn Asp Asp Ala Leu Gln Tyr Val 
        35                   40                  45            
Glu Glu Leu Ile Leu Gln Leu Leu Asn Met Leu Cys Gln Ala Gln Pro 
    50                   55                  60                
Arg Ser Ala Ser Asp Val Glu Glu Arg Val Gln Lys Ser Phe Pro His 
65                   70                  75                  80
Pro Ile Asp Lys Trp Ala Ile Ala Asp Ala Gln Ser Ala Ile Glu Lys 
                85                   90                  95    
Arg Lys Arg Arg Asn Pro Leu Ser Leu Pro Val Glu Lys Ile His Pro 
            100                  105                110        
Leu Leu Lys Glu Val Leu Gly Tyr Lys Ile Asp His Gln Val Ser Val 
        115                  120                125            
Tyr Ile Val Ala Val Leu Glu Tyr Ile Ser Ala Asp Ile Leu Lys Leu 
    130                  135                140                
Val Gly Asn Tyr Val Arg Asn Ile Arg His Tyr Glu Ile Thr Lys Gln 
145                  150                155                  160
Asp Ile Lys Val Ala Met Cys Ala Asp Lys Val Leu Met Asp Met Phe 
                165                  170                175    
His Gln Asp Val Glu Asp Ile Asn Ile Leu Ser Leu Thr Asp Glu Glu 
            180                  185                190        
Pro Ser Thr Ser Gly Glu Gln Thr Tyr Tyr Asp Leu Val Lys Ala Phe 
        195                  200                205            
Met Ala Glu Ile Arg Gln Tyr Ile Arg Glu Leu Asn Leu Ile Ile Lys 
    210                  215                220                
Val Phe Arg Glu Pro Phe Val Ser Asn Ser Lys Leu Phe Ser Ala Asn 
225                  230                235                  240
Asp Val Glu Asn Ile Phe Ser Arg Ile Val Asp Ile His Glu Leu Ser 
                245                  250                255    
Val Lys Leu Leu Gly His Ile Glu Asp Thr Val Glu Met Thr Asp Glu 
            260                  265                270        
Gly Ser Pro His Pro Leu Val Gly Ser Cys Phe Glu Asp Leu Ala Glu 
        275                  280                285            
Glu Leu Ala Phe Asp Pro Tyr Glu Ser Tyr Ala Arg Asp Ile Leu Arg 
    290                  295                300                
Pro Gly Phe His Asp Arg Phe Leu Ser Gln Leu Ser Lys Pro Gly Ala 
305                  310                315                  320
Ala Leu Tyr Leu Gln Ser Ile Gly Glu Gly Phe Lys Glu Ala Val Gln 
                325                  330                335    
Tyr Val Leu Pro Arg Leu Leu Leu Ala Pro Val Tyr His Cys Leu His 
            340                  345                350        
Tyr Phe Glu Leu Leu Lys Gln Leu Glu Glu Lys Ser Glu Asp Gln Glu 
        355                  360                365            
Asp Lys Glu Cys Leu Lys Gln Ala Ile Thr Ala Leu Leu Asn Val Gln 
    370                  375                380                
Ser Gly Met Glu Lys Ile Cys Ser Lys Ser Leu Ala Lys Arg Arg Leu 
385                  390                395                  400
Ser Glu Ser Ala Cys Arg Phe Tyr Ser Gln Gln Met Lys Gly Lys Gln 
                405                  410                415    
Leu Ala Ile Lys Lys Met Asn Glu Ile Gln Lys Asn Ile Asp Gly Trp 
            420                  425                430        
Glu Gly Lys Asp Ile Gly Gln Cys Cys Asn Glu Phe Ile Met Glu Gly 
        435                  440                445            
Thr Leu Thr Arg Val Gly Ala Lys His Glu Arg His Ile Phe Leu Phe 
    450                  455                460                
Asp Gly Leu Met Ile Cys Cys Lys Ser Asn His Gly Gln Pro Arg Leu 
465                  470                475                  480
Pro Gly Ala Ser Asn Ala Glu Tyr Arg Leu Lys Glu Lys Phe Phe Met 
                485                  490                495    
Arg Lys Val Gln Ile Asn Asp Lys Asp Asp Thr Asn Glu Tyr Lys His 
            500                  505                510        
Ala Phe Glu Ile Ile Leu Lys Asp Glu Asn Ser Val Ile Phe Ser Ala 
        515                  520                525            
Lys Ser Ala Glu Glu Lys Asn Asn Trp Met Ala Ala Leu Ile Ser Leu 
    530                  535                540                
Gln Tyr Arg Ser Thr Leu Glu Arg Met Leu Asp Val Thr Met Leu Gln 
545                  550                555                  560
Glu Glu Lys Glu Glu Gln Met Arg Leu Pro Ser Ala Asp Val Tyr Arg 
                565                  570                575    
Phe Ala Glu Pro Asp Ser Glu Glu Asn Ile Ile Phe Glu Glu Asn Met 
            580                  585                590        
Gln Pro Lys Ala Gly Ile Pro Ile Ile Lys Ala Gly Thr Val Ile Lys 
        595                  600                605            
Leu Ile Glu Arg Leu Thr Tyr His Met Tyr Ala Asp Pro Asn Phe Val 
    610                  615                620                
Arg Thr Phe Leu Thr Thr Tyr Arg Ser Phe Cys Lys Pro Gln Glu Leu 
625                  630                635                  640
Leu Ser Leu Ile Ile Glu Arg Phe Glu Ile Pro Glu Pro Glu Pro Thr 
                645                  650                655    
Glu Ala Asp Arg Ile Ala Ile Glu Asn Gly Asp Gln Pro Leu Ser Ala 
            660                  665                670        
Glu Leu Lys Arg Phe Arg Lys Glu Tyr Ile Gln Pro Val Gln Leu Arg 
        675                  680                685            
Val Leu Asn Val Cys Arg His Trp Val Glu His His Phe Tyr Asp Phe 
    690                  695                700                
Glu Arg Asp Ala Tyr Leu Leu Gln Arg Met Glu Glu Phe Ile Gly Thr 
705                  710                715                  720
Val Arg Gly Lys Ala Met Lys Lys Trp Val Glu Ser Ile Thr Lys Ile 
                725                  730                735    
Ile Gln Arg Lys Lys Ile Ala Arg Asp Asn Gly Pro Gly His Asn Ile 
            740                  745                750        
Thr Phe Gln Ser Ser Pro Pro Thr Val Glu Trp His Ile Ser Arg Pro 
        755                  760                765            
Gly His Ile Glu Thr Phe Asp Leu Leu Thr Leu His Pro Ile Glu Ile 
    770                  775                780                
Ala Arg Gln Leu Thr Leu Leu Glu Ser Asp Leu Tyr Arg Ala Val Gln 
785                  790                795                  800
Pro Ser Glu Leu Val Gly Ser Val Trp Thr Lys Glu Asp Lys Glu Ile 
                805                  810                815    
Asn Ser Pro Asn Leu Leu Lys Met Ile Arg His Thr Thr Asn Leu Thr 
            820                  825                830        
Leu Trp Phe Glu Lys Cys Ile Val Glu Thr Glu Asn Leu Glu Glu Arg 
        835                  840                845            
Val Ala Val Val Ser Arg Ile Ile Glu Ile Leu Gln Val Phe Gln Glu 
    850                  855                860                
Leu Asn Asn Phe Asn Gly Val Leu Glu Val Val Ser Ala Met Asn Ser 
865                  870                875                  880
Ser Pro Val Tyr Arg Leu Asp His Thr Phe Glu Gln Ile Pro Ser Arg 
                885                  890                895    
Gln Lys Lys Ile Leu Glu Glu Ala His Glu Leu Ser Glu Asp His Tyr 
            900                  905                910        
Lys Lys Tyr Leu Ala Lys Leu Arg Ser Ile Asn Pro Pro Cys Val Pro 
        915                  920                925            
Phe Phe Gly Ile Tyr Leu Thr Asn Ile Leu Lys Thr Glu Glu Gly Asn 
    930                  935                940                
Pro Glu Val Leu Lys Arg His Gly Lys Glu Leu Ile Asn Phe Ser Lys 
945                  950                955                  960
Arg Arg Lys Val Ala Glu Ile Thr Gly Glu Ile Gln Gln Tyr Gln Asn 
                965                  970                975    
Gln Pro Tyr Cys Leu Arg Val Glu Ser Asp Ile Lys Arg Phe Phe Glu 
            980                  985                990        
Asn Leu Asn Pro Met Gly Asn Ser Met Glu Lys Glu Phe Thr Asp Tyr 
        995                  1000                1005            
Leu Phe Asn Lys Ser Leu Glu Ile Glu Pro Arg Asn Pro Lys Pro Leu 
    1010                1015                1020                
Pro Arg Phe Pro Lys Lys Tyr Ser Tyr Pro Leu Lys Ser Pro Gly Val 
1025                1030                1035                1040
Arg Pro Ser Asn Pro Arg Pro Gly Thr Met Arg His Pro Thr Pro Leu 
                1045                1050                1055    
Gln Gln Glu Pro Arg Lys Ile Ser Tyr Ser Arg Ile Pro Glu Ser Glu 
            1060                1065                1070        
Thr Glu Ser Thr Ala Ser Ala Pro Asn Ser Pro Arg Thr Pro Leu Thr 
        1075                1080                1085            
Pro Pro Pro Ala Ser Gly Ala Ser Ser Thr Thr Asp Val Cys Ser Val 
    1090                1095                1100                
Phe Asp Ser Asp His Ser Ser Pro Phe His Ser Ser Asn Asp Thr Val 
1105                1110                1115                1120
Phe Ile Gln Val Thr Leu Pro His Gly Pro Arg Ser Ala Ser Val Ser 
                1125                1130                1135    
Ser Ile Ser Leu Thr Lys Gly Thr Asp Glu Val Pro Val Pro Pro Pro 
            1140                1145                1150        
Val Pro Pro Arg Arg Arg Pro Glu Ser Ala Pro Ala Glu Ser Ser Pro 
        1155                1160                1165            
Ser Lys Ile Met Ser Lys His Leu Asp Ser Pro Pro Ala Ile Pro Pro 
    1170                1175                1180                
Arg Gln Pro Thr Ser Lys Ala Tyr Ser Pro Arg Tyr Ser Ile Ser Asp 
1185                1190                1195                1200
Arg Thr Ser Ile Ser Asp Pro Pro Glu Ser Pro Pro Leu Leu Pro Pro 
                1205                1210                1215    
Arg Glu Pro Val Arg Thr Pro Asp Val Phe Ser Ser Ser Pro Leu His 
            1220                1225                1230        
Leu Gln Pro Pro Pro Leu Gly Lys Lys Ser Asp His Gly Asn Ala Phe 
        1235                1240                1245            
Phe Pro Asn Ser Pro Ser Pro Phe Thr Pro Pro Pro Pro Gln Thr Pro 
    1250                1255                1260                
Ser Pro His Gly Thr Arg Arg His Leu Pro Ser Pro Pro Leu Thr Gln 
1265                1270                1275                1280
Glu Val Asp Leu His Ser Ile Ala Gly Pro Pro Val Pro Pro Arg Gln 
                1285                1290                1295    
Ser Thr Ser Gln His Ile Pro Lys Leu Pro Pro Lys Thr Tyr Lys Arg 
            1300                1305                1310        
Glu His Thr His Pro Ser Met His Arg Asp Gly Pro Pro Leu Leu Glu 
        1315                1320                1325            
Asn Ala His Ser Ser 
    1330            

<210> 3
<211> 864
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..864
<223> /mol_type="DNA"
      /note="SOS1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 3
atgcaggcgc agcagctgcc ctacgagttt ttcagcgaag agaacgcgcc caagtggcgg      60

ggactactgg tgcctgcgct gaaaaaggtc caggggcaag ttcatcctac tctcgagtct     120

aatgatgatg ctcttcagta tgttgaagaa ttaattttgc aattattaaa tatgctatgc     180

caagctcagc cccgaagtgc ttcagatgta gaggaacgtg ttcaaaaaag tttccctcat     240

ccaattgata aatgggcaat agctgatgcc caatcagcta ttgaaaagag gaagcgaaga     300

aaccctttat ctctcccagt agaaaaaatt catcctttat taaaggaggt cctaggttat     360

aaaattgacc accaggtttc tgtttacata gtagcagtct tagaatacat ttctgcagac     420

attttaaagc tggttgggaa ttatgtaaga aatatacggc attatgaaat tacaaaacaa     480

gatattaaag tggcaatgtg tgctgacaag gtattgatgg atatgtttca tcaagatgta     540

gaagatatta atatattatc tttaactgac gaagagcctt ccacctcagg agaacaaact     600

tactatgatt tggtaaaagc atttatggca gaaattcgac aatatataag ggaactaaat     660

ctaattataa aagtttttag agagcccttt gtctccaatt caaaattgtt ttcagctaat     720

gatgtagaaa atatatttag tcgcatagta gatatacatg aacttagtgt aaagttactg     780

ggccatatag aagatacagt agaaatgaca gatgaaggca gtccccatcc actagtagga     840

agctgctttg aagacttagc agag                                            864


<210> 4
<211> 288
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..288
<223> /mol_type="protein"
      /note="SOS1 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 4
Met Gln Ala Gln Gln Leu Pro Tyr Glu Phe Phe Ser Glu Glu Asn Ala 
1               5                   10                   15    
Pro Lys Trp Arg Gly Leu Leu Val Pro Ala Leu Lys Lys Val Gln Gly 
            20                   25                  30        
Gln Val His Pro Thr Leu Glu Ser Asn Asp Asp Ala Leu Gln Tyr Val 
        35                   40                  45            
Glu Glu Leu Ile Leu Gln Leu Leu Asn Met Leu Cys Gln Ala Gln Pro 
    50                   55                  60                
Arg Ser Ala Ser Asp Val Glu Glu Arg Val Gln Lys Ser Phe Pro His 
65                   70                  75                  80
Pro Ile Asp Lys Trp Ala Ile Ala Asp Ala Gln Ser Ala Ile Glu Lys 
                85                   90                  95    
Arg Lys Arg Arg Asn Pro Leu Ser Leu Pro Val Glu Lys Ile His Pro 
            100                  105                110        
Leu Leu Lys Glu Val Leu Gly Tyr Lys Ile Asp His Gln Val Ser Val 
        115                  120                125            
Tyr Ile Val Ala Val Leu Glu Tyr Ile Ser Ala Asp Ile Leu Lys Leu 
    130                  135                140                
Val Gly Asn Tyr Val Arg Asn Ile Arg His Tyr Glu Ile Thr Lys Gln 
145                  150                155                  160
Asp Ile Lys Val Ala Met Cys Ala Asp Lys Val Leu Met Asp Met Phe 
                165                  170                175    
His Gln Asp Val Glu Asp Ile Asn Ile Leu Ser Leu Thr Asp Glu Glu 
            180                  185                190        
Pro Ser Thr Ser Gly Glu Gln Thr Tyr Tyr Asp Leu Val Lys Ala Phe 
        195                  200                205            
Met Ala Glu Ile Arg Gln Tyr Ile Arg Glu Leu Asn Leu Ile Ile Lys 
    210                  215                220                
Val Phe Arg Glu Pro Phe Val Ser Asn Ser Lys Leu Phe Ser Ala Asn 
225                  230                235                  240
Asp Val Glu Asn Ile Phe Ser Arg Ile Val Asp Ile His Glu Leu Ser 
                245                  250                255    
Val Lys Leu Leu Gly His Ile Glu Asp Thr Val Glu Met Thr Asp Glu 
            260                  265                270        
Gly Ser Pro His Pro Leu Val Gly Ser Cys Phe Glu Asp Leu Ala Glu 
        275                  280                285            

<210> 5
<211> 3435
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3435
<223> /mol_type="DNA"
      /note="ADCY3 (CCDS nucleotide sequence of ADCY3 (Gene ID: 109))"
      /organism="Homo sapiens"

<400> 5
atgccgagga accagggctt ctccgagccc gaatactcgg ccgagtactc agccgagtac       60

tccgtcagcc tgccctccga ccctgaccgc ggggtgggcc ggacccatga aatctcggtc      120

cggaactcgg gctcctgcct gtgcctgcct cgcttcatgc ggctgacttt cgtgccggag      180

tccttggaga acctctacca gacctacttc aaaaggcagc gccacgagac cctgctggtg      240

ctggtggtct ttgcagccct ctttgactgc tacgtggtgg tcatgtgtgc tgtggtcttc      300

tccagcgaca agctggcttc cctcgccgtg gctggaattg gactggtgtt ggacatcatc      360

ctcttcgtgc tctgcaaaaa ggggctgctc ccggaccggg tcacccgcag agtgctgccc      420

tacgtgctgt ggctgctcat aaccgcccag atcttctcct acctgggcct gaacttcgcg      480

cgtgcccacg cggctagtga cacggtgggc tggcaggtct tctttgtctt ctccttcttc      540

atcacgctgc ccctcagcct cagccccatc gtgatcatct ccgtggtctc ctgtgtggtg      600

cacacgttgg tcctgggggt caccgtggcc cagcagcagc aggaggagct caaggggatg      660

cagctgctgc gggagatcct ggccaacgtc ttcctctacc tgtgcgccat cgctgtgggc      720

atcatgtcct actacatggc tgaccgcaag caccgcaagg ccttcctgga ggcccgccag      780

tcgctggagg tgaagatgaa cctggaagag cagagccagc agcaggagaa cctcatgctt      840

tccatcctgc ccaagcacgt ggctgacgag atgctgaaag acatgaagaa agacgagagc      900

cagaaggacc agcagcagtt caacaccatg tacatgtacc gtcacgagaa cgtcagcatc      960

ctctttgccg acatcgtggg ctttacccag ctgtcttctg cctgcagtgc ccaggagctt     1020

gtgaagctgc tcaacgagct ctttgcccgc tttgacaagc tggcagctaa ataccaccag     1080

ctgcggatta agatcctggg cgactgctac tactgcatct gcggcttgcc cgactaccgg     1140

gaggaccacg ccgtctgctc catcctcatg gggctggcca tggtggaggc catctcgtat     1200

gtgcgggaga agaccaagac tggggtggac atgcgtgtgg gggtgcacac gggcaccgtg     1260

ctggggggcg tcctgggcca gaagcgctgg cagtacgacg tgtggtcgac tgatgtcact     1320

gtagccaaca agatggaggc cggcggcatc cctgggcgcg tgcacatctc ccagagcacc     1380

atggactgcc tgaaagggga gtttgatgtg gagccaggcg atgggggcag ccgctgtgat     1440

tacctagaag agaagggtat tgaaacctac ctcatcattg cctccaagcc agaggtgaag     1500

aaaacagcca cccagaatgg cctcaatggc tcggccctgc ccaatggagc accagcttcc     1560

tcaaagtcca gctcccctgc cctcattgag accaaggagc ccaacgggag tgcccacagc     1620

agtgggtcca cgtcggagaa gcccgaggag caggatgccc aggccgacaa cccctcattc     1680

cccaacccac gccggaggct gcgcctgcag gacctggctg accgagtggt ggatgcctct     1740

gaagatgagc acgagctcaa ccagctgctc aacgaggccc tgcttgagcg agagtccgcc     1800

caagtagtaa agaagagaaa caccttcctc ttgtccatgc ggttcatgga ccccgagatg     1860

gaaacccgct actcggtgga gaaggagaag cagagtgggg ctgccttcag ctgctcctgc     1920

gtcgtcctgc tctgcacggc cctggtcgag atactcatcg acccctggct aatgacaaac     1980

tatgtgacct tcatggtggg ggagattctg ctcctcatcc tgaccatctg ctccctggct     2040

gccatctttc cccgggcctt tcctaagaag cttgtggcct tctcaacttg gattgaccgg     2100

acccgctggg ccaggaacac ctgggccatg ctcgccatct tcatcctggt gatggcaaat     2160

gtcgtggaca tgctcagctg tctccagtac tacacgggac ccagcaatgc aacggcaggg     2220

atggaaacgg agggcagctg cctggagaac cccaagtatt acaactatgt ggccgtgctg     2280

tccctcatcg ccaccatcat gctggtgcag gtcagccaca tggtgaagct cacgctcatg     2340

ctgctcgtcg caggcgccgt ggccaccatc aacctctatg cctggcgtcc cgtctttgat     2400

gaatacgacc acaagcgttt tcgggagcac gacttaccta tggtggcctt agagcagatg     2460

caaggattca accctgggct caatggcact gacaggctgc ccctggtgcc ttccaagtac     2520

tctatgacgg tgatggtgtt cctcatgatg ctcagcttct actacttctc ccgccacgta     2580

gaaaaactgg cacggacact tttcttgtgg aagattgagg tccacgacca gaaggaacgt     2640

gtctatgaga tgcgacgctg gaacgaggcc ttggtcacca acatgttgcc tgagcacgtg     2700

gcacgccatt tcctggggtc caagaagaga gatgaggagc tgtatagcca gacgtatgat     2760

gagattggag tcatgtttgc ctccctgccc aactttgctg acttctacac agaggagagc     2820

atcaacaatg gtggtattga gtgtctgcgt ttcctcaatg aaatcatctc agattttgac     2880

tctctcctgg acaatcccaa gttccgggtg atcaccaaga tcaaaaccat tggcagcacg     2940

tatatggcgg cttcaggagt cacccccgat gtcaacacca atggctttgc cagctccaac     3000

aaggaagaca agtccgagag agagcgctgg cagcacctgg ctgacctggc cgacttcgcg     3060

ctggccatga aggatacgct caccaacatc aacaaccagt ccttcaataa cttcatgctg     3120

cgcataggca tgaacaaagg cggggttctg gctggggtca tcggagcccg gaaaccacac     3180

tacgacatct ggggcaatac agtcaatgta gccagcagga tggagtccac gggggtcatg     3240

ggcaacattc aggtggtaga agaaacccaa gtcatcctcc gagagtacgg cttccgcttt     3300

gtgaggcgag gccccatctt tgtgaagggg aagggggagc tgctgacctt cttcttgaag     3360

gggcgggata agctagccac cttccccaat ggcccctctg tcacactgcc ccaccaggtg     3420

gtggacaact cctga                                                      3435


<210> 6
<211> 1144
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1144
<223> /mol_type="protein"
      /note="ADCY3 (full-length protein)"
      /organism="Homo sapiens"

<400> 6
Met Pro Arg Asn Gln Gly Phe Ser Glu Pro Glu Tyr Ser Ala Glu Tyr 
1               5                   10                   15    
Ser Ala Glu Tyr Ser Val Ser Leu Pro Ser Asp Pro Asp Arg Gly Val 
            20                   25                  30        
Gly Arg Thr His Glu Ile Ser Val Arg Asn Ser Gly Ser Cys Leu Cys 
        35                   40                  45            
Leu Pro Arg Phe Met Arg Leu Thr Phe Val Pro Glu Ser Leu Glu Asn 
    50                   55                  60                
Leu Tyr Gln Thr Tyr Phe Lys Arg Gln Arg His Glu Thr Leu Leu Val 
65                   70                  75                  80
Leu Val Val Phe Ala Ala Leu Phe Asp Cys Tyr Val Val Val Met Cys 
                85                   90                  95    
Ala Val Val Phe Ser Ser Asp Lys Leu Ala Ser Leu Ala Val Ala Gly 
            100                  105                110        
Ile Gly Leu Val Leu Asp Ile Ile Leu Phe Val Leu Cys Lys Lys Gly 
        115                  120                125            
Leu Leu Pro Asp Arg Val Thr Arg Arg Val Leu Pro Tyr Val Leu Trp 
    130                  135                140                
Leu Leu Ile Thr Ala Gln Ile Phe Ser Tyr Leu Gly Leu Asn Phe Ala 
145                  150                155                  160
Arg Ala His Ala Ala Ser Asp Thr Val Gly Trp Gln Val Phe Phe Val 
                165                  170                175    
Phe Ser Phe Phe Ile Thr Leu Pro Leu Ser Leu Ser Pro Ile Val Ile 
            180                  185                190        
Ile Ser Val Val Ser Cys Val Val His Thr Leu Val Leu Gly Val Thr 
        195                  200                205            
Val Ala Gln Gln Gln Gln Glu Glu Leu Lys Gly Met Gln Leu Leu Arg 
    210                  215                220                
Glu Ile Leu Ala Asn Val Phe Leu Tyr Leu Cys Ala Ile Ala Val Gly 
225                  230                235                  240
Ile Met Ser Tyr Tyr Met Ala Asp Arg Lys His Arg Lys Ala Phe Leu 
                245                  250                255    
Glu Ala Arg Gln Ser Leu Glu Val Lys Met Asn Leu Glu Glu Gln Ser 
            260                  265                270        
Gln Gln Gln Glu Asn Leu Met Leu Ser Ile Leu Pro Lys His Val Ala 
        275                  280                285            
Asp Glu Met Leu Lys Asp Met Lys Lys Asp Glu Ser Gln Lys Asp Gln 
    290                  295                300                
Gln Gln Phe Asn Thr Met Tyr Met Tyr Arg His Glu Asn Val Ser Ile 
305                  310                315                  320
Leu Phe Ala Asp Ile Val Gly Phe Thr Gln Leu Ser Ser Ala Cys Ser 
                325                  330                335    
Ala Gln Glu Leu Val Lys Leu Leu Asn Glu Leu Phe Ala Arg Phe Asp 
            340                  345                350        
Lys Leu Ala Ala Lys Tyr His Gln Leu Arg Ile Lys Ile Leu Gly Asp 
        355                  360                365            
Cys Tyr Tyr Cys Ile Cys Gly Leu Pro Asp Tyr Arg Glu Asp His Ala 
    370                  375                380                
Val Cys Ser Ile Leu Met Gly Leu Ala Met Val Glu Ala Ile Ser Tyr 
385                  390                395                  400
Val Arg Glu Lys Thr Lys Thr Gly Val Asp Met Arg Val Gly Val His 
                405                  410                415    
Thr Gly Thr Val Leu Gly Gly Val Leu Gly Gln Lys Arg Trp Gln Tyr 
            420                  425                430        
Asp Val Trp Ser Thr Asp Val Thr Val Ala Asn Lys Met Glu Ala Gly 
        435                  440                445            
Gly Ile Pro Gly Arg Val His Ile Ser Gln Ser Thr Met Asp Cys Leu 
    450                  455                460                
Lys Gly Glu Phe Asp Val Glu Pro Gly Asp Gly Gly Ser Arg Cys Asp 
465                  470                475                  480
Tyr Leu Glu Glu Lys Gly Ile Glu Thr Tyr Leu Ile Ile Ala Ser Lys 
                485                  490                495    
Pro Glu Val Lys Lys Thr Ala Thr Gln Asn Gly Leu Asn Gly Ser Ala 
            500                  505                510        
Leu Pro Asn Gly Ala Pro Ala Ser Ser Lys Ser Ser Ser Pro Ala Leu 
        515                  520                525            
Ile Glu Thr Lys Glu Pro Asn Gly Ser Ala His Ser Ser Gly Ser Thr 
    530                  535                540                
Ser Glu Lys Pro Glu Glu Gln Asp Ala Gln Ala Asp Asn Pro Ser Phe 
545                  550                555                  560
Pro Asn Pro Arg Arg Arg Leu Arg Leu Gln Asp Leu Ala Asp Arg Val 
                565                  570                575    
Val Asp Ala Ser Glu Asp Glu His Glu Leu Asn Gln Leu Leu Asn Glu 
            580                  585                590        
Ala Leu Leu Glu Arg Glu Ser Ala Gln Val Val Lys Lys Arg Asn Thr 
        595                  600                605            
Phe Leu Leu Ser Met Arg Phe Met Asp Pro Glu Met Glu Thr Arg Tyr 
    610                  615                620                
Ser Val Glu Lys Glu Lys Gln Ser Gly Ala Ala Phe Ser Cys Ser Cys 
625                  630                635                  640
Val Val Leu Leu Cys Thr Ala Leu Val Glu Ile Leu Ile Asp Pro Trp 
                645                  650                655    
Leu Met Thr Asn Tyr Val Thr Phe Met Val Gly Glu Ile Leu Leu Leu 
            660                  665                670        
Ile Leu Thr Ile Cys Ser Leu Ala Ala Ile Phe Pro Arg Ala Phe Pro 
        675                  680                685            
Lys Lys Leu Val Ala Phe Ser Thr Trp Ile Asp Arg Thr Arg Trp Ala 
    690                  695                700                
Arg Asn Thr Trp Ala Met Leu Ala Ile Phe Ile Leu Val Met Ala Asn 
705                  710                715                  720
Val Val Asp Met Leu Ser Cys Leu Gln Tyr Tyr Thr Gly Pro Ser Asn 
                725                  730                735    
Ala Thr Ala Gly Met Glu Thr Glu Gly Ser Cys Leu Glu Asn Pro Lys 
            740                  745                750        
Tyr Tyr Asn Tyr Val Ala Val Leu Ser Leu Ile Ala Thr Ile Met Leu 
        755                  760                765            
Val Gln Val Ser His Met Val Lys Leu Thr Leu Met Leu Leu Val Ala 
    770                  775                780                
Gly Ala Val Ala Thr Ile Asn Leu Tyr Ala Trp Arg Pro Val Phe Asp 
785                  790                795                  800
Glu Tyr Asp His Lys Arg Phe Arg Glu His Asp Leu Pro Met Val Ala 
                805                  810                815    
Leu Glu Gln Met Gln Gly Phe Asn Pro Gly Leu Asn Gly Thr Asp Arg 
            820                  825                830        
Leu Pro Leu Val Pro Ser Lys Tyr Ser Met Thr Val Met Val Phe Leu 
        835                  840                845            
Met Met Leu Ser Phe Tyr Tyr Phe Ser Arg His Val Glu Lys Leu Ala 
    850                  855                860                
Arg Thr Leu Phe Leu Trp Lys Ile Glu Val His Asp Gln Lys Glu Arg 
865                  870                875                  880
Val Tyr Glu Met Arg Arg Trp Asn Glu Ala Leu Val Thr Asn Met Leu 
                885                  890                895    
Pro Glu His Val Ala Arg His Phe Leu Gly Ser Lys Lys Arg Asp Glu 
            900                  905                910        
Glu Leu Tyr Ser Gln Thr Tyr Asp Glu Ile Gly Val Met Phe Ala Ser 
        915                  920                925            
Leu Pro Asn Phe Ala Asp Phe Tyr Thr Glu Glu Ser Ile Asn Asn Gly 
    930                  935                940                
Gly Ile Glu Cys Leu Arg Phe Leu Asn Glu Ile Ile Ser Asp Phe Asp 
945                  950                955                  960
Ser Leu Leu Asp Asn Pro Lys Phe Arg Val Ile Thr Lys Ile Lys Thr 
                965                  970                975    
Ile Gly Ser Thr Tyr Met Ala Ala Ser Gly Val Thr Pro Asp Val Asn 
            980                  985                990        
Thr Asn Gly Phe Ala Ser Ser Asn Lys Glu Asp Lys Ser Glu Arg Glu 
        995                  1000                1005            
Arg Trp Gln His Leu Ala Asp Leu Ala Asp Phe Ala Leu Ala Met Lys 
    1010                1015                1020                
Asp Thr Leu Thr Asn Ile Asn Asn Gln Ser Phe Asn Asn Phe Met Leu 
1025                1030                1035                1040
Arg Ile Gly Met Asn Lys Gly Gly Val Leu Ala Gly Val Ile Gly Ala 
                1045                1050                1055    
Arg Lys Pro His Tyr Asp Ile Trp Gly Asn Thr Val Asn Val Ala Ser 
            1060                1065                1070        
Arg Met Glu Ser Thr Gly Val Met Gly Asn Ile Gln Val Val Glu Glu 
        1075                1080                1085            
Thr Gln Val Ile Leu Arg Glu Tyr Gly Phe Arg Phe Val Arg Arg Gly 
    1090                1095                1100                
Pro Ile Phe Val Lys Gly Lys Gly Glu Leu Leu Thr Phe Phe Leu Lys 
1105                1110                1115                1120
Gly Arg Asp Lys Leu Ala Thr Phe Pro Asn Gly Pro Ser Val Thr Leu 
                1125                1130                1135    
Pro His Gln Val Val Asp Asn Ser 
            1140                

<210> 7
<211> 2760
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..2760
<223> /mol_type="DNA"
      /note="ADCY3 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 7
atcctggcca acgtcttcct ctacctgtgc gccatcgctg tgggcatcat gtcctactac      60

atggctgacc gcaagcaccg caaggccttc ctggaggccc gccagtcgct ggaggtgaag     120

atgaacctgg aagagcagag ccagcagcag gagaacctca tgctttccat cctgcccaag     180

cacgtggctg acgagatgct gaaagacatg aagaaagacg agagccagaa ggaccagcag     240

cagttcaaca ccatgtacat gtaccgtcac gagaacgtca gcatcctctt tgccgacatc     300

gtgggcttta cccagctgtc ttctgcctgc agtgcccagg agcttgtgaa gctgctcaac     360

gagctctttg cccgctttga caagctggca gctaaatacc accagctgcg gattaagatc     420

ctgggcgact gctactactg catctgcggc ttgcccgact accgggagga ccacgccgtc     480

tgctccatcc tcatggggct ggccatggtg gaggccatct cgtatgtgcg ggagaagacc     540

aagactgggg tggacatgcg tgtgggggtg cacacgggca ccgtgctggg gggcgtcctg     600

ggccagaagc gctggcagta cgacgtgtgg tcgactgatg tcactgtagc caacaagatg     660

gaggccggcg gcatccctgg gcgcgtgcac atctcccaga gcaccatgga ctgcctgaaa     720

ggggagtttg atgtggagcc aggcgatggg ggcagccgct gtgattacct agaagagaag     780

ggtattgaaa cctacctcat cattgcctcc aagccagagg tgaagaaaac agccacccag     840

aatggcctca atggctcggc cctgcccaat ggagcaccag cttcctcaaa gtccagctcc     900

cctgccctca ttgagaccaa ggagcccaac gggagtgccc acagcagtgg gtccacgtcg     960

gagaagcccg aggagcagga tgcccaggcc gacaacccct cattccccaa cccacgccgg    1020

aggctgcgcc tgcaggacct ggctgaccga gtggtggatg cctctgaaga tgagcacgag    1080

ctcaaccagc tgctcaacga ggccctgctt gagcgagagt ccgcccaagt agtaaagaag    1140

agaaacacct tcctcttgtc catgcggttc atggaccccg agatggaaac ccgctactcg    1200

gtggagaagg agaagcagag tggggctgcc ttcagctgct cctgcgtcgt cctgctctgc    1260

acggccctgg tcgagatact catcgacccc tggctaatga caaactatgt gaccttcatg    1320

gtgggggaga ttctgctcct catcctgacc atctgctccc tggctgccat ctttccccgg    1380

gcctttccta agaagcttgt ggccttctca acttggattg accggacccg ctgggccagg    1440

aacacctggg ccatgctcgc catcttcatc ctggtgatgg caaatgtcgt ggacatgctc    1500

agctgtctcc agtactacac gggacccagc aatgcaacgg cagggatgga aacggagggc    1560

agctgcctgg agaaccccaa gtattacaac tatgtggccg tgctgtccct catcgccacc    1620

atcatgctgg tgcaggtcag ccacatggtg aagctcacgc tcatgctgct cgtcgcaggc    1680

gccgtggcca ccatcaacct ctatgcctgg cgtcccgtct ttgatgaata cgaccacaag    1740

cgttttcggg agcacgactt acctatggtg gccttagagc agatgcaagg attcaaccct    1800

gggctcaatg gcactgacag gctgcccctg gtgccttcca agtactctat gacggtgatg    1860

gtgttcctca tgatgctcag cttctactac ttctcccgcc acgtagaaaa actggcacgg    1920

acacttttct tgtggaagat tgaggtccac gaccagaagg aacgtgtcta tgagatgcga    1980

cgctggaacg aggccttggt caccaacatg ttgcctgagc acgtggcacg ccatttcctg    2040

gggtccaaga agagagatga ggagctgtat agccagacgt atgatgagat tggagtcatg    2100

tttgcctccc tgcccaactt tgctgacttc tacacagagg agagcatcaa caatggtggt    2160

attgagtgtc tgcgtttcct caatgaaatc atctcagatt ttgactctct cctggacaat    2220

cccaagttcc gggtgatcac caagatcaaa accattggca gcacgtatat ggcggcttca    2280

ggagtcaccc ccgatgtcaa caccaatggc tttgccagct ccaacaagga agacaagtcc    2340

gagagagagc gctggcagca cctggctgac ctggccgact tcgcgctggc catgaaggat    2400

acgctcacca acatcaacaa ccagtccttc aataacttca tgctgcgcat aggcatgaac    2460

aaaggcgggg ttctggctgg ggtcatcgga gcccggaaac cacactacga catctggggc    2520

aatacagtca atgtagccag caggatggag tccacggggg tcatgggcaa cattcaggtg    2580

gtagaagaaa cccaagtcat cctccgagag tacggcttcc gctttgtgag gcgaggcccc    2640

atctttgtga aggggaaggg ggagctgctg accttcttct tgaaggggcg ggataagcta    2700

gccaccttcc ccaatggccc ctctgtcaca ctgccccacc aggtggtgga caactcctga    2760


<210> 8
<211> 919
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..919
<223> /mol_type="protein"
      /note="ADCY3 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 8
Ile Leu Ala Asn Val Phe Leu Tyr Leu Cys Ala Ile Ala Val Gly Ile 
1               5                   10                   15    
Met Ser Tyr Tyr Met Ala Asp Arg Lys His Arg Lys Ala Phe Leu Glu 
            20                   25                  30        
Ala Arg Gln Ser Leu Glu Val Lys Met Asn Leu Glu Glu Gln Ser Gln 
        35                   40                  45            
Gln Gln Glu Asn Leu Met Leu Ser Ile Leu Pro Lys His Val Ala Asp 
    50                   55                  60                
Glu Met Leu Lys Asp Met Lys Lys Asp Glu Ser Gln Lys Asp Gln Gln 
65                   70                  75                  80
Gln Phe Asn Thr Met Tyr Met Tyr Arg His Glu Asn Val Ser Ile Leu 
                85                   90                  95    
Phe Ala Asp Ile Val Gly Phe Thr Gln Leu Ser Ser Ala Cys Ser Ala 
            100                  105                110        
Gln Glu Leu Val Lys Leu Leu Asn Glu Leu Phe Ala Arg Phe Asp Lys 
        115                  120                125            
Leu Ala Ala Lys Tyr His Gln Leu Arg Ile Lys Ile Leu Gly Asp Cys 
    130                  135                140                
Tyr Tyr Cys Ile Cys Gly Leu Pro Asp Tyr Arg Glu Asp His Ala Val 
145                  150                155                  160
Cys Ser Ile Leu Met Gly Leu Ala Met Val Glu Ala Ile Ser Tyr Val 
                165                  170                175    
Arg Glu Lys Thr Lys Thr Gly Val Asp Met Arg Val Gly Val His Thr 
            180                  185                190        
Gly Thr Val Leu Gly Gly Val Leu Gly Gln Lys Arg Trp Gln Tyr Asp 
        195                  200                205            
Val Trp Ser Thr Asp Val Thr Val Ala Asn Lys Met Glu Ala Gly Gly 
    210                  215                220                
Ile Pro Gly Arg Val His Ile Ser Gln Ser Thr Met Asp Cys Leu Lys 
225                  230                235                  240
Gly Glu Phe Asp Val Glu Pro Gly Asp Gly Gly Ser Arg Cys Asp Tyr 
                245                  250                255    
Leu Glu Glu Lys Gly Ile Glu Thr Tyr Leu Ile Ile Ala Ser Lys Pro 
            260                  265                270        
Glu Val Lys Lys Thr Ala Thr Gln Asn Gly Leu Asn Gly Ser Ala Leu 
        275                  280                285            
Pro Asn Gly Ala Pro Ala Ser Ser Lys Ser Ser Ser Pro Ala Leu Ile 
    290                  295                300                
Glu Thr Lys Glu Pro Asn Gly Ser Ala His Ser Ser Gly Ser Thr Ser 
305                  310                315                  320
Glu Lys Pro Glu Glu Gln Asp Ala Gln Ala Asp Asn Pro Ser Phe Pro 
                325                  330                335    
Asn Pro Arg Arg Arg Leu Arg Leu Gln Asp Leu Ala Asp Arg Val Val 
            340                  345                350        
Asp Ala Ser Glu Asp Glu His Glu Leu Asn Gln Leu Leu Asn Glu Ala 
        355                  360                365            
Leu Leu Glu Arg Glu Ser Ala Gln Val Val Lys Lys Arg Asn Thr Phe 
    370                  375                380                
Leu Leu Ser Met Arg Phe Met Asp Pro Glu Met Glu Thr Arg Tyr Ser 
385                  390                395                  400
Val Glu Lys Glu Lys Gln Ser Gly Ala Ala Phe Ser Cys Ser Cys Val 
                405                  410                415    
Val Leu Leu Cys Thr Ala Leu Val Glu Ile Leu Ile Asp Pro Trp Leu 
            420                  425                430        
Met Thr Asn Tyr Val Thr Phe Met Val Gly Glu Ile Leu Leu Leu Ile 
        435                  440                445            
Leu Thr Ile Cys Ser Leu Ala Ala Ile Phe Pro Arg Ala Phe Pro Lys 
    450                  455                460                
Lys Leu Val Ala Phe Ser Thr Trp Ile Asp Arg Thr Arg Trp Ala Arg 
465                  470                475                  480
Asn Thr Trp Ala Met Leu Ala Ile Phe Ile Leu Val Met Ala Asn Val 
                485                  490                495    
Val Asp Met Leu Ser Cys Leu Gln Tyr Tyr Thr Gly Pro Ser Asn Ala 
            500                  505                510        
Thr Ala Gly Met Glu Thr Glu Gly Ser Cys Leu Glu Asn Pro Lys Tyr 
        515                  520                525            
Tyr Asn Tyr Val Ala Val Leu Ser Leu Ile Ala Thr Ile Met Leu Val 
    530                  535                540                
Gln Val Ser His Met Val Lys Leu Thr Leu Met Leu Leu Val Ala Gly 
545                  550                555                  560
Ala Val Ala Thr Ile Asn Leu Tyr Ala Trp Arg Pro Val Phe Asp Glu 
                565                  570                575    
Tyr Asp His Lys Arg Phe Arg Glu His Asp Leu Pro Met Val Ala Leu 
            580                  585                590        
Glu Gln Met Gln Gly Phe Asn Pro Gly Leu Asn Gly Thr Asp Arg Leu 
        595                  600                605            
Pro Leu Val Pro Ser Lys Tyr Ser Met Thr Val Met Val Phe Leu Met 
    610                  615                620                
Met Leu Ser Phe Tyr Tyr Phe Ser Arg His Val Glu Lys Leu Ala Arg 
625                  630                635                  640
Thr Leu Phe Leu Trp Lys Ile Glu Val His Asp Gln Lys Glu Arg Val 
                645                  650                655    
Tyr Glu Met Arg Arg Trp Asn Glu Ala Leu Val Thr Asn Met Leu Pro 
            660                  665                670        
Glu His Val Ala Arg His Phe Leu Gly Ser Lys Lys Arg Asp Glu Glu 
        675                  680                685            
Leu Tyr Ser Gln Thr Tyr Asp Glu Ile Gly Val Met Phe Ala Ser Leu 
    690                  695                700                
Pro Asn Phe Ala Asp Phe Tyr Thr Glu Glu Ser Ile Asn Asn Gly Gly 
705                  710                715                  720
Ile Glu Cys Leu Arg Phe Leu Asn Glu Ile Ile Ser Asp Phe Asp Ser 
                725                  730                735    
Leu Leu Asp Asn Pro Lys Phe Arg Val Ile Thr Lys Ile Lys Thr Ile 
            740                  745                750        
Gly Ser Thr Tyr Met Ala Ala Ser Gly Val Thr Pro Asp Val Asn Thr 
        755                  760                765            
Asn Gly Phe Ala Ser Ser Asn Lys Glu Asp Lys Ser Glu Arg Glu Arg 
    770                  775                780                
Trp Gln His Leu Ala Asp Leu Ala Asp Phe Ala Leu Ala Met Lys Asp 
785                  790                795                  800
Thr Leu Thr Asn Ile Asn Asn Gln Ser Phe Asn Asn Phe Met Leu Arg 
                805                  810                815    
Ile Gly Met Asn Lys Gly Gly Val Leu Ala Gly Val Ile Gly Ala Arg 
            820                  825                830        
Lys Pro His Tyr Asp Ile Trp Gly Asn Thr Val Asn Val Ala Ser Arg 
        835                  840                845            
Met Glu Ser Thr Gly Val Met Gly Asn Ile Gln Val Val Glu Glu Thr 
    850                  855                860                
Gln Val Ile Leu Arg Glu Tyr Gly Phe Arg Phe Val Arg Arg Gly Pro 
865                  870                875                  880
Ile Phe Val Lys Gly Lys Gly Glu Leu Leu Thr Phe Phe Leu Lys Gly 
                885                  890                895    
Arg Asp Lys Leu Ala Thr Phe Pro Asn Gly Pro Ser Val Thr Leu Pro 
            900                  905                910        
His Gln Val Val Asp Asn Ser 
        915                

<210> 9
<211> 3624
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..3624
<223> /mol_type="DNA"
      /note="SOS1-ADCY3 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 9
atgcaggcgc agcagctgcc ctacgagttt ttcagcgaag agaacgcgcc caagtggcgg       60

ggactactgg tgcctgcgct gaaaaaggtc caggggcaag ttcatcctac tctcgagtct      120

aatgatgatg ctcttcagta tgttgaagaa ttaattttgc aattattaaa tatgctatgc      180

caagctcagc cccgaagtgc ttcagatgta gaggaacgtg ttcaaaaaag tttccctcat      240

ccaattgata aatgggcaat agctgatgcc caatcagcta ttgaaaagag gaagcgaaga      300

aaccctttat ctctcccagt agaaaaaatt catcctttat taaaggaggt cctaggttat      360

aaaattgacc accaggtttc tgtttacata gtagcagtct tagaatacat ttctgcagac      420

attttaaagc tggttgggaa ttatgtaaga aatatacggc attatgaaat tacaaaacaa      480

gatattaaag tggcaatgtg tgctgacaag gtattgatgg atatgtttca tcaagatgta      540

gaagatatta atatattatc tttaactgac gaagagcctt ccacctcagg agaacaaact      600

tactatgatt tggtaaaagc atttatggca gaaattcgac aatatataag ggaactaaat      660

ctaattataa aagtttttag agagcccttt gtctccaatt caaaattgtt ttcagctaat      720

gatgtagaaa atatatttag tcgcatagta gatatacatg aacttagtgt aaagttactg      780

ggccatatag aagatacagt agaaatgaca gatgaaggca gtccccatcc actagtagga      840

agctgctttg aagacttagc agagatcctg gccaacgtct tcctctacct gtgcgccatc      900

gctgtgggca tcatgtccta ctacatggct gaccgcaagc accgcaaggc cttcctggag      960

gcccgccagt cgctggaggt gaagatgaac ctggaagagc agagccagca gcaggagaac     1020

ctcatgcttt ccatcctgcc caagcacgtg gctgacgaga tgctgaaaga catgaagaaa     1080

gacgagagcc agaaggacca gcagcagttc aacaccatgt acatgtaccg tcacgagaac     1140

gtcagcatcc tctttgccga catcgtgggc tttacccagc tgtcttctgc ctgcagtgcc     1200

caggagcttg tgaagctgct caacgagctc tttgcccgct ttgacaagct ggcagctaaa     1260

taccaccagc tgcggattaa gatcctgggc gactgctact actgcatctg cggcttgccc     1320

gactaccggg aggaccacgc cgtctgctcc atcctcatgg ggctggccat ggtggaggcc     1380

atctcgtatg tgcgggagaa gaccaagact ggggtggaca tgcgtgtggg ggtgcacacg     1440

ggcaccgtgc tggggggcgt cctgggccag aagcgctggc agtacgacgt gtggtcgact     1500

gatgtcactg tagccaacaa gatggaggcc ggcggcatcc ctgggcgcgt gcacatctcc     1560

cagagcacca tggactgcct gaaaggggag tttgatgtgg agccaggcga tgggggcagc     1620

cgctgtgatt acctagaaga gaagggtatt gaaacctacc tcatcattgc ctccaagcca     1680

gaggtgaaga aaacagccac ccagaatggc ctcaatggct cggccctgcc caatggagca     1740

ccagcttcct caaagtccag ctcccctgcc ctcattgaga ccaaggagcc caacgggagt     1800

gcccacagca gtgggtccac gtcggagaag cccgaggagc aggatgccca ggccgacaac     1860

ccctcattcc ccaacccacg ccggaggctg cgcctgcagg acctggctga ccgagtggtg     1920

gatgcctctg aagatgagca cgagctcaac cagctgctca acgaggccct gcttgagcga     1980

gagtccgccc aagtagtaaa gaagagaaac accttcctct tgtccatgcg gttcatggac     2040

cccgagatgg aaacccgcta ctcggtggag aaggagaagc agagtggggc tgccttcagc     2100

tgctcctgcg tcgtcctgct ctgcacggcc ctggtcgaga tactcatcga cccctggcta     2160

atgacaaact atgtgacctt catggtgggg gagattctgc tcctcatcct gaccatctgc     2220

tccctggctg ccatctttcc ccgggccttt cctaagaagc ttgtggcctt ctcaacttgg     2280

attgaccgga cccgctgggc caggaacacc tgggccatgc tcgccatctt catcctggtg     2340

atggcaaatg tcgtggacat gctcagctgt ctccagtact acacgggacc cagcaatgca     2400

acggcaggga tggaaacgga gggcagctgc ctggagaacc ccaagtatta caactatgtg     2460

gccgtgctgt ccctcatcgc caccatcatg ctggtgcagg tcagccacat ggtgaagctc     2520

acgctcatgc tgctcgtcgc aggcgccgtg gccaccatca acctctatgc ctggcgtccc     2580

gtctttgatg aatacgacca caagcgtttt cgggagcacg acttacctat ggtggcctta     2640

gagcagatgc aaggattcaa ccctgggctc aatggcactg acaggctgcc cctggtgcct     2700

tccaagtact ctatgacggt gatggtgttc ctcatgatgc tcagcttcta ctacttctcc     2760

cgccacgtag aaaaactggc acggacactt ttcttgtgga agattgaggt ccacgaccag     2820

aaggaacgtg tctatgagat gcgacgctgg aacgaggcct tggtcaccaa catgttgcct     2880

gagcacgtgg cacgccattt cctggggtcc aagaagagag atgaggagct gtatagccag     2940

acgtatgatg agattggagt catgtttgcc tccctgccca actttgctga cttctacaca     3000

gaggagagca tcaacaatgg tggtattgag tgtctgcgtt tcctcaatga aatcatctca     3060

gattttgact ctctcctgga caatcccaag ttccgggtga tcaccaagat caaaaccatt     3120

ggcagcacgt atatggcggc ttcaggagtc acccccgatg tcaacaccaa tggctttgcc     3180

agctccaaca aggaagacaa gtccgagaga gagcgctggc agcacctggc tgacctggcc     3240

gacttcgcgc tggccatgaa ggatacgctc accaacatca acaaccagtc cttcaataac     3300

ttcatgctgc gcataggcat gaacaaaggc ggggttctgg ctggggtcat cggagcccgg     3360

aaaccacact acgacatctg gggcaataca gtcaatgtag ccagcaggat ggagtccacg     3420

ggggtcatgg gcaacattca ggtggtagaa gaaacccaag tcatcctccg agagtacggc     3480

ttccgctttg tgaggcgagg ccccatcttt gtgaagggga agggggagct gctgaccttc     3540

ttcttgaagg ggcgggataa gctagccacc ttccccaatg gcccctctgt cacactgccc     3600

caccaggtgg tggacaactc ctga                                            3624


<210> 10
<211> 1207
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..1207
<223> /mol_type="protein"
      /note="SOS1-ADCY3 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 10
Met Gln Ala Gln Gln Leu Pro Tyr Glu Phe Phe Ser Glu Glu Asn Ala 
1               5                   10                   15    
Pro Lys Trp Arg Gly Leu Leu Val Pro Ala Leu Lys Lys Val Gln Gly 
            20                   25                  30        
Gln Val His Pro Thr Leu Glu Ser Asn Asp Asp Ala Leu Gln Tyr Val 
        35                   40                  45            
Glu Glu Leu Ile Leu Gln Leu Leu Asn Met Leu Cys Gln Ala Gln Pro 
    50                   55                  60                
Arg Ser Ala Ser Asp Val Glu Glu Arg Val Gln Lys Ser Phe Pro His 
65                   70                  75                  80
Pro Ile Asp Lys Trp Ala Ile Ala Asp Ala Gln Ser Ala Ile Glu Lys 
                85                   90                  95    
Arg Lys Arg Arg Asn Pro Leu Ser Leu Pro Val Glu Lys Ile His Pro 
            100                  105                110        
Leu Leu Lys Glu Val Leu Gly Tyr Lys Ile Asp His Gln Val Ser Val 
        115                  120                125            
Tyr Ile Val Ala Val Leu Glu Tyr Ile Ser Ala Asp Ile Leu Lys Leu 
    130                  135                140                
Val Gly Asn Tyr Val Arg Asn Ile Arg His Tyr Glu Ile Thr Lys Gln 
145                  150                155                  160
Asp Ile Lys Val Ala Met Cys Ala Asp Lys Val Leu Met Asp Met Phe 
                165                  170                175    
His Gln Asp Val Glu Asp Ile Asn Ile Leu Ser Leu Thr Asp Glu Glu 
            180                  185                190        
Pro Ser Thr Ser Gly Glu Gln Thr Tyr Tyr Asp Leu Val Lys Ala Phe 
        195                  200                205            
Met Ala Glu Ile Arg Gln Tyr Ile Arg Glu Leu Asn Leu Ile Ile Lys 
    210                  215                220                
Val Phe Arg Glu Pro Phe Val Ser Asn Ser Lys Leu Phe Ser Ala Asn 
225                  230                235                  240
Asp Val Glu Asn Ile Phe Ser Arg Ile Val Asp Ile His Glu Leu Ser 
                245                  250                255    
Val Lys Leu Leu Gly His Ile Glu Asp Thr Val Glu Met Thr Asp Glu 
            260                  265                270        
Gly Ser Pro His Pro Leu Val Gly Ser Cys Phe Glu Asp Leu Ala Glu 
        275                  280                285            
Ile Leu Ala Asn Val Phe Leu Tyr Leu Cys Ala Ile Ala Val Gly Ile 
    290                  295                300                
Met Ser Tyr Tyr Met Ala Asp Arg Lys His Arg Lys Ala Phe Leu Glu 
305                  310                315                  320
Ala Arg Gln Ser Leu Glu Val Lys Met Asn Leu Glu Glu Gln Ser Gln 
                325                  330                335    
Gln Gln Glu Asn Leu Met Leu Ser Ile Leu Pro Lys His Val Ala Asp 
            340                  345                350        
Glu Met Leu Lys Asp Met Lys Lys Asp Glu Ser Gln Lys Asp Gln Gln 
        355                  360                365            
Gln Phe Asn Thr Met Tyr Met Tyr Arg His Glu Asn Val Ser Ile Leu 
    370                  375                380                
Phe Ala Asp Ile Val Gly Phe Thr Gln Leu Ser Ser Ala Cys Ser Ala 
385                  390                395                  400
Gln Glu Leu Val Lys Leu Leu Asn Glu Leu Phe Ala Arg Phe Asp Lys 
                405                  410                415    
Leu Ala Ala Lys Tyr His Gln Leu Arg Ile Lys Ile Leu Gly Asp Cys 
            420                  425                430        
Tyr Tyr Cys Ile Cys Gly Leu Pro Asp Tyr Arg Glu Asp His Ala Val 
        435                  440                445            
Cys Ser Ile Leu Met Gly Leu Ala Met Val Glu Ala Ile Ser Tyr Val 
    450                  455                460                
Arg Glu Lys Thr Lys Thr Gly Val Asp Met Arg Val Gly Val His Thr 
465                  470                475                  480
Gly Thr Val Leu Gly Gly Val Leu Gly Gln Lys Arg Trp Gln Tyr Asp 
                485                  490                495    
Val Trp Ser Thr Asp Val Thr Val Ala Asn Lys Met Glu Ala Gly Gly 
            500                  505                510        
Ile Pro Gly Arg Val His Ile Ser Gln Ser Thr Met Asp Cys Leu Lys 
        515                  520                525            
Gly Glu Phe Asp Val Glu Pro Gly Asp Gly Gly Ser Arg Cys Asp Tyr 
    530                  535                540                
Leu Glu Glu Lys Gly Ile Glu Thr Tyr Leu Ile Ile Ala Ser Lys Pro 
545                  550                555                  560
Glu Val Lys Lys Thr Ala Thr Gln Asn Gly Leu Asn Gly Ser Ala Leu 
                565                  570                575    
Pro Asn Gly Ala Pro Ala Ser Ser Lys Ser Ser Ser Pro Ala Leu Ile 
            580                  585                590        
Glu Thr Lys Glu Pro Asn Gly Ser Ala His Ser Ser Gly Ser Thr Ser 
        595                  600                605            
Glu Lys Pro Glu Glu Gln Asp Ala Gln Ala Asp Asn Pro Ser Phe Pro 
    610                  615                620                
Asn Pro Arg Arg Arg Leu Arg Leu Gln Asp Leu Ala Asp Arg Val Val 
625                  630                635                  640
Asp Ala Ser Glu Asp Glu His Glu Leu Asn Gln Leu Leu Asn Glu Ala 
                645                  650                655    
Leu Leu Glu Arg Glu Ser Ala Gln Val Val Lys Lys Arg Asn Thr Phe 
            660                  665                670        
Leu Leu Ser Met Arg Phe Met Asp Pro Glu Met Glu Thr Arg Tyr Ser 
        675                  680                685            
Val Glu Lys Glu Lys Gln Ser Gly Ala Ala Phe Ser Cys Ser Cys Val 
    690                  695                700                
Val Leu Leu Cys Thr Ala Leu Val Glu Ile Leu Ile Asp Pro Trp Leu 
705                  710                715                  720
Met Thr Asn Tyr Val Thr Phe Met Val Gly Glu Ile Leu Leu Leu Ile 
                725                  730                735    
Leu Thr Ile Cys Ser Leu Ala Ala Ile Phe Pro Arg Ala Phe Pro Lys 
            740                  745                750        
Lys Leu Val Ala Phe Ser Thr Trp Ile Asp Arg Thr Arg Trp Ala Arg 
        755                  760                765            
Asn Thr Trp Ala Met Leu Ala Ile Phe Ile Leu Val Met Ala Asn Val 
    770                  775                780                
Val Asp Met Leu Ser Cys Leu Gln Tyr Tyr Thr Gly Pro Ser Asn Ala 
785                  790                795                  800
Thr Ala Gly Met Glu Thr Glu Gly Ser Cys Leu Glu Asn Pro Lys Tyr 
                805                  810                815    
Tyr Asn Tyr Val Ala Val Leu Ser Leu Ile Ala Thr Ile Met Leu Val 
            820                  825                830        
Gln Val Ser His Met Val Lys Leu Thr Leu Met Leu Leu Val Ala Gly 
        835                  840                845            
Ala Val Ala Thr Ile Asn Leu Tyr Ala Trp Arg Pro Val Phe Asp Glu 
    850                  855                860                
Tyr Asp His Lys Arg Phe Arg Glu His Asp Leu Pro Met Val Ala Leu 
865                  870                875                  880
Glu Gln Met Gln Gly Phe Asn Pro Gly Leu Asn Gly Thr Asp Arg Leu 
                885                  890                895    
Pro Leu Val Pro Ser Lys Tyr Ser Met Thr Val Met Val Phe Leu Met 
            900                  905                910        
Met Leu Ser Phe Tyr Tyr Phe Ser Arg His Val Glu Lys Leu Ala Arg 
        915                  920                925            
Thr Leu Phe Leu Trp Lys Ile Glu Val His Asp Gln Lys Glu Arg Val 
    930                  935                940                
Tyr Glu Met Arg Arg Trp Asn Glu Ala Leu Val Thr Asn Met Leu Pro 
945                  950                955                  960
Glu His Val Ala Arg His Phe Leu Gly Ser Lys Lys Arg Asp Glu Glu 
                965                  970                975    
Leu Tyr Ser Gln Thr Tyr Asp Glu Ile Gly Val Met Phe Ala Ser Leu 
            980                  985                990        
Pro Asn Phe Ala Asp Phe Tyr Thr Glu Glu Ser Ile Asn Asn Gly Gly 
        995                  1000                1005            
Ile Glu Cys Leu Arg Phe Leu Asn Glu Ile Ile Ser Asp Phe Asp Ser 
    1010                1015                1020                
Leu Leu Asp Asn Pro Lys Phe Arg Val Ile Thr Lys Ile Lys Thr Ile 
1025                1030                1035                1040
Gly Ser Thr Tyr Met Ala Ala Ser Gly Val Thr Pro Asp Val Asn Thr 
                1045                1050                1055    
Asn Gly Phe Ala Ser Ser Asn Lys Glu Asp Lys Ser Glu Arg Glu Arg 
            1060                1065                1070        
Trp Gln His Leu Ala Asp Leu Ala Asp Phe Ala Leu Ala Met Lys Asp 
        1075                1080                1085            
Thr Leu Thr Asn Ile Asn Asn Gln Ser Phe Asn Asn Phe Met Leu Arg 
    1090                1095                1100                
Ile Gly Met Asn Lys Gly Gly Val Leu Ala Gly Val Ile Gly Ala Arg 
1105                1110                1115                1120
Lys Pro His Tyr Asp Ile Trp Gly Asn Thr Val Asn Val Ala Ser Arg 
                1125                1130                1135    
Met Glu Ser Thr Gly Val Met Gly Asn Ile Gln Val Val Glu Glu Thr 
            1140                1145                1150        
Gln Val Ile Leu Arg Glu Tyr Gly Phe Arg Phe Val Arg Arg Gly Pro 
        1155                1160                1165            
Ile Phe Val Lys Gly Lys Gly Glu Leu Leu Thr Phe Phe Leu Lys Gly 
    1170                1175                1180                
Arg Asp Lys Leu Ala Thr Phe Pro Asn Gly Pro Ser Val Thr Leu Pro 
1185                1190                1195                1200
His Gln Val Val Asp Asn Ser 
                1205        

<210> 11
<211> 5064
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..5064
<223> /mol_type="DNA"
      /note="ZNF142 (CCDS nucleotide sequence of ZNF142 (Gene ID: 7701)
      )"
      /organism="Homo sapiens"

<400> 11
atgacagacc cccttttgga ctcacagcca gccagtagca ccggggagat ggatggactg       60

tgccctgagc tattgctgat ccccccgcct ctctctaacc gtggaatcct ggggcctgtc      120

cagagcccct gtccttcccg ggaccctgca cctataccta ctgagccagg ctgcctgctg      180

gtagaggcca cagcaactga agagggacca gggaacatgg agatcattgt ggagacagta      240

gctggaaccc tgaccccagg tgctcctgga gagaccccag ctcccaaact gcctccagga      300

gagagagaac cttcacagga agcaggtaca cccttgcctg ggcaggagac agctgaagag      360

gagaatgtag agaaagaaga gaagagtgac acccagaagg actcccaaaa ggctgtggat      420

aaaggccaag gggctcagcg gctggaaggg gatgtggtct ctggcaccga gtccctcttc      480

aagacccata tgtgtccaga gtgtaagcgc tgctttaaga agcggactca tctggtggag      540

cacctgcatc tccacttccc agaccccagc ctccagtgcc ctaactgcca gaagttcttc      600

accagtaaga gcaagctcaa gacccatctg ctgcgggagc tgggtgaaaa ggcccaccac      660

tgcccactgt gccactacag tgcggtggag aggaatgcac tcaaccgcca catggccagc      720

atgcatgaag atatttccaa cttctactca gacacctatg cctgtcctgt ctgccgtgag      780

gaattccgcc tcagccaggc cctaaaggag cacctcaaga gccacacggc agcagccgca      840

gcagagccat taccccttcg ctgctttcag gagggctgca gctatgcagc acccgaccgc      900

aaggccttca ttaagcacct gaaggagacc catggggtgc gggctgtgga gtgccgccat      960

cactcatgtc ccatgctctt tgccacagcc gaagccatgg aggcccacca caagagtcac     1020

tacgccttcc actgccccca ctgtgatttt gcttgttcca ataagcacct attccgtaaa     1080

cacaagaagc agggccaccc tggcagtgaa gagctgcgct gcaccttctg cccctttgcc     1140

accttcaacc cagtggctta ccaggatcat gtaggcaaga tgcatgctca tgaaaagatc     1200

caccagtgtc ctgagtgcaa ctttgccact gcccacaaga gggtgctcat ccgacacatg     1260

cttctacata cgggtgagaa gccccacaag tgtgagctgt gtgacttcac atgccgagac     1320

gtgagctacc tatccaagca catgctgacc cactccaaca ccaaggatta catgtgcact     1380

gaatgtggct atgtcaccaa gtggaagcac tacctccgtg tgcacatgcg aaaacatgca     1440

ggggacctca ggtatcagtg caaccagtgc tcctatcgct gtcaccgggc tgatcagctg     1500

agcagccaca agctgcggca tcagggcaag tctctgatgt gtgaggtgtg tgccttcgcc     1560

tgcaagcgga agtatgagct gcagaagcac atggcttccc agcaccaccc tggcacaccg     1620

gccccactct acccttgcca ctactgcagt taccagagcc gccacaagca ggctgtgctg     1680

agccatgaga actgcaagca tacccgcctc cgtgagttcc actgtgccct ctgtgactac     1740

cgcaccttca gcaacaccac actcttgttc cataaacgca aggcccatgg ctatgtacct     1800

ggagaccagg cctggcagct ccgctatgca agccaggagc cagaaggggc catgcagggc     1860

ccaacacccc caccagattc agagccctca aaccagctgt cagcccgacc tgaggggcca     1920

ggtcacgaac ctgggactgt ggtggacccc agcttggacc aggccctgcc agagatgagt     1980

gaggaggtca acactggaag acaggagggc agtgaggctc cccatggggg tgacctgggt     2040

ggcagtccca gcccagcaga ggtggaggag ggcagctgca cactacacct agaggccctg     2100

ggagtagagc tggagtctgt gactgagcca ccccttgagg aggtcactga aacagcccct     2160

atggagttca ggcccctggg actggaaggg ccagatggac tggaaggacc agagctatct     2220

agctttgaag gtattgggac ttctgacttg agtgctgaag aaaatcccct tctggaaaag     2280

ccagtgtctg agccctccac aaatcctcca tccttagagg aggctcctaa caactgggta     2340

ggaaccttca agacaactcc acctgctgag acagcaccct tgcccccatt acctgagtca     2400

gagtcattac tcaaggccct aaggagacag gacaaagaac aagcagaggc attggtgcta     2460

gaggggcggg tgcagatggt agtgatccag ggagaggggc gagccttccg ctgcccacac     2520

tgccctttta tcactcgccg ggagaaggcc ctgaatctgc actccaggac tgggtgccaa     2580

ggccgccgag agcccctgct gtgccccgag tgtggggcta gcttcaagca acaacgcggc     2640

ctcagcaccc acctgctgaa gaagtgccct gttctactca gaaagaacaa gggcttgccc     2700

agaccagatt cacccatccc tctgcaacct gtgctcccag gtacccaggc ctcagaggac     2760

acagaaagtg ggaagccccc acctgcatca caagaagcag agctactgct tccaaaagat     2820

gctcctttgg agcttcccag ggagccagaa gaaacagaag agcctcttgc cacagtctct     2880

ggttccccag tccctcctgc aggaaactcc ttgcccacag aggcccctaa gaagcactgc     2940

tttgacccag tccctcctgc aggaaactcc tcacccacgg aggcccctaa gaagcaccac     3000

cttgacccag tccctcctgc aggaaactcc tcacccacag aggccctgaa gaagcaccgc     3060

tttgagcagg gcaagtttca ctgcaactcc tgcccattcc tttgttcccg gctctcctct     3120

attacctctc acgtggctga aggctgcagg gggggacgtg gcgggggagg aaaacgaggg     3180

accccccaga cccagcctga tgtgtccccg ttgagcaatg gggactctgc tcccccgaag     3240

aatgggagta cagagtccag ctctggtgat ggggatacag ttctggttca aaagcagaag     3300

ggggctcgct tctcctgccc tacatgtccc tttagctgcc agcaggaacg ggctctgagg     3360

actcaccaga tccggggctg ccccctcgag gagtctggag agctgcactg cagcctctgc     3420

ccattcactg ctcctgctgc cactgcctta aggctccacc agaagcggag gcaccccact     3480

gcagccccag cccgtgggcc ccggccccat ctacagtgtg gggactgtgg cttcacctgt     3540

aaacagagcc gttgcatgca gcagcaccgg cggctcaagc acgagggggt gaagccccat     3600

cagtgcccct tctgtgactt ttcgaccacc agacggtacc ggttagaggc tcaccagtcc     3660

cgacacacag gcattggccg catcccctgc agctcttgcc cccagacgtt tggtaccaac     3720

tcgaaactgc gcttgcaccg gttaagggta catgacaaaa cacctaccca cttctgtcca     3780

ctttgtgact atagtggcta ccttcgccat gacatcactc gtcatgtcaa cagctgccac     3840

caaggcaccc cagcctttgc ctgctcccag tgtgaagccc agttcagctc agagacagca     3900

cttaagcagc atgctctgcg ccgacacccc gagcctgcac agcctgcccc tggctctcct     3960

gcagagacca ctgagggccc cctgcactgt tcccgctgtg ggttgctgtg ccccagccct     4020

gccagcttac gaggacacac ccgtaaacag cacccacggc ttgagtgtgg ggcctgccag     4080

gaggccttcc ctagccgact ggctctggat gagcaccgga ggcagcagca tttcagccac     4140

cgctgtcagc tctgtgactt tgctgcccgg gagcgggtgg gcctggtaaa gcactacctg     4200

gaacagcatg aggagacttc agcagccgtg gcagcctcag atggggatgg ggatgctggc     4260

cagcccccgc tacactgccc cttttgtgac ttcacatgcc gccatcagct ggtactagat     4320

caccatgtga aagggcatgg gggcactcgt ctctacaagt gcaccgattg tgcttacagc     4380

accaagaacc gacagaagat cacctggcac agccgcatcc acactgggga aaagccttac     4440

cactgtcacc tctgccccta tgcctgtgct gatccctctc gtctcaagta ccacatgcgg     4500

atccacaagg aggaacggaa gtacctgtgc cctgagtgtg gctacaagtg caagtgggtc     4560

aaccagctga aataccacat gaccaagcat acaggactga agccatacca gtgtcccgag     4620

tgtgagtact gcaccaaccg ggctgatgca ctgcgtgtgc accaggagac ccggcatcga     4680

gaagcacggg ctttcatgtg tgagcagtgt ggcaaggcct tcaagacgcg cttcctgctg     4740

cgcacccacc ttcgcaagca cagtgaggcc aaaccctatg tgtgcaatgt gtgccaccgt     4800

gctttccgct gggctgctgg cctgcgccat catgccctca cccacaccga ccgccacccc     4860

ttcttttgcc gcctctgcaa ctacaaggcc aagcaaaagt tccaggtggt caagcacgta     4920

cgcaggcacc accctgacca agccgaccca aaccagggtg tgggcaaaga ccccaccacc     4980

cccacagtgc acctgcatga tgtgcagctg gaggatccca gccctcctgc tcctgccgct     5040

ccccacactg gacctgaggg ctga                                            5064


<210> 12
<211> 1687
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1687
<223> /mol_type="protein"
      /note="ZNF142 (full-length protein)"
      /organism="Homo sapiens"

<400> 12
Met Thr Asp Pro Leu Leu Asp Ser Gln Pro Ala Ser Ser Thr Gly Glu 
1               5                   10                   15    
Met Asp Gly Leu Cys Pro Glu Leu Leu Leu Ile Pro Pro Pro Leu Ser 
            20                   25                  30        
Asn Arg Gly Ile Leu Gly Pro Val Gln Ser Pro Cys Pro Ser Arg Asp 
        35                   40                  45            
Pro Ala Pro Ile Pro Thr Glu Pro Gly Cys Leu Leu Val Glu Ala Thr 
    50                   55                  60                
Ala Thr Glu Glu Gly Pro Gly Asn Met Glu Ile Ile Val Glu Thr Val 
65                   70                  75                  80
Ala Gly Thr Leu Thr Pro Gly Ala Pro Gly Glu Thr Pro Ala Pro Lys 
                85                   90                  95    
Leu Pro Pro Gly Glu Arg Glu Pro Ser Gln Glu Ala Gly Thr Pro Leu 
            100                  105                110        
Pro Gly Gln Glu Thr Ala Glu Glu Glu Asn Val Glu Lys Glu Glu Lys 
        115                  120                125            
Ser Asp Thr Gln Lys Asp Ser Gln Lys Ala Val Asp Lys Gly Gln Gly 
    130                  135                140                
Ala Gln Arg Leu Glu Gly Asp Val Val Ser Gly Thr Glu Ser Leu Phe 
145                  150                155                  160
Lys Thr His Met Cys Pro Glu Cys Lys Arg Cys Phe Lys Lys Arg Thr 
                165                  170                175    
His Leu Val Glu His Leu His Leu His Phe Pro Asp Pro Ser Leu Gln 
            180                  185                190        
Cys Pro Asn Cys Gln Lys Phe Phe Thr Ser Lys Ser Lys Leu Lys Thr 
        195                  200                205            
His Leu Leu Arg Glu Leu Gly Glu Lys Ala His His Cys Pro Leu Cys 
    210                  215                220                
His Tyr Ser Ala Val Glu Arg Asn Ala Leu Asn Arg His Met Ala Ser 
225                  230                235                  240
Met His Glu Asp Ile Ser Asn Phe Tyr Ser Asp Thr Tyr Ala Cys Pro 
                245                  250                255    
Val Cys Arg Glu Glu Phe Arg Leu Ser Gln Ala Leu Lys Glu His Leu 
            260                  265                270        
Lys Ser His Thr Ala Ala Ala Ala Ala Glu Pro Leu Pro Leu Arg Cys 
        275                  280                285            
Phe Gln Glu Gly Cys Ser Tyr Ala Ala Pro Asp Arg Lys Ala Phe Ile 
    290                  295                300                
Lys His Leu Lys Glu Thr His Gly Val Arg Ala Val Glu Cys Arg His 
305                  310                315                  320
His Ser Cys Pro Met Leu Phe Ala Thr Ala Glu Ala Met Glu Ala His 
                325                  330                335    
His Lys Ser His Tyr Ala Phe His Cys Pro His Cys Asp Phe Ala Cys 
            340                  345                350        
Ser Asn Lys His Leu Phe Arg Lys His Lys Lys Gln Gly His Pro Gly 
        355                  360                365            
Ser Glu Glu Leu Arg Cys Thr Phe Cys Pro Phe Ala Thr Phe Asn Pro 
    370                  375                380                
Val Ala Tyr Gln Asp His Val Gly Lys Met His Ala His Glu Lys Ile 
385                  390                395                  400
His Gln Cys Pro Glu Cys Asn Phe Ala Thr Ala His Lys Arg Val Leu 
                405                  410                415    
Ile Arg His Met Leu Leu His Thr Gly Glu Lys Pro His Lys Cys Glu 
            420                  425                430        
Leu Cys Asp Phe Thr Cys Arg Asp Val Ser Tyr Leu Ser Lys His Met 
        435                  440                445            
Leu Thr His Ser Asn Thr Lys Asp Tyr Met Cys Thr Glu Cys Gly Tyr 
    450                  455                460                
Val Thr Lys Trp Lys His Tyr Leu Arg Val His Met Arg Lys His Ala 
465                  470                475                  480
Gly Asp Leu Arg Tyr Gln Cys Asn Gln Cys Ser Tyr Arg Cys His Arg 
                485                  490                495    
Ala Asp Gln Leu Ser Ser His Lys Leu Arg His Gln Gly Lys Ser Leu 
            500                  505                510        
Met Cys Glu Val Cys Ala Phe Ala Cys Lys Arg Lys Tyr Glu Leu Gln 
        515                  520                525            
Lys His Met Ala Ser Gln His His Pro Gly Thr Pro Ala Pro Leu Tyr 
    530                  535                540                
Pro Cys His Tyr Cys Ser Tyr Gln Ser Arg His Lys Gln Ala Val Leu 
545                  550                555                  560
Ser His Glu Asn Cys Lys His Thr Arg Leu Arg Glu Phe His Cys Ala 
                565                  570                575    
Leu Cys Asp Tyr Arg Thr Phe Ser Asn Thr Thr Leu Leu Phe His Lys 
            580                  585                590        
Arg Lys Ala His Gly Tyr Val Pro Gly Asp Gln Ala Trp Gln Leu Arg 
        595                  600                605            
Tyr Ala Ser Gln Glu Pro Glu Gly Ala Met Gln Gly Pro Thr Pro Pro 
    610                  615                620                
Pro Asp Ser Glu Pro Ser Asn Gln Leu Ser Ala Arg Pro Glu Gly Pro 
625                  630                635                  640
Gly His Glu Pro Gly Thr Val Val Asp Pro Ser Leu Asp Gln Ala Leu 
                645                  650                655    
Pro Glu Met Ser Glu Glu Val Asn Thr Gly Arg Gln Glu Gly Ser Glu 
            660                  665                670        
Ala Pro His Gly Gly Asp Leu Gly Gly Ser Pro Ser Pro Ala Glu Val 
        675                  680                685            
Glu Glu Gly Ser Cys Thr Leu His Leu Glu Ala Leu Gly Val Glu Leu 
    690                  695                700                
Glu Ser Val Thr Glu Pro Pro Leu Glu Glu Val Thr Glu Thr Ala Pro 
705                  710                715                  720
Met Glu Phe Arg Pro Leu Gly Leu Glu Gly Pro Asp Gly Leu Glu Gly 
                725                  730                735    
Pro Glu Leu Ser Ser Phe Glu Gly Ile Gly Thr Ser Asp Leu Ser Ala 
            740                  745                750        
Glu Glu Asn Pro Leu Leu Glu Lys Pro Val Ser Glu Pro Ser Thr Asn 
        755                  760                765            
Pro Pro Ser Leu Glu Glu Ala Pro Asn Asn Trp Val Gly Thr Phe Lys 
    770                  775                780                
Thr Thr Pro Pro Ala Glu Thr Ala Pro Leu Pro Pro Leu Pro Glu Ser 
785                  790                795                  800
Glu Ser Leu Leu Lys Ala Leu Arg Arg Gln Asp Lys Glu Gln Ala Glu 
                805                  810                815    
Ala Leu Val Leu Glu Gly Arg Val Gln Met Val Val Ile Gln Gly Glu 
            820                  825                830        
Gly Arg Ala Phe Arg Cys Pro His Cys Pro Phe Ile Thr Arg Arg Glu 
        835                  840                845            
Lys Ala Leu Asn Leu His Ser Arg Thr Gly Cys Gln Gly Arg Arg Glu 
    850                  855                860                
Pro Leu Leu Cys Pro Glu Cys Gly Ala Ser Phe Lys Gln Gln Arg Gly 
865                  870                875                  880
Leu Ser Thr His Leu Leu Lys Lys Cys Pro Val Leu Leu Arg Lys Asn 
                885                  890                895    
Lys Gly Leu Pro Arg Pro Asp Ser Pro Ile Pro Leu Gln Pro Val Leu 
            900                  905                910        
Pro Gly Thr Gln Ala Ser Glu Asp Thr Glu Ser Gly Lys Pro Pro Pro 
        915                  920                925            
Ala Ser Gln Glu Ala Glu Leu Leu Leu Pro Lys Asp Ala Pro Leu Glu 
    930                  935                940                
Leu Pro Arg Glu Pro Glu Glu Thr Glu Glu Pro Leu Ala Thr Val Ser 
945                  950                955                  960
Gly Ser Pro Val Pro Pro Ala Gly Asn Ser Leu Pro Thr Glu Ala Pro 
                965                  970                975    
Lys Lys His Cys Phe Asp Pro Val Pro Pro Ala Gly Asn Ser Ser Pro 
            980                  985                990        
Thr Glu Ala Pro Lys Lys His His Leu Asp Pro Val Pro Pro Ala Gly 
        995                  1000                1005            
Asn Ser Ser Pro Thr Glu Ala Leu Lys Lys His Arg Phe Glu Gln Gly 
    1010                1015                1020                
Lys Phe His Cys Asn Ser Cys Pro Phe Leu Cys Ser Arg Leu Ser Ser 
1025                1030                1035                1040
Ile Thr Ser His Val Ala Glu Gly Cys Arg Gly Gly Arg Gly Gly Gly 
                1045                1050                1055    
Gly Lys Arg Gly Thr Pro Gln Thr Gln Pro Asp Val Ser Pro Leu Ser 
            1060                1065                1070        
Asn Gly Asp Ser Ala Pro Pro Lys Asn Gly Ser Thr Glu Ser Ser Ser 
        1075                1080                1085            
Gly Asp Gly Asp Thr Val Leu Val Gln Lys Gln Lys Gly Ala Arg Phe 
    1090                1095                1100                
Ser Cys Pro Thr Cys Pro Phe Ser Cys Gln Gln Glu Arg Ala Leu Arg 
1105                1110                1115                1120
Thr His Gln Ile Arg Gly Cys Pro Leu Glu Glu Ser Gly Glu Leu His 
                1125                1130                1135    
Cys Ser Leu Cys Pro Phe Thr Ala Pro Ala Ala Thr Ala Leu Arg Leu 
            1140                1145                1150        
His Gln Lys Arg Arg His Pro Thr Ala Ala Pro Ala Arg Gly Pro Arg 
        1155                1160                1165            
Pro His Leu Gln Cys Gly Asp Cys Gly Phe Thr Cys Lys Gln Ser Arg 
    1170                1175                1180                
Cys Met Gln Gln His Arg Arg Leu Lys His Glu Gly Val Lys Pro His 
1185                1190                1195                1200
Gln Cys Pro Phe Cys Asp Phe Ser Thr Thr Arg Arg Tyr Arg Leu Glu 
                1205                1210                1215    
Ala His Gln Ser Arg His Thr Gly Ile Gly Arg Ile Pro Cys Ser Ser 
            1220                1225                1230        
Cys Pro Gln Thr Phe Gly Thr Asn Ser Lys Leu Arg Leu His Arg Leu 
        1235                1240                1245            
Arg Val His Asp Lys Thr Pro Thr His Phe Cys Pro Leu Cys Asp Tyr 
    1250                1255                1260                
Ser Gly Tyr Leu Arg His Asp Ile Thr Arg His Val Asn Ser Cys His 
1265                1270                1275                1280
Gln Gly Thr Pro Ala Phe Ala Cys Ser Gln Cys Glu Ala Gln Phe Ser 
                1285                1290                1295    
Ser Glu Thr Ala Leu Lys Gln His Ala Leu Arg Arg His Pro Glu Pro 
            1300                1305                1310        
Ala Gln Pro Ala Pro Gly Ser Pro Ala Glu Thr Thr Glu Gly Pro Leu 
        1315                1320                1325            
His Cys Ser Arg Cys Gly Leu Leu Cys Pro Ser Pro Ala Ser Leu Arg 
    1330                1335                1340                
Gly His Thr Arg Lys Gln His Pro Arg Leu Glu Cys Gly Ala Cys Gln 
1345                1350                1355                1360
Glu Ala Phe Pro Ser Arg Leu Ala Leu Asp Glu His Arg Arg Gln Gln 
                1365                1370                1375    
His Phe Ser His Arg Cys Gln Leu Cys Asp Phe Ala Ala Arg Glu Arg 
            1380                1385                1390        
Val Gly Leu Val Lys His Tyr Leu Glu Gln His Glu Glu Thr Ser Ala 
        1395                1400                1405            
Ala Val Ala Ala Ser Asp Gly Asp Gly Asp Ala Gly Gln Pro Pro Leu 
    1410                1415                1420                
His Cys Pro Phe Cys Asp Phe Thr Cys Arg His Gln Leu Val Leu Asp 
1425                1430                1435                1440
His His Val Lys Gly His Gly Gly Thr Arg Leu Tyr Lys Cys Thr Asp 
                1445                1450                1455    
Cys Ala Tyr Ser Thr Lys Asn Arg Gln Lys Ile Thr Trp His Ser Arg 
            1460                1465                1470        
Ile His Thr Gly Glu Lys Pro Tyr His Cys His Leu Cys Pro Tyr Ala 
        1475                1480                1485            
Cys Ala Asp Pro Ser Arg Leu Lys Tyr His Met Arg Ile His Lys Glu 
    1490                1495                1500                
Glu Arg Lys Tyr Leu Cys Pro Glu Cys Gly Tyr Lys Cys Lys Trp Val 
1505                1510                1515                1520
Asn Gln Leu Lys Tyr His Met Thr Lys His Thr Gly Leu Lys Pro Tyr 
                1525                1530                1535    
Gln Cys Pro Glu Cys Glu Tyr Cys Thr Asn Arg Ala Asp Ala Leu Arg 
            1540                1545                1550        
Val His Gln Glu Thr Arg His Arg Glu Ala Arg Ala Phe Met Cys Glu 
        1555                1560                1565            
Gln Cys Gly Lys Ala Phe Lys Thr Arg Phe Leu Leu Arg Thr His Leu 
    1570                1575                1580                
Arg Lys His Ser Glu Ala Lys Pro Tyr Val Cys Asn Val Cys His Arg 
1585                1590                1595                1600
Ala Phe Arg Trp Ala Ala Gly Leu Arg His His Ala Leu Thr His Thr 
                1605                1610                1615    
Asp Arg His Pro Phe Phe Cys Arg Leu Cys Asn Tyr Lys Ala Lys Gln 
            1620                1625                1630        
Lys Phe Gln Val Val Lys His Val Arg Arg His His Pro Asp Gln Ala 
        1635                1640                1645            
Asp Pro Asn Gln Gly Val Gly Lys Asp Pro Thr Thr Pro Thr Val His 
    1650                1655                1660                
Leu His Asp Val Gln Leu Glu Asp Pro Ser Pro Pro Ala Pro Ala Ala 
1665                1670                1675                1680
Pro His Thr Gly Pro Glu Gly 
                1685        

<210> 13
<211> 4594
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..4594
<223> /mol_type="DNA"
      /note="ZNF142 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 13
atgacagacc cccttttgga ctcacagcca gccagtagca ccggggagat ggatggactg       60

tgccctgagc tattgctgat ccccccgcct ctctctaacc gtggaatcct ggggcctgtc      120

cagagcccct gtccttcccg ggaccctgca cctataccta ctgagccagg ctgcctgctg      180

gtagaggcca cagcaactga agagggacca gggaacatgg agatcattgt ggagacagta      240

gctggaaccc tgaccccagg tgctcctgga gagaccccag ctcccaaact gcctccagga      300

gagagagaac cttcacagga agcaggtaca cccttgcctg ggcaggagac agctgaagag      360

gagaatgtag agaaagaaga gaagagtgac acccagaagg actcccaaaa ggctgtggat      420

aaaggccaag gggctcagcg gctggaaggg gatgtggtct ctggcaccga gtccctcttc      480

aagacccata tgtgtccaga gtgtaagcgc tgctttaaga agcggactca tctggtggag      540

cacctgcatc tccacttccc agaccccagc ctccagtgcc ctaactgcca gaagttcttc      600

accagtaaga gcaagctcaa gacccatctg ctgcgggagc tgggtgaaaa ggcccaccac      660

tgcccactgt gccactacag tgcggtggag aggaatgcac tcaaccgcca catggccagc      720

atgcatgaag atatttccaa cttctactca gacacctatg cctgtcctgt ctgccgtgag      780

gaattccgcc tcagccaggc cctaaaggag cacctcaaga gccacacggc agcagccgca      840

gcagagccat taccccttcg ctgctttcag gagggctgca gctatgcagc acccgaccgc      900

aaggccttca ttaagcacct gaaggagacc catggggtgc gggctgtgga gtgccgccat      960

cactcatgtc ccatgctctt tgccacagcc gaagccatgg aggcccacca caagagtcac     1020

tacgccttcc actgccccca ctgtgatttt gcttgttcca ataagcacct attccgtaaa     1080

cacaagaagc agggccaccc tggcagtgaa gagctgcgct gcaccttctg cccctttgcc     1140

accttcaacc cagtggctta ccaggatcat gtaggcaaga tgcatgctca tgaaaagatc     1200

caccagtgtc ctgagtgcaa ctttgccact gcccacaaga gggtgctcat ccgacacatg     1260

cttctacata cgggtgagaa gccccacaag tgtgagctgt gtgacttcac atgccgagac     1320

gtgagctacc tatccaagca catgctgacc cactccaaca ccaaggatta catgtgcact     1380

gaatgtggct atgtcaccaa gtggaagcac tacctccgtg tgcacatgcg aaaacatgca     1440

ggggacctca ggtatcagtg caaccagtgc tcctatcgct gtcaccgggc tgatcagctg     1500

agcagccaca agctgcggca tcagggcaag tctctgatgt gtgaggtgtg tgccttcgcc     1560

tgcaagcgga agtatgagct gcagaagcac atggcttccc agcaccaccc tggcacaccg     1620

gccccactct acccttgcca ctactgcagt taccagagcc gccacaagca ggctgtgctg     1680

agccatgaga actgcaagca tacccgcctc cgtgagttcc actgtgccct ctgtgactac     1740

cgcaccttca gcaacaccac actcttgttc cataaacgca aggcccatgg ctatgtacct     1800

ggagaccagg cctggcagct ccgctatgca agccaggagc cagaaggggc catgcagggc     1860

ccaacacccc caccagattc agagccctca aaccagctgt cagcccgacc tgaggggcca     1920

ggtcacgaac ctgggactgt ggtggacccc agcttggacc aggccctgcc agagatgagt     1980

gaggaggtca acactggaag acaggagggc agtgaggctc cccatggggg tgacctgggt     2040

ggcagtccca gcccagcaga ggtggaggag ggcagctgca cactacacct agaggccctg     2100

ggagtagagc tggagtctgt gactgagcca ccccttgagg aggtcactga aacagcccct     2160

atggagttca ggcccctggg actggaaggg ccagatggac tggaaggacc agagctatct     2220

agctttgaag gtattgggac ttctgacttg agtgctgaag aaaatcccct tctggaaaag     2280

ccagtgtctg agccctccac aaatcctcca tccttagagg aggctcctaa caactgggta     2340

ggaaccttca agacaactcc acctgctgag acagcaccct tgcccccatt acctgagtca     2400

gagtcattac tcaaggccct aaggagacag gacaaagaac aagcagaggc attggtgcta     2460

gaggggcggg tgcagatggt agtgatccag ggagaggggc gagccttccg ctgcccacac     2520

tgccctttta tcactcgccg ggagaaggcc ctgaatctgc actccaggac tgggtgccaa     2580

ggccgccgag agcccctgct gtgccccgag tgtggggcta gcttcaagca acaacgcggc     2640

ctcagcaccc acctgctgaa gaagtgccct gttctactca gaaagaacaa gggcttgccc     2700

agaccagatt cacccatccc tctgcaacct gtgctcccag gtacccaggc ctcagaggac     2760

acagaaagtg ggaagccccc acctgcatca caagaagcag agctactgct tccaaaagat     2820

gctcctttgg agcttcccag ggagccagaa gaaacagaag agcctcttgc cacagtctct     2880

ggttccccag tccctcctgc aggaaactcc ttgcccacag aggcccctaa gaagcactgc     2940

tttgacccag tccctcctgc aggaaactcc tcacccacgg aggcccctaa gaagcaccac     3000

cttgacccag tccctcctgc aggaaactcc tcacccacag aggccctgaa gaagcaccgc     3060

tttgagcagg gcaagtttca ctgcaactcc tgcccattcc tttgttcccg gctctcctct     3120

attacctctc acgtggctga aggctgcagg gggggacgtg gcgggggagg aaaacgaggg     3180

accccccaga cccagcctga tgtgtccccg ttgagcaatg gggactctgc tcccccgaag     3240

aatgggagta cagagtccag ctctggtgat ggggatacag ttctggttca aaagcagaag     3300

ggggctcgct tctcctgccc tacatgtccc tttagctgcc agcaggaacg ggctctgagg     3360

actcaccaga tccggggctg ccccctcgag gagtctggag agctgcactg cagcctctgc     3420

ccattcactg ctcctgctgc cactgcctta aggctccacc agaagcggag gcaccccact     3480

gcagccccag cccgtgggcc ccggccccat ctacagtgtg gggactgtgg cttcacctgt     3540

aaacagagcc gttgcatgca gcagcaccgg cggctcaagc acgagggggt gaagccccat     3600

cagtgcccct tctgtgactt ttcgaccacc agacggtacc ggttagaggc tcaccagtcc     3660

cgacacacag gcattggccg catcccctgc agctcttgcc cccagacgtt tggtaccaac     3720

tcgaaactgc gcttgcaccg gttaagggta catgacaaaa cacctaccca cttctgtcca     3780

ctttgtgact atagtggcta ccttcgccat gacatcactc gtcatgtcaa cagctgccac     3840

caaggcaccc cagcctttgc ctgctcccag tgtgaagccc agttcagctc agagacagca     3900

cttaagcagc atgctctgcg ccgacacccc gagcctgcac agcctgcccc tggctctcct     3960

gcagagacca ctgagggccc cctgcactgt tcccgctgtg ggttgctgtg ccccagccct     4020

gccagcttac gaggacacac ccgtaaacag cacccacggc ttgagtgtgg ggcctgccag     4080

gaggccttcc ctagccgact ggctctggat gagcaccgga ggcagcagca tttcagccac     4140

cgctgtcagc tctgtgactt tgctgcccgg gagcgggtgg gcctggtaaa gcactacctg     4200

gaacagcatg aggagacttc agcagccgtg gcagcctcag atggggatgg ggatgctggc     4260

cagcccccgc tacactgccc cttttgtgac ttcacatgcc gccatcagct ggtactagat     4320

caccatgtga aagggcatgg gggcactcgt ctctacaagt gcaccgattg tgcttacagc     4380

accaagaacc gacagaagat cacctggcac agccgcatcc acactgggga aaagccttac     4440

cactgtcacc tctgccccta tgcctgtgct gatccctctc gtctcaagta ccacatgcgg     4500

atccacaagg aggaacggaa gtacctgtgc cctgagtgtg gctacaagtg caagtgggtc     4560

aaccagctga aataccacat gaccaagcat acag                                 4594


<210> 14
<211> 1531
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..1531
<223> /mol_type="protein"
      /note="ZNF142 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 14
Met Thr Asp Pro Leu Leu Asp Ser Gln Pro Ala Ser Ser Thr Gly Glu 
1               5                   10                   15    
Met Asp Gly Leu Cys Pro Glu Leu Leu Leu Ile Pro Pro Pro Leu Ser 
            20                   25                  30        
Asn Arg Gly Ile Leu Gly Pro Val Gln Ser Pro Cys Pro Ser Arg Asp 
        35                   40                  45            
Pro Ala Pro Ile Pro Thr Glu Pro Gly Cys Leu Leu Val Glu Ala Thr 
    50                   55                  60                
Ala Thr Glu Glu Gly Pro Gly Asn Met Glu Ile Ile Val Glu Thr Val 
65                   70                  75                  80
Ala Gly Thr Leu Thr Pro Gly Ala Pro Gly Glu Thr Pro Ala Pro Lys 
                85                   90                  95    
Leu Pro Pro Gly Glu Arg Glu Pro Ser Gln Glu Ala Gly Thr Pro Leu 
            100                  105                110        
Pro Gly Gln Glu Thr Ala Glu Glu Glu Asn Val Glu Lys Glu Glu Lys 
        115                  120                125            
Ser Asp Thr Gln Lys Asp Ser Gln Lys Ala Val Asp Lys Gly Gln Gly 
    130                  135                140                
Ala Gln Arg Leu Glu Gly Asp Val Val Ser Gly Thr Glu Ser Leu Phe 
145                  150                155                  160
Lys Thr His Met Cys Pro Glu Cys Lys Arg Cys Phe Lys Lys Arg Thr 
                165                  170                175    
His Leu Val Glu His Leu His Leu His Phe Pro Asp Pro Ser Leu Gln 
            180                  185                190        
Cys Pro Asn Cys Gln Lys Phe Phe Thr Ser Lys Ser Lys Leu Lys Thr 
        195                  200                205            
His Leu Leu Arg Glu Leu Gly Glu Lys Ala His His Cys Pro Leu Cys 
    210                  215                220                
His Tyr Ser Ala Val Glu Arg Asn Ala Leu Asn Arg His Met Ala Ser 
225                  230                235                  240
Met His Glu Asp Ile Ser Asn Phe Tyr Ser Asp Thr Tyr Ala Cys Pro 
                245                  250                255    
Val Cys Arg Glu Glu Phe Arg Leu Ser Gln Ala Leu Lys Glu His Leu 
            260                  265                270        
Lys Ser His Thr Ala Ala Ala Ala Ala Glu Pro Leu Pro Leu Arg Cys 
        275                  280                285            
Phe Gln Glu Gly Cys Ser Tyr Ala Ala Pro Asp Arg Lys Ala Phe Ile 
    290                  295                300                
Lys His Leu Lys Glu Thr His Gly Val Arg Ala Val Glu Cys Arg His 
305                  310                315                  320
His Ser Cys Pro Met Leu Phe Ala Thr Ala Glu Ala Met Glu Ala His 
                325                  330                335    
His Lys Ser His Tyr Ala Phe His Cys Pro His Cys Asp Phe Ala Cys 
            340                  345                350        
Ser Asn Lys His Leu Phe Arg Lys His Lys Lys Gln Gly His Pro Gly 
        355                  360                365            
Ser Glu Glu Leu Arg Cys Thr Phe Cys Pro Phe Ala Thr Phe Asn Pro 
    370                  375                380                
Val Ala Tyr Gln Asp His Val Gly Lys Met His Ala His Glu Lys Ile 
385                  390                395                  400
His Gln Cys Pro Glu Cys Asn Phe Ala Thr Ala His Lys Arg Val Leu 
                405                  410                415    
Ile Arg His Met Leu Leu His Thr Gly Glu Lys Pro His Lys Cys Glu 
            420                  425                430        
Leu Cys Asp Phe Thr Cys Arg Asp Val Ser Tyr Leu Ser Lys His Met 
        435                  440                445            
Leu Thr His Ser Asn Thr Lys Asp Tyr Met Cys Thr Glu Cys Gly Tyr 
    450                  455                460                
Val Thr Lys Trp Lys His Tyr Leu Arg Val His Met Arg Lys His Ala 
465                  470                475                  480
Gly Asp Leu Arg Tyr Gln Cys Asn Gln Cys Ser Tyr Arg Cys His Arg 
                485                  490                495    
Ala Asp Gln Leu Ser Ser His Lys Leu Arg His Gln Gly Lys Ser Leu 
            500                  505                510        
Met Cys Glu Val Cys Ala Phe Ala Cys Lys Arg Lys Tyr Glu Leu Gln 
        515                  520                525            
Lys His Met Ala Ser Gln His His Pro Gly Thr Pro Ala Pro Leu Tyr 
    530                  535                540                
Pro Cys His Tyr Cys Ser Tyr Gln Ser Arg His Lys Gln Ala Val Leu 
545                  550                555                  560
Ser His Glu Asn Cys Lys His Thr Arg Leu Arg Glu Phe His Cys Ala 
                565                  570                575    
Leu Cys Asp Tyr Arg Thr Phe Ser Asn Thr Thr Leu Leu Phe His Lys 
            580                  585                590        
Arg Lys Ala His Gly Tyr Val Pro Gly Asp Gln Ala Trp Gln Leu Arg 
        595                  600                605            
Tyr Ala Ser Gln Glu Pro Glu Gly Ala Met Gln Gly Pro Thr Pro Pro 
    610                  615                620                
Pro Asp Ser Glu Pro Ser Asn Gln Leu Ser Ala Arg Pro Glu Gly Pro 
625                  630                635                  640
Gly His Glu Pro Gly Thr Val Val Asp Pro Ser Leu Asp Gln Ala Leu 
                645                  650                655    
Pro Glu Met Ser Glu Glu Val Asn Thr Gly Arg Gln Glu Gly Ser Glu 
            660                  665                670        
Ala Pro His Gly Gly Asp Leu Gly Gly Ser Pro Ser Pro Ala Glu Val 
        675                  680                685            
Glu Glu Gly Ser Cys Thr Leu His Leu Glu Ala Leu Gly Val Glu Leu 
    690                  695                700                
Glu Ser Val Thr Glu Pro Pro Leu Glu Glu Val Thr Glu Thr Ala Pro 
705                  710                715                  720
Met Glu Phe Arg Pro Leu Gly Leu Glu Gly Pro Asp Gly Leu Glu Gly 
                725                  730                735    
Pro Glu Leu Ser Ser Phe Glu Gly Ile Gly Thr Ser Asp Leu Ser Ala 
            740                  745                750        
Glu Glu Asn Pro Leu Leu Glu Lys Pro Val Ser Glu Pro Ser Thr Asn 
        755                  760                765            
Pro Pro Ser Leu Glu Glu Ala Pro Asn Asn Trp Val Gly Thr Phe Lys 
    770                  775                780                
Thr Thr Pro Pro Ala Glu Thr Ala Pro Leu Pro Pro Leu Pro Glu Ser 
785                  790                795                  800
Glu Ser Leu Leu Lys Ala Leu Arg Arg Gln Asp Lys Glu Gln Ala Glu 
                805                  810                815    
Ala Leu Val Leu Glu Gly Arg Val Gln Met Val Val Ile Gln Gly Glu 
            820                  825                830        
Gly Arg Ala Phe Arg Cys Pro His Cys Pro Phe Ile Thr Arg Arg Glu 
        835                  840                845            
Lys Ala Leu Asn Leu His Ser Arg Thr Gly Cys Gln Gly Arg Arg Glu 
    850                  855                860                
Pro Leu Leu Cys Pro Glu Cys Gly Ala Ser Phe Lys Gln Gln Arg Gly 
865                  870                875                  880
Leu Ser Thr His Leu Leu Lys Lys Cys Pro Val Leu Leu Arg Lys Asn 
                885                  890                895    
Lys Gly Leu Pro Arg Pro Asp Ser Pro Ile Pro Leu Gln Pro Val Leu 
            900                  905                910        
Pro Gly Thr Gln Ala Ser Glu Asp Thr Glu Ser Gly Lys Pro Pro Pro 
        915                  920                925            
Ala Ser Gln Glu Ala Glu Leu Leu Leu Pro Lys Asp Ala Pro Leu Glu 
    930                  935                940                
Leu Pro Arg Glu Pro Glu Glu Thr Glu Glu Pro Leu Ala Thr Val Ser 
945                  950                955                  960
Gly Ser Pro Val Pro Pro Ala Gly Asn Ser Leu Pro Thr Glu Ala Pro 
                965                  970                975    
Lys Lys His Cys Phe Asp Pro Val Pro Pro Ala Gly Asn Ser Ser Pro 
            980                  985                990        
Thr Glu Ala Pro Lys Lys His His Leu Asp Pro Val Pro Pro Ala Gly 
        995                  1000                1005            
Asn Ser Ser Pro Thr Glu Ala Leu Lys Lys His Arg Phe Glu Gln Gly 
    1010                1015                1020                
Lys Phe His Cys Asn Ser Cys Pro Phe Leu Cys Ser Arg Leu Ser Ser 
1025                1030                1035                1040
Ile Thr Ser His Val Ala Glu Gly Cys Arg Gly Gly Arg Gly Gly Gly 
                1045                1050                1055    
Gly Lys Arg Gly Thr Pro Gln Thr Gln Pro Asp Val Ser Pro Leu Ser 
            1060                1065                1070        
Asn Gly Asp Ser Ala Pro Pro Lys Asn Gly Ser Thr Glu Ser Ser Ser 
        1075                1080                1085            
Gly Asp Gly Asp Thr Val Leu Val Gln Lys Gln Lys Gly Ala Arg Phe 
    1090                1095                1100                
Ser Cys Pro Thr Cys Pro Phe Ser Cys Gln Gln Glu Arg Ala Leu Arg 
1105                1110                1115                1120
Thr His Gln Ile Arg Gly Cys Pro Leu Glu Glu Ser Gly Glu Leu His 
                1125                1130                1135    
Cys Ser Leu Cys Pro Phe Thr Ala Pro Ala Ala Thr Ala Leu Arg Leu 
            1140                1145                1150        
His Gln Lys Arg Arg His Pro Thr Ala Ala Pro Ala Arg Gly Pro Arg 
        1155                1160                1165            
Pro His Leu Gln Cys Gly Asp Cys Gly Phe Thr Cys Lys Gln Ser Arg 
    1170                1175                1180                
Cys Met Gln Gln His Arg Arg Leu Lys His Glu Gly Val Lys Pro His 
1185                1190                1195                1200
Gln Cys Pro Phe Cys Asp Phe Ser Thr Thr Arg Arg Tyr Arg Leu Glu 
                1205                1210                1215    
Ala His Gln Ser Arg His Thr Gly Ile Gly Arg Ile Pro Cys Ser Ser 
            1220                1225                1230        
Cys Pro Gln Thr Phe Gly Thr Asn Ser Lys Leu Arg Leu His Arg Leu 
        1235                1240                1245            
Arg Val His Asp Lys Thr Pro Thr His Phe Cys Pro Leu Cys Asp Tyr 
    1250                1255                1260                
Ser Gly Tyr Leu Arg His Asp Ile Thr Arg His Val Asn Ser Cys His 
1265                1270                1275                1280
Gln Gly Thr Pro Ala Phe Ala Cys Ser Gln Cys Glu Ala Gln Phe Ser 
                1285                1290                1295    
Ser Glu Thr Ala Leu Lys Gln His Ala Leu Arg Arg His Pro Glu Pro 
            1300                1305                1310        
Ala Gln Pro Ala Pro Gly Ser Pro Ala Glu Thr Thr Glu Gly Pro Leu 
        1315                1320                1325            
His Cys Ser Arg Cys Gly Leu Leu Cys Pro Ser Pro Ala Ser Leu Arg 
    1330                1335                1340                
Gly His Thr Arg Lys Gln His Pro Arg Leu Glu Cys Gly Ala Cys Gln 
1345                1350                1355                1360
Glu Ala Phe Pro Ser Arg Leu Ala Leu Asp Glu His Arg Arg Gln Gln 
                1365                1370                1375    
His Phe Ser His Arg Cys Gln Leu Cys Asp Phe Ala Ala Arg Glu Arg 
            1380                1385                1390        
Val Gly Leu Val Lys His Tyr Leu Glu Gln His Glu Glu Thr Ser Ala 
        1395                1400                1405            
Ala Val Ala Ala Ser Asp Gly Asp Gly Asp Ala Gly Gln Pro Pro Leu 
    1410                1415                1420                
His Cys Pro Phe Cys Asp Phe Thr Cys Arg His Gln Leu Val Leu Asp 
1425                1430                1435                1440
His His Val Lys Gly His Gly Gly Thr Arg Leu Tyr Lys Cys Thr Asp 
                1445                1450                1455    
Cys Ala Tyr Ser Thr Lys Asn Arg Gln Lys Ile Thr Trp His Ser Arg 
            1460                1465                1470        
Ile His Thr Gly Glu Lys Pro Tyr His Cys His Leu Cys Pro Tyr Ala 
        1475                1480                1485            
Cys Ala Asp Pro Ser Arg Leu Lys Tyr His Met Arg Ile His Lys Glu 
    1490                1495                1500                
Glu Arg Lys Tyr Leu Cys Pro Glu Cys Gly Tyr Lys Cys Lys Trp Val 
1505                1510                1515                1520
Asn Gln Leu Lys Tyr His Met Thr Lys His Thr 
                1525                1530    

<210> 15
<211> 1356
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1356
<223> /mol_type="DNA"
      /note="PTK6 (CCDS nucleotide sequence of PTK6 (Gene ID: 5753))"
      /organism="Homo sapiens"

<400> 15
atggtgtccc gggaccaggc tcacctgggc cccaagtatg tgggcctctg ggacttcaag      60

tcccggacgg acgaggagct gagcttccgc gcgggggacg tcttccacgt ggccaggaag     120

gaggagcagt ggtggtgggc cacgctgctg gacgaggcgg gtggggccgt ggcccagggc     180

tatgtgcccc acaactacct ggccgagagg gagacggtgg agtcggaacc gtggttcttt     240

ggctgcatct cccgctcgga agctgtgcgt cggctgcagg ccgagggcaa cgccacgggc     300

gccttcctga tcagggtcag cgagaagccg agtgccgact acgtcctgtc ggtgcgggac     360

acgcaggctg tgcggcacta caagatctgg cggcgtgccg ggggccggct gcacctgaac     420

gaggcggtgt ccttcctcag cctgcccgag cttgtgaact accacagggc ccagagcctg     480

tcccacggcc tgcggctggc cgcgccctgc cggaagcacg agcctgagcc cctgccccat     540

tgggatgact gggagaggcc gagggaggag ttcacgctct gcaggaagct ggggtccggc     600

tactttgggg aggtcttcga ggggctctgg aaagaccggg tccaggtggc cattaaggtg     660

atttctcgag acaacctcct gcaccagcag atgctgcagt cggagatcca ggccatgaag     720

aagctgcggc acaaacacat cctggcgctg tacgccgtgg tgtccgtggg ggaccccgtg     780

tacatcatca cggagctcat ggccaagggc agcctgctgg agctgctccg cgactctgat     840

gagaaagtcc tgcccgtttc ggagctgctg gacatcgcct ggcaggtggc tgagggcatg     900

tgttacctgg agtcgcagaa ttacatccac cgggacctgg ccgccaggaa catcctcgtc     960

ggggaaaaca ccctctgcaa agttggggac ttcgggttag ccaggcttat caaggaggac    1020

gtctacctct cccatgacca caatatcccc tacaagtgga cggcccctga agcgctctcc    1080

cgaggccatt actccaccaa atccgacgtc tggtcctttg ggattctcct gcatgagatg    1140

ttcagcaggg gtcaggtgcc ctacccaggc atgtccaacc atgaggcctt cctgagggtg    1200

gacgccggct accgcatgcc ctgccctctg gagtgcccgc ccagcgtgca caagctgatg    1260

ctgacatgct ggtgcaggga ccccgagcag agaccctgct tcaaggccct gcgggagagg    1320

ctctccagct tcaccagcta cgagaacccg acctga                              1356


<210> 16
<211> 451
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..451
<223> /mol_type="protein"
      /note="PTK6 (full-length protein)"
      /organism="Homo sapiens"

<400> 16
Met Val Ser Arg Asp Gln Ala His Leu Gly Pro Lys Tyr Val Gly Leu 
1               5                   10                   15    
Trp Asp Phe Lys Ser Arg Thr Asp Glu Glu Leu Ser Phe Arg Ala Gly 
            20                   25                  30        
Asp Val Phe His Val Ala Arg Lys Glu Glu Gln Trp Trp Trp Ala Thr 
        35                   40                  45            
Leu Leu Asp Glu Ala Gly Gly Ala Val Ala Gln Gly Tyr Val Pro His 
    50                   55                  60                
Asn Tyr Leu Ala Glu Arg Glu Thr Val Glu Ser Glu Pro Trp Phe Phe 
65                   70                  75                  80
Gly Cys Ile Ser Arg Ser Glu Ala Val Arg Arg Leu Gln Ala Glu Gly 
                85                   90                  95    
Asn Ala Thr Gly Ala Phe Leu Ile Arg Val Ser Glu Lys Pro Ser Ala 
            100                  105                110        
Asp Tyr Val Leu Ser Val Arg Asp Thr Gln Ala Val Arg His Tyr Lys 
        115                  120                125            
Ile Trp Arg Arg Ala Gly Gly Arg Leu His Leu Asn Glu Ala Val Ser 
    130                  135                140                
Phe Leu Ser Leu Pro Glu Leu Val Asn Tyr His Arg Ala Gln Ser Leu 
145                  150                155                  160
Ser His Gly Leu Arg Leu Ala Ala Pro Cys Arg Lys His Glu Pro Glu 
                165                  170                175    
Pro Leu Pro His Trp Asp Asp Trp Glu Arg Pro Arg Glu Glu Phe Thr 
            180                  185                190        
Leu Cys Arg Lys Leu Gly Ser Gly Tyr Phe Gly Glu Val Phe Glu Gly 
        195                  200                205            
Leu Trp Lys Asp Arg Val Gln Val Ala Ile Lys Val Ile Ser Arg Asp 
    210                  215                220                
Asn Leu Leu His Gln Gln Met Leu Gln Ser Glu Ile Gln Ala Met Lys 
225                  230                235                  240
Lys Leu Arg His Lys His Ile Leu Ala Leu Tyr Ala Val Val Ser Val 
                245                  250                255    
Gly Asp Pro Val Tyr Ile Ile Thr Glu Leu Met Ala Lys Gly Ser Leu 
            260                  265                270        
Leu Glu Leu Leu Arg Asp Ser Asp Glu Lys Val Leu Pro Val Ser Glu 
        275                  280                285            
Leu Leu Asp Ile Ala Trp Gln Val Ala Glu Gly Met Cys Tyr Leu Glu 
    290                  295                300                
Ser Gln Asn Tyr Ile His Arg Asp Leu Ala Ala Arg Asn Ile Leu Val 
305                  310                315                  320
Gly Glu Asn Thr Leu Cys Lys Val Gly Asp Phe Gly Leu Ala Arg Leu 
                325                  330                335    
Ile Lys Glu Asp Val Tyr Leu Ser His Asp His Asn Ile Pro Tyr Lys 
            340                  345                350        
Trp Thr Ala Pro Glu Ala Leu Ser Arg Gly His Tyr Ser Thr Lys Ser 
        355                  360                365            
Asp Val Trp Ser Phe Gly Ile Leu Leu His Glu Met Phe Ser Arg Gly 
    370                  375                380                
Gln Val Pro Tyr Pro Gly Met Ser Asn His Glu Ala Phe Leu Arg Val 
385                  390                395                  400
Asp Ala Gly Tyr Arg Met Pro Cys Pro Leu Glu Cys Pro Pro Ser Val 
                405                  410                415    
His Lys Leu Met Leu Thr Cys Trp Cys Arg Asp Pro Glu Gln Arg Pro 
            420                  425                430        
Cys Phe Lys Ala Leu Arg Glu Arg Leu Ser Ser Phe Thr Ser Tyr Glu 
        435                  440                445            
Asn Pro Thr 
    450    

<210> 17
<211> 524
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..524
<223> /mol_type="DNA"
      /note="PTK6 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 17
actctgatga gaaagtcctg cccgtttcgg agctgctgga catcgcctgg caggtggctg      60

agggcatgtg ttacctggag tcgcagaatt acatccaccg ggacctggcc gccaggaaca     120

tcctcgtcgg ggaaaacacc ctctgcaaag ttggggactt cgggttagcc aggcttatca     180

aggaggacgt ctacctctcc catgaccaca atatccccta caagtggacg gcccctgaag     240

cgctctcccg aggccattac tccaccaaat ccgacgtctg gtcctttggg attctcctgc     300

atgagatgtt cagcaggggt caggtgccct acccaggcat gtccaaccat gaggccttcc     360

tgagggtgga cgccggctac cgcatgccct gccctctgga gtgcccgccc agcgtgcaca     420

agctgatgct gacatgctgg tgcagggacc ccgagcagag accctgcttc aaggccctgc     480

gggagaggct ctccagcttc accagctacg agaacccgac ctga                      524


<210> 18
<211> 175
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..175
<223> /mol_type="protein"
      /note="PTK6 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 18
Arg Asp Ser Asp Glu Lys Val Leu Pro Val Ser Glu Leu Leu Asp Ile 
1               5                   10                   15    
Ala Trp Gln Val Ala Glu Gly Met Cys Tyr Leu Glu Ser Gln Asn Tyr 
            20                   25                  30        
Ile His Arg Asp Leu Ala Ala Arg Asn Ile Leu Val Gly Glu Asn Thr 
        35                   40                  45            
Leu Cys Lys Val Gly Asp Phe Gly Leu Ala Arg Leu Ile Lys Glu Asp 
    50                   55                  60                
Val Tyr Leu Ser His Asp His Asn Ile Pro Tyr Lys Trp Thr Ala Pro 
65                   70                  75                  80
Glu Ala Leu Ser Arg Gly His Tyr Ser Thr Lys Ser Asp Val Trp Ser 
                85                   90                  95    
Phe Gly Ile Leu Leu His Glu Met Phe Ser Arg Gly Gln Val Pro Tyr 
            100                  105                110        
Pro Gly Met Ser Asn His Glu Ala Phe Leu Arg Val Asp Ala Gly Tyr 
        115                  120                125            
Arg Met Pro Cys Pro Leu Glu Cys Pro Pro Ser Val His Lys Leu Met 
    130                  135                140                
Leu Thr Cys Trp Cys Arg Asp Pro Glu Gln Arg Pro Cys Phe Lys Ala 
145                  150                155                  160
Leu Arg Glu Arg Leu Ser Ser Phe Thr Ser Tyr Glu Asn Pro Thr 
                165                  170                175

<210> 19
<211> 5118
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..5118
<223> /mol_type="DNA"
      /note="ZNF142-PTK6 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 19
atgacagacc cccttttgga ctcacagcca gccagtagca ccggggagat ggatggactg       60

tgccctgagc tattgctgat ccccccgcct ctctctaacc gtggaatcct ggggcctgtc      120

cagagcccct gtccttcccg ggaccctgca cctataccta ctgagccagg ctgcctgctg      180

gtagaggcca cagcaactga agagggacca gggaacatgg agatcattgt ggagacagta      240

gctggaaccc tgaccccagg tgctcctgga gagaccccag ctcccaaact gcctccagga      300

gagagagaac cttcacagga agcaggtaca cccttgcctg ggcaggagac agctgaagag      360

gagaatgtag agaaagaaga gaagagtgac acccagaagg actcccaaaa ggctgtggat      420

aaaggccaag gggctcagcg gctggaaggg gatgtggtct ctggcaccga gtccctcttc      480

aagacccata tgtgtccaga gtgtaagcgc tgctttaaga agcggactca tctggtggag      540

cacctgcatc tccacttccc agaccccagc ctccagtgcc ctaactgcca gaagttcttc      600

accagtaaga gcaagctcaa gacccatctg ctgcgggagc tgggtgaaaa ggcccaccac      660

tgcccactgt gccactacag tgcggtggag aggaatgcac tcaaccgcca catggccagc      720

atgcatgaag atatttccaa cttctactca gacacctatg cctgtcctgt ctgccgtgag      780

gaattccgcc tcagccaggc cctaaaggag cacctcaaga gccacacggc agcagccgca      840

gcagagccat taccccttcg ctgctttcag gagggctgca gctatgcagc acccgaccgc      900

aaggccttca ttaagcacct gaaggagacc catggggtgc gggctgtgga gtgccgccat      960

cactcatgtc ccatgctctt tgccacagcc gaagccatgg aggcccacca caagagtcac     1020

tacgccttcc actgccccca ctgtgatttt gcttgttcca ataagcacct attccgtaaa     1080

cacaagaagc agggccaccc tggcagtgaa gagctgcgct gcaccttctg cccctttgcc     1140

accttcaacc cagtggctta ccaggatcat gtaggcaaga tgcatgctca tgaaaagatc     1200

caccagtgtc ctgagtgcaa ctttgccact gcccacaaga gggtgctcat ccgacacatg     1260

cttctacata cgggtgagaa gccccacaag tgtgagctgt gtgacttcac atgccgagac     1320

gtgagctacc tatccaagca catgctgacc cactccaaca ccaaggatta catgtgcact     1380

gaatgtggct atgtcaccaa gtggaagcac tacctccgtg tgcacatgcg aaaacatgca     1440

ggggacctca ggtatcagtg caaccagtgc tcctatcgct gtcaccgggc tgatcagctg     1500

agcagccaca agctgcggca tcagggcaag tctctgatgt gtgaggtgtg tgccttcgcc     1560

tgcaagcgga agtatgagct gcagaagcac atggcttccc agcaccaccc tggcacaccg     1620

gccccactct acccttgcca ctactgcagt taccagagcc gccacaagca ggctgtgctg     1680

agccatgaga actgcaagca tacccgcctc cgtgagttcc actgtgccct ctgtgactac     1740

cgcaccttca gcaacaccac actcttgttc cataaacgca aggcccatgg ctatgtacct     1800

ggagaccagg cctggcagct ccgctatgca agccaggagc cagaaggggc catgcagggc     1860

ccaacacccc caccagattc agagccctca aaccagctgt cagcccgacc tgaggggcca     1920

ggtcacgaac ctgggactgt ggtggacccc agcttggacc aggccctgcc agagatgagt     1980

gaggaggtca acactggaag acaggagggc agtgaggctc cccatggggg tgacctgggt     2040

ggcagtccca gcccagcaga ggtggaggag ggcagctgca cactacacct agaggccctg     2100

ggagtagagc tggagtctgt gactgagcca ccccttgagg aggtcactga aacagcccct     2160

atggagttca ggcccctggg actggaaggg ccagatggac tggaaggacc agagctatct     2220

agctttgaag gtattgggac ttctgacttg agtgctgaag aaaatcccct tctggaaaag     2280

ccagtgtctg agccctccac aaatcctcca tccttagagg aggctcctaa caactgggta     2340

ggaaccttca agacaactcc acctgctgag acagcaccct tgcccccatt acctgagtca     2400

gagtcattac tcaaggccct aaggagacag gacaaagaac aagcagaggc attggtgcta     2460

gaggggcggg tgcagatggt agtgatccag ggagaggggc gagccttccg ctgcccacac     2520

tgccctttta tcactcgccg ggagaaggcc ctgaatctgc actccaggac tgggtgccaa     2580

ggccgccgag agcccctgct gtgccccgag tgtggggcta gcttcaagca acaacgcggc     2640

ctcagcaccc acctgctgaa gaagtgccct gttctactca gaaagaacaa gggcttgccc     2700

agaccagatt cacccatccc tctgcaacct gtgctcccag gtacccaggc ctcagaggac     2760

acagaaagtg ggaagccccc acctgcatca caagaagcag agctactgct tccaaaagat     2820

gctcctttgg agcttcccag ggagccagaa gaaacagaag agcctcatgc cacagtctct     2880

ggttccccag tccctcctgc aggaaactcc ttgcccacag aggcccctaa gaagcactgc     2940

tttgacccag tccctcctgc aggaaactcc tcacccacgg aggcccctaa gaagcaccac     3000

cttgacccag tccctcctgc aggaaactcc tcacccacag aggccctgaa gaagcaccgc     3060

tttgagcagg gcaagtttca ctgcaactcc tgcccattcc tttgttcccg gctctcctct     3120

attacctctc acgtggctga aggctgcagg gggggacgtg gcgggggagg aaaacgaggg     3180

accccccaga cccagcctga tgtgtccccg ttgagcaatg gggactctgc tcccccgaag     3240

aatgggagta cagagtccag ctctggtgat ggggatacag ttctggttca aaagcagaag     3300

ggggctcgct tctcctgccc tacatgtccc tttagctgcc agcaggaacg ggctctgagg     3360

actcaccaga tccggggctg ccccctcgag gagtctggag agctgcactg cagcctctgc     3420

ccattcactg ctcctgctgc cactgcctta aggctccacc agaagcggag gcaccccact     3480

gcagccccag cccgtgggcc ccggccccat ctacagtgtg gggactgtgg cttcacctgt     3540

aaacagagcc gttgcatgca gcagcaccgg cggctcaagc acgagggggt gaagccccat     3600

cagtgcccct tctgtgactt ttcgaccacc agacggtacc ggttagaggc tcaccagtcc     3660

cgacacacag gcattggccg catcccctgc agctcttgcc cccagacgtt tggtaccaac     3720

tcgaaactgc gcttgcaccg gttaagggta catgacaaaa cacctaccca cttctgtcca     3780

ctttgtgact atagtggcta ccttcgccat gacatcactc gtcatgtcaa cagctgccac     3840

caaggcaccc cagcctttgc ctgctcccag tgtgaagccc agttcagctc agagacagca     3900

cttaagcagc atgctctgcg ccgacacccc gagcctgcac agcctgcccc tggctctcct     3960

gcagagacca ctgagggccc cctgcactgt tcccgctgtg ggttgctgtg ccccagccct     4020

gccagcttac gaggacacac ccgtaaacag cacccacggc ttgagtgtgg ggcctgccag     4080

gaggccttcc ctagccgact ggctctggat gagcaccgga ggcagcagca tttcagccac     4140

cgctgtcagc tctgtgactt tgctgcccgg gagcgggtgg gcctggtaaa gcactacctg     4200

gaacagcatg aggagacttc agcagccgtg gcagcctcag atggggatgg ggatgctggc     4260

cagcccccgc tacactgccc cttttgtgac ttcacatgcc gccatcagct ggtactagat     4320

caccatgtga aagggcatgg gggcactcgt ctctacaagt gcaccgattg tgcttacagc     4380

accaagaacc gacagaagat cacctggcac agccgcatcc acactgggga aaagccttac     4440

cactgtcacc tctgccccta tgcctgtgct gatccctctc gtctcaagta ccacatgcgg     4500

atccacaagg aggaacggaa gtacctgtgc cctgagtgtg gctacaagtg caagtgggtc     4560

aaccagctga aataccacat gaccaagcat acagactctg atgagaaagt cctgcccgtt     4620

tcggagctgc tggacatcgc ctggcaggtg gctgagggca tgtgttacct ggagtcgcag     4680

aattacatcc accgggacct ggccgccagg aacatcctcg tcggggaaaa caccctctgc     4740

aaagttgggg acttcgggtt agccaggctt atcaaggagg acgtctacct ctcccatgac     4800

cacaatatcc cctacaagtg gacggcccct gaagcgctct cccgaggcca ttactccacc     4860

aaatccgacg tctggtcctt tgggattctc ctgcatgaga tgttcagcag gggtcaggtg     4920

ccctacccag gcatgtccaa ccatgaggcc ttcctgaggg tggacgccgg ctaccgcatg     4980

ccctgccctc tggagtgccc gcccagcgtg cacaagctga tgctgacatg ctggtgcagg     5040

gaccccgagc agagaccctg cttcaaggcc ctgcgggaga ggctctccag cttcaccagc     5100

tacgagaacc cgacctga                                                   5118


<210> 20
<211> 1705
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..1705
<223> /mol_type="protein"
      /note="ZNF142-PTK6 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 20
Met Thr Asp Pro Leu Leu Asp Ser Gln Pro Ala Ser Ser Thr Gly Glu 
1               5                   10                   15    
Met Asp Gly Leu Cys Pro Glu Leu Leu Leu Ile Pro Pro Pro Leu Ser 
            20                   25                  30        
Asn Arg Gly Ile Leu Gly Pro Val Gln Ser Pro Cys Pro Ser Arg Asp 
        35                   40                  45            
Pro Ala Pro Ile Pro Thr Glu Pro Gly Cys Leu Leu Val Glu Ala Thr 
    50                   55                  60                
Ala Thr Glu Glu Gly Pro Gly Asn Met Glu Ile Ile Val Glu Thr Val 
65                   70                  75                  80
Ala Gly Thr Leu Thr Pro Gly Ala Pro Gly Glu Thr Pro Ala Pro Lys 
                85                   90                  95    
Leu Pro Pro Gly Glu Arg Glu Pro Ser Gln Glu Ala Gly Thr Pro Leu 
            100                  105                110        
Pro Gly Gln Glu Thr Ala Glu Glu Glu Asn Val Glu Lys Glu Glu Lys 
        115                  120                125            
Ser Asp Thr Gln Lys Asp Ser Gln Lys Ala Val Asp Lys Gly Gln Gly 
    130                  135                140                
Ala Gln Arg Leu Glu Gly Asp Val Val Ser Gly Thr Glu Ser Leu Phe 
145                  150                155                  160
Lys Thr His Met Cys Pro Glu Cys Lys Arg Cys Phe Lys Lys Arg Thr 
                165                  170                175    
His Leu Val Glu His Leu His Leu His Phe Pro Asp Pro Ser Leu Gln 
            180                  185                190        
Cys Pro Asn Cys Gln Lys Phe Phe Thr Ser Lys Ser Lys Leu Lys Thr 
        195                  200                205            
His Leu Leu Arg Glu Leu Gly Glu Lys Ala His His Cys Pro Leu Cys 
    210                  215                220                
His Tyr Ser Ala Val Glu Arg Asn Ala Leu Asn Arg His Met Ala Ser 
225                  230                235                  240
Met His Glu Asp Ile Ser Asn Phe Tyr Ser Asp Thr Tyr Ala Cys Pro 
                245                  250                255    
Val Cys Arg Glu Glu Phe Arg Leu Ser Gln Ala Leu Lys Glu His Leu 
            260                  265                270        
Lys Ser His Thr Ala Ala Ala Ala Ala Glu Pro Leu Pro Leu Arg Cys 
        275                  280                285            
Phe Gln Glu Gly Cys Ser Tyr Ala Ala Pro Asp Arg Lys Ala Phe Ile 
    290                  295                300                
Lys His Leu Lys Glu Thr His Gly Val Arg Ala Val Glu Cys Arg His 
305                  310                315                  320
His Ser Cys Pro Met Leu Phe Ala Thr Ala Glu Ala Met Glu Ala His 
                325                  330                335    
His Lys Ser His Tyr Ala Phe His Cys Pro His Cys Asp Phe Ala Cys 
            340                  345                350        
Ser Asn Lys His Leu Phe Arg Lys His Lys Lys Gln Gly His Pro Gly 
        355                  360                365            
Ser Glu Glu Leu Arg Cys Thr Phe Cys Pro Phe Ala Thr Phe Asn Pro 
    370                  375                380                
Val Ala Tyr Gln Asp His Val Gly Lys Met His Ala His Glu Lys Ile 
385                  390                395                  400
His Gln Cys Pro Glu Cys Asn Phe Ala Thr Ala His Lys Arg Val Leu 
                405                  410                415    
Ile Arg His Met Leu Leu His Thr Gly Glu Lys Pro His Lys Cys Glu 
            420                  425                430        
Leu Cys Asp Phe Thr Cys Arg Asp Val Ser Tyr Leu Ser Lys His Met 
        435                  440                445            
Leu Thr His Ser Asn Thr Lys Asp Tyr Met Cys Thr Glu Cys Gly Tyr 
    450                  455                460                
Val Thr Lys Trp Lys His Tyr Leu Arg Val His Met Arg Lys His Ala 
465                  470                475                  480
Gly Asp Leu Arg Tyr Gln Cys Asn Gln Cys Ser Tyr Arg Cys His Arg 
                485                  490                495    
Ala Asp Gln Leu Ser Ser His Lys Leu Arg His Gln Gly Lys Ser Leu 
            500                  505                510        
Met Cys Glu Val Cys Ala Phe Ala Cys Lys Arg Lys Tyr Glu Leu Gln 
        515                  520                525            
Lys His Met Ala Ser Gln His His Pro Gly Thr Pro Ala Pro Leu Tyr 
    530                  535                540                
Pro Cys His Tyr Cys Ser Tyr Gln Ser Arg His Lys Gln Ala Val Leu 
545                  550                555                  560
Ser His Glu Asn Cys Lys His Thr Arg Leu Arg Glu Phe His Cys Ala 
                565                  570                575    
Leu Cys Asp Tyr Arg Thr Phe Ser Asn Thr Thr Leu Leu Phe His Lys 
            580                  585                590        
Arg Lys Ala His Gly Tyr Val Pro Gly Asp Gln Ala Trp Gln Leu Arg 
        595                  600                605            
Tyr Ala Ser Gln Glu Pro Glu Gly Ala Met Gln Gly Pro Thr Pro Pro 
    610                  615                620                
Pro Asp Ser Glu Pro Ser Asn Gln Leu Ser Ala Arg Pro Glu Gly Pro 
625                  630                635                  640
Gly His Glu Pro Gly Thr Val Val Asp Pro Ser Leu Asp Gln Ala Leu 
                645                  650                655    
Pro Glu Met Ser Glu Glu Val Asn Thr Gly Arg Gln Glu Gly Ser Glu 
            660                  665                670        
Ala Pro His Gly Gly Asp Leu Gly Gly Ser Pro Ser Pro Ala Glu Val 
        675                  680                685            
Glu Glu Gly Ser Cys Thr Leu His Leu Glu Ala Leu Gly Val Glu Leu 
    690                  695                700                
Glu Ser Val Thr Glu Pro Pro Leu Glu Glu Val Thr Glu Thr Ala Pro 
705                  710                715                  720
Met Glu Phe Arg Pro Leu Gly Leu Glu Gly Pro Asp Gly Leu Glu Gly 
                725                  730                735    
Pro Glu Leu Ser Ser Phe Glu Gly Ile Gly Thr Ser Asp Leu Ser Ala 
            740                  745                750        
Glu Glu Asn Pro Leu Leu Glu Lys Pro Val Ser Glu Pro Ser Thr Asn 
        755                  760                765            
Pro Pro Ser Leu Glu Glu Ala Pro Asn Asn Trp Val Gly Thr Phe Lys 
    770                  775                780                
Thr Thr Pro Pro Ala Glu Thr Ala Pro Leu Pro Pro Leu Pro Glu Ser 
785                  790                795                  800
Glu Ser Leu Leu Lys Ala Leu Arg Arg Gln Asp Lys Glu Gln Ala Glu 
                805                  810                815    
Ala Leu Val Leu Glu Gly Arg Val Gln Met Val Val Ile Gln Gly Glu 
            820                  825                830        
Gly Arg Ala Phe Arg Cys Pro His Cys Pro Phe Ile Thr Arg Arg Glu 
        835                  840                845            
Lys Ala Leu Asn Leu His Ser Arg Thr Gly Cys Gln Gly Arg Arg Glu 
    850                  855                860                
Pro Leu Leu Cys Pro Glu Cys Gly Ala Ser Phe Lys Gln Gln Arg Gly 
865                  870                875                  880
Leu Ser Thr His Leu Leu Lys Lys Cys Pro Val Leu Leu Arg Lys Asn 
                885                  890                895    
Lys Gly Leu Pro Arg Pro Asp Ser Pro Ile Pro Leu Gln Pro Val Leu 
            900                  905                910        
Pro Gly Thr Gln Ala Ser Glu Asp Thr Glu Ser Gly Lys Pro Pro Pro 
        915                  920                925            
Ala Ser Gln Glu Ala Glu Leu Leu Leu Pro Lys Asp Ala Pro Leu Glu 
    930                  935                940                
Leu Pro Arg Glu Pro Glu Glu Thr Glu Glu Pro His Ala Thr Val Ser 
945                  950                955                  960
Gly Ser Pro Val Pro Pro Ala Gly Asn Ser Leu Pro Thr Glu Ala Pro 
                965                  970                975    
Lys Lys His Cys Phe Asp Pro Val Pro Pro Ala Gly Asn Ser Ser Pro 
            980                  985                990        
Thr Glu Ala Pro Lys Lys His His Leu Asp Pro Val Pro Pro Ala Gly 
        995                  1000                1005            
Asn Ser Ser Pro Thr Glu Ala Leu Lys Lys His Arg Phe Glu Gln Gly 
    1010                1015                1020                
Lys Phe His Cys Asn Ser Cys Pro Phe Leu Cys Ser Arg Leu Ser Ser 
1025                1030                1035                1040
Ile Thr Ser His Val Ala Glu Gly Cys Arg Gly Gly Arg Gly Gly Gly 
                1045                1050                1055    
Gly Lys Arg Gly Thr Pro Gln Thr Gln Pro Asp Val Ser Pro Leu Ser 
            1060                1065                1070        
Asn Gly Asp Ser Ala Pro Pro Lys Asn Gly Ser Thr Glu Ser Ser Ser 
        1075                1080                1085            
Gly Asp Gly Asp Thr Val Leu Val Gln Lys Gln Lys Gly Ala Arg Phe 
    1090                1095                1100                
Ser Cys Pro Thr Cys Pro Phe Ser Cys Gln Gln Glu Arg Ala Leu Arg 
1105                1110                1115                1120
Thr His Gln Ile Arg Gly Cys Pro Leu Glu Glu Ser Gly Glu Leu His 
                1125                1130                1135    
Cys Ser Leu Cys Pro Phe Thr Ala Pro Ala Ala Thr Ala Leu Arg Leu 
            1140                1145                1150        
His Gln Lys Arg Arg His Pro Thr Ala Ala Pro Ala Arg Gly Pro Arg 
        1155                1160                1165            
Pro His Leu Gln Cys Gly Asp Cys Gly Phe Thr Cys Lys Gln Ser Arg 
    1170                1175                1180                
Cys Met Gln Gln His Arg Arg Leu Lys His Glu Gly Val Lys Pro His 
1185                1190                1195                1200
Gln Cys Pro Phe Cys Asp Phe Ser Thr Thr Arg Arg Tyr Arg Leu Glu 
                1205                1210                1215    
Ala His Gln Ser Arg His Thr Gly Ile Gly Arg Ile Pro Cys Ser Ser 
            1220                1225                1230        
Cys Pro Gln Thr Phe Gly Thr Asn Ser Lys Leu Arg Leu His Arg Leu 
        1235                1240                1245            
Arg Val His Asp Lys Thr Pro Thr His Phe Cys Pro Leu Cys Asp Tyr 
    1250                1255                1260                
Ser Gly Tyr Leu Arg His Asp Ile Thr Arg His Val Asn Ser Cys His 
1265                1270                1275                1280
Gln Gly Thr Pro Ala Phe Ala Cys Ser Gln Cys Glu Ala Gln Phe Ser 
                1285                1290                1295    
Ser Glu Thr Ala Leu Lys Gln His Ala Leu Arg Arg His Pro Glu Pro 
            1300                1305                1310        
Ala Gln Pro Ala Pro Gly Ser Pro Ala Glu Thr Thr Glu Gly Pro Leu 
        1315                1320                1325            
His Cys Ser Arg Cys Gly Leu Leu Cys Pro Ser Pro Ala Ser Leu Arg 
    1330                1335                1340                
Gly His Thr Arg Lys Gln His Pro Arg Leu Glu Cys Gly Ala Cys Gln 
1345                1350                1355                1360
Glu Ala Phe Pro Ser Arg Leu Ala Leu Asp Glu His Arg Arg Gln Gln 
                1365                1370                1375    
His Phe Ser His Arg Cys Gln Leu Cys Asp Phe Ala Ala Arg Glu Arg 
            1380                1385                1390        
Val Gly Leu Val Lys His Tyr Leu Glu Gln His Glu Glu Thr Ser Ala 
        1395                1400                1405            
Ala Val Ala Ala Ser Asp Gly Asp Gly Asp Ala Gly Gln Pro Pro Leu 
    1410                1415                1420                
His Cys Pro Phe Cys Asp Phe Thr Cys Arg His Gln Leu Val Leu Asp 
1425                1430                1435                1440
His His Val Lys Gly His Gly Gly Thr Arg Leu Tyr Lys Cys Thr Asp 
                1445                1450                1455    
Cys Ala Tyr Ser Thr Lys Asn Arg Gln Lys Ile Thr Trp His Ser Arg 
            1460                1465                1470        
Ile His Thr Gly Glu Lys Pro Tyr His Cys His Leu Cys Pro Tyr Ala 
        1475                1480                1485            
Cys Ala Asp Pro Ser Arg Leu Lys Tyr His Met Arg Ile His Lys Glu 
    1490                1495                1500                
Glu Arg Lys Tyr Leu Cys Pro Glu Cys Gly Tyr Lys Cys Lys Trp Val 
1505                1510                1515                1520
Asn Gln Leu Lys Tyr His Met Thr Lys His Thr Asp Ser Asp Glu Lys 
                1525                1530                1535    
Val Leu Pro Val Ser Glu Leu Leu Asp Ile Ala Trp Gln Val Ala Glu 
            1540                1545                1550        
Gly Met Cys Tyr Leu Glu Ser Gln Asn Tyr Ile His Arg Asp Leu Ala 
        1555                1560                1565            
Ala Arg Asn Ile Leu Val Gly Glu Asn Thr Leu Cys Lys Val Gly Asp 
    1570                1575                1580                
Phe Gly Leu Ala Arg Leu Ile Lys Glu Asp Val Tyr Leu Ser His Asp 
1585                1590                1595                1600
His Asn Ile Pro Tyr Lys Trp Thr Ala Pro Glu Ala Leu Ser Arg Gly 
                1605                1610                1615    
His Tyr Ser Thr Lys Ser Asp Val Trp Ser Phe Gly Ile Leu Leu His 
            1620                1625                1630        
Glu Met Phe Ser Arg Gly Gln Val Pro Tyr Pro Gly Met Ser Asn His 
        1635                1640                1645            
Glu Ala Phe Leu Arg Val Asp Ala Gly Tyr Arg Met Pro Cys Pro Leu 
    1650                1655                1660                
Glu Cys Pro Pro Ser Val His Lys Leu Met Leu Thr Cys Trp Cys Arg 
1665                1670                1675                1680
Asp Pro Glu Gln Arg Pro Cys Phe Lys Ala Leu Arg Glu Arg Leu Ser 
                1685                1690                1695    
Ser Phe Thr Ser Tyr Glu Asn Pro Thr 
            1700                1705

<210> 21
<211> 2574
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2574
<223> /mol_type="DNA"
      /note="AGAP1 (nucleotide sequence of AGAP1 Gene ID: 116987)"
      /organism="Homo sapiens"

<400> 21
atgaactacc agcagcagct ggccaactcg gctgccatcc gggccgagat ccagcgcttc      60

gagtcggtcc accccaacat ctactccatc tacgagctgc tggagcgcgt ggaggagccg     120

gtgctgcaga accagatccg ggagcacgtc atcgccatcg aagatgcctt cgtgaacagc     180

caggaatgga cgctgagtcg atctgtcccg gagctcaaag tgggaattgt gggtaacttg     240

gccagcggca agtctgccct ggtgcaccgg tacctgacgg gcacatatgt ccaggaggag     300

tctccggaag gtggcaggtt caagaaagag attgtcgttg atggacagag ctatctgctg     360

ctgatcagag atgaaggggg ccccccggag gcgcagtttg ccatgtgggt ggacgctgtt     420

atatttgtct tcagcttgga ggatgaaata agtttccaga ccgtttacca ctactacagt     480

cgaatggcca actatcggaa cacgagcgag attcctctgg ttctggtggg aacccaggat     540

gccataagtt ctgctaaccc gagggtcatc gatgacgcca gggcgaggaa gctctccaac     600

gacctgaaac ggtgcacgta ctacgagacg tgtgctacat acgggctgaa tgtggagagg     660

gtcttccagg acgttgccca gaagattgtt gccacaagga agaagcagca gctgtccata     720

ggaccctgca agtcgctacc taattctccc agccattcct ccgtctgttc cgcgcaggtg     780

tctgccgtgc acatcagcca gacaagtaat ggaggtggga gtttaagcga ctattcctcc     840

tccgttccat cgactcccag caccagccag aaggaacttc ggatcgatgt tcctcccact     900

gccaacacgc ccacgcccgt tcgcaagcag tctaagcgcc ggtccaacct gttcacctct     960

cggaaaggga gcgacccaga caaagagaag aaaggcctgg agagtcgtgc ggacagcatt    1020

gggagcggcc gagccatccc aattaaacag ggcatgctgt tgaagcgaag tggcaaatcg    1080

ttgaataaag agtggaaaaa gaaatatgtc accctgtgtg acaatggcgt gctgacctat    1140

catcccagtt tacatgatta catgcagaat gttcatggta aggagattga ccttctgaga    1200

accactgtga aagtcccagg gaagaggcca ccccgagcca cgtcagcctg cgcacccatc    1260

tccagcccta aaaccaatgg cctatccaag gacatgagca gtttacacat ctcacccaat    1320

tcagggaatg tcactagtgc atctgggtct cagatggcaa gcggcatcag cctggtctcc    1380

ttcaacagcc gacccgacgg catgcaccag cgctcctact cagtctccag tgccgaccag    1440

tggagtgagg ctacggtcat tgcaaactcg gccatcagca gtgacacagg gctgggtgac    1500

tccgtatgct ccagccccag tatctccagc accaccagcc ccaagctcga cccgcccccc    1560

tcccctcacg ccaacagaaa gaagcaccga aggaagaaaa gcactagcaa cttcaaagcc    1620

gacggcctgt ccggcactgc tgaagaacaa gaagaaaatt ttgagtttat cattgtgtcc    1680

ctcactggcc aaacatggca ctttgaagcc acgacgtatg aggagcggga cgcctgggtc    1740

caagccatcg agagccagat cctggccagc ctgcagtcgt gcgagagcag caagaacaag    1800

tcccggctga cgagccagag cgaggccatg gccctgcagt cgatccggaa catgcgcggg    1860

aactcccact gtgtggactg cgagacccag aatcccaact gggccagttt gaacttggga    1920

gccctcatgt gcatcgaatg ctcagggatc caccggaatc ttggcaccca cctttcccga    1980

gtccgatctc tggacctgga tgactggcca gtcgagctca tcaaggtgat gtcatccatc    2040

gggaacgagc tagccaacag cgtctgggaa gagagcagcc aggggcggac gaaaccatcg    2100

gtagactcca caagggaaga gaaggaacgg tggatccgtg ccaagtacga gcagaagctc    2160

ttcctggccc cgctgccctg cacggagctg tccctgggcc agcacctgct gcgggccacc    2220

gccgacgagg acctgcggac ggccatcctg ctgctggcac acggctcccg ggacgaggtg    2280

aacgagacct gcggggaggg agacggccgc acggcgctgc atctggcctg ccgcaagggg    2340

aatgtggtcc tggcgcagct cctgatctgg tacggagtgg acgtcacggc ccgagatgcc    2400

cacgggaaca cagctctggc ctacgcccgg caggcctcca gccaggagtg catcgacgtg    2460

ctgctgcagt acggctgccc cgacgagcgc ttcgtgctca tggccacccc taacctgtcc    2520

aggagaaaca ataaccggaa caacagcagt gggagggtgc ccaccatcat ctga          2574


<210> 22
<211> 857
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..857
<223> /mol_type="protein"
      /note="AGAP1 (full-length protein)"
      /organism="Homo sapiens"

<400> 22
Met Asn Tyr Gln Gln Gln Leu Ala Asn Ser Ala Ala Ile Arg Ala Glu 
1               5                   10                   15    
Ile Gln Arg Phe Glu Ser Val His Pro Asn Ile Tyr Ser Ile Tyr Glu 
            20                   25                  30        
Leu Leu Glu Arg Val Glu Glu Pro Val Leu Gln Asn Gln Ile Arg Glu 
        35                   40                  45            
His Val Ile Ala Ile Glu Asp Ala Phe Val Asn Ser Gln Glu Trp Thr 
    50                   55                  60                
Leu Ser Arg Ser Val Pro Glu Leu Lys Val Gly Ile Val Gly Asn Leu 
65                   70                  75                  80
Ala Ser Gly Lys Ser Ala Leu Val His Arg Tyr Leu Thr Gly Thr Tyr 
                85                   90                  95    
Val Gln Glu Glu Ser Pro Glu Gly Gly Arg Phe Lys Lys Glu Ile Val 
            100                  105                110        
Val Asp Gly Gln Ser Tyr Leu Leu Leu Ile Arg Asp Glu Gly Gly Pro 
        115                  120                125            
Pro Glu Ala Gln Phe Ala Met Trp Val Asp Ala Val Ile Phe Val Phe 
    130                  135                140                
Ser Leu Glu Asp Glu Ile Ser Phe Gln Thr Val Tyr His Tyr Tyr Ser 
145                  150                155                  160
Arg Met Ala Asn Tyr Arg Asn Thr Ser Glu Ile Pro Leu Val Leu Val 
                165                  170                175    
Gly Thr Gln Asp Ala Ile Ser Ser Ala Asn Pro Arg Val Ile Asp Asp 
            180                  185                190        
Ala Arg Ala Arg Lys Leu Ser Asn Asp Leu Lys Arg Cys Thr Tyr Tyr 
        195                  200                205            
Glu Thr Cys Ala Thr Tyr Gly Leu Asn Val Glu Arg Val Phe Gln Asp 
    210                  215                220                
Val Ala Gln Lys Ile Val Ala Thr Arg Lys Lys Gln Gln Leu Ser Ile 
225                  230                235                  240
Gly Pro Cys Lys Ser Leu Pro Asn Ser Pro Ser His Ser Ser Val Cys 
                245                  250                255    
Ser Ala Gln Val Ser Ala Val His Ile Ser Gln Thr Ser Asn Gly Gly 
            260                  265                270        
Gly Ser Leu Ser Asp Tyr Ser Ser Ser Val Pro Ser Thr Pro Ser Thr 
        275                  280                285            
Ser Gln Lys Glu Leu Arg Ile Asp Val Pro Pro Thr Ala Asn Thr Pro 
    290                  295                300                
Thr Pro Val Arg Lys Gln Ser Lys Arg Arg Ser Asn Leu Phe Thr Ser 
305                  310                315                  320
Arg Lys Gly Ser Asp Pro Asp Lys Glu Lys Lys Gly Leu Glu Ser Arg 
                325                  330                335    
Ala Asp Ser Ile Gly Ser Gly Arg Ala Ile Pro Ile Lys Gln Gly Met 
            340                  345                350        
Leu Leu Lys Arg Ser Gly Lys Ser Leu Asn Lys Glu Trp Lys Lys Lys 
        355                  360                365            
Tyr Val Thr Leu Cys Asp Asn Gly Val Leu Thr Tyr His Pro Ser Leu 
    370                  375                380                
His Asp Tyr Met Gln Asn Val His Gly Lys Glu Ile Asp Leu Leu Arg 
385                  390                395                  400
Thr Thr Val Lys Val Pro Gly Lys Arg Pro Pro Arg Ala Thr Ser Ala 
                405                  410                415    
Cys Ala Pro Ile Ser Ser Pro Lys Thr Asn Gly Leu Ser Lys Asp Met 
            420                  425                430        
Ser Ser Leu His Ile Ser Pro Asn Ser Gly Asn Val Thr Ser Ala Ser 
        435                  440                445            
Gly Ser Gln Met Ala Ser Gly Ile Ser Leu Val Ser Phe Asn Ser Arg 
    450                  455                460                
Pro Asp Gly Met His Gln Arg Ser Tyr Ser Val Ser Ser Ala Asp Gln 
465                  470                475                  480
Trp Ser Glu Ala Thr Val Ile Ala Asn Ser Ala Ile Ser Ser Asp Thr 
                485                  490                495    
Gly Leu Gly Asp Ser Val Cys Ser Ser Pro Ser Ile Ser Ser Thr Thr 
            500                  505                510        
Ser Pro Lys Leu Asp Pro Pro Pro Ser Pro His Ala Asn Arg Lys Lys 
        515                  520                525            
His Arg Arg Lys Lys Ser Thr Ser Asn Phe Lys Ala Asp Gly Leu Ser 
    530                  535                540                
Gly Thr Ala Glu Glu Gln Glu Glu Asn Phe Glu Phe Ile Ile Val Ser 
545                  550                555                  560
Leu Thr Gly Gln Thr Trp His Phe Glu Ala Thr Thr Tyr Glu Glu Arg 
                565                  570                575    
Asp Ala Trp Val Gln Ala Ile Glu Ser Gln Ile Leu Ala Ser Leu Gln 
            580                  585                590        
Ser Cys Glu Ser Ser Lys Asn Lys Ser Arg Leu Thr Ser Gln Ser Glu 
        595                  600                605            
Ala Met Ala Leu Gln Ser Ile Arg Asn Met Arg Gly Asn Ser His Cys 
    610                  615                620                
Val Asp Cys Glu Thr Gln Asn Pro Asn Trp Ala Ser Leu Asn Leu Gly 
625                  630                635                  640
Ala Leu Met Cys Ile Glu Cys Ser Gly Ile His Arg Asn Leu Gly Thr 
                645                  650                655    
His Leu Ser Arg Val Arg Ser Leu Asp Leu Asp Asp Trp Pro Val Glu 
            660                  665                670        
Leu Ile Lys Val Met Ser Ser Ile Gly Asn Glu Leu Ala Asn Ser Val 
        675                  680                685            
Trp Glu Glu Ser Ser Gln Gly Arg Thr Lys Pro Ser Val Asp Ser Thr 
    690                  695                700                
Arg Glu Glu Lys Glu Arg Trp Ile Arg Ala Lys Tyr Glu Gln Lys Leu 
705                  710                715                  720
Phe Leu Ala Pro Leu Pro Cys Thr Glu Leu Ser Leu Gly Gln His Leu 
                725                  730                735    
Leu Arg Ala Thr Ala Asp Glu Asp Leu Arg Thr Ala Ile Leu Leu Leu 
            740                  745                750        
Ala His Gly Ser Arg Asp Glu Val Asn Glu Thr Cys Gly Glu Gly Asp 
        755                  760                765            
Gly Arg Thr Ala Leu His Leu Ala Cys Arg Lys Gly Asn Val Val Leu 
    770                  775                780                
Ala Gln Leu Leu Ile Trp Tyr Gly Val Asp Val Thr Ala Arg Asp Ala 
785                  790                795                  800
His Gly Asn Thr Ala Leu Ala Tyr Ala Arg Gln Ala Ser Ser Gln Glu 
                805                  810                815    
Cys Ile Asp Val Leu Leu Gln Tyr Gly Cys Pro Asp Glu Arg Phe Val 
            820                  825                830        
Leu Met Ala Thr Pro Asn Leu Ser Arg Arg Asn Asn Asn Arg Asn Asn 
        835                  840                845            
Ser Ser Gly Arg Val Pro Thr Ile Ile 
    850                  855        

<210> 23
<211> 163
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..163
<223> /mol_type="DNA"
      /note="AGAP1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 23
atgaactacc agcagcagct ggccaactcg gctgccatcc gggccgagat ccagcgcttc     60

gagtcggtcc accccaacat ctactccatc tacgagctgc tggagcgcgt ggaggagccg    120

gtgctgcaga accagatccg ggagcacgtc atcgccatcg aag                      163


<210> 24
<211> 54
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..54
<223> /mol_type="protein"
      /note="AGAP1 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 24
Met Asn Tyr Gln Gln Gln Leu Ala Asn Ser Ala Ala Ile Arg Ala Glu 
1               5                   10                   15    
Ile Gln Arg Phe Glu Ser Val His Pro Asn Ile Tyr Ser Ile Tyr Glu 
            20                   25                  30        
Leu Leu Glu Arg Val Glu Glu Pro Val Leu Gln Asn Gln Ile Arg Glu 
        35                   40                  45            
His Val Ile Ala Ile Glu 
    50                

<210> 25
<211> 987
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..987
<223> /mol_type="DNA"
      /note="IGFBP2 (CCDS nucleotide sequence of IGFBP2 (Gene ID: 3485)
      )"
      /organism="Homo sapiens"

<400> 25
atgctgccga gagtgggctg ccccgcgctg ccgctgccgc cgccgccgct gctgccgctg      60

ctgccgctgc tgctgctgct actgggcgcg agtggcggcg gcggcggggc gcgcgcggag     120

gtgctgttcc gctgcccgcc ctgcacaccc gagcgcctgg ccgcctgcgg gcccccgccg     180

gttgcgccgc ccgccgcggt ggccgcagtg gccggaggcg cccgcatgcc atgcgcggag     240

ctcgtccggg agccgggctg cggctgctgc tcggtgtgcg cccggctgga gggcgaggcg     300

tgcggcgtct acaccccgcg ctgcggccag gggctgcgct gctatcccca cccgggctcc     360

gagctgcccc tgcaggcgct ggtcatgggc gagggcactt gtgagaagcg ccgggacgcc     420

gagtatggcg ccagcccgga gcaggttgca gacaatggcg atgaccactc agaaggaggc     480

ctggtggaga accacgtgga cagcaccatg aacatgttgg gcgggggagg cagtgctggc     540

cggaagcccc tcaagtcggg tatgaaggag ctggccgtgt tccgggagaa ggtcactgag     600

cagcaccggc agatgggcaa gggtggcaag catcaccttg gcctggagga gcccaagaag     660

ctgcgaccac cccctgccag gactccctgc caacaggaac tggaccaggt cctggagcgg     720

atctccacca tgcgccttcc ggatgagcgg ggccctctgg agcacctcta ctccctgcac     780

atccccaact gtgacaagca tggcctgtac aacctcaaac agtgcaagat gtctctgaac     840

gggcagcgtg gggagtgctg gtgtgtgaac cccaacaccg ggaagctgat ccagggagcc     900

cccaccatcc ggggggaccc cgagtgtcat ctcttctaca atgagcagca ggaggctcgc     960

ggggtgcaca cccagcggat gcagtag                                         987


<210> 26
<211> 325
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..325
<223> /mol_type="protein"
      /note="IGFBP2 (full-length protein)"
      /organism="Homo sapiens"

<400> 26
Met Leu Pro Arg Val Gly Cys Pro Ala Leu Pro Leu Pro Pro Pro Pro 
1               5                   10                   15    
Leu Leu Pro Leu Leu Leu Leu Leu Leu Gly Ala Ser Gly Gly Gly Gly 
            20                   25                  30        
Gly Ala Arg Ala Glu Val Leu Phe Arg Cys Pro Pro Cys Thr Pro Glu 
        35                   40                  45            
Arg Leu Ala Ala Cys Gly Pro Pro Pro Val Ala Pro Pro Ala Ala Val 
    50                   55                  60                
Ala Ala Val Ala Gly Gly Ala Arg Met Pro Cys Ala Glu Leu Val Arg 
65                   70                  75                  80
Glu Pro Gly Cys Gly Cys Cys Ser Val Cys Ala Arg Leu Glu Gly Glu 
                85                   90                  95    
Ala Cys Gly Val Tyr Thr Pro Arg Cys Gly Gln Gly Leu Arg Cys Tyr 
            100                  105                110        
Pro His Pro Gly Ser Glu Leu Pro Leu Gln Ala Leu Val Met Gly Glu 
        115                  120                125            
Gly Thr Cys Glu Lys Arg Arg Asp Ala Glu Tyr Gly Ala Ser Pro Glu 
    130                  135                140                
Gln Val Ala Asp Asn Gly Asp Asp His Ser Glu Gly Gly Leu Val Glu 
145                  150                155                  160
Asn His Val Asp Ser Thr Met Asn Met Leu Gly Gly Gly Gly Ser Ala 
                165                  170                175    
Gly Arg Lys Pro Leu Lys Ser Gly Met Lys Glu Leu Ala Val Phe Arg 
            180                  185                190        
Glu Lys Val Thr Glu Gln His Arg Gln Met Gly Lys Gly Gly Lys His 
        195                  200                205            
His Leu Gly Leu Glu Glu Pro Lys Lys Leu Arg Pro Pro Pro Ala Arg 
    210                  215                220                
Thr Pro Cys Gln Gln Glu Leu Asp Gln Val Leu Glu Arg Ile Ser Thr 
225                  230                235                  240
Met Arg Leu Pro Asp Glu Arg Gly Pro Leu Glu His Leu Tyr Ser Leu 
                245                  250                255    
His Ile Pro Asn Cys Asp Lys His Gly Leu Tyr Asn Leu Lys Gln Cys 
            260                  265                270        
Lys Met Ser Leu Asn Gly Gln Arg Gly Glu Cys Trp Cys Val Asn Pro 
        275                  280                285            
Asn Thr Gly Lys Leu Ile Gln Gly Ala Pro Thr Ile Arg Gly Asp Pro 
    290                  295                300                
Glu Cys His Leu Phe Tyr Asn Glu Gln Gln Glu Ala Arg Gly Val His 
305                  310                315                  320
Thr Gln Arg Met Gln 
                325

<210> 27
<211> 536
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..536
<223> /mol_type="DNA"
      /note="IGFBP2 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 27
acaatggcga tgaccactca gaaggaggcc tggtggagaa ccacgtggac agcaccatga      60

acatgttggg cgggggaggc agtgctggcc ggaagcccct caagtcgggt atgaaggagc     120

tggccgtgtt ccgggagaag gtcactgagc agcaccggca gatgggcaag ggtggcaagc     180

atcaccttgg cctggaggag cccaagaagc tgcgaccacc ccctgccagg actccctgcc     240

aacaggaact ggaccaggtc ctggagcgga tctccaccat gcgccttccg gatgagcggg     300

gccctctgga gcacctctac tccctgcaca tccccaactg tgacaagcat ggcctgtaca     360

acctcaaaca gtgcaagatg tctctgaacg ggcagcgtgg ggagtgctgg tgtgtgaacc     420

ccaacaccgg gaagctgatc cagggagccc ccaccatccg gggggacccc gagtgtcatc     480

tcttctacaa tgagcagcag gaggctcgcg gggtgcacac ccagcggatg cagtag         536


<210> 28
<211> 178
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..178
<223> /mol_type="protein"
      /note="IGFBP2 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 28
Asp Asn Gly Asp Asp His Ser Glu Gly Gly Leu Val Glu Asn His Val 
1               5                   10                   15    
Asp Ser Thr Met Asn Met Leu Gly Gly Gly Gly Ser Ala Gly Arg Lys 
            20                   25                  30        
Pro Leu Lys Ser Gly Met Lys Glu Leu Ala Val Phe Arg Glu Lys Val 
        35                   40                  45            
Thr Glu Gln His Arg Gln Met Gly Lys Gly Gly Lys His His Leu Gly 
    50                   55                  60                
Leu Glu Glu Pro Lys Lys Leu Arg Pro Pro Pro Ala Arg Thr Pro Cys 
65                   70                  75                  80
Gln Gln Glu Leu Asp Gln Val Leu Glu Arg Ile Ser Thr Met Arg Leu 
                85                   90                  95    
Pro Asp Glu Arg Gly Pro Leu Glu His Leu Tyr Ser Leu His Ile Pro 
            100                  105                110        
Asn Cys Asp Lys His Gly Leu Tyr Asn Leu Lys Gln Cys Lys Met Ser 
        115                  120                125            
Leu Asn Gly Gln Arg Gly Glu Cys Trp Cys Val Asn Pro Asn Thr Gly 
    130                  135                140                
Lys Leu Ile Gln Gly Ala Pro Thr Ile Arg Gly Asp Pro Glu Cys His 
145                  150                155                  160
Leu Phe Tyr Asn Glu Gln Gln Glu Ala Arg Gly Val His Thr Gln Arg 
                165                  170                175    
Met Gln 
        

<210> 29
<211> 699
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..699
<223> /mol_type="DNA"
      /note="AGAP1-IGFBP2 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 29
atgaactacc agcagcagct ggccaactcg gctgccatcc gggccgagat ccagcgcttc      60

gagtcggtcc accccaacat ctactccatc tacgagctgc tggagcgcgt ggaggagccg     120

gtgctgcaga accagatccg ggagcacgtc atcgccatcg aagacaatgg cgatgaccac     180

tcagaaggag gcctggtgga gaaccacgtg gacagcacca tgaacatgtt gggcggggga     240

ggcagtgctg gccggaagcc cctcaagtcg ggtatgaagg agctggccgt gttccgggag     300

aaggtcactg agcagcaccg gcagatgggc aagggtggca agcatcacct tggcctggag     360

gagcccaaga agctgcgacc accccctgcc aggactccct gccaacagga actggaccag     420

gtcctggagc ggatctccac catgcgcctt ccggatgagc ggggccctct ggagcacctc     480

tactccctgc acatccccaa ctgtgacaag catggcctgt acaacctcaa acagtgcaag     540

atgtctctga acgggcagcg tggggagtgc tggtgtgtga accccaacac cgggaagctg     600

atccagggag cccccaccat ccggggggac cccgagtgtc atctcttcta caatgagcag     660

caggaggctc gcggggtgca cacccagcgg atgcagtag                            699


<210> 30
<211> 232
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..232
<223> /mol_type="protein"
      /note="AGAP1- IGFBP2 (preferred fusion-protein)"
      /organism="artificial sequences"

<400> 30
Met Asn Tyr Gln Gln Gln Leu Ala Asn Ser Ala Ala Ile Arg Ala Glu 
1               5                   10                   15    
Ile Gln Arg Phe Glu Ser Val His Pro Asn Ile Tyr Ser Ile Tyr Glu 
            20                   25                  30        
Leu Leu Glu Arg Val Glu Glu Pro Val Leu Gln Asn Gln Ile Arg Glu 
        35                   40                  45            
His Val Ile Ala Ile Glu Asp Asn Gly Asp Asp His Ser Glu Gly Gly 
    50                   55                  60                
Leu Val Glu Asn His Val Asp Ser Thr Met Asn Met Leu Gly Gly Gly 
65                   70                  75                  80
Gly Ser Ala Gly Arg Lys Pro Leu Lys Ser Gly Met Lys Glu Leu Ala 
                85                   90                  95    
Val Phe Arg Glu Lys Val Thr Glu Gln His Arg Gln Met Gly Lys Gly 
            100                  105                110        
Gly Lys His His Leu Gly Leu Glu Glu Pro Lys Lys Leu Arg Pro Pro 
        115                  120                125            
Pro Ala Arg Thr Pro Cys Gln Gln Glu Leu Asp Gln Val Leu Glu Arg 
    130                  135                140                
Ile Ser Thr Met Arg Leu Pro Asp Glu Arg Gly Pro Leu Glu His Leu 
145                  150                155                  160
Tyr Ser Leu His Ile Pro Asn Cys Asp Lys His Gly Leu Tyr Asn Leu 
                165                  170                175    
Lys Gln Cys Lys Met Ser Leu Asn Gly Gln Arg Gly Glu Cys Trp Cys 
            180                  185                190        
Val Asn Pro Asn Thr Gly Lys Leu Ile Gln Gly Ala Pro Thr Ile Arg 
        195                  200                205            
Gly Asp Pro Glu Cys His Leu Phe Tyr Asn Glu Gln Gln Glu Ala Arg 
    210                  215                220                
Gly Val His Thr Gln Arg Met Gln 
225                  230        

<210> 31
<211> 804
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..804
<223> /mol_type="DNA"
      /note="POMC (CCDS nucleotide sequence of POMC (Gene ID: 5443) plu
      s 3’UTR)"
      /organism="Homo sapiens"

<400> 31
atgccgagat cgtgctgcag ccgctcgggg gccctgttgc tggccttgct gcttcaggcc      60

tccatggaag tgcgtggctg gtgcctggag agcagccagt gtcaggacct caccacggaa     120

agcaacctgc tggagtgcat ccgggcctgc aagcccgacc tctcggccga gactcccatg     180

ttcccgggaa atggcgacga gcagcctctg accgagaacc cccggaagta cgtcatgggc     240

cacttccgct gggaccgatt cggccgccgc aacagcagca gcagcggcag cagcggcgca     300

gggcagaagc gcgaggacgt ctcagcgggc gaagactgcg gcccgctgcc tgagggcggc     360

cccgagcccc gcagcgatgg tgccaagccg ggcccgcgcg agggcaagcg ctcctactcc     420

atggagcact tccgctgggg caagccggtg ggcaagaagc ggcgcccagt gaaggtgtac     480

cctaacggcg ccgaggacga gtcggccgag gccttccccc tggagttcaa gagggagctg     540

actggccagc gactccggga gggagatggc cccgacggcc ctgccgatga cggcgcaggg     600

gcccaggccg acctggagca cagcctgctg gtggcggccg agaagaagga cgagggcccc     660

tacaggatgg agcacttccg ctggggcagc ccgcccaagg acaagcgcta cggcggtttc     720

atgacctccg agaagagcca gacgcccctg gtgacgctgt tcaaaaacgc catcatcaag     780

aacgcctaca agaagggcga gtga                                            804


<210> 32
<211> 267
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..267
<223> /mol_type="protein"
      /note="POMC (full-length protein)"
      /organism="Homo sapiens"

<400> 32
Met Pro Arg Ser Cys Cys Ser Arg Ser Gly Ala Leu Leu Leu Ala Leu 
1               5                   10                   15    
Leu Leu Gln Ala Ser Met Glu Val Arg Gly Trp Cys Leu Glu Ser Ser 
            20                   25                  30        
Gln Cys Gln Asp Leu Thr Thr Glu Ser Asn Leu Leu Glu Cys Ile Arg 
        35                   40                  45            
Ala Cys Lys Pro Asp Leu Ser Ala Glu Thr Pro Met Phe Pro Gly Asn 
    50                   55                  60                
Gly Asp Glu Gln Pro Leu Thr Glu Asn Pro Arg Lys Tyr Val Met Gly 
65                   70                  75                  80
His Phe Arg Trp Asp Arg Phe Gly Arg Arg Asn Ser Ser Ser Ser Gly 
                85                   90                  95    
Ser Ser Gly Ala Gly Gln Lys Arg Glu Asp Val Ser Ala Gly Glu Asp 
            100                  105                110        
Cys Gly Pro Leu Pro Glu Gly Gly Pro Glu Pro Arg Ser Asp Gly Ala 
        115                  120                125            
Lys Pro Gly Pro Arg Glu Gly Lys Arg Ser Tyr Ser Met Glu His Phe 
    130                  135                140                
Arg Trp Gly Lys Pro Val Gly Lys Lys Arg Arg Pro Val Lys Val Tyr 
145                  150                155                  160
Pro Asn Gly Ala Glu Asp Glu Ser Ala Glu Ala Phe Pro Leu Glu Phe 
                165                  170                175    
Lys Arg Glu Leu Thr Gly Gln Arg Leu Arg Glu Gly Asp Gly Pro Asp 
            180                  185                190        
Gly Pro Ala Asp Asp Gly Ala Gly Ala Gln Ala Asp Leu Glu His Ser 
        195                  200                205            
Leu Leu Val Ala Ala Glu Lys Lys Asp Glu Gly Pro Tyr Arg Met Glu 
    210                  215                220                
His Phe Arg Trp Gly Ser Pro Pro Lys Asp Lys Arg Tyr Gly Gly Phe 
225                  230                235                  240
Met Thr Ser Glu Lys Ser Gln Thr Pro Leu Val Thr Leu Phe Lys Asn 
                245                  250                255    
Ala Ile Ile Lys Asn Ala Tyr Lys Lys Gly Glu 
            260                  265        

<210> 33
<211> 2214
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..2214
<223> /mol_type="DNA"
      /note="POMC (preferred gene fragment)"
      /organism="artificial sequences"

<400> 33
atgaatggtg attctcgtgc tgcggtggtg acctcaccac ccccgaccac agcccctcac      60

aaggagaggt acttcgaccg agtagatgag aacaacccag agtacttgag ggagaggaac     120

atggcaccag accttcgcca ggacttcaac atgatggagc aaaagaagag ggtgtccatg     180

attctgcaaa gccctgcttt ctgtgaagaa ttggaatcaa tgatacagga gcaatttaag     240

aaggggaaga accccacagg cctattggca ttacagcaga ttgcagattt tatgaccacg     300

aatgtaccaa atgtctaccc agcagctccg caaggaggga tggctgcctt aaacatgagt     360

cttggtatgg tgactcctgt gaacgatctt agaggatctg attctattgc gtatgacaaa     420

ggagagaagt tattacggtg taaattggca gcgttttata gactagcaga tctctttggg     480

tggtctcagc ttatctacaa tcatatcaca accagagtga actccgagca ggaacacttc     540

ctcattgtcc cttttgggct tctttacagt gaagtgactg catccagttt ggttaagatc     600

aatctacaag gagatatagt agatcgtgga agcactaatc tgggagtgaa tcaggccggc     660

ttcaccttac actctgcaat ttatgctgca cgcccggacg tgaagtgcgt cgtgcacatt     720

cacaccccag caggggctgc ggtctctgca atgaaatgtg gcctcttgcc aatctccccg     780

gaggcgcttt cccttggaga agtggcttat catgactacc atggcattct ggttgatgaa     840

gaggaaaaag ttttgattca gaaaaatctg gggcctaaaa gcaaggttct tattctccgg     900

aaccatgggc tcgtgtcagt tggagagagc gttgaggagg ccttctatta catccataac     960

cttgtggttg cctgtgagat ccaggttcga actctggcca gtgcaggagg accagacaac    1020

ttagtcctgc tgaatcctga gaagtacaaa gccaagtccc gttccccagg gtctccggta    1080

ggggaaggca ctggatcgcc tcccaagtgg cagattggtg agcaggaatt tgaagccctc    1140

atgcggatgc tcgataatct gggctacaga actggctacc cttatcgata ccctgctctg    1200

agagagaagt ctaaaaaata cagcgatgtg gaggttcctg ctagtgtcac aggttactcc    1260

tttgctagtg acggtgattc gggcacttgc tccccactca gacacagttt tcagaagcag    1320

cagcgggaga agacaagatg gctgaactct ggccggggcg acgaagcttc cgaggaaggg    1380

cagaatggaa gcagtcccaa gtcgaagact aagtggacta aagaggatgg acatagaact    1440

tccacctctg ctgtccctaa cctgtttgtt ccattgaaca ctaacccaaa agaggtccag    1500

gagatgagga acaagatccg agagcagaat ttacaggaca ttaagacggc tggccctcag    1560

tcccaggttt tgtgtggtgt agtgatggac aggagcctcg tccagggaga gctggtgacg    1620

gcctccaagg ccatcattga aaaggagtac cagccccacg tcattgtgag caccacgggc    1680

cccaacccct tcaccacact cacagaccgt gagctggagg agtaccgcag ggaggtggag    1740

aggaagcaga agggctctga agagaatctg gacgaggcta gagaacagaa agaaaagagt    1800

cctccagacc agcctgcggt cccccacccg cctcccagca ctcccatcaa gctggaggaa    1860

gaccttgtgc cggagccgac tactggagat gacagtgatg ctgccacctt taagccaact    1920

ctccccgatc tgtcccctga tgaaccttca gaagcactcg gcttcccaat gttagagaag    1980

gaggaggaag cccatagacc cccaagcccc actgaggccc ctactgaggc cagccccgag    2040

ccagccccag acccagcccc ggtggctgaa gaggctgccc cctcagctgt cgaggagggg    2100

gccgccgcgg accctggcag cgatgggtct ccaggcaagt ccccgtccaa aaagaagaag    2160

aagttccgta ccccgtcctt tctgaagaag agcaagaaga agagtgactc ctga          2214


<210> 34
<211> 737
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..737
<223> /mol_type="protein"
      /note="POMC (preferred protein fragment)"
      /organism="artificial sequences"

<400> 34
Met Asn Gly Asp Ser Arg Ala Ala Val Val Thr Ser Pro Pro Pro Thr 
1               5                   10                   15    
Thr Ala Pro His Lys Glu Arg Tyr Phe Asp Arg Val Asp Glu Asn Asn 
            20                   25                  30        
Pro Glu Tyr Leu Arg Glu Arg Asn Met Ala Pro Asp Leu Arg Gln Asp 
        35                   40                  45            
Phe Asn Met Met Glu Gln Lys Lys Arg Val Ser Met Ile Leu Gln Ser 
    50                   55                  60                
Pro Ala Phe Cys Glu Glu Leu Glu Ser Met Ile Gln Glu Gln Phe Lys 
65                   70                  75                  80
Lys Gly Lys Asn Pro Thr Gly Leu Leu Ala Leu Gln Gln Ile Ala Asp 
                85                   90                  95    
Phe Met Thr Thr Asn Val Pro Asn Val Tyr Pro Ala Ala Pro Gln Gly 
            100                  105                110        
Gly Met Ala Ala Leu Asn Met Ser Leu Gly Met Val Thr Pro Val Asn 
        115                  120                125            
Asp Leu Arg Gly Ser Asp Ser Ile Ala Tyr Asp Lys Gly Glu Lys Leu 
    130                  135                140                
Leu Arg Cys Lys Leu Ala Ala Phe Tyr Arg Leu Ala Asp Leu Phe Gly 
145                  150                155                  160
Trp Ser Gln Leu Ile Tyr Asn His Ile Thr Thr Arg Val Asn Ser Glu 
                165                  170                175    
Gln Glu His Phe Leu Ile Val Pro Phe Gly Leu Leu Tyr Ser Glu Val 
            180                  185                190        
Thr Ala Ser Ser Leu Val Lys Ile Asn Leu Gln Gly Asp Ile Val Asp 
        195                  200                205            
Arg Gly Ser Thr Asn Leu Gly Val Asn Gln Ala Gly Phe Thr Leu His 
    210                  215                220                
Ser Ala Ile Tyr Ala Ala Arg Pro Asp Val Lys Cys Val Val His Ile 
225                  230                235                  240
His Thr Pro Ala Gly Ala Ala Val Ser Ala Met Lys Cys Gly Leu Leu 
                245                  250                255    
Pro Ile Ser Pro Glu Ala Leu Ser Leu Gly Glu Val Ala Tyr His Asp 
            260                  265                270        
Tyr His Gly Ile Leu Val Asp Glu Glu Glu Lys Val Leu Ile Gln Lys 
        275                  280                285            
Asn Leu Gly Pro Lys Ser Lys Val Leu Ile Leu Arg Asn His Gly Leu 
    290                  295                300                
Val Ser Val Gly Glu Ser Val Glu Glu Ala Phe Tyr Tyr Ile His Asn 
305                  310                315                  320
Leu Val Val Ala Cys Glu Ile Gln Val Arg Thr Leu Ala Ser Ala Gly 
                325                  330                335    
Gly Pro Asp Asn Leu Val Leu Leu Asn Pro Glu Lys Tyr Lys Ala Lys 
            340                  345                350        
Ser Arg Ser Pro Gly Ser Pro Val Gly Glu Gly Thr Gly Ser Pro Pro 
        355                  360                365            
Lys Trp Gln Ile Gly Glu Gln Glu Phe Glu Ala Leu Met Arg Met Leu 
    370                  375                380                
Asp Asn Leu Gly Tyr Arg Thr Gly Tyr Pro Tyr Arg Tyr Pro Ala Leu 
385                  390                395                  400
Arg Glu Lys Ser Lys Lys Tyr Ser Asp Val Glu Val Pro Ala Ser Val 
                405                  410                415    
Thr Gly Tyr Ser Phe Ala Ser Asp Gly Asp Ser Gly Thr Cys Ser Pro 
            420                  425                430        
Leu Arg His Ser Phe Gln Lys Gln Gln Arg Glu Lys Thr Arg Trp Leu 
        435                  440                445            
Asn Ser Gly Arg Gly Asp Glu Ala Ser Glu Glu Gly Gln Asn Gly Ser 
    450                  455                460                
Ser Pro Lys Ser Lys Thr Lys Trp Thr Lys Glu Asp Gly His Arg Thr 
465                  470                475                  480
Ser Thr Ser Ala Val Pro Asn Leu Phe Val Pro Leu Asn Thr Asn Pro 
                485                  490                495    
Lys Glu Val Gln Glu Met Arg Asn Lys Ile Arg Glu Gln Asn Leu Gln 
            500                  505                510        
Asp Ile Lys Thr Ala Gly Pro Gln Ser Gln Val Leu Cys Gly Val Val 
        515                  520                525            
Met Asp Arg Ser Leu Val Gln Gly Glu Leu Val Thr Ala Ser Lys Ala 
    530                  535                540                
Ile Ile Glu Lys Glu Tyr Gln Pro His Val Ile Val Ser Thr Thr Gly 
545                  550                555                  560
Pro Asn Pro Phe Thr Thr Leu Thr Asp Arg Glu Leu Glu Glu Tyr Arg 
                565                  570                575    
Arg Glu Val Glu Arg Lys Gln Lys Gly Ser Glu Glu Asn Leu Asp Glu 
            580                  585                590        
Ala Arg Glu Gln Lys Glu Lys Ser Pro Pro Asp Gln Pro Ala Val Pro 
        595                  600                605            
His Pro Pro Pro Ser Thr Pro Ile Lys Leu Glu Glu Asp Leu Val Pro 
    610                  615                620                
Glu Pro Thr Thr Gly Asp Asp Ser Asp Ala Ala Thr Phe Lys Pro Thr 
625                  630                635                  640
Leu Pro Asp Leu Ser Pro Asp Glu Pro Ser Glu Ala Leu Gly Phe Pro 
                645                  650                655    
Met Leu Glu Lys Glu Glu Glu Ala His Arg Pro Pro Ser Pro Thr Glu 
            660                  665                670        
Ala Pro Thr Glu Ala Ser Pro Glu Pro Ala Pro Asp Pro Ala Pro Val 
        675                  680                685            
Ala Glu Glu Ala Ala Pro Ser Ala Val Glu Glu Gly Ala Ala Ala Asp 
    690                  695                700                
Pro Gly Ser Asp Gly Ser Pro Gly Lys Ser Pro Ser Lys Lys Lys Lys 
705                  710                715                  720
Lys Phe Arg Thr Pro Ser Phe Leu Lys Lys Ser Lys Lys Lys Ser Asp 
                725                  730                735    
Ser 
    

<210> 35
<211> 2214
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2214
<223> /mol_type="DNA"
      /note="ADD1 (CCDS nucleotide sequence of ADD1 (Gene ID: 118))"
      /organism="Homo sapiens"

<400> 35
atgaatggtg attctcgtgc tgcggtggtg acctcaccac ccccgaccac agcccctcac      60

aaggagaggt acttcgaccg agtagatgag aacaacccag agtacttgag ggagaggaac     120

atggcaccag accttcgcca ggacttcaac atgatggagc aaaagaagag ggtgtccatg     180

attctgcaaa gccctgcttt ctgtgaagaa ttggaatcaa tgatacagga gcaatttaag     240

aaggggaaga accccacagg cctattggca ttacagcaga ttgcagattt tatgaccacg     300

aatgtaccaa atgtctaccc agcagctccg caaggaggga tggctgcctt aaacatgagt     360

cttggtatgg tgactcctgt gaacgatctt agaggatctg attctattgc gtatgacaaa     420

ggagagaagt tattacggtg taaattggca gcgttttata gactagcaga tctctttggg     480

tggtctcagc ttatctacaa tcatatcaca accagagtga actccgagca ggaacacttc     540

ctcattgtcc cttttgggct tctttacagt gaagtgactg catccagttt ggttaagatc     600

aatctacaag gagatatagt agatcgtgga agcactaatc tgggagtgaa tcaggccggc     660

ttcaccttac actctgcaat ttatgctgca cgcccggacg tgaagtgcgt cgtgcacatt     720

cacaccccag caggggctgc ggtctctgca atgaaatgtg gcctcttgcc aatctccccg     780

gaggcgcttt cccttggaga agtggcttat catgactacc atggcattct ggttgatgaa     840

gaggaaaaag ttttgattca gaaaaatctg gggcctaaaa gcaaggttct tattctccgg     900

aaccatgggc tcgtgtcagt tggagagagc gttgaggagg ccttctatta catccataac     960

cttgtggttg cctgtgagat ccaggttcga actctggcca gtgcaggagg accagacaac    1020

ttagtcctgc tgaatcctga gaagtacaaa gccaagtccc gttccccagg gtctccggta    1080

ggggaaggca ctggatcgcc tcccaagtgg cagattggtg agcaggaatt tgaagccctc    1140

atgcggatgc tcgataatct gggctacaga actggctacc cttatcgata ccctgctctg    1200

agagagaagt ctaaaaaata cagcgatgtg gaggttcctg ctagtgtcac aggttactcc    1260

tttgctagtg acggtgattc gggcacttgc tccccactca gacacagttt tcagaagcag    1320

cagcgggaga agacaagatg gctgaactct ggccggggcg acgaagcttc cgaggaaggg    1380

cagaatggaa gcagtcccaa gtcgaagact aagtggacta aagaggatgg acatagaact    1440

tccacctctg ctgtccctaa cctgtttgtt ccattgaaca ctaacccaaa agaggtccag    1500

gagatgagga acaagatccg agagcagaat ttacaggaca ttaagacggc tggccctcag    1560

tcccaggttt tgtgtggtgt agtgatggac aggagcctcg tccagggaga gctggtgacg    1620

gcctccaagg ccatcattga aaaggagtac cagccccacg tcattgtgag caccacgggc    1680

cccaacccct tcaccacact cacagaccgt gagctggagg agtaccgcag ggaggtggag    1740

aggaagcaga agggctctga agagaatctg gacgaggcta gagaacagaa agaaaagagt    1800

cctccagacc agcctgcggt cccccacccg cctcccagca ctcccatcaa gctggaggaa    1860

gaccttgtgc cggagccgac tactggagat gacagtgatg ctgccacctt taagccaact    1920

ctccccgatc tgtcccctga tgaaccttca gaagcactcg gcttcccaat gttagagaag    1980

gaggaggaag cccatagacc cccaagcccc actgaggccc ctactgaggc cagccccgag    2040

ccagccccag acccagcccc ggtggctgaa gaggctgccc cctcagctgt cgaggagggg    2100

gccgccgcgg accctggcag cgatgggtct ccaggcaagt ccccgtccaa aaagaagaag    2160

aagttccgta ccccgtcctt tctgaagaag agcaagaaga agagtgactc ctga          2214


<210> 36
<211> 737
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..737
<223> /mol_type="protein"
      /note="ADD1 (full-length protein)"
      /organism="Homo sapiens"

<400> 36
Met Asn Gly Asp Ser Arg Ala Ala Val Val Thr Ser Pro Pro Pro Thr 
1               5                   10                   15    
Thr Ala Pro His Lys Glu Arg Tyr Phe Asp Arg Val Asp Glu Asn Asn 
            20                   25                  30        
Pro Glu Tyr Leu Arg Glu Arg Asn Met Ala Pro Asp Leu Arg Gln Asp 
        35                   40                  45            
Phe Asn Met Met Glu Gln Lys Lys Arg Val Ser Met Ile Leu Gln Ser 
    50                   55                  60                
Pro Ala Phe Cys Glu Glu Leu Glu Ser Met Ile Gln Glu Gln Phe Lys 
65                   70                  75                  80
Lys Gly Lys Asn Pro Thr Gly Leu Leu Ala Leu Gln Gln Ile Ala Asp 
                85                   90                  95    
Phe Met Thr Thr Asn Val Pro Asn Val Tyr Pro Ala Ala Pro Gln Gly 
            100                  105                110        
Gly Met Ala Ala Leu Asn Met Ser Leu Gly Met Val Thr Pro Val Asn 
        115                  120                125            
Asp Leu Arg Gly Ser Asp Ser Ile Ala Tyr Asp Lys Gly Glu Lys Leu 
    130                  135                140                
Leu Arg Cys Lys Leu Ala Ala Phe Tyr Arg Leu Ala Asp Leu Phe Gly 
145                  150                155                  160
Trp Ser Gln Leu Ile Tyr Asn His Ile Thr Thr Arg Val Asn Ser Glu 
                165                  170                175    
Gln Glu His Phe Leu Ile Val Pro Phe Gly Leu Leu Tyr Ser Glu Val 
            180                  185                190        
Thr Ala Ser Ser Leu Val Lys Ile Asn Leu Gln Gly Asp Ile Val Asp 
        195                  200                205            
Arg Gly Ser Thr Asn Leu Gly Val Asn Gln Ala Gly Phe Thr Leu His 
    210                  215                220                
Ser Ala Ile Tyr Ala Ala Arg Pro Asp Val Lys Cys Val Val His Ile 
225                  230                235                  240
His Thr Pro Ala Gly Ala Ala Val Ser Ala Met Lys Cys Gly Leu Leu 
                245                  250                255    
Pro Ile Ser Pro Glu Ala Leu Ser Leu Gly Glu Val Ala Tyr His Asp 
            260                  265                270        
Tyr His Gly Ile Leu Val Asp Glu Glu Glu Lys Val Leu Ile Gln Lys 
        275                  280                285            
Asn Leu Gly Pro Lys Ser Lys Val Leu Ile Leu Arg Asn His Gly Leu 
    290                  295                300                
Val Ser Val Gly Glu Ser Val Glu Glu Ala Phe Tyr Tyr Ile His Asn 
305                  310                315                  320
Leu Val Val Ala Cys Glu Ile Gln Val Arg Thr Leu Ala Ser Ala Gly 
                325                  330                335    
Gly Pro Asp Asn Leu Val Leu Leu Asn Pro Glu Lys Tyr Lys Ala Lys 
            340                  345                350        
Ser Arg Ser Pro Gly Ser Pro Val Gly Glu Gly Thr Gly Ser Pro Pro 
        355                  360                365            
Lys Trp Gln Ile Gly Glu Gln Glu Phe Glu Ala Leu Met Arg Met Leu 
    370                  375                380                
Asp Asn Leu Gly Tyr Arg Thr Gly Tyr Pro Tyr Arg Tyr Pro Ala Leu 
385                  390                395                  400
Arg Glu Lys Ser Lys Lys Tyr Ser Asp Val Glu Val Pro Ala Ser Val 
                405                  410                415    
Thr Gly Tyr Ser Phe Ala Ser Asp Gly Asp Ser Gly Thr Cys Ser Pro 
            420                  425                430        
Leu Arg His Ser Phe Gln Lys Gln Gln Arg Glu Lys Thr Arg Trp Leu 
        435                  440                445            
Asn Ser Gly Arg Gly Asp Glu Ala Ser Glu Glu Gly Gln Asn Gly Ser 
    450                  455                460                
Ser Pro Lys Ser Lys Thr Lys Trp Thr Lys Glu Asp Gly His Arg Thr 
465                  470                475                  480
Ser Thr Ser Ala Val Pro Asn Leu Phe Val Pro Leu Asn Thr Asn Pro 
                485                  490                495    
Lys Glu Val Gln Glu Met Arg Asn Lys Ile Arg Glu Gln Asn Leu Gln 
            500                  505                510        
Asp Ile Lys Thr Ala Gly Pro Gln Ser Gln Val Leu Cys Gly Val Val 
        515                  520                525            
Met Asp Arg Ser Leu Val Gln Gly Glu Leu Val Thr Ala Ser Lys Ala 
    530                  535                540                
Ile Ile Glu Lys Glu Tyr Gln Pro His Val Ile Val Ser Thr Thr Gly 
545                  550                555                  560
Pro Asn Pro Phe Thr Thr Leu Thr Asp Arg Glu Leu Glu Glu Tyr Arg 
                565                  570                575    
Arg Glu Val Glu Arg Lys Gln Lys Gly Ser Glu Glu Asn Leu Asp Glu 
            580                  585                590        
Ala Arg Glu Gln Lys Glu Lys Ser Pro Pro Asp Gln Pro Ala Val Pro 
        595                  600                605            
His Pro Pro Pro Ser Thr Pro Ile Lys Leu Glu Glu Asp Leu Val Pro 
    610                  615                620                
Glu Pro Thr Thr Gly Asp Asp Ser Asp Ala Ala Thr Phe Lys Pro Thr 
625                  630                635                  640
Leu Pro Asp Leu Ser Pro Asp Glu Pro Ser Glu Ala Leu Gly Phe Pro 
                645                  650                655    
Met Leu Glu Lys Glu Glu Glu Ala His Arg Pro Pro Ser Pro Thr Glu 
            660                  665                670        
Ala Pro Thr Glu Ala Ser Pro Glu Pro Ala Pro Asp Pro Ala Pro Val 
        675                  680                685            
Ala Glu Glu Ala Ala Pro Ser Ala Val Glu Glu Gly Ala Ala Ala Asp 
    690                  695                700                
Pro Gly Ser Asp Gly Ser Pro Gly Lys Ser Pro Ser Lys Lys Lys Lys 
705                  710                715                  720
Lys Phe Arg Thr Pro Ser Phe Leu Lys Lys Ser Lys Lys Lys Ser Asp 
                725                  730                735    
Ser 
    

<210> 37
<211> 1050
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1050
<223> /mol_type="DNA"
      /note="ADD1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 37
aagccctgcg ctaacactgt cctgtccgga gcgaccctgg ctctgccagc gtccccggcc      60

acgtctgtgc tctgtccttg tgtaatggaa tgcaaaaaag ccaagccctc cgcctagagg     120

tcccctcacg tgaccagccc cgtgtagccc cgggctgacc cagtgtgtgc tcagcagccc     180

caccccaccc tgccccttgt cctctcagag cctcagcttc tgggggagac atgctctccc     240

cacagggggg aggcactaag tcatggtcct ggctggaagg tactgaaggc ttctgcagct     300

ttggctgcac gtcaccctcc tgagcctcac ctttcctgcc gtccctcctg ttgtgaaatc     360

accacattct gtctctgctt ggcttcccct ccaccctaaa gtctcaggtg acggactcag     420

actcctggct tcatgtggca ttctctctgc tcagtgatct cacttaaatc tatatacaaa     480

gccttggtcc cgtgaaaaca ctcgtgtgcc caccagcggc cttgaagagg caggtctggg     540

ccagatgctg ggcaggaaac cccagcggca gatgggcctg tgtgcaccca acgtgatgct     600

atgcatgtct gaccgacgat ccctcgacca gaatcagatt caggagctca gtttcttttt     660

cacttgggtc tctggattcc tgtcataggg aaggtatatc aggaggggaa gaggcctttc     720

tagaattttc tttgagcagg tttacaattt agcttacatt tttcgactgt gaacgtgaat     780

aggctgcttt ttgctttctt ctttccagac cccacagtag agcacttttc acttatttgg     840

gggaggcttc aggggactgt tctcacctta actcagccag aaagatgccc tagttgtgat     900

caaaggtacc ctcccccagg aggtcgaccc caaagcccct tgctctcccc tgccctgctg     960

ccgcctccca gcctgggggg tcgtggcaga taatcagcct cttaaagctg cctgtagtta    1020

ggaaataaaa cctttcaaat ttcacatcca                                     1050


<210> 38
<211> 269
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..269
<223> /mol_type="protein"
      /note="Novel Kinase Domain"
      /organism="artificial sequences"

<400> 38
Leu Val Leu Asp His His Val Lys Gly His Gly Gly Thr Arg Leu Tyr 
1               5                   10                   15    
Lys Cys Thr Asp Cys Ala Tyr Ser Thr Lys Asn Arg Gln Lys Ile Thr 
            20                   25                  30        
Trp His Ser Arg Ile His Thr Gly Glu Lys Pro Tyr His Cys His Leu 
        35                   40                  45            
Cys Pro Tyr Ala Cys Ala Asp Pro Ser Arg Leu Lys Tyr His Met Arg 
    50                   55                  60                
Ile His Lys Glu Glu Arg Lys Tyr Leu Cys Pro Glu Cys Gly Tyr Lys 
65                   70                  75                  80
Cys Lys Trp Val Asn Gln Leu Lys Tyr His Met Thr Lys His Thr Asp 
                85                   90                  95    
Ser Asp Glu Lys Val Leu Pro Val Ser Glu Leu Leu Asp Ile Ala Trp 
            100                  105                110        
Gln Val Ala Glu Gly Met Cys Tyr Leu Glu Ser Gln Asn Tyr Ile His 
        115                  120                125            
Arg Asp Leu Ala Ala Arg Asn Ile Leu Val Gly Glu Asn Thr Leu Cys 
    130                  135                140                
Lys Val Gly Asp Phe Gly Leu Ala Arg Leu Ile Lys Glu Asp Val Tyr 
145                  150                155                  160
Leu Ser His Asp His Asn Ile Pro Tyr Lys Trp Thr Ala Pro Glu Ala 
                165                  170                175    
Leu Ser Arg Gly His Tyr Ser Thr Lys Ser Asp Val Trp Ser Phe Gly 
            180                  185                190        
Ile Leu Leu His Glu Met Phe Ser Arg Gly Gln Val Pro Tyr Pro Gly 
        195                  200                205            
Met Ser Asn His Glu Ala Phe Leu Arg Val Asp Ala Gly Tyr Arg Met 
    210                  215                220                
Pro Cys Pro Leu Glu Cys Pro Pro Ser Val His Lys Leu Met Leu Thr 
225                  230                235                  240
Cys Trp Cys Arg Asp Pro Glu Gln Arg Pro Cys Phe Lys Ala Leu Arg 
                245                  250                255    
Glu Arg Leu Ser Ser Phe Thr Ser Tyr Glu Asn Pro Thr 
            260                  265                

<210> 39
<211> 1407
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1407
<223> /mol_type="DNA"
      /note="POMC-ADD1 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 39
atgccgagat cgtgctgcag ccgctcgggg gccctgttgc tggccttgct gcttcaggcc      60

tccatggaag tgcgtggctg gtgcctggag agcagccagt gtcaggacct caccacggaa     120

agcaacctgc tggagtgcat ccgggcctgc aagcccgacc tctcggccga gactcccatg     180

ttcccgggaa atggcgacga gcagcctctg accgagaacc cccggaagta cgtcatgggc     240

cacttccgct gggaccgatt cggccgccgc aacagcagca gcagcggcag cagcggcgca     300

gggcagaagc gcgaggacgt ctcagcgggc gaagactgcg gcccgctgcc tgagggcggc     360

cccgagcccc gcagcgatgg tgccaagccg ggcccgcgcg agggcaagcg ctcctactcc     420

atggagcact tccgctgggg caagccggtg ggcaagaagc ggcgcccagt gaaggtgtac     480

cctaacggcg ccgaggacga gtcggccgag gccttccccc tggagttcaa gagggagctg     540

actggccagc gactccggga gggagatggc cccgacggcc ctgccgatga cggcgcaggg     600

gcccaggccg acctggagca cagcctgctg gtggcggccg agaagaagga cgagggcccc     660

tacaggatgg agcacttccg ctggggcagc ccgcccaagg acaagcgcta cggcggtttc     720

atgacctccg agaagagcca gacgcccctg gtgacgctgt tcaaaaacgc catcatcaag     780

aacgcctaca agaagggcga gtgagggccc ctcgacatca ccgtcattga tggagcctga     840

accgtgtgct cctcggcaga tgctgttgtt gttacttccc tccaagaggc tggaaaaggg     900

ctcagagctg ctgagcagga accggaggtg acccatttca ggaggtgccg gtaccagcct     960

gactaggtac aggcaagctt gtgtgggccc aacaggccct tggtagagct ggtgccagat    1020

gtgggctcag atcctgggca tgatgggccg agccactcgg atcccactga ttggccagcc    1080

gagcgagaac caggctgctg catggcactg accgccgctt ccagcttcct ctgagccgca    1140

gggcctgcta cgcgggcaag cgtgctgcct ctcttctgtg tcttttgttg ccaaggcaga    1200

atgaaaagtc cttaaccgtg gactcttcct ttatcccctc ctttacccca catatgcaat    1260

gacttttaat tttcactttt gtagtttaat cctttgtatt acaacatgaa tatagttgca    1320

tatatggaca ccgacttggg aggacaggtc ctgaatgtcc tttctccagt gtaacatgtt    1380

ttactcacaa ataaaattct ttcagca                                        1407


<210> 40
<211> 3615
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3615
<223> /mol_type="DNA"
      /note="FNDC3B (CCDS nucleotide sequence of FNDC3B (Gene ID: 64778
      ))"
      /organism="Homo sapiens"

<400> 40
atgtacgtca caatgatgat gaccgaccaa atccctctgg aactgccacc attgctgaac       60

ggagaggtag ccatgatgcc ccacttggtg aatggagatg cagctcagca ggttattctc      120

gttcaagtta atccaggtga gactttcaca ataagagcag aggatggaac acttcagtgc      180

attcaaggac ctgctgaagt tcccatgatg tcacccaatg gatccattcc tcccattcat      240

gtgcctccag gttatatctc acaggtgatt gaagatagta ctggagtccg ccgggtggtg      300

gtcacacccc agtctcctga gtgttatccc ccaagctacc cctcagccat gtctccaacc      360

catcatctcc ctccctatct gactcaccat ccacatttta ttcataactc acacacggct      420

tactacccac ctgttaccgg acctggagat atgccgcctc agttttttcc ccagcatcat      480

cttccccaca caatatatgg tgagcaagaa attataccat tttatggaat gtcaacctac      540

atcacccgag aagaccagta cagcaagcct ccgcacaaaa aactgaaaga ccgccagatc      600

gatcgccaga accgcctcaa cagccctcct tcttctatct acaaaagcag ctgcacaaca      660

gtatacaatg gctatgggaa gggccatagt ggtggaagtg gcggaggcgg cagcggtagt      720

ggtcccggaa ttaagaaaac agagcgacga gcaagaagca gcccaaagtc gaatgattca      780

gacttgcaag aatatgagtt ggaagtaaag agggtgcaag acattctttc gggaatagag      840

aaaccacagg tttctaatat tcaggcaaga gcagttgtgt tgtcctgggc tccccctgtt      900

ggactttcct gtggacccca cagtggtctt tccttcccct acagttacga ggtggcctta      960

tcagacaaag gacgagatgg aaaatacaag ataatttaca gtggagaaga attagaatgt     1020

aacctgaaag atcttagacc agcaacagat tatcatgtga gggtgtatgc catgtacaat     1080

tccgtaaagg gatcctgctc cgagcctgtt agcttcacca cccacagctg tgcacccgag     1140

tgtcctttcc cccctaagct ggcacatagg agcaaaagtt cactaaccct gcagtggaag     1200

gcaccaattg acaacggttc aaaaatcacc aactaccttt tagagtggga tgagggaaaa     1260

agaaatagtg gtttcagaca gtgcttcttc gggagccaga agcactgcaa gttgacaaag     1320

ctttgtccgg caatggggta cacattcagg ctggccgctc gaaacgacat tggtaccagt     1380

ggttatagcc aagaggtggt gtgctacaca ttaggaaata tccctcagat gccttctgca     1440

ccaaggctgg ttcgagctgg catcacatgg gtcacgttgc agtggagtaa gccagaaggc     1500

tgttcacccg aggaagtgat cacctacacc ttggaaattc aggaggatga aaatgataac     1560

cttttccacc caaaatacac tggagaggat ttaacctgta ctgtgaaaaa tctcaaaaga     1620

agcacacagt ataaattcag gctgactgct tctaatacgg aaggaaaaag ctgtccaagc     1680

gaagttcttg tttgtacgac gagtcctgac aggcctggac ctcctaccag accgcttgtc     1740

aaaggcccag ttacatctca tggctttagt gtcaaatggg atccccctaa ggacaatggt     1800

ggttcagaaa tcctcaagta cttgctagag attactgatg gaaattctga agcgaatcag     1860

tgggaagtgg cctacagtgg gtcggctacc gaatacacct tcacccactt gaaaccaggc     1920

actttgtaca aactccgagc atgctgcatc agtaccggcg gacacagcca gtgttctgaa     1980

agtctccctg ttcgcacact aagcattgca ccaggtcaat gtcgaccacc gagggttttg     2040

ggtagaccaa agcacaaaga agtccactta gagtgggatg ttcctgcatc ggaaagtggc     2100

tgtgaggtct cagagtacag cgtggagatg acggagcccg aagacgtagc ctcggaagtg     2160

taccatggcc cagagctgga gtgcaccgtc ggcaacctgc ttcctggaac cgtgtatcgc     2220

ttccgggtga gggctctgaa tgatggaggg tatggtccct attctgatgt ctcagaaatt     2280

accactgctg cagggcctcc tggacaatgc aaagcacctt gtatttcttg tacacctgat     2340

ggatgtgtct tagtgggttg ggagagtcct gatagttctg gtgctgacat ctcagagtac     2400

aggttggaat ggggagaaga tgaagaatcc ttagaactca tttatcatgg gacagacacc     2460

cgttttgaaa taagagacct gttgcctgct gcacagtatt gctgtagact acaggccttc     2520

aatcaagcag gggcagggcc gtacagtgaa cttgtccttt gccagacgcc agcgtctgcc     2580

cctgaccccg tctccactct ctgtgtcctg gaggaggagc cccttgatgc ctaccctgat     2640

tcaccttctg cgtgccttgt actgaactgg gaagagccgt gcaataacgg atctgaaatc     2700

cttgcttaca ccattgatct aggagacact agcattaccg tgggcaacac caccatgcat     2760

gttatgaaag atctccttcc agaaaccacc taccggatca gaattcaggc tataaatgaa     2820

attggagctg gaccatttag tcagttcatt aaagcaaaaa ctcggccatt accacccttg     2880

cctcctaggc tagaatgtgc tgctgctggt cctcagagcc tgaagctaaa atggggagac     2940

agtaactcca agacacatgc tgctgaggac attgtgtaca cactacagct ggaggacaga     3000

aacaagaggt ttatttcaat ctacagagga cccagccaca cctacaaggt ccagagactg     3060

acggaattca catgctactc cttcagaatc caggcagcaa gcgaggctgg agaagggccc     3120

ttctcagaaa cctatacctt cagcacaacc aaaagtgtcc cccccaccat caaagcacct     3180

cgagtaacac agttagaagg aaattcatgt gaaattttat gggagacggt accatcaatg     3240

aaaggtgacc ctgttaacta cattctgcag gtattggttg gaagagaatc tgagtacaaa     3300

caggtgtaca agggagaaga agccacattc caaatctcag gcctccagac caacacagac     3360

tacaggttcc gcgtatgtgc gtgtcgtcgc tgtttagaca cctctcagga gctaagcgga     3420

gccttcagcc cctctgcggc ttttgtatta caacgaagtg aggtcatgct tacaggggac     3480

atggggagct tagatgatcc caaaatgaag agcatgatgc ctactgatga acagtttgca     3540

gccatcattg tgcttggctt tgcaactttg tccattttat ttgcctttat attacagtac     3600

ttcttaatga agtaa                                                      3615


<210> 41
<211> 1204
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1204
<223> /mol_type="protein"
      /note="FNDC3B (full-length protein)"
      /organism="Homo sapiens"

<400> 41
Met Tyr Val Thr Met Met Met Thr Asp Gln Ile Pro Leu Glu Leu Pro 
1               5                   10                   15    
Pro Leu Leu Asn Gly Glu Val Ala Met Met Pro His Leu Val Asn Gly 
            20                   25                  30        
Asp Ala Ala Gln Gln Val Ile Leu Val Gln Val Asn Pro Gly Glu Thr 
        35                   40                  45            
Phe Thr Ile Arg Ala Glu Asp Gly Thr Leu Gln Cys Ile Gln Gly Pro 
    50                   55                  60                
Ala Glu Val Pro Met Met Ser Pro Asn Gly Ser Ile Pro Pro Ile His 
65                   70                  75                  80
Val Pro Pro Gly Tyr Ile Ser Gln Val Ile Glu Asp Ser Thr Gly Val 
                85                   90                  95    
Arg Arg Val Val Val Thr Pro Gln Ser Pro Glu Cys Tyr Pro Pro Ser 
            100                  105                110        
Tyr Pro Ser Ala Met Ser Pro Thr His His Leu Pro Pro Tyr Leu Thr 
        115                  120                125            
His His Pro His Phe Ile His Asn Ser His Thr Ala Tyr Tyr Pro Pro 
    130                  135                140                
Val Thr Gly Pro Gly Asp Met Pro Pro Gln Phe Phe Pro Gln His His 
145                  150                155                  160
Leu Pro His Thr Ile Tyr Gly Glu Gln Glu Ile Ile Pro Phe Tyr Gly 
                165                  170                175    
Met Ser Thr Tyr Ile Thr Arg Glu Asp Gln Tyr Ser Lys Pro Pro His 
            180                  185                190        
Lys Lys Leu Lys Asp Arg Gln Ile Asp Arg Gln Asn Arg Leu Asn Ser 
        195                  200                205            
Pro Pro Ser Ser Ile Tyr Lys Ser Ser Cys Thr Thr Val Tyr Asn Gly 
    210                  215                220                
Tyr Gly Lys Gly His Ser Gly Gly Ser Gly Gly Gly Gly Ser Gly Ser 
225                  230                235                  240
Gly Pro Gly Ile Lys Lys Thr Glu Arg Arg Ala Arg Ser Ser Pro Lys 
                245                  250                255    
Ser Asn Asp Ser Asp Leu Gln Glu Tyr Glu Leu Glu Val Lys Arg Val 
            260                  265                270        
Gln Asp Ile Leu Ser Gly Ile Glu Lys Pro Gln Val Ser Asn Ile Gln 
        275                  280                285            
Ala Arg Ala Val Val Leu Ser Trp Ala Pro Pro Val Gly Leu Ser Cys 
    290                  295                300                
Gly Pro His Ser Gly Leu Ser Phe Pro Tyr Ser Tyr Glu Val Ala Leu 
305                  310                315                  320
Ser Asp Lys Gly Arg Asp Gly Lys Tyr Lys Ile Ile Tyr Ser Gly Glu 
                325                  330                335    
Glu Leu Glu Cys Asn Leu Lys Asp Leu Arg Pro Ala Thr Asp Tyr His 
            340                  345                350        
Val Arg Val Tyr Ala Met Tyr Asn Ser Val Lys Gly Ser Cys Ser Glu 
        355                  360                365            
Pro Val Ser Phe Thr Thr His Ser Cys Ala Pro Glu Cys Pro Phe Pro 
    370                  375                380                
Pro Lys Leu Ala His Arg Ser Lys Ser Ser Leu Thr Leu Gln Trp Lys 
385                  390                395                  400
Ala Pro Ile Asp Asn Gly Ser Lys Ile Thr Asn Tyr Leu Leu Glu Trp 
                405                  410                415    
Asp Glu Gly Lys Arg Asn Ser Gly Phe Arg Gln Cys Phe Phe Gly Ser 
            420                  425                430        
Gln Lys His Cys Lys Leu Thr Lys Leu Cys Pro Ala Met Gly Tyr Thr 
        435                  440                445            
Phe Arg Leu Ala Ala Arg Asn Asp Ile Gly Thr Ser Gly Tyr Ser Gln 
    450                  455                460                
Glu Val Val Cys Tyr Thr Leu Gly Asn Ile Pro Gln Met Pro Ser Ala 
465                  470                475                  480
Pro Arg Leu Val Arg Ala Gly Ile Thr Trp Val Thr Leu Gln Trp Ser 
                485                  490                495    
Lys Pro Glu Gly Cys Ser Pro Glu Glu Val Ile Thr Tyr Thr Leu Glu 
            500                  505                510        
Ile Gln Glu Asp Glu Asn Asp Asn Leu Phe His Pro Lys Tyr Thr Gly 
        515                  520                525            
Glu Asp Leu Thr Cys Thr Val Lys Asn Leu Lys Arg Ser Thr Gln Tyr 
    530                  535                540                
Lys Phe Arg Leu Thr Ala Ser Asn Thr Glu Gly Lys Ser Cys Pro Ser 
545                  550                555                  560
Glu Val Leu Val Cys Thr Thr Ser Pro Asp Arg Pro Gly Pro Pro Thr 
                565                  570                575    
Arg Pro Leu Val Lys Gly Pro Val Thr Ser His Gly Phe Ser Val Lys 
            580                  585                590        
Trp Asp Pro Pro Lys Asp Asn Gly Gly Ser Glu Ile Leu Lys Tyr Leu 
        595                  600                605            
Leu Glu Ile Thr Asp Gly Asn Ser Glu Ala Asn Gln Trp Glu Val Ala 
    610                  615                620                
Tyr Ser Gly Ser Ala Thr Glu Tyr Thr Phe Thr His Leu Lys Pro Gly 
625                  630                635                  640
Thr Leu Tyr Lys Leu Arg Ala Cys Cys Ile Ser Thr Gly Gly His Ser 
                645                  650                655    
Gln Cys Ser Glu Ser Leu Pro Val Arg Thr Leu Ser Ile Ala Pro Gly 
            660                  665                670        
Gln Cys Arg Pro Pro Arg Val Leu Gly Arg Pro Lys His Lys Glu Val 
        675                  680                685            
His Leu Glu Trp Asp Val Pro Ala Ser Glu Ser Gly Cys Glu Val Ser 
    690                  695                700                
Glu Tyr Ser Val Glu Met Thr Glu Pro Glu Asp Val Ala Ser Glu Val 
705                  710                715                  720
Tyr His Gly Pro Glu Leu Glu Cys Thr Val Gly Asn Leu Leu Pro Gly 
                725                  730                735    
Thr Val Tyr Arg Phe Arg Val Arg Ala Leu Asn Asp Gly Gly Tyr Gly 
            740                  745                750        
Pro Tyr Ser Asp Val Ser Glu Ile Thr Thr Ala Ala Gly Pro Pro Gly 
        755                  760                765            
Gln Cys Lys Ala Pro Cys Ile Ser Cys Thr Pro Asp Gly Cys Val Leu 
    770                  775                780                
Val Gly Trp Glu Ser Pro Asp Ser Ser Gly Ala Asp Ile Ser Glu Tyr 
785                  790                795                  800
Arg Leu Glu Trp Gly Glu Asp Glu Glu Ser Leu Glu Leu Ile Tyr His 
                805                  810                815    
Gly Thr Asp Thr Arg Phe Glu Ile Arg Asp Leu Leu Pro Ala Ala Gln 
            820                  825                830        
Tyr Cys Cys Arg Leu Gln Ala Phe Asn Gln Ala Gly Ala Gly Pro Tyr 
        835                  840                845            
Ser Glu Leu Val Leu Cys Gln Thr Pro Ala Ser Ala Pro Asp Pro Val 
    850                  855                860                
Ser Thr Leu Cys Val Leu Glu Glu Glu Pro Leu Asp Ala Tyr Pro Asp 
865                  870                875                  880
Ser Pro Ser Ala Cys Leu Val Leu Asn Trp Glu Glu Pro Cys Asn Asn 
                885                  890                895    
Gly Ser Glu Ile Leu Ala Tyr Thr Ile Asp Leu Gly Asp Thr Ser Ile 
            900                  905                910        
Thr Val Gly Asn Thr Thr Met His Val Met Lys Asp Leu Leu Pro Glu 
        915                  920                925            
Thr Thr Tyr Arg Ile Arg Ile Gln Ala Ile Asn Glu Ile Gly Ala Gly 
    930                  935                940                
Pro Phe Ser Gln Phe Ile Lys Ala Lys Thr Arg Pro Leu Pro Pro Leu 
945                  950                955                  960
Pro Pro Arg Leu Glu Cys Ala Ala Ala Gly Pro Gln Ser Leu Lys Leu 
                965                  970                975    
Lys Trp Gly Asp Ser Asn Ser Lys Thr His Ala Ala Glu Asp Ile Val 
            980                  985                990        
Tyr Thr Leu Gln Leu Glu Asp Arg Asn Lys Arg Phe Ile Ser Ile Tyr 
        995                  1000                1005            
Arg Gly Pro Ser His Thr Tyr Lys Val Gln Arg Leu Thr Glu Phe Thr 
    1010                1015                1020                
Cys Tyr Ser Phe Arg Ile Gln Ala Ala Ser Glu Ala Gly Glu Gly Pro 
1025                1030                1035                1040
Phe Ser Glu Thr Tyr Thr Phe Ser Thr Thr Lys Ser Val Pro Pro Thr 
                1045                1050                1055    
Ile Lys Ala Pro Arg Val Thr Gln Leu Glu Gly Asn Ser Cys Glu Ile 
            1060                1065                1070        
Leu Trp Glu Thr Val Pro Ser Met Lys Gly Asp Pro Val Asn Tyr Ile 
        1075                1080                1085            
Leu Gln Val Leu Val Gly Arg Glu Ser Glu Tyr Lys Gln Val Tyr Lys 
    1090                1095                1100                
Gly Glu Glu Ala Thr Phe Gln Ile Ser Gly Leu Gln Thr Asn Thr Asp 
1105                1110                1115                1120
Tyr Arg Phe Arg Val Cys Ala Cys Arg Arg Cys Leu Asp Thr Ser Gln 
                1125                1130                1135    
Glu Leu Ser Gly Ala Phe Ser Pro Ser Ala Ala Phe Val Leu Gln Arg 
            1140                1145                1150        
Ser Glu Val Met Leu Thr Gly Asp Met Gly Ser Leu Asp Asp Pro Lys 
        1155                1160                1165            
Met Lys Ser Met Met Pro Thr Asp Glu Gln Phe Ala Ala Ile Ile Val 
    1170                1175                1180                
Leu Gly Phe Ala Thr Leu Ser Ile Leu Phe Ala Phe Ile Leu Gln Tyr 
1185                1190                1195                1200
Phe Leu Met Lys 
                

<210> 42
<211> 264
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..264
<223> /mol_type="DNA"
      /note="FNDC3B (preferred gene fragment)"
      /organism="artificial sequences"

<400> 42
atgtacgtca caatgatgat gaccgaccaa atccctctgg aactgccacc attgctgaac     60

ggagaggtag ccatgatgcc ccacttggtg aatggagatg cagctcagca ggttattctc    120

gttcaagtta atccaggtga gactttcaca ataagagcag aggatggaac acttcagtgc    180

attcaaggac ctgctgaagt tcccatgatg tcacccaatg gatccattcc tcccattcat    240

gtgcctccag gttatatctc acag                                           264


<210> 43
<211> 88
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..88
<223> /mol_type="protein"
      /note="FNDC3B (preferred protein fragment)"
      /organism="artificial sequences"

<400> 43
Met Tyr Val Thr Met Met Met Thr Asp Gln Ile Pro Leu Glu Leu Pro 
1               5                   10                   15    
Pro Leu Leu Asn Gly Glu Val Ala Met Met Pro His Leu Val Asn Gly 
            20                   25                  30        
Asp Ala Ala Gln Gln Val Ile Leu Val Gln Val Asn Pro Gly Glu Thr 
        35                   40                  45            
Phe Thr Ile Arg Ala Glu Asp Gly Thr Leu Gln Cys Ile Gln Gly Pro 
    50                   55                  60                
Ala Glu Val Pro Met Met Ser Pro Asn Gly Ser Ile Pro Pro Ile His 
65                   70                  75                  80
Val Pro Pro Gly Tyr Ile Ser Gln 
                85            

<210> 44
<211> 4059
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4059
<223> /mol_type="DNA"
      /note="TNIK (CCDS nucleotide sequence of TNIK (Gene ID: 23043))"
      /organism="Homo sapiens"

<400> 44
atggcgagcg actccccggc tcgaagcctg gatgaaatag atctctcggc tctgagggac       60

cccgcaggga tctttgaatt ggtggaactt gttggaaatg gaacatacgg gcaagtttat      120

aagggtcgtc atgtcaaaac gggccagctt gcagccatca aggttatgga tgtcacaggg      180

gatgaagagg aagaaatcaa acaagaaatt aacatgttga agaaatattc tcatcaccgg      240

aatattgcta catactatgg tgcttttatc aaaaagaacc caccaggcat ggatgaccaa      300

ctttggttgg tgatggagtt ttgtggtgct ggctctgtca ccgacctgat caagaacaca      360

aaaggtaaca cgttgaaaga ggagtggatt gcatacatct gcagggaaat cttacggggg      420

ctgagtcacc tgcaccagca taaagtgatt catcgagata ttaaagggca aaatgtcttg      480

ctgactgaaa atgcagaagt taaactagtg gactttggag tcagtgctca gcttgatcga      540

acagtgggca ggaggaatac tttcattgga actccctact ggatggcacc agaagttatt      600

gcctgtgatg aaaacccaga tgccacatat gatttcaaga gtgacttgtg gtctttgggt      660

atcaccgcca ttgaaatggc agaaggtgct ccccctctct gtgacatgca ccccatgaga      720

gctctcttcc tcatcccccg gaacccagcg cctcggctga agtctaagaa gtggtcaaaa      780

aaattccagt catttattga gagctgcttg gtaaagaatc acagccagcg accagcaaca      840

gaacaattga tgaagcatcc atttatacga gaccaaccta atgagcgaca ggtccgcatt      900

caactcaagg accatattga tagaacaaag aagaagcgag gagaaaaaga tgagacagag      960

tatgagtaca gtggaagtga ggaagaagag gaggagaatg actcaggaga gcccagctcc     1020

atcctgaatc tgccagggga gtcgacgctg cggagggact ttctgaggct gcagctggcc     1080

aacaaggagc gttctgaggc cctacggagg cagcagctgg agcagcagca gcgggagaat     1140

gaggagcaca agcggcagct gctggccgag cgtcagaagc gcatcgagga gcagaaagag     1200

cagaggcggc ggctggagga gcaacaaagg cgagagaagg agctgcggaa gcagcaggag     1260

agggagcagc gccggcacta tgaggagcag atgcgccggg aggaggagag gaggcgtgcg     1320

gagcatgaac aggaatacat caggcgacag ttagaggagg agcagagaca gttagagatc     1380

ttgcagcagc agctactgca tgaacaagct ctacttctgg aatataagcg caaacaattg     1440

gaagaacaga gacaagcaga aagactgcag aggcagctaa agcaagaaag agactactta     1500

gtttcccttc agcatcagcg gcaggagcag aggcctgtgg agaagaagcc actgtaccat     1560

tacaaagaag gaatgagtcc tagtgagaag ccagcatggg ccaaggaggt agaagaacgg     1620

tcaaggctca accggcaaag ttcccctgcc atgcctcaca aggttgccaa caggatatct     1680

gaccccaacc tgcccccaag gtcggagtcc ttcagcatta gtggagttca gcctgctcga     1740

acacccccca tgctcagacc agtcgatccc cagatcccac atctggtagc tgtaaaatcc     1800

cagggacctg ccttgaccgc ctcccagtca gtgcacgagc agcccacaaa gggcctctct     1860

gggtttcagg aggctctgaa cgtgacctcc caccgcgtgg agatgccacg ccagaactca     1920

gatcccacct cggaaaatcc tcctctcccc actcgcattg aaaagtttga ccgaagctct     1980

tggttacgac aggaagaaga cattccacca aaggtgcctc aaagaacaac ttctatatcc     2040

ccagcattag ccagaaagaa ttctcctggg aatggtagtg ctctgggacc cagactagga     2100

tctcaaccca tcagagcaag caaccctgat ctccggagaa ctgagcccat cttggagagc     2160

cccttgcaga ggaccagcag tggcagttcc tccagctcca gcacccctag ctcccagccc     2220

agctcccaag gaggctccca gcctggatca caagcaggat ccagtgaacg caccagagtt     2280

cgagccaaca gtaagtcaga aggatcacct gtgctccccc atgagcctgc caaggtgaaa     2340

ccagaagaat ccagggacat tacccggccc agtcgaccag ctgatctgac ggcattagcc     2400

aaagaactaa gagaactccg gattgaagaa acaaaccgcc caatgaagaa ggtgactgat     2460

tactcctcct ccagtgagga gtcagaaagt agcgaggaag aggaggaaga tggagagagc     2520

gagacccatg atgggacagt ggctgtcagc gacataccca gactgatacc aacaggagct     2580

ccaggcagca acgagcagta caatgtggga atggtgggga cgcatgggct ggagacctct     2640

catgcggaca gtttcagcgg cagtatttca agagaaggaa ccttgatgat tagagagacg     2700

tctggagaga agaagcgatc tggccacagt gacagcaatg gctttgctgg ccacatcaac     2760

ctccctgacc tggtgcagca gagccattct ccagctggaa ccccgactga gggactgggg     2820

cgcgtctcaa cccattccca ggagatggac tctgggactg aatatggcat ggggagcagc     2880

accaaagcct ccttcacccc ctttgtggac cccagagtat accagacgtc tcccactgat     2940

gaagatgaag aggatgagga atcatcagcc gcagctctgt ttactagcga acttcttagg     3000

caagaacagg ccaaactcaa tgaagcaaga aagatttcgg tggtaaatgt aaacccaacc     3060

aacattcggc ctcatagcga cacaccagaa atcagaaaat acaagaaacg attcaactca     3120

gaaatacttt gtgcagctct gtggggtgta aaccttctgg tggggactga aaatggcctg     3180

atgcttttgg accgaagtgg gcaaggcaaa gtctataatc tgatcaaccg gaggcgattt     3240

cagcagatgg atgtgctaga gggactgaat gtccttgtga caatttcagg aaagaagaat     3300

aagctacgag tttactatct ttcatggtta agaaacagaa tactacataa tgacccagaa     3360

gtagaaaaga aacaaggctg gatcactgtt ggggacttgg aaggctgtat acattataaa     3420

gttgttaaat atgaaaggat caaatttttg gtgattgcct taaagaatgc tgtggaaata     3480

tatgcttggg ctcctaaacc gtatcataaa ttcatggcat ttaagtcttt tgcagatctc     3540

cagcacaagc ctctgctagt tgatctcacg gtagaagaag gtcaaagatt aaaggttatt     3600

tttggttcac acactggttt ccatgtaatt gatgttgatt caggaaactc ttatgatatc     3660

tacataccat ctcatattca gggcaatatc actcctcatg ctattgtcat cttgcctaaa     3720

acagatggaa tggaaatgct tgtttgctat gaggatgagg gggtgtatgt aaacacctat     3780

ggccggataa ctaaggatgt ggtgctccaa tggggagaaa tgcccacgtc tgtggcctac     3840

attcattcca atcagataat gggctggggc gagaaagcta ttgagatccg gtcagtggaa     3900

acaggacatt tggatggagt atttatgcat aagcgagctc aaaggttaaa gtttctatgt     3960

gaaagaaatg ataaggtatt ttttgcatcc gtgcgatctg gaggaagtag ccaagtgttt     4020

ttcatgaccc tcaacagaaa ttccatgatg aactggtaa                            4059


<210> 45
<211> 1352
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1352
<223> /mol_type="protein"
      /note="TNIK (full-length protein)"
      /organism="Homo sapiens"

<400> 45
Met Ala Ser Asp Ser Pro Ala Arg Ser Leu Asp Glu Ile Asp Leu Ser 
1               5                   10                   15    
Ala Leu Arg Asp Pro Ala Gly Ile Phe Glu Leu Val Glu Leu Val Gly 
            20                   25                  30        
Asn Gly Thr Tyr Gly Gln Val Tyr Lys Gly Arg His Val Lys Thr Gly 
        35                   40                  45            
Gln Leu Ala Ala Ile Lys Val Met Asp Val Thr Gly Asp Glu Glu Glu 
    50                   55                  60                
Glu Ile Lys Gln Glu Ile Asn Met Leu Lys Lys Tyr Ser His His Arg 
65                   70                  75                  80
Asn Ile Ala Thr Tyr Tyr Gly Ala Phe Ile Lys Lys Asn Pro Pro Gly 
                85                   90                  95    
Met Asp Asp Gln Leu Trp Leu Val Met Glu Phe Cys Gly Ala Gly Ser 
            100                  105                110        
Val Thr Asp Leu Ile Lys Asn Thr Lys Gly Asn Thr Leu Lys Glu Glu 
        115                  120                125            
Trp Ile Ala Tyr Ile Cys Arg Glu Ile Leu Arg Gly Leu Ser His Leu 
    130                  135                140                
His Gln His Lys Val Ile His Arg Asp Ile Lys Gly Gln Asn Val Leu 
145                  150                155                  160
Leu Thr Glu Asn Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala 
                165                  170                175    
Gln Leu Asp Arg Thr Val Gly Arg Arg Asn Thr Phe Ile Gly Thr Pro 
            180                  185                190        
Tyr Trp Met Ala Pro Glu Val Ile Ala Cys Asp Glu Asn Pro Asp Ala 
        195                  200                205            
Thr Tyr Asp Phe Lys Ser Asp Leu Trp Ser Leu Gly Ile Thr Ala Ile 
    210                  215                220                
Glu Met Ala Glu Gly Ala Pro Pro Leu Cys Asp Met His Pro Met Arg 
225                  230                235                  240
Ala Leu Phe Leu Ile Pro Arg Asn Pro Ala Pro Arg Leu Lys Ser Lys 
                245                  250                255    
Lys Trp Ser Lys Lys Phe Gln Ser Phe Ile Glu Ser Cys Leu Val Lys 
            260                  265                270        
Asn His Ser Gln Arg Pro Ala Thr Glu Gln Leu Met Lys His Pro Phe 
        275                  280                285            
Ile Arg Asp Gln Pro Asn Glu Arg Gln Val Arg Ile Gln Leu Lys Asp 
    290                  295                300                
His Ile Asp Arg Thr Lys Lys Lys Arg Gly Glu Lys Asp Glu Thr Glu 
305                  310                315                  320
Tyr Glu Tyr Ser Gly Ser Glu Glu Glu Glu Glu Glu Asn Asp Ser Gly 
                325                  330                335    
Glu Pro Ser Ser Ile Leu Asn Leu Pro Gly Glu Ser Thr Leu Arg Arg 
            340                  345                350        
Asp Phe Leu Arg Leu Gln Leu Ala Asn Lys Glu Arg Ser Glu Ala Leu 
        355                  360                365            
Arg Arg Gln Gln Leu Glu Gln Gln Gln Arg Glu Asn Glu Glu His Lys 
    370                  375                380                
Arg Gln Leu Leu Ala Glu Arg Gln Lys Arg Ile Glu Glu Gln Lys Glu 
385                  390                395                  400
Gln Arg Arg Arg Leu Glu Glu Gln Gln Arg Arg Glu Lys Glu Leu Arg 
                405                  410                415    
Lys Gln Gln Glu Arg Glu Gln Arg Arg His Tyr Glu Glu Gln Met Arg 
            420                  425                430        
Arg Glu Glu Glu Arg Arg Arg Ala Glu His Glu Gln Glu Tyr Ile Arg 
        435                  440                445            
Arg Gln Leu Glu Glu Glu Gln Arg Gln Leu Glu Ile Leu Gln Gln Gln 
    450                  455                460                
Leu Leu His Glu Gln Ala Leu Leu Leu Glu Tyr Lys Arg Lys Gln Leu 
465                  470                475                  480
Glu Glu Gln Arg Gln Ala Glu Arg Leu Gln Arg Gln Leu Lys Gln Glu 
                485                  490                495    
Arg Asp Tyr Leu Val Ser Leu Gln His Gln Arg Gln Glu Gln Arg Pro 
            500                  505                510        
Val Glu Lys Lys Pro Leu Tyr His Tyr Lys Glu Gly Met Ser Pro Ser 
        515                  520                525            
Glu Lys Pro Ala Trp Ala Lys Glu Val Glu Glu Arg Ser Arg Leu Asn 
    530                  535                540                
Arg Gln Ser Ser Pro Ala Met Pro His Lys Val Ala Asn Arg Ile Ser 
545                  550                555                  560
Asp Pro Asn Leu Pro Pro Arg Ser Glu Ser Phe Ser Ile Ser Gly Val 
                565                  570                575    
Gln Pro Ala Arg Thr Pro Pro Met Leu Arg Pro Val Asp Pro Gln Ile 
            580                  585                590        
Pro His Leu Val Ala Val Lys Ser Gln Gly Pro Ala Leu Thr Ala Ser 
        595                  600                605            
Gln Ser Val His Glu Gln Pro Thr Lys Gly Leu Ser Gly Phe Gln Glu 
    610                  615                620                
Ala Leu Asn Val Thr Ser His Arg Val Glu Met Pro Arg Gln Asn Ser 
625                  630                635                  640
Asp Pro Thr Ser Glu Asn Pro Pro Leu Pro Thr Arg Ile Glu Lys Phe 
                645                  650                655    
Asp Arg Ser Ser Trp Leu Arg Gln Glu Glu Asp Ile Pro Pro Lys Val 
            660                  665                670        
Pro Gln Arg Thr Thr Ser Ile Ser Pro Ala Leu Ala Arg Lys Asn Ser 
        675                  680                685            
Pro Gly Asn Gly Ser Ala Leu Gly Pro Arg Leu Gly Ser Gln Pro Ile 
    690                  695                700                
Arg Ala Ser Asn Pro Asp Leu Arg Arg Thr Glu Pro Ile Leu Glu Ser 
705                  710                715                  720
Pro Leu Gln Arg Thr Ser Ser Gly Ser Ser Ser Ser Ser Ser Thr Pro 
                725                  730                735    
Ser Ser Gln Pro Ser Ser Gln Gly Gly Ser Gln Pro Gly Ser Gln Ala 
            740                  745                750        
Gly Ser Ser Glu Arg Thr Arg Val Arg Ala Asn Ser Lys Ser Glu Gly 
        755                  760                765            
Ser Pro Val Leu Pro His Glu Pro Ala Lys Val Lys Pro Glu Glu Ser 
    770                  775                780                
Arg Asp Ile Thr Arg Pro Ser Arg Pro Ala Asp Leu Thr Ala Leu Ala 
785                  790                795                  800
Lys Glu Leu Arg Glu Leu Arg Ile Glu Glu Thr Asn Arg Pro Met Lys 
                805                  810                815    
Lys Val Thr Asp Tyr Ser Ser Ser Ser Glu Glu Ser Glu Ser Ser Glu 
            820                  825                830        
Glu Glu Glu Glu Asp Gly Glu Ser Glu Thr His Asp Gly Thr Val Ala 
        835                  840                845            
Val Ser Asp Ile Pro Arg Leu Ile Pro Thr Gly Ala Pro Gly Ser Asn 
    850                  855                860                
Glu Gln Tyr Asn Val Gly Met Val Gly Thr His Gly Leu Glu Thr Ser 
865                  870                875                  880
His Ala Asp Ser Phe Ser Gly Ser Ile Ser Arg Glu Gly Thr Leu Met 
                885                  890                895    
Ile Arg Glu Thr Ser Gly Glu Lys Lys Arg Ser Gly His Ser Asp Ser 
            900                  905                910        
Asn Gly Phe Ala Gly His Ile Asn Leu Pro Asp Leu Val Gln Gln Ser 
        915                  920                925            
His Ser Pro Ala Gly Thr Pro Thr Glu Gly Leu Gly Arg Val Ser Thr 
    930                  935                940                
His Ser Gln Glu Met Asp Ser Gly Thr Glu Tyr Gly Met Gly Ser Ser 
945                  950                955                  960
Thr Lys Ala Ser Phe Thr Pro Phe Val Asp Pro Arg Val Tyr Gln Thr 
                965                  970                975    
Ser Pro Thr Asp Glu Asp Glu Glu Asp Glu Glu Ser Ser Ala Ala Ala 
            980                  985                990        
Leu Phe Thr Ser Glu Leu Leu Arg Gln Glu Gln Ala Lys Leu Asn Glu 
        995                  1000                1005            
Ala Arg Lys Ile Ser Val Val Asn Val Asn Pro Thr Asn Ile Arg Pro 
    1010                1015                1020                
His Ser Asp Thr Pro Glu Ile Arg Lys Tyr Lys Lys Arg Phe Asn Ser 
1025                1030                1035                1040
Glu Ile Leu Cys Ala Ala Leu Trp Gly Val Asn Leu Leu Val Gly Thr 
                1045                1050                1055    
Glu Asn Gly Leu Met Leu Leu Asp Arg Ser Gly Gln Gly Lys Val Tyr 
            1060                1065                1070        
Asn Leu Ile Asn Arg Arg Arg Phe Gln Gln Met Asp Val Leu Glu Gly 
        1075                1080                1085            
Leu Asn Val Leu Val Thr Ile Ser Gly Lys Lys Asn Lys Leu Arg Val 
    1090                1095                1100                
Tyr Tyr Leu Ser Trp Leu Arg Asn Arg Ile Leu His Asn Asp Pro Glu 
1105                1110                1115                1120
Val Glu Lys Lys Gln Gly Trp Ile Thr Val Gly Asp Leu Glu Gly Cys 
                1125                1130                1135    
Ile His Tyr Lys Val Val Lys Tyr Glu Arg Ile Lys Phe Leu Val Ile 
            1140                1145                1150        
Ala Leu Lys Asn Ala Val Glu Ile Tyr Ala Trp Ala Pro Lys Pro Tyr 
        1155                1160                1165            
His Lys Phe Met Ala Phe Lys Ser Phe Ala Asp Leu Gln His Lys Pro 
    1170                1175                1180                
Leu Leu Val Asp Leu Thr Val Glu Glu Gly Gln Arg Leu Lys Val Ile 
1185                1190                1195                1200
Phe Gly Ser His Thr Gly Phe His Val Ile Asp Val Asp Ser Gly Asn 
                1205                1210                1215    
Ser Tyr Asp Ile Tyr Ile Pro Ser His Ile Gln Gly Asn Ile Thr Pro 
            1220                1225                1230        
His Ala Ile Val Ile Leu Pro Lys Thr Asp Gly Met Glu Met Leu Val 
        1235                1240                1245            
Cys Tyr Glu Asp Glu Gly Val Tyr Val Asn Thr Tyr Gly Arg Ile Thr 
    1250                1255                1260                
Lys Asp Val Val Leu Gln Trp Gly Glu Met Pro Thr Ser Val Ala Tyr 
1265                1270                1275                1280
Ile His Ser Asn Gln Ile Met Gly Trp Gly Glu Lys Ala Ile Glu Ile 
                1285                1290                1295    
Arg Ser Val Glu Thr Gly His Leu Asp Gly Val Phe Met His Lys Arg 
            1300                1305                1310        
Ala Gln Arg Leu Lys Phe Leu Cys Glu Arg Asn Asp Lys Val Phe Phe 
        1315                1320                1325            
Ala Ser Val Arg Ser Gly Gly Ser Ser Gln Val Phe Phe Met Thr Leu 
    1330                1335                1340                
Asn Arg Asn Ser Met Met Asn Trp 
1345                1350        

<210> 46
<211> 1362
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1362
<223> /mol_type="DNA"
      /note="TNIK (preferred gene fragment )"
      /organism="artificial sequences"

<400> 46
acgtctggag agaagaagcg atctggccac agtgacagca atggctttgc tggccacatc      60

aacctccctg acctggtgca gcagagccat tctccagctg gaaccccgac tgagggactg     120

gggcgcgtct caacccattc ccaggagatg gactctggga ctgaatatgg catggggagc     180

agcaccaaag cctccttcac cccctttgtg gaccccagag tataccagac gtctcccact     240

gatgaagatg aagaggatga ggaatcatca gccgcagctc tgtttactag cgaacttctt     300

aggcaagaac aggccaaact caatgaagca agaaagattt cggtggtaaa tgtaaaccca     360

accaacattc ggcctcatag cgacacacca gaaatcagaa aatacaagaa acgattcaac     420

tcagaaatac tttgtgcagc tctgtggggt gtaaaccttc tggtggggac tgaaaatggc     480

ctgatgcttt tggaccgaag tgggcaaggc aaagtctata atctgatcaa ccggaggcga     540

tttcagcaga tggatgtgct agagggactg aatgtccttg tgacaatttc aggaaagaag     600

aataagctac gagtttacta tctttcatgg ttaagaaaca gaatactaca taatgaccca     660

gaagtagaaa agaaacaagg ctggatcact gttggggact tggaaggctg tatacattat     720

aaagttgtta aatatgaaag gatcaaattt ttggtgattg ccttaaagaa tgctgtggaa     780

atatatgctt gggctcctaa accgtatcat aaattcatgg catttaagtc ttttgcagat     840

ctccagcaca agcctctgct agttgatctc acggtagaag aaggtcaaag attaaaggtt     900

atttttggtt cacacactgg tttccatgta attgatgttg attcaggaaa ctcttatgat     960

atctacatac catctcatat tcagggcaat atcactcctc atgctattgt catcttgcct    1020

aaaacagatg gaatggaaat gcttgtttgc tatgaggatg agggggtgta tgtaaacacc    1080

tatggccgga taactaagga tgtggtgctc caatggggag aaatgcccac gtctgtggcc    1140

tacattcatt ccaatcagat aatgggctgg ggcgagaaag ctattgagat ccggtcagtg    1200

gaaacaggac atttggatgg agtatttatg cataagcgag ctcaaaggtt aaagtttcta    1260

tgtgaaagaa atgataaggt attttttgca tccgtgcgat ctggaggaag tagccaagtg    1320

tttttcatga ccctcaacag aaattccatg atgaactggt aa                       1362


<210> 47
<211> 453
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..453
<223> /mol_type="protein"
      /note="TNIK (preferred protein fragment)"
      /organism="artificial sequences"

<400> 47
Thr Ser Gly Glu Lys Lys Arg Ser Gly His Ser Asp Ser Asn Gly Phe 
1               5                   10                   15    
Ala Gly His Ile Asn Leu Pro Asp Leu Val Gln Gln Ser His Ser Pro 
            20                   25                  30        
Ala Gly Thr Pro Thr Glu Gly Leu Gly Arg Val Ser Thr His Ser Gln 
        35                   40                  45            
Glu Met Asp Ser Gly Thr Glu Tyr Gly Met Gly Ser Ser Thr Lys Ala 
    50                   55                  60                
Ser Phe Thr Pro Phe Val Asp Pro Arg Val Tyr Gln Thr Ser Pro Thr 
65                   70                  75                  80
Asp Glu Asp Glu Glu Asp Glu Glu Ser Ser Ala Ala Ala Leu Phe Thr 
                85                   90                  95    
Ser Glu Leu Leu Arg Gln Glu Gln Ala Lys Leu Asn Glu Ala Arg Lys 
            100                  105                110        
Ile Ser Val Val Asn Val Asn Pro Thr Asn Ile Arg Pro His Ser Asp 
        115                  120                125            
Thr Pro Glu Ile Arg Lys Tyr Lys Lys Arg Phe Asn Ser Glu Ile Leu 
    130                  135                140                
Cys Ala Ala Leu Trp Gly Val Asn Leu Leu Val Gly Thr Glu Asn Gly 
145                  150                155                  160
Leu Met Leu Leu Asp Arg Ser Gly Gln Gly Lys Val Tyr Asn Leu Ile 
                165                  170                175    
Asn Arg Arg Arg Phe Gln Gln Met Asp Val Leu Glu Gly Leu Asn Val 
            180                  185                190        
Leu Val Thr Ile Ser Gly Lys Lys Asn Lys Leu Arg Val Tyr Tyr Leu 
        195                  200                205            
Ser Trp Leu Arg Asn Arg Ile Leu His Asn Asp Pro Glu Val Glu Lys 
    210                  215                220                
Lys Gln Gly Trp Ile Thr Val Gly Asp Leu Glu Gly Cys Ile His Tyr 
225                  230                235                  240
Lys Val Val Lys Tyr Glu Arg Ile Lys Phe Leu Val Ile Ala Leu Lys 
                245                  250                255    
Asn Ala Val Glu Ile Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe 
            260                  265                270        
Met Ala Phe Lys Ser Phe Ala Asp Leu Gln His Lys Pro Leu Leu Val 
        275                  280                285            
Asp Leu Thr Val Glu Glu Gly Gln Arg Leu Lys Val Ile Phe Gly Ser 
    290                  295                300                
His Thr Gly Phe His Val Ile Asp Val Asp Ser Gly Asn Ser Tyr Asp 
305                  310                315                  320
Ile Tyr Ile Pro Ser His Ile Gln Gly Asn Ile Thr Pro His Ala Ile 
                325                  330                335    
Val Ile Leu Pro Lys Thr Asp Gly Met Glu Met Leu Val Cys Tyr Glu 
            340                  345                350        
Asp Glu Gly Val Tyr Val Asn Thr Tyr Gly Arg Ile Thr Lys Asp Val 
        355                  360                365            
Val Leu Gln Trp Gly Glu Met Pro Thr Ser Val Ala Tyr Ile His Ser 
    370                  375                380                
Asn Gln Ile Met Gly Trp Gly Glu Lys Ala Ile Glu Ile Arg Ser Val 
385                  390                395                  400
Glu Thr Gly His Leu Asp Gly Val Phe Met His Lys Arg Ala Gln Arg 
                405                  410                415    
Leu Lys Phe Leu Cys Glu Arg Asn Asp Lys Val Phe Phe Ala Ser Val 
            420                  425                430        
Arg Ser Gly Gly Ser Ser Gln Val Phe Phe Met Thr Leu Asn Arg Asn 
        435                  440                445            
Ser Met Met Asn Trp 
    450            

<210> 48
<211> 1626
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1626
<223> /mol_type="DNA"
      /note="FNDC3B-TNIK (preferred fusion gene)"
      /organism="artificial sequences"

<400> 48
atgtacgtca caatgatgat gaccgaccaa atccctctgg aactgccacc attgctgaac      60

ggagaggtag ccatgatgcc ccacttggtg aatggagatg cagctcagca ggttattctc     120

gttcaagtta atccaggtga gactttcaca ataagagcag aggatggaac acttcagtgc     180

attcaaggac ctgctgaagt tcccatgatg tcacccaatg gatccattcc tcccattcat     240

gtgcctccag gttatatctc acagacgtct ggagagaaga agcgatctgg ccacagtgac     300

agcaatggct ttgctggcca catcaacctc cctgacctgg tgcagcagag ccattctcca     360

gctggaaccc cgactgaggg actggggcgc gtctcaaccc attcccagga gatggactct     420

gggactgaat atggcatggg gagcagcacc aaagcctcct tcaccccctt tgtggacccc     480

agagtatacc agacgtctcc cactgatgaa gatgaagagg atgaggaatc atcagccgca     540

gctctgttta ctagcgaact tcttaggcaa gaacaggcca aactcaatga agcaagaaag     600

atttcggtgg taaatgtaaa cccaaccaac attcggcctc atagcgacac accagaaatc     660

agaaaataca agaaacgatt caactcagaa atactttgtg cagctctgtg gggtgtaaac     720

cttctggtgg ggactgaaaa tggcctgatg cttttggacc gaagtgggca aggcaaagtc     780

tataatctga tcaaccggag gcgatttcag cagatggatg tgctagaggg actgaatgtc     840

cttgtgacaa tttcaggaaa gaagaataag ctacgagttt actatctttc atggttaaga     900

aacagaatac tacataatga cccagaagta gaaaagaaac aaggctggat cactgttggg     960

gacttggaag gctgtataca ttataaagtt gttaaatatg aaaggatcaa atttttggtg    1020

attgccttaa agaatgctgt ggaaatatat gcttgggctc ctaaaccgta tcataaattc    1080

atggcattta agtcttttgc agatctccag cacaagcctc tgctagttga tctcacggta    1140

gaagaaggtc aaagattaaa ggttattttt ggttcacaca ctggtttcca tgtaattgat    1200

gttgattcag gaaactctta tgatatctac ataccatctc atattcaggg caatatcact    1260

cctcatgcta ttgtcatctt gcctaaaaca gatggaatgg aaatgcttgt ttgctatgag    1320

gatgaggggg tgtatgtaaa cacctatggc cggataacta aggatgtggt gctccaatgg    1380

ggagaaatgc ccacgtctgt ggcctacatt cattccaatc agataatggg ctggggcgag    1440

aaagctattg agatccggtc agtggaaaca ggacatttgg atggagtatt tatgcataag    1500

cgagctcaaa ggttaaagtt tctatgtgaa agaaatgata aggtattttt tgcatccgtg    1560

cgatctggag gaagtagcca agtgtttttc atgaccctca acagaaattc catgatgaac    1620

tggtaa                                                               1626


<210> 49
<211> 541
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..541
<223> /mol_type="protein"
      /note="FNDC3B-TNIK (preferred fusion protein)"
      /organism="artificial sequences"

<400> 49
Met Tyr Val Thr Met Met Met Thr Asp Gln Ile Pro Leu Glu Leu Pro 
1               5                   10                   15    
Pro Leu Leu Asn Gly Glu Val Ala Met Met Pro His Leu Val Asn Gly 
            20                   25                  30        
Asp Ala Ala Gln Gln Val Ile Leu Val Gln Val Asn Pro Gly Glu Thr 
        35                   40                  45            
Phe Thr Ile Arg Ala Glu Asp Gly Thr Leu Gln Cys Ile Gln Gly Pro 
    50                   55                  60                
Ala Glu Val Pro Met Met Ser Pro Asn Gly Ser Ile Pro Pro Ile His 
65                   70                  75                  80
Val Pro Pro Gly Tyr Ile Ser Gln Thr Ser Gly Glu Lys Lys Arg Ser 
                85                   90                  95    
Gly His Ser Asp Ser Asn Gly Phe Ala Gly His Ile Asn Leu Pro Asp 
            100                  105                110        
Leu Val Gln Gln Ser His Ser Pro Ala Gly Thr Pro Thr Glu Gly Leu 
        115                  120                125            
Gly Arg Val Ser Thr His Ser Gln Glu Met Asp Ser Gly Thr Glu Tyr 
    130                  135                140                
Gly Met Gly Ser Ser Thr Lys Ala Ser Phe Thr Pro Phe Val Asp Pro 
145                  150                155                  160
Arg Val Tyr Gln Thr Ser Pro Thr Asp Glu Asp Glu Glu Asp Glu Glu 
                165                  170                175    
Ser Ser Ala Ala Ala Leu Phe Thr Ser Glu Leu Leu Arg Gln Glu Gln 
            180                  185                190        
Ala Lys Leu Asn Glu Ala Arg Lys Ile Ser Val Val Asn Val Asn Pro 
        195                  200                205            
Thr Asn Ile Arg Pro His Ser Asp Thr Pro Glu Ile Arg Lys Tyr Lys 
    210                  215                220                
Lys Arg Phe Asn Ser Glu Ile Leu Cys Ala Ala Leu Trp Gly Val Asn 
225                  230                235                  240
Leu Leu Val Gly Thr Glu Asn Gly Leu Met Leu Leu Asp Arg Ser Gly 
                245                  250                255    
Gln Gly Lys Val Tyr Asn Leu Ile Asn Arg Arg Arg Phe Gln Gln Met 
            260                  265                270        
Asp Val Leu Glu Gly Leu Asn Val Leu Val Thr Ile Ser Gly Lys Lys 
        275                  280                285            
Asn Lys Leu Arg Val Tyr Tyr Leu Ser Trp Leu Arg Asn Arg Ile Leu 
    290                  295                300                
His Asn Asp Pro Glu Val Glu Lys Lys Gln Gly Trp Ile Thr Val Gly 
305                  310                315                  320
Asp Leu Glu Gly Cys Ile His Tyr Lys Val Val Lys Tyr Glu Arg Ile 
                325                  330                335    
Lys Phe Leu Val Ile Ala Leu Lys Asn Ala Val Glu Ile Tyr Ala Trp 
            340                  345                350        
Ala Pro Lys Pro Tyr His Lys Phe Met Ala Phe Lys Ser Phe Ala Asp 
        355                  360                365            
Leu Gln His Lys Pro Leu Leu Val Asp Leu Thr Val Glu Glu Gly Gln 
    370                  375                380                
Arg Leu Lys Val Ile Phe Gly Ser His Thr Gly Phe His Val Ile Asp 
385                  390                395                  400
Val Asp Ser Gly Asn Ser Tyr Asp Ile Tyr Ile Pro Ser His Ile Gln 
                405                  410                415    
Gly Asn Ile Thr Pro His Ala Ile Val Ile Leu Pro Lys Thr Asp Gly 
            420                  425                430        
Met Glu Met Leu Val Cys Tyr Glu Asp Glu Gly Val Tyr Val Asn Thr 
        435                  440                445            
Tyr Gly Arg Ile Thr Lys Asp Val Val Leu Gln Trp Gly Glu Met Pro 
    450                  455                460                
Thr Ser Val Ala Tyr Ile His Ser Asn Gln Ile Met Gly Trp Gly Glu 
465                  470                475                  480
Lys Ala Ile Glu Ile Arg Ser Val Glu Thr Gly His Leu Asp Gly Val 
                485                  490                495    
Phe Met His Lys Arg Ala Gln Arg Leu Lys Phe Leu Cys Glu Arg Asn 
            500                  505                510        
Asp Lys Val Phe Phe Ala Ser Val Arg Ser Gly Gly Ser Ser Gln Val 
        515                  520                525            
Phe Phe Met Thr Leu Asn Arg Asn Ser Met Met Asn Trp 
    530                  535                540    

<210> 50
<211> 4242
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4242
<223> /mol_type="DNA"
      /note="C12orf11 (CCDS nucleotide sequence of C12orf11 (Gene ID: 5
      5726))"
      /organism="Homo sapiens"

<400> 50
atgaagattt tttctgaatc tcataaaaca gtgtttgttg tggatcactg cccttatatg       60

gcagaatctt gcaggcagca tgtcgagttt gatatgctgg tgaagaatag aacccaagga      120

atcattcctt tggcccccat atctaaatca ttgtggactt gctcagtaga atcttccatg      180

gaatattgta gaataatgta tgatatattt cctttcaaaa agctggtgaa ttttattgtg      240

agtgactctg gagcacatgt tttaaattct tggactcaag aagaccaaaa tttacaggag      300

ctaatggcag cattagccgc tgttgggcct cctaatcctc gggcagatcc agagtgctgc      360

agtattctgc atggccttgt tgcagcagtg gaaactctct gcaaaattac tgaataccaa      420

catgaggctc gtactctact catggagaat gcagaacgtg ttggaaatag aggacgaata      480

atctgtatta ctaatgcaaa aagtgatagt catgtgcgaa tgcttgaaga ctgtgtccag      540

gaaacgattc atgaacataa caagcttgct gcaaattcag atcatctcat gcagattcaa      600

aaatgtgagt tggtcttgat ccacacctac ccagttggtg aagacagcct tgtatctgat      660

cgttctaaaa aagagttgtc cccggtttta accagtgaag ttcatagtgt tcgtgcagga      720

cggcatcttg ctaccaaatt gaatatttta gtacagcaac attttgactt ggcttcaact      780

actattacaa atattccaat gaaggaagaa cagcatgcta acacatctgc caattatgat      840

gtggagctac ttcatcacaa agatgcacat gtagatttcc tgaaaagtgg tgattcgcat      900

ctaggtggcg gcagtcgaga aggctcgttt aaagaaacaa taacattaaa gtggtgtaca      960

ccaaggacaa ataacattga attacactat tgtactggag cttatcggat ttcacctgta     1020

gatgtaaata gtagaccttc ctcctgcctt actaattttc ttctaaatgg tcgttctgtt     1080

ttattggaac aaccacgaaa gtcaggttct aaagtcatta gtcatatgct tagtagccat     1140

ggaggagaga tttttttgca cgtccttagc agttctcgat ccattctaga agatccacct     1200

tcaattagtg aaggatgtgg aggaagagtt acagactacc ggattacaga ttttggtgaa     1260

tttatgaggg aaaacagatt aactcctttt ctagacccca gatataaaat cgatggaagt     1320

cttgaggtcc ctttggaacg agcaaaagat cagttagaaa aacatacccg ttactggcct     1380

atgatcattt cacaaaccac catttttaac atgcaagcgg tagttccatt agccagtgtt     1440

attgtgaaag aatctctgac agaagaagat gtgttaaact gtcaaaaaac aatatacaac     1500

ttagttgata tggaaagaaa aaatgatcct ctacctattt ccacagttgg tacaagagga     1560

aagggcccta aaagagatga acaataccgt atcatgtgga atgaattaga aacccttgtc     1620

agagcccata tcaacaactc agagaaacat caaagagtct tggaatgtct gatggcatgc     1680

aggagcaaac ccccagaaga ggaagaacga aagaaacgag gaagaaagag ggaagacaaa     1740

gaggacaagt cagagaaagc agtgaaagat tatgaacagg aaaagtcttg gcaagactca     1800

gagagattaa aaggaatctt agagcgtgga aaagaagaat tggctgaagc tgagattata     1860

aaagattcgc ctgattcccc agaacctcca aacaaaaaac cccttgttga aatggatgaa     1920

actccacaag tggaaaaatc aaaagggcca gtgtcgttat tatccttgtg gagtaataga     1980

atcaatactg ccaattccag aaaacatcag gaatttgctg gacgtttgaa ctctgttaat     2040

aacagagctg aactatatca acatcttaaa gaggaaaatg ggatggagac aacagaaaat     2100

ggaaaagcca gccggcagtg aatgaagatt ttttctgaat ctcataaaac agtgtttgtt     2160

gtggatcact gcccttatat ggcagaatct tgcaggcagc atgtcgagtt tgatatgctg     2220

gtgaagaata gaacccaagg aatcattcct ttggccccca tatctaaatc attgtggact     2280

tgctcagtag aatcttccat ggaatattgt agaataatgt atgatatatt tcctttcaaa     2340

aagctggtga attttattgt gagtgactct ggagcacatg ttttaaattc ttggactcaa     2400

gaagaccaaa atttacagga gctaatggca gcattagccg ctgttgggcc tcctaatcct     2460

cgggcagatc cagagtgctg cagtattctg catggccttg ttgcagcagt ggaaactctc     2520

tgcaaaatta ctgaatacca acatgaggct cgtactctac tcatggagaa tgcagaacgt     2580

gttggaaata gaggacgaat aatctgtatt actaatgcaa aaagtgatag tcatgtgcga     2640

atgcttgaag actgtgtcca ggaaacgatt catgaacata acaagcttgc tgcaaattca     2700

gatcatctca tgcagattca aaaatgtgag ttggtcttga tccacaccta cccagttggt     2760

gaagacagcc ttgtatctga tcgttctaaa aaagagttgt ccccggtttt aaccagtgaa     2820

gttcatagtg ttcgtgcagg acggcatctt gctaccaaat tgaatatttt agtacagcaa     2880

cattttgact tggcttcaac tactattaca aatattccaa tgaaggaaga acagcatgct     2940

aacacatctg ccaattatga tgtggagcta cttcatcaca aagatgcaca tgtagatttc     3000

ctgaaaagtg gtgattcgca tctaggtggc ggcagtcgag aaggctcgtt taaagaaaca     3060

ataacattaa agtggtgtac accaaggaca aataacattg aattacacta ttgtactgga     3120

gcttatcgga tttcacctgt agatgtaaat agtagacctt cctcctgcct tactaatttt     3180

cttctaaatg gtcgttctgt tttattggaa caaccacgaa agtcaggttc taaagtcatt     3240

agtcatatgc ttagtagcca tggaggagag atttttttgc acgtccttag cagttctcga     3300

tccattctag aagatccacc ttcaattagt gaaggatgtg gaggaagagt tacagactac     3360

cggattacag attttggtga atttatgagg gaaaacagat taactccttt tctagacccc     3420

agatataaaa tcgatggaag tcttgaggtc cctttggaac gagcaaaaga tcagttagaa     3480

aaacataccc gttactggcc tatgatcatt tcacaaacca ccatttttaa catgcaagcg     3540

gtagttccat tagccagtgt tattgtgaaa gaatctctga cagaagaaga tgtgttaaac     3600

tgtcaaaaaa caatatacaa cttagttgat atggaaagaa aaaatgatcc tctacctatt     3660

tccacagttg gtacaagagg aaagggccct aaaagagatg aacaataccg tatcatgtgg     3720

aatgaattag aaacccttgt cagagcccat atcaacaact cagagaaaca tcaaagagtc     3780

ttggaatgtc tgatggcatg caggagcaaa cccccagaag aggaagaacg aaagaaacga     3840

ggaagaaaga gggaagacaa agaggacaag tcagagaaag cagtgaaaga ttatgaacag     3900

gaaaagtctt ggcaagactc agagagatta aaaggaatct tagagcgtgg aaaagaagaa     3960

ttggctgaag ctgagattat aaaagattcg cctgattccc cagaacctcc aaacaaaaaa     4020

ccccttgttg aaatggatga aactccacaa gtggaaaaat caaaagggcc agtgtcgtta     4080

ttatccttgt ggagtaatag aatcaatact gccaattcca gaaaacatca ggaatttgct     4140

ggacgtttga actctgttaa taacagagct gaactatatc aacatcttaa agaggaaaat     4200

gggatggaga caacagaaaa tggaaaagcc agccggcagt ga                        4242


<210> 51
<211> 706
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..706
<223> /mol_type="protein"
      /note="C12orf11 (full-length protein)"
      /organism="Homo sapiens"

<400> 51
Met Lys Ile Phe Ser Glu Ser His Lys Thr Val Phe Val Val Asp His 
1               5                   10                   15    
Cys Pro Tyr Met Ala Glu Ser Cys Arg Gln His Val Glu Phe Asp Met 
            20                   25                  30        
Leu Val Lys Asn Arg Thr Gln Gly Ile Ile Pro Leu Ala Pro Ile Ser 
        35                   40                  45            
Lys Ser Leu Trp Thr Cys Ser Val Glu Ser Ser Met Glu Tyr Cys Arg 
    50                   55                  60                
Ile Met Tyr Asp Ile Phe Pro Phe Lys Lys Leu Val Asn Phe Ile Val 
65                   70                  75                  80
Ser Asp Ser Gly Ala His Val Leu Asn Ser Trp Thr Gln Glu Asp Gln 
                85                   90                  95    
Asn Leu Gln Glu Leu Met Ala Ala Leu Ala Ala Val Gly Pro Pro Asn 
            100                  105                110        
Pro Arg Ala Asp Pro Glu Cys Cys Ser Ile Leu His Gly Leu Val Ala 
        115                  120                125            
Ala Val Glu Thr Leu Cys Lys Ile Thr Glu Tyr Gln His Glu Ala Arg 
    130                  135                140                
Thr Leu Leu Met Glu Asn Ala Glu Arg Val Gly Asn Arg Gly Arg Ile 
145                  150                155                  160
Ile Cys Ile Thr Asn Ala Lys Ser Asp Ser His Val Arg Met Leu Glu 
                165                  170                175    
Asp Cys Val Gln Glu Thr Ile His Glu His Asn Lys Leu Ala Ala Asn 
            180                  185                190        
Ser Asp His Leu Met Gln Ile Gln Lys Cys Glu Leu Val Leu Ile His 
        195                  200                205            
Thr Tyr Pro Val Gly Glu Asp Ser Leu Val Ser Asp Arg Ser Lys Lys 
    210                  215                220                
Glu Leu Ser Pro Val Leu Thr Ser Glu Val His Ser Val Arg Ala Gly 
225                  230                235                  240
Arg His Leu Ala Thr Lys Leu Asn Ile Leu Val Gln Gln His Phe Asp 
                245                  250                255    
Leu Ala Ser Thr Thr Ile Thr Asn Ile Pro Met Lys Glu Glu Gln His 
            260                  265                270        
Ala Asn Thr Ser Ala Asn Tyr Asp Val Glu Leu Leu His His Lys Asp 
        275                  280                285            
Ala His Val Asp Phe Leu Lys Ser Gly Asp Ser His Leu Gly Gly Gly 
    290                  295                300                
Ser Arg Glu Gly Ser Phe Lys Glu Thr Ile Thr Leu Lys Trp Cys Thr 
305                  310                315                  320
Pro Arg Thr Asn Asn Ile Glu Leu His Tyr Cys Thr Gly Ala Tyr Arg 
                325                  330                335    
Ile Ser Pro Val Asp Val Asn Ser Arg Pro Ser Ser Cys Leu Thr Asn 
            340                  345                350        
Phe Leu Leu Asn Gly Arg Ser Val Leu Leu Glu Gln Pro Arg Lys Ser 
        355                  360                365            
Gly Ser Lys Val Ile Ser His Met Leu Ser Ser His Gly Gly Glu Ile 
    370                  375                380                
Phe Leu His Val Leu Ser Ser Ser Arg Ser Ile Leu Glu Asp Pro Pro 
385                  390                395                  400
Ser Ile Ser Glu Gly Cys Gly Gly Arg Val Thr Asp Tyr Arg Ile Thr 
                405                  410                415    
Asp Phe Gly Glu Phe Met Arg Glu Asn Arg Leu Thr Pro Phe Leu Asp 
            420                  425                430        
Pro Arg Tyr Lys Ile Asp Gly Ser Leu Glu Val Pro Leu Glu Arg Ala 
        435                  440                445            
Lys Asp Gln Leu Glu Lys His Thr Arg Tyr Trp Pro Met Ile Ile Ser 
    450                  455                460                
Gln Thr Thr Ile Phe Asn Met Gln Ala Val Val Pro Leu Ala Ser Val 
465                  470                475                  480
Ile Val Lys Glu Ser Leu Thr Glu Glu Asp Val Leu Asn Cys Gln Lys 
                485                  490                495    
Thr Ile Tyr Asn Leu Val Asp Met Glu Arg Lys Asn Asp Pro Leu Pro 
            500                  505                510        
Ile Ser Thr Val Gly Thr Arg Gly Lys Gly Pro Lys Arg Asp Glu Gln 
        515                  520                525            
Tyr Arg Ile Met Trp Asn Glu Leu Glu Thr Leu Val Arg Ala His Ile 
    530                  535                540                
Asn Asn Ser Glu Lys His Gln Arg Val Leu Glu Cys Leu Met Ala Cys 
545                  550                555                  560
Arg Ser Lys Pro Pro Glu Glu Glu Glu Arg Lys Lys Arg Gly Arg Lys 
                565                  570                575    
Arg Glu Asp Lys Glu Asp Lys Ser Glu Lys Ala Val Lys Asp Tyr Glu 
            580                  585                590        
Gln Glu Lys Ser Trp Gln Asp Ser Glu Arg Leu Lys Gly Ile Leu Glu 
        595                  600                605            
Arg Gly Lys Glu Glu Leu Ala Glu Ala Glu Ile Ile Lys Asp Ser Pro 
    610                  615                620                
Asp Ser Pro Glu Pro Pro Asn Lys Lys Pro Leu Val Glu Met Asp Glu 
625                  630                635                  640
Thr Pro Gln Val Glu Lys Ser Lys Gly Pro Val Ser Leu Leu Ser Leu 
                645                  650                655    
Trp Ser Asn Arg Ile Asn Thr Ala Asn Ser Arg Lys His Gln Glu Phe 
            660                  665                670        
Ala Gly Arg Leu Asn Ser Val Asn Asn Arg Ala Glu Leu Tyr Gln His 
        675                  680                685            
Leu Lys Glu Glu Asn Gly Met Glu Thr Thr Glu Asn Gly Lys Ala Ser 
    690                  695                700                
Arg Gln 
705    

<210> 52
<211> 225
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..225
<223> /mol_type="DNA"
      /note="C12orf11 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 52
atgaagattt tttctgaatc tcataaaaca gtgtttgttg tggatcactg cccttatatg     60

gcagaatctt gcaggcagca tgtcgagttt gatatgctgg tgaagaatag aacccaagga    120

atcattcctt tggcccccat atctaaatca ttgtggactt gctcagtaga atcttccatg    180

gaatattgta gaataatgta tgatatattt cctttcaaaa agctg                    225


<210> 53
<211> 75
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..75
<223> /mol_type="protein"
      /note="C12orf11 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 53
Met Lys Ile Phe Ser Glu Ser His Lys Thr Val Phe Val Val Asp His 
1               5                   10                   15    
Cys Pro Tyr Met Ala Glu Ser Cys Arg Gln His Val Glu Phe Asp Met 
            20                   25                  30        
Leu Val Lys Asn Arg Thr Gln Gly Ile Ile Pro Leu Ala Pro Ile Ser 
        35                   40                  45            
Lys Ser Leu Trp Thr Cys Ser Val Glu Ser Ser Met Glu Tyr Cys Arg 
    50                   55                  60                
Ile Met Tyr Asp Ile Phe Pro Phe Lys Lys Leu 
65                   70                  75

<210> 54
<211> 1260
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1260
<223> /mol_type="DNA"
      /note="RASSF8 (CCDS nucleotide sequence of RASSF8 (Gene ID: 11228
      ))"
      /organism="Homo sapiens"

<400> 54
atggaactta aagtatgggt ggatggagtt cagaggattg tttgtggagt cactgaagtc      60

acaacttgcc aggaggttgt catagcctta gctcaagcaa taggtcgaac tggaaggtac     120

acccttatag agaaatggag agatactgaa agacacttag cacctcatga aaatcctatc     180

atatccttaa acaaatgggg gcagtatgct agtgatgtgc agctcattct acgacgaact     240

gggccgtctc tcagtgagcg acccacttca gacagtgtgg ctcgaattcc tgaaagaact     300

ttatacaggc agagtctgcc ccccttagct aaactgaggc ctcagattga caaatcaatc     360

aaaaggaggg aaccgaaaag gaaatcactg acatttacag gaggtgccaa aggattaatg     420

gacatttttg gaaaaggtaa agaaactgag tttaagcaaa aggtgctgaa taactgcaaa     480

acaacagcag atgagttgaa gaagctaatc cgtctgcaga cagagaagct tcaatccatt     540

gagaaacagc tggaatctaa tgaaatagaa ataagatttt gggagcaaaa gtataattcc     600

aaccttgaag aggaaattgt ccgtctagag caaaagatca aaagaaacga tgtagaaatt     660

gaggaggaag aattctggga aaatgaatta cagattgaac aggaaaatga aaaacagctg     720

aaggatcaac ttcaagaaat aagacagaaa ataacagaat gtgaaaacaa attaaaggac     780

tatttggcac agatccggac tatggaaagt ggtcttgaag cagaaaaatt gcaacgggaa     840

gttcaagagg cacaggtcaa tgaggaagag gttaaaggaa agatcggtaa ggtcaaaggg     900

gagattgaca ttcaaggcca gcagagtctg aggttggaaa atggcatcaa agctgtggaa     960

agatctcttg gacaagccac caaacgctta caggacaaag aacaggaact ggagcagttg    1020

actaaggagt tgcggcaagt caatctccag cagttcatcc agcagacagg gacaaaagtt    1080

accgttttgc cagcggagcc cattgaaata gaggcctcac atgcagacat tgaaagggag    1140

gcaccattcc agtctgggtc cctgaagcga cctggttcat ctcggcagct ccccagtaat    1200

ctccgcattc tgcagaatcc tatctcatct ggttttaatc ctgaaggcat atatgtatga    1260


<210> 55
<211> 419
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..419
<223> /mol_type="protein"
      /note="RASSF8 (full-length protein)"
      /organism="Homo sapiens"

<400> 55
Met Glu Leu Lys Val Trp Val Asp Gly Val Gln Arg Ile Val Cys Gly 
1               5                   10                   15    
Val Thr Glu Val Thr Thr Cys Gln Glu Val Val Ile Ala Leu Ala Gln 
            20                   25                  30        
Ala Ile Gly Arg Thr Gly Arg Tyr Thr Leu Ile Glu Lys Trp Arg Asp 
        35                   40                  45            
Thr Glu Arg His Leu Ala Pro His Glu Asn Pro Ile Ile Ser Leu Asn 
    50                   55                  60                
Lys Trp Gly Gln Tyr Ala Ser Asp Val Gln Leu Ile Leu Arg Arg Thr 
65                   70                  75                  80
Gly Pro Ser Leu Ser Glu Arg Pro Thr Ser Asp Ser Val Ala Arg Ile 
                85                   90                  95    
Pro Glu Arg Thr Leu Tyr Arg Gln Ser Leu Pro Pro Leu Ala Lys Leu 
            100                  105                110        
Arg Pro Gln Ile Asp Lys Ser Ile Lys Arg Arg Glu Pro Lys Arg Lys 
        115                  120                125            
Ser Leu Thr Phe Thr Gly Gly Ala Lys Gly Leu Met Asp Ile Phe Gly 
    130                  135                140                
Lys Gly Lys Glu Thr Glu Phe Lys Gln Lys Val Leu Asn Asn Cys Lys 
145                  150                155                  160
Thr Thr Ala Asp Glu Leu Lys Lys Leu Ile Arg Leu Gln Thr Glu Lys 
                165                  170                175    
Leu Gln Ser Ile Glu Lys Gln Leu Glu Ser Asn Glu Ile Glu Ile Arg 
            180                  185                190        
Phe Trp Glu Gln Lys Tyr Asn Ser Asn Leu Glu Glu Glu Ile Val Arg 
        195                  200                205            
Leu Glu Gln Lys Ile Lys Arg Asn Asp Val Glu Ile Glu Glu Glu Glu 
    210                  215                220                
Phe Trp Glu Asn Glu Leu Gln Ile Glu Gln Glu Asn Glu Lys Gln Leu 
225                  230                235                  240
Lys Asp Gln Leu Gln Glu Ile Arg Gln Lys Ile Thr Glu Cys Glu Asn 
                245                  250                255    
Lys Leu Lys Asp Tyr Leu Ala Gln Ile Arg Thr Met Glu Ser Gly Leu 
            260                  265                270        
Glu Ala Glu Lys Leu Gln Arg Glu Val Gln Glu Ala Gln Val Asn Glu 
        275                  280                285            
Glu Glu Val Lys Gly Lys Ile Gly Lys Val Lys Gly Glu Ile Asp Ile 
    290                  295                300                
Gln Gly Gln Gln Ser Leu Arg Leu Glu Asn Gly Ile Lys Ala Val Glu 
305                  310                315                  320
Arg Ser Leu Gly Gln Ala Thr Lys Arg Leu Gln Asp Lys Glu Gln Glu 
                325                  330                335    
Leu Glu Gln Leu Thr Lys Glu Leu Arg Gln Val Asn Leu Gln Gln Phe 
            340                  345                350        
Ile Gln Gln Thr Gly Thr Lys Val Thr Val Leu Pro Ala Glu Pro Ile 
        355                  360                365            
Glu Ile Glu Ala Ser His Ala Asp Ile Glu Arg Glu Ala Pro Phe Gln 
    370                  375                380                
Ser Gly Ser Leu Lys Arg Pro Gly Ser Ser Arg Gln Leu Pro Ser Asn 
385                  390                395                  400
Leu Arg Ile Leu Gln Asn Pro Ile Ser Ser Gly Phe Asn Pro Glu Gly 
                405                  410                415    
Ile Tyr Val 
            

<210> 56
<211> 267
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..267
<223> /mol_type="DNA"
      /note="RASSF8 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 56
gacaaagaac aggaactgga gcagttgact aaggagttgc ggcaagtcaa tctccagcag     60

ttcatccagc agacagggac aaaagttacc gttttgccag cggagcccat tgaaatagag    120

gcctcacatg cagacattga aagggaggca ccattccagt ctgggtccct gaagcgacct    180

ggttcatctc ggcagctccc cagtaatctc cgcattctgc agaatcctat ctcatctggt    240

tttaatcctg aaggcatata tgtatga                                        267


<210> 57
<211> 88
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..88
<223> /mol_type="protein"
      /note="RASSF8 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 57
Asp Lys Glu Gln Glu Leu Glu Gln Leu Thr Lys Glu Leu Arg Gln Val 
1               5                   10                   15    
Asn Leu Gln Gln Phe Ile Gln Gln Thr Gly Thr Lys Val Thr Val Leu 
            20                   25                  30        
Pro Ala Glu Pro Ile Glu Ile Glu Ala Ser His Ala Asp Ile Glu Arg 
        35                   40                  45            
Glu Ala Pro Phe Gln Ser Gly Ser Leu Lys Arg Pro Gly Ser Ser Arg 
    50                   55                  60                
Gln Leu Pro Ser Asn Leu Arg Ile Leu Gln Asn Pro Ile Ser Ser Gly 
65                   70                  75                  80
Phe Asn Pro Glu Gly Ile Tyr Val 
                85            

<210> 58
<211> 492
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..492
<223> /mol_type="DNA"
      /note="C12orf11-RASSF8 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 58
atgaagattt tttctgaatc tcataaaaca gtgtttgttg tggatcactg cccttatatg      60

gcagaatctt gcaggcagca tgtcgagttt gatatgctgg tgaagaatag aacccaagga     120

atcattcctt tggcccccat atctaaatca ttgtggactt gctcagtaga atcttccatg     180

gaatattgta gaataatgta tgatatattt cctttcaaaa agctggacaa agaacaggaa     240

ctggagcagt tgactaagga gttgcggcaa gtcaatctcc agcagttcat ccagcagaca     300

gggacaaaag ttaccgtttt gccagcggag cccattgaaa tagaggcctc acatgcagac     360

attgaaaggg aggcaccatt ccagtctggg tccctgaagc gacctggttc atctcggcag     420

ctccccagta atctccgcat tctgcagaat cctatctcat ctggttttaa tcctgaaggc     480

atatatgtat ga                                                         492


<210> 59
<211> 163
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..163
<223> /mol_type="protein"
      /note="C12orf11-RASSF8 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 59
Met Lys Ile Phe Ser Glu Ser His Lys Thr Val Phe Val Val Asp His 
1               5                   10                   15    
Cys Pro Tyr Met Ala Glu Ser Cys Arg Gln His Val Glu Phe Asp Met 
            20                   25                  30        
Leu Val Lys Asn Arg Thr Gln Gly Ile Ile Pro Leu Ala Pro Ile Ser 
        35                   40                  45            
Lys Ser Leu Trp Thr Cys Ser Val Glu Ser Ser Met Glu Tyr Cys Arg 
    50                   55                  60                
Ile Met Tyr Asp Ile Phe Pro Phe Lys Lys Leu Asp Lys Glu Gln Glu 
65                   70                  75                  80
Leu Glu Gln Leu Thr Lys Glu Leu Arg Gln Val Asn Leu Gln Gln Phe 
                85                   90                  95    
Ile Gln Gln Thr Gly Thr Lys Val Thr Val Leu Pro Ala Glu Pro Ile 
            100                  105                110        
Glu Ile Glu Ala Ser His Ala Asp Ile Glu Arg Glu Ala Pro Phe Gln 
        115                  120                125            
Ser Gly Ser Leu Lys Arg Pro Gly Ser Ser Arg Gln Leu Pro Ser Asn 
    130                  135                140                
Leu Arg Ile Leu Gln Asn Pro Ile Ser Ser Gly Phe Asn Pro Glu Gly 
145                  150                155                  160
Ile Tyr Val 
            

<210> 60
<211> 1242
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1242
<223> /mol_type="DNA"
      /note="E2F4 (CCDS nucleotide sequence of E2F4 (Gene ID: 1874))"
      /organism="Homo sapiens"

<400> 60
atggcggagg ccgggccaca ggcgccgccg cccccgggca ctccaagccg gcacgaaaag      60

agcctgggac tgctcaccac caagttcgtg tcccttctgc aggaggccaa ggacggcgtg     120

cttgacctca agctggcagc tgacacccta gctgtacgcc agaagcggcg gatttacgac     180

attaccaatg ttttggaagg tatcgggcta atcgagaaaa agtccaagaa cagcatccag     240

tggaagggtg tggggcctgg ctgcaatacc cgggagattg ctgacaaact gattgagctc     300

aaggcagaga tcgaggagct gcagcagcgg gagcaagaac tagaccagca caaggtgtgg     360

gtgcagcaga gcatccggaa cgtcacagag gacgtgcaga acagctgttt ggcctacgtc     420

actcatgagg acatctgcag atgctttgct ggagataccc tcttggccat ccgggcccca     480

tcaggcacca gcctggaggt gcccatccca gagggtctca atgggcagaa gaagtaccag     540

attcacctga agagtgtgag tggtcccatt gaggttctgc tggtgaacaa ggaggcatgg     600

agctcacccc ctgtggctgt gcctgtgcca ccacctgaag atttgctcca gagcccatct     660

gctgtttcta cacctccacc tctgcccaag cctgccctag cccagtccca ggaagcctca     720

cgtccaaata gtcctcagct cactcccact gctgtccctg gcagtgcaga agtccaggga     780

atggctggcc cagcagctga gatcacagtg agtggcggcc ctgggactga tagcaaggac     840

agtggtgagc tcagttcact cccactgggc ccaacaacac tggacacccg gccactgcag     900

tcttctgccc tgctggacag cagcagcagc agcagcagca gcagcagcag cagcagcaac     960

agtaacagca gcagttcgtc cggacccaac ccttctacct cctttgagcc catcaaggca    1020

gaccccacag gtgttttgga actccccaaa gagctgtcag aaatctttga tcccacacga    1080

gagtgcatga gctcggagct gctggaggag ttgatgtcct cagaagtgtt tgcccctctg    1140

cttcgtcttt ctccaccccc gggagaccac gattatatct acaacctgga cgagagtgaa    1200

ggtgtctgtg acctctttga tgtgcctgtt ctcaacctct ga                       1242


<210> 61
<211> 413
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..413
<223> /mol_type="protein"
      /note="E2F4 (full-length protein)"
      /organism="Homo sapiens"

<400> 61
Met Ala Glu Ala Gly Pro Gln Ala Pro Pro Pro Pro Gly Thr Pro Ser 
1               5                   10                   15    
Arg His Glu Lys Ser Leu Gly Leu Leu Thr Thr Lys Phe Val Ser Leu 
            20                   25                  30        
Leu Gln Glu Ala Lys Asp Gly Val Leu Asp Leu Lys Leu Ala Ala Asp 
        35                   40                  45            
Thr Leu Ala Val Arg Gln Lys Arg Arg Ile Tyr Asp Ile Thr Asn Val 
    50                   55                  60                
Leu Glu Gly Ile Gly Leu Ile Glu Lys Lys Ser Lys Asn Ser Ile Gln 
65                   70                  75                  80
Trp Lys Gly Val Gly Pro Gly Cys Asn Thr Arg Glu Ile Ala Asp Lys 
                85                   90                  95    
Leu Ile Glu Leu Lys Ala Glu Ile Glu Glu Leu Gln Gln Arg Glu Gln 
            100                  105                110        
Glu Leu Asp Gln His Lys Val Trp Val Gln Gln Ser Ile Arg Asn Val 
        115                  120                125            
Thr Glu Asp Val Gln Asn Ser Cys Leu Ala Tyr Val Thr His Glu Asp 
    130                  135                140                
Ile Cys Arg Cys Phe Ala Gly Asp Thr Leu Leu Ala Ile Arg Ala Pro 
145                  150                155                  160
Ser Gly Thr Ser Leu Glu Val Pro Ile Pro Glu Gly Leu Asn Gly Gln 
                165                  170                175    
Lys Lys Tyr Gln Ile His Leu Lys Ser Val Ser Gly Pro Ile Glu Val 
            180                  185                190        
Leu Leu Val Asn Lys Glu Ala Trp Ser Ser Pro Pro Val Ala Val Pro 
        195                  200                205            
Val Pro Pro Pro Glu Asp Leu Leu Gln Ser Pro Ser Ala Val Ser Thr 
    210                  215                220                
Pro Pro Pro Leu Pro Lys Pro Ala Leu Ala Gln Ser Gln Glu Ala Ser 
225                  230                235                  240
Arg Pro Asn Ser Pro Gln Leu Thr Pro Thr Ala Val Pro Gly Ser Ala 
                245                  250                255    
Glu Val Gln Gly Met Ala Gly Pro Ala Ala Glu Ile Thr Val Ser Gly 
            260                  265                270        
Gly Pro Gly Thr Asp Ser Lys Asp Ser Gly Glu Leu Ser Ser Leu Pro 
        275                  280                285            
Leu Gly Pro Thr Thr Leu Asp Thr Arg Pro Leu Gln Ser Ser Ala Leu 
    290                  295                300                
Leu Asp Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn 
305                  310                315                  320
Ser Asn Ser Ser Ser Ser Ser Gly Pro Asn Pro Ser Thr Ser Phe Glu 
                325                  330                335    
Pro Ile Lys Ala Asp Pro Thr Gly Val Leu Glu Leu Pro Lys Glu Leu 
            340                  345                350        
Ser Glu Ile Phe Asp Pro Thr Arg Glu Cys Met Ser Ser Glu Leu Leu 
        355                  360                365            
Glu Glu Leu Met Ser Ser Glu Val Phe Ala Pro Leu Leu Arg Leu Ser 
    370                  375                380                
Pro Pro Pro Gly Asp His Asp Tyr Ile Tyr Asn Leu Asp Glu Ser Glu 
385                  390                395                  400
Gly Val Cys Asp Leu Phe Asp Val Pro Val Leu Asn Leu 
                405                  410            

<210> 62
<211> 911
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..911
<223> /mol_type="DNA"
      /note="E2F4 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 62
atggcggagg ccgggccaca ggcgccgccg cccccgggca ctccaagccg gcacgaaaag      60

agcctgggac tgctcaccac caagttcgtg tcccttctgc aggaggccaa ggacggcgtg     120

cttgacctca agctggcagc tgacacccta gctgtacgcc agaagcggcg gatttacgac     180

attaccaatg ttttggaagg tatcgggcta atcgagaaaa agtccaagaa cagcatccag     240

tggaagggtg tggggcctgg ctgcaatacc cgggagattg ctgacaaact gattgagctc     300

aaggcagaga tcgaggagct gcagcagcgg gagcaagaac tagaccagca caaggtgtgg     360

gtgcagcaga gcatccggaa cgtcacagag gacgtgcaga acagctgttt ggcctacgtc     420

actcatgagg acatctgcag atgctttgct ggagataccc tcttggccat ccgggcccca     480

tcaggcacca gcctggaggt gcccatccca gagggtctca atgggcagaa gaagtaccag     540

attcacctga agagtgtgag tggtcccatt gaggttctgc tggtgaacaa ggaggcatgg     600

agctcacccc ctgtggctgt gcctgtgcca ccacctgaag atttgctcca gagcccatct     660

gctgtttcta cacctccacc tctgcccaag cctgccctag cccagtccca ggaagcctca     720

cgtccaaata gtcctcagct cactcccact gctgtccctg gcagtgcaga agtccaggga     780

atggctggcc cagcagctga gatcacagtg agtggcggcc ctgggactga tagcaaggac     840

agtggtgagc tcagttcact cccactgggc ccaacaacac tggacacccg gccactgcag     900

tcttctgccc t                                                          911


<210> 63
<211> 303
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..303
<223> /mol_type="protein"
      /note="E2F4 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 63
Met Ala Glu Ala Gly Pro Gln Ala Pro Pro Pro Pro Gly Thr Pro Ser 
1               5                   10                   15    
Arg His Glu Lys Ser Leu Gly Leu Leu Thr Thr Lys Phe Val Ser Leu 
            20                   25                  30        
Leu Gln Glu Ala Lys Asp Gly Val Leu Asp Leu Lys Leu Ala Ala Asp 
        35                   40                  45            
Thr Leu Ala Val Arg Gln Lys Arg Arg Ile Tyr Asp Ile Thr Asn Val 
    50                   55                  60                
Leu Glu Gly Ile Gly Leu Ile Glu Lys Lys Ser Lys Asn Ser Ile Gln 
65                   70                  75                  80
Trp Lys Gly Val Gly Pro Gly Cys Asn Thr Arg Glu Ile Ala Asp Lys 
                85                   90                  95    
Leu Ile Glu Leu Lys Ala Glu Ile Glu Glu Leu Gln Gln Arg Glu Gln 
            100                  105                110        
Glu Leu Asp Gln His Lys Val Trp Val Gln Gln Ser Ile Arg Asn Val 
        115                  120                125            
Thr Glu Asp Val Gln Asn Ser Cys Leu Ala Tyr Val Thr His Glu Asp 
    130                  135                140                
Ile Cys Arg Cys Phe Ala Gly Asp Thr Leu Leu Ala Ile Arg Ala Pro 
145                  150                155                  160
Ser Gly Thr Ser Leu Glu Val Pro Ile Pro Glu Gly Leu Asn Gly Gln 
                165                  170                175    
Lys Lys Tyr Gln Ile His Leu Lys Ser Val Ser Gly Pro Ile Glu Val 
            180                  185                190        
Leu Leu Val Asn Lys Glu Ala Trp Ser Ser Pro Pro Val Ala Val Pro 
        195                  200                205            
Val Pro Pro Pro Glu Asp Leu Leu Gln Ser Pro Ser Ala Val Ser Thr 
    210                  215                220                
Pro Pro Pro Leu Pro Lys Pro Ala Leu Ala Gln Ser Gln Glu Ala Ser 
225                  230                235                  240
Arg Pro Asn Ser Pro Gln Leu Thr Pro Thr Ala Val Pro Gly Ser Ala 
                245                  250                255    
Glu Val Gln Gly Met Ala Gly Pro Ala Ala Glu Ile Thr Val Ser Gly 
            260                  265                270        
Gly Pro Gly Thr Asp Ser Lys Asp Ser Gly Glu Leu Ser Ser Leu Pro 
        275                  280                285            
Leu Gly Pro Thr Thr Leu Asp Thr Arg Pro Leu Gln Ser Ser Ala 
    290                  295                300            

<210> 64
<211> 648
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..648
<223> /mol_type="DNA"
      /note="RPL14 (CCDS nucleotide sequence of RPL14 (Gene ID: 9045))"
      /organism="Homo sapiens"

<400> 64
atggtgttca ggcgcttcgt ggaggttggc cgggtggcct atgtctcctt tggacctcat      60

gccggaaaat tggtcgcgat tgtagatgtt attgatcaga acagggcttt ggtcgatgga     120

ccttgcactc aagtgaggag acaggccatg cctttcaagt gcatgcagct cactgatttc     180

atcctcaagt ttccgcacag tgcccaccag aagtatgtcc gacaagcctg gcagaaggca     240

gacatcaata caaaatgggc agccacacga tgggccaaga agattgaagc cagagaaagg     300

aaagccaaga tgacagattt tgatcgtttt aaagttatga aggcaaagaa aatgaggaac     360

agaataatca agaatgaagt taagaagctt caaaaggcag ctctcctgaa agcttctccc     420

aaaaaagcac ctggtactaa gggtactgct gctgctgctg ctgctgctgc tgctgctaaa     480

gttccagcaa aaaagatcac cgccgcgagt aaaaaggctc cagcccagaa ggttcctgcc     540

cagaaagcca caggccagaa agcagcgcct gctccaaaag ctcagaaggg tcaaaaagct     600

ccagcccaga aagcacctgc tccaaaggca tctggcaaga aagcataa                  648


<210> 65
<211> 215
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..215
<223> /mol_type="protein"
      /note="RPL14 (full-length protein)"
      /organism="Homo sapiens"

<400> 65
Met Val Phe Arg Arg Phe Val Glu Val Gly Arg Val Ala Tyr Val Ser 
1               5                   10                   15    
Phe Gly Pro His Ala Gly Lys Leu Val Ala Ile Val Asp Val Ile Asp 
            20                   25                  30        
Gln Asn Arg Ala Leu Val Asp Gly Pro Cys Thr Gln Val Arg Arg Gln 
        35                   40                  45            
Ala Met Pro Phe Lys Cys Met Gln Leu Thr Asp Phe Ile Leu Lys Phe 
    50                   55                  60                
Pro His Ser Ala His Gln Lys Tyr Val Arg Gln Ala Trp Gln Lys Ala 
65                   70                  75                  80
Asp Ile Asn Thr Lys Trp Ala Ala Thr Arg Trp Ala Lys Lys Ile Glu 
                85                   90                  95    
Ala Arg Glu Arg Lys Ala Lys Met Thr Asp Phe Asp Arg Phe Lys Val 
            100                  105                110        
Met Lys Ala Lys Lys Met Arg Asn Arg Ile Ile Lys Asn Glu Val Lys 
        115                  120                125            
Lys Leu Gln Lys Ala Ala Leu Leu Lys Ala Ser Pro Lys Lys Ala Pro 
    130                  135                140                
Gly Thr Lys Gly Thr Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Lys 
145                  150                155                  160
Val Pro Ala Lys Lys Ile Thr Ala Ala Ser Lys Lys Ala Pro Ala Gln 
                165                  170                175    
Lys Val Pro Ala Gln Lys Ala Thr Gly Gln Lys Ala Ala Pro Ala Pro 
            180                  185                190        
Lys Ala Gln Lys Gly Gln Lys Ala Pro Ala Gln Lys Ala Pro Ala Pro 
        195                  200                205            
Lys Ala Ser Gly Lys Lys Ala 
    210                  215

<210> 66
<211> 180
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..180
<223> /mol_type="DNA"
      /note="RPL14 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 66
gctgctgcta aagttccagc aaaaaagatc accgccgcga gtaaaaaggc tccagcccag     60

aaggttcctg cccagaaagc cacaggccag aaagcagcgc ctgctccaaa agctcagaag    120

ggtcaaaaag ctccagccca gaaagcacct gctccaaagg catctggcaa gaaagcataa    180


<210> 67
<211> 59
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..59
<223> /mol_type="protein"
      /note="RPL14 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 67
Ala Ala Ala Lys Val Pro Ala Lys Lys Ile Thr Ala Ala Ser Lys Lys 
1               5                   10                   15    
Ala Pro Ala Gln Lys Val Pro Ala Gln Lys Ala Thr Gly Gln Lys Ala 
            20                   25                  30        
Ala Pro Ala Pro Lys Ala Gln Lys Gly Gln Lys Ala Pro Ala Gln Lys 
        35                   40                  45            
Ala Pro Ala Pro Lys Ala Ser Gly Lys Lys Ala 
    50                   55                

<210> 68
<211> 1229
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1229
<223> /mol_type="DNA"
      /note="E2F4-RPL14 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 68
atggcggagg ccgggccaca ggcgccgccg cccccgggca ctccaagccg gcacgaaaag      60

agcctgggac tgctcaccac caagttcgtg tcccttctgc aggaggccaa ggacggcgtg     120

cttgacctca agctggcagc tgacacccta gctgtacgcc agaagcggcg gatttacgac     180

attaccaatg ttttggaagg tatcgggcta atcgagaaaa agtccaagaa cagcatccag     240

tggaagggtg tggggcctgg ctgcaatacc cgggagattg ctgacaaact gattgagctc     300

aaggcagaga tcgaggagct gcagcagcgg gagcaagaac tagaccagca caaggtgtgg     360

gtgcagcaga gcatccggaa cgtcacagag gacgtgcaga acagctgttt ggcctacgtc     420

actcatgagg acatctgcag atgctttgct ggagataccc tcttggccat ccgggcccca     480

tcaggcacca gcctggaggt gcccatccca gagggtctca atgggcagaa gaagtaccag     540

attcacctga agagtgtgag tggtcccatt gaggttctgc tggtgaacaa ggaggcatgg     600

agctcacccc ctgtggctgt gcctgtgcca ccacctgaag atttgctcca gagcccatct     660

gctgtttcta cacctccacc tctgcccaag cctgccctag cccagtccca ggaagcctca     720

cgtccaaata gtcctcagct cactcccact gctgtccctg gcagtgcaga agtccaggga     780

atggctggcc cagcagctga gatcacagtg agtggcggcc ctgggactga tagcaaggac     840

agtggtgagc tcagttcact cccactgggc ccaacaacac tggacacccg gccactgcag     900

tcttctgccc tgctgctgct aaagttccag caaaaaagat caccgccgcg agtaaaaagg     960

ctccagccca gaaggttcct gcccagaaag ccacaggcca gaaagcagcg cctgctccaa    1020

aagctcagaa gggtcaaaaa gctccagccc agaaagcacc tgctccaaag gcatctggca    1080

agaaagcata agtggcaatc ataaaaagta ataaaggttc tttttgacct gttgacaaat    1140

gtatttaagc ctttggattt aaagcctgtt gaggctggag ttaggaggca gattgatagt    1200

aggattataa taaacattaa ataatcagt                                      1229


<210> 69
<211> 367
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..367
<223> /mol_type="protein"
      /note="E2F4-RPL14 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 69
Met Ala Glu Ala Gly Pro Gln Ala Pro Pro Pro Pro Gly Thr Pro Ser 
1               5                   10                   15    
Arg His Glu Lys Ser Leu Gly Leu Leu Thr Thr Lys Phe Val Ser Leu 
            20                   25                  30        
Leu Gln Glu Ala Lys Asp Gly Val Leu Asp Leu Lys Leu Ala Ala Asp 
        35                   40                  45            
Thr Leu Ala Val Arg Gln Lys Arg Arg Ile Tyr Asp Ile Thr Asn Val 
    50                   55                  60                
Leu Glu Gly Ile Gly Leu Ile Glu Lys Lys Ser Lys Asn Ser Ile Gln 
65                   70                  75                  80
Trp Lys Gly Val Gly Pro Gly Cys Asn Thr Arg Glu Ile Ala Asp Lys 
                85                   90                  95    
Leu Ile Glu Leu Lys Ala Glu Ile Glu Glu Leu Gln Gln Arg Glu Gln 
            100                  105                110        
Glu Leu Asp Gln His Lys Val Trp Val Gln Gln Ser Ile Arg Asn Val 
        115                  120                125            
Thr Glu Asp Val Gln Asn Ser Cys Leu Ala Tyr Val Thr His Glu Asp 
    130                  135                140                
Ile Cys Arg Cys Phe Ala Gly Asp Thr Leu Leu Ala Ile Arg Ala Pro 
145                  150                155                  160
Ser Gly Thr Ser Leu Glu Val Pro Ile Pro Glu Gly Leu Asn Gly Gln 
                165                  170                175    
Lys Lys Tyr Gln Ile His Leu Lys Ser Val Ser Gly Pro Ile Glu Val 
            180                  185                190        
Leu Leu Val Asn Lys Glu Ala Trp Ser Ser Pro Pro Val Ala Val Pro 
        195                  200                205            
Val Pro Pro Pro Glu Asp Leu Leu Gln Ser Pro Ser Ala Val Ser Thr 
    210                  215                220                
Pro Pro Pro Leu Pro Lys Pro Ala Leu Ala Gln Ser Gln Glu Ala Ser 
225                  230                235                  240
Arg Pro Asn Ser Pro Gln Leu Thr Pro Thr Ala Val Pro Gly Ser Ala 
                245                  250                255    
Glu Val Gln Gly Met Ala Gly Pro Ala Ala Glu Ile Thr Val Ser Gly 
            260                  265                270        
Gly Pro Gly Thr Asp Ser Lys Asp Ser Gly Glu Leu Ser Ser Leu Pro 
        275                  280                285            
Leu Gly Pro Thr Thr Leu Asp Thr Arg Pro Leu Gln Ser Ser Ala Leu 
    290                  295                300                
Leu Leu Leu Lys Phe Gln Gln Lys Arg Ser Pro Pro Arg Val Lys Arg 
305                  310                315                  320
Leu Gln Pro Arg Arg Phe Leu Pro Arg Lys Pro Gln Ala Arg Lys Gln 
                325                  330                335    
Arg Leu Leu Gln Lys Leu Arg Arg Val Lys Lys Leu Gln Pro Arg Lys 
            340                  345                350        
His Leu Leu Gln Arg His Leu Ala Arg Lys His Lys Trp Gln Ser 
        355                  360                365        

<210> 70
<211> 1761
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1761
<223> /mol_type="DNA"
      /note="CHEK2 (CCDS nucleotide sequence of CHEK2 (Gene ID: 11200))
      "
      /organism="Homo sapiens"

<400> 70
atgtctcggg agtcggatgt tgaggctcag cagtctcatg gcagcagtgc ctgttcacag      60

ccccatggca gcgttaccca gtcccaaggc tcctcctcac agtcccaggg catatccagc     120

tcctctacca gcacgatgcc aaactccagc cagtcctctc actccagctc tgggacactg     180

agctccttag agacagtgtc cactcaggaa ctctattcta ttcctgagga ccaagaacct     240

gaggaccaag aacctgagga gcctacccct gccccctggg ctcgattatg ggcccttcag     300

gatggatttg ccaatcttga gacagagtct ggccatgtta cccaatctga tcttgaactc     360

ctgctgtcat ctgatcctcc tgcctcagcc tcccaaagtg ctgggataag aggtgtgagg     420

caccatcccc ggccagtttg cagtctaaaa tgtgtgaatg acaactactg gtttgggagg     480

gacaaaagct gtgaatattg ctttgatgaa ccactgctga aaagaacaga taaataccga     540

acatacagca agaaacactt tcggattttc agggaagtgg gtcctaaaaa ctcttacatt     600

gcatacatag aagatcacag tggcaatgga acctttgtaa atacagagct tgtagggaaa     660

ggaaaacgcc gtcctttgaa taacaattct gaaattgcac tgtcactaag cagaaataaa     720

gtttttgtct tttttgatct gactgtagat gatcagtcag tttatcctaa ggcattaaga     780

gatgaataca tcatgtcaaa aactcttgga agtggtgcct gtggagaggt aaagctggct     840

ttcgagagga aaacatgtaa gaaagtagcc ataaagatca tcagcaaaag gaagtttgct     900

attggttcag caagagaggc agacccagct ctcaatgttg aaacagaaat agaaattttg     960

aaaaagctaa atcatccttg catcatcaag attaaaaact tttttgatgc agaagattat    1020

tatattgttt tggaattgat ggaaggggga gagctgtttg acaaagtggt ggggaataaa    1080

cgcctgaaag aagctacctg caagctctat ttttaccaga tgctcttggc tgtgcagtac    1140

cttcatgaaa acggtattat acaccgtgac ttaaagccag agaatgtttt actgtcatct    1200

caagaagagg actgtcttat aaagattact gattttgggc actccaagat tttgggagag    1260

acctctctca tgagaacctt atgtggaacc cccacctact tggcgcctga agttcttgtt    1320

tctgttggga ctgctgggta taaccgtgct gtggactgct ggagtttagg agttattctt    1380

tttatctgcc ttagtgggta tccacctttc tctgagcata ggactcaagt gtcactgaag    1440

gatcagatca ccagtggaaa atacaacttc attcctgaag tctgggcaga agtctcagag    1500

aaagctctgg accttgtcaa gaagttgttg gtagtggatc caaaggcacg ttttacgaca    1560

gaagaagcct taagacaccc gtggcttcag gatgaagaca tgaagagaaa gtttcaagat    1620

cttctgtctg aggaaaatga atccacagct ctaccccagg ttctagccca gccttctact    1680

agtcgaaagc ggccccgtga aggggaagcc gagggtgccg agaccacaaa gcgcccagct    1740

gtgtgtgctg ctgtgttgtg a                                              1761


<210> 71
<211> 586
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..586
<223> /mol_type="protein"
      /note="CHEK2 (full-length protein)"
      /organism="Homo sapiens"

<400> 71
Met Ser Arg Glu Ser Asp Val Glu Ala Gln Gln Ser His Gly Ser Ser 
1               5                   10                   15    
Ala Cys Ser Gln Pro His Gly Ser Val Thr Gln Ser Gln Gly Ser Ser 
            20                   25                  30        
Ser Gln Ser Gln Gly Ile Ser Ser Ser Ser Thr Ser Thr Met Pro Asn 
        35                   40                  45            
Ser Ser Gln Ser Ser His Ser Ser Ser Gly Thr Leu Ser Ser Leu Glu 
    50                   55                  60                
Thr Val Ser Thr Gln Glu Leu Tyr Ser Ile Pro Glu Asp Gln Glu Pro 
65                   70                  75                  80
Glu Asp Gln Glu Pro Glu Glu Pro Thr Pro Ala Pro Trp Ala Arg Leu 
                85                   90                  95    
Trp Ala Leu Gln Asp Gly Phe Ala Asn Leu Glu Thr Glu Ser Gly His 
            100                  105                110        
Val Thr Gln Ser Asp Leu Glu Leu Leu Leu Ser Ser Asp Pro Pro Ala 
        115                  120                125            
Ser Ala Ser Gln Ser Ala Gly Ile Arg Gly Val Arg His His Pro Arg 
    130                  135                140                
Pro Val Cys Ser Leu Lys Cys Val Asn Asp Asn Tyr Trp Phe Gly Arg 
145                  150                155                  160
Asp Lys Ser Cys Glu Tyr Cys Phe Asp Glu Pro Leu Leu Lys Arg Thr 
                165                  170                175    
Asp Lys Tyr Arg Thr Tyr Ser Lys Lys His Phe Arg Ile Phe Arg Glu 
            180                  185                190        
Val Gly Pro Lys Asn Ser Tyr Ile Ala Tyr Ile Glu Asp His Ser Gly 
        195                  200                205            
Asn Gly Thr Phe Val Asn Thr Glu Leu Val Gly Lys Gly Lys Arg Arg 
    210                  215                220                
Pro Leu Asn Asn Asn Ser Glu Ile Ala Leu Ser Leu Ser Arg Asn Lys 
225                  230                235                  240
Val Phe Val Phe Phe Asp Leu Thr Val Asp Asp Gln Ser Val Tyr Pro 
                245                  250                255    
Lys Ala Leu Arg Asp Glu Tyr Ile Met Ser Lys Thr Leu Gly Ser Gly 
            260                  265                270        
Ala Cys Gly Glu Val Lys Leu Ala Phe Glu Arg Lys Thr Cys Lys Lys 
        275                  280                285            
Val Ala Ile Lys Ile Ile Ser Lys Arg Lys Phe Ala Ile Gly Ser Ala 
    290                  295                300                
Arg Glu Ala Asp Pro Ala Leu Asn Val Glu Thr Glu Ile Glu Ile Leu 
305                  310                315                  320
Lys Lys Leu Asn His Pro Cys Ile Ile Lys Ile Lys Asn Phe Phe Asp 
                325                  330                335    
Ala Glu Asp Tyr Tyr Ile Val Leu Glu Leu Met Glu Gly Gly Glu Leu 
            340                  345                350        
Phe Asp Lys Val Val Gly Asn Lys Arg Leu Lys Glu Ala Thr Cys Lys 
        355                  360                365            
Leu Tyr Phe Tyr Gln Met Leu Leu Ala Val Gln Tyr Leu His Glu Asn 
    370                  375                380                
Gly Ile Ile His Arg Asp Leu Lys Pro Glu Asn Val Leu Leu Ser Ser 
385                  390                395                  400
Gln Glu Glu Asp Cys Leu Ile Lys Ile Thr Asp Phe Gly His Ser Lys 
                405                  410                415    
Ile Leu Gly Glu Thr Ser Leu Met Arg Thr Leu Cys Gly Thr Pro Thr 
            420                  425                430        
Tyr Leu Ala Pro Glu Val Leu Val Ser Val Gly Thr Ala Gly Tyr Asn 
        435                  440                445            
Arg Ala Val Asp Cys Trp Ser Leu Gly Val Ile Leu Phe Ile Cys Leu 
    450                  455                460                
Ser Gly Tyr Pro Pro Phe Ser Glu His Arg Thr Gln Val Ser Leu Lys 
465                  470                475                  480
Asp Gln Ile Thr Ser Gly Lys Tyr Asn Phe Ile Pro Glu Val Trp Ala 
                485                  490                495    
Glu Val Ser Glu Lys Ala Leu Asp Leu Val Lys Lys Leu Leu Val Val 
            500                  505                510        
Asp Pro Lys Ala Arg Phe Thr Thr Glu Glu Ala Leu Arg His Pro Trp 
        515                  520                525            
Leu Gln Asp Glu Asp Met Lys Arg Lys Phe Gln Asp Leu Leu Ser Glu 
    530                  535                540                
Glu Asn Glu Ser Thr Ala Leu Pro Gln Val Leu Ala Gln Pro Ser Thr 
545                  550                555                  560
Ser Arg Lys Arg Pro Arg Glu Gly Glu Ala Glu Gly Ala Glu Thr Thr 
                565                  570                575    
Lys Arg Pro Ala Val Cys Ala Ala Val Leu 
            580                  585    

<210> 72
<211> 319
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..319
<223> /mol_type="DNA"
      /note="CHEK2 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 72
atgtctcggg agtcggatgt tgaggctcag cagtctcatg gcagcagtgc ctgttcacag      60

ccccatggca gcgttaccca gtcccaaggc tcctcctcac agtcccaggg catatccagc     120

tcctctacca gcacgatgcc aaactccagc cagtcctctc actccagctc tgggacactg     180

agctccttag agacagtgtc cactcaggaa ctctattcta ttcctgagga ccaagaacct     240

gaggaccaag aacctgagga gcctacccct gccccctggg ctcgattatg ggcccttcag     300

gatggatttg ccaatcttg                                                  319


<210> 73
<211> 106
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..106
<223> /mol_type="protein"
      /note="CHEK2 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 73
Met Ser Arg Glu Ser Asp Val Glu Ala Gln Gln Ser His Gly Ser Ser 
1               5                   10                   15    
Ala Cys Ser Gln Pro His Gly Ser Val Thr Gln Ser Gln Gly Ser Ser 
            20                   25                  30        
Ser Gln Ser Gln Gly Ile Ser Ser Ser Ser Thr Ser Thr Met Pro Asn 
        35                   40                  45            
Ser Ser Gln Ser Ser His Ser Ser Ser Gly Thr Leu Ser Ser Leu Glu 
    50                   55                  60                
Thr Val Ser Thr Gln Glu Leu Tyr Ser Ile Pro Glu Asp Gln Glu Pro 
65                   70                  75                  80
Glu Asp Gln Glu Pro Glu Glu Pro Thr Pro Ala Pro Trp Ala Arg Leu 
                85                   90                  95    
Trp Ala Leu Gln Asp Gly Phe Ala Asn Leu 
            100                  105    

<210> 74
<211> 2052
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2052
<223> /mol_type="DNA"
      /note="THOC5 (CCDS nucleotide sequence of THOC5 (Gene ID: 8563))"
      /organism="Homo sapiens"

<400> 74
atgtcatcag aatcgagcaa aaaacggaag cccaaagtga tccgaagcga tggagcccca      60

gctgaaggaa agcggaatcg atctgacacc gagcaggaag gtaaatacta cagtgaggag     120

gccgaggtgg atctgcggga ccctggcaga gactatgagt tatacaagta cacctgccag     180

gagctacaga ggctgatggc tgagatccaa gacctgaaga gcaggggtgg caaggatgtg     240

gcaatagaaa tagaagaacg gaggatccag agctgtgtgc atttcatgac tctaaagaag     300

cttaaccgat tagcccacat caggttgaag aaaggaagag atcagaccca cgaggctaag     360

cagaaagtag atgcctatca tctgcagctc cagaacctgt tgtatgaggt gatgcaccta     420

cagaaggaga tcaccaaatg tttggagttt aagtcaaagc atgaagaaat tgatctggtc     480

agtttagagg agttttataa ggaggctcca ccagatatca gcaaggccga agtcaccatg     540

ggagaccctc accagcaaac actggcacgt ctggactggg agctggagca gcggaaaagg     600

ctggcagaga agtaccgaga gtgcctatct aacaaggaga agattctcaa ggagattgag     660

gtgaagaagg agtacctgag cagcctccag ccccgcctca acagcatcat gcaggcttcc     720

cttccggtgc aggagtacct gtttatgcca ttcgaccagg ctcacaagca gtatgagaca     780

gccagacacc tgccgcctcc cctctatgtc ctctttgttc aggccactgc gtatgggcag     840

gcctgtgata agacgttatc tgtggcaatc gaaggcagtg tggatgaagc caaggctctg     900

ttcaaacctc cagaggactc ccaagatgac gagagtgact cagatgccga ggaggagcag     960

actacgaagc gccggagacc cacactgggg gttcagttgg acgacaaacg caaggagatg    1020

ctgaagaggc acccactgtc tgtcatgctc gacctgaagt gcaaagatga cagtgtgctt    1080

cacctgactt tctactacct catgaacctc aacatcatga cagtaaaagc caaagtgaca    1140

actgccatgg agctgatcac ccccatcagt gcaggtgact tgctgtctcc tgactcagtc    1200

ctgagttgct tgtatcctgg ggatcatgga aagaaaactc cgaatccagc caatcagtat    1260

cagtttgata aagttggcat cctgactttg agcgactatg tacttgagct aggtcacccc    1320

tatttgtggg tgcagaagct gggtggcctc cacttcccca aagagcagcc ccagcaaaca    1380

gtgattgctg accactcgct gagcgccagc cacatggaga ccaccatgaa acttctgaag    1440

accagggtgc agtcccgcct ggccctccac aaacagtttg catccctaga acatggcatt    1500

gtgccagtta ccagtgattg ccagtacctc ttccctgcca aggttgtctc tcgcctggtg    1560

aaatgggtga cagttgccca tgaggattac atggagctgc acttcaccaa agacattgtg    1620

gatgcgggac tggctgggga caccaatctc tactacatgg cgctcatcga aaggggcaca    1680

gccaaactgc aggccgctgt ggtgttgaac cctggctact cctccatccc acctgttttc    1740

cagctctgtt tgaactggaa aggggagaaa accaacagca acgatgacaa cattcgggcc    1800

atggagggcg aagtcaatgt gtgctacaag gagctgtgtg gcccttggcc cagccaccag    1860

ctgttgacca accagctgca gcggctgtgt gtgctgctgg atgtttacct ggagaccgag    1920

agccatgacg acagtgtgga ggggcccaag gaatttcccc aggagaagat gtgtctgcgg    1980

ctcttcaggg gtcctagcag gatgaagcca tttaaataca accatcctca gggattcttc    2040

agccatcgct ga                                                        2052


<210> 75
<211> 683
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..683
<223> /mol_type="protein"
      /note="THOC5 (full-length protein"
      /organism="Homo sapiens"

<400> 75
Met Ser Ser Glu Ser Ser Lys Lys Arg Lys Pro Lys Val Ile Arg Ser 
1               5                   10                   15    
Asp Gly Ala Pro Ala Glu Gly Lys Arg Asn Arg Ser Asp Thr Glu Gln 
            20                   25                  30        
Glu Gly Lys Tyr Tyr Ser Glu Glu Ala Glu Val Asp Leu Arg Asp Pro 
        35                   40                  45            
Gly Arg Asp Tyr Glu Leu Tyr Lys Tyr Thr Cys Gln Glu Leu Gln Arg 
    50                   55                  60                
Leu Met Ala Glu Ile Gln Asp Leu Lys Ser Arg Gly Gly Lys Asp Val 
65                   70                  75                  80
Ala Ile Glu Ile Glu Glu Arg Arg Ile Gln Ser Cys Val His Phe Met 
                85                   90                  95    
Thr Leu Lys Lys Leu Asn Arg Leu Ala His Ile Arg Leu Lys Lys Gly 
            100                  105                110        
Arg Asp Gln Thr His Glu Ala Lys Gln Lys Val Asp Ala Tyr His Leu 
        115                  120                125            
Gln Leu Gln Asn Leu Leu Tyr Glu Val Met His Leu Gln Lys Glu Ile 
    130                  135                140                
Thr Lys Cys Leu Glu Phe Lys Ser Lys His Glu Glu Ile Asp Leu Val 
145                  150                155                  160
Ser Leu Glu Glu Phe Tyr Lys Glu Ala Pro Pro Asp Ile Ser Lys Ala 
                165                  170                175    
Glu Val Thr Met Gly Asp Pro His Gln Gln Thr Leu Ala Arg Leu Asp 
            180                  185                190        
Trp Glu Leu Glu Gln Arg Lys Arg Leu Ala Glu Lys Tyr Arg Glu Cys 
        195                  200                205            
Leu Ser Asn Lys Glu Lys Ile Leu Lys Glu Ile Glu Val Lys Lys Glu 
    210                  215                220                
Tyr Leu Ser Ser Leu Gln Pro Arg Leu Asn Ser Ile Met Gln Ala Ser 
225                  230                235                  240
Leu Pro Val Gln Glu Tyr Leu Phe Met Pro Phe Asp Gln Ala His Lys 
                245                  250                255    
Gln Tyr Glu Thr Ala Arg His Leu Pro Pro Pro Leu Tyr Val Leu Phe 
            260                  265                270        
Val Gln Ala Thr Ala Tyr Gly Gln Ala Cys Asp Lys Thr Leu Ser Val 
        275                  280                285            
Ala Ile Glu Gly Ser Val Asp Glu Ala Lys Ala Leu Phe Lys Pro Pro 
    290                  295                300                
Glu Asp Ser Gln Asp Asp Glu Ser Asp Ser Asp Ala Glu Glu Glu Gln 
305                  310                315                  320
Thr Thr Lys Arg Arg Arg Pro Thr Leu Gly Val Gln Leu Asp Asp Lys 
                325                  330                335    
Arg Lys Glu Met Leu Lys Arg His Pro Leu Ser Val Met Leu Asp Leu 
            340                  345                350        
Lys Cys Lys Asp Asp Ser Val Leu His Leu Thr Phe Tyr Tyr Leu Met 
        355                  360                365            
Asn Leu Asn Ile Met Thr Val Lys Ala Lys Val Thr Thr Ala Met Glu 
    370                  375                380                
Leu Ile Thr Pro Ile Ser Ala Gly Asp Leu Leu Ser Pro Asp Ser Val 
385                  390                395                  400
Leu Ser Cys Leu Tyr Pro Gly Asp His Gly Lys Lys Thr Pro Asn Pro 
                405                  410                415    
Ala Asn Gln Tyr Gln Phe Asp Lys Val Gly Ile Leu Thr Leu Ser Asp 
            420                  425                430        
Tyr Val Leu Glu Leu Gly His Pro Tyr Leu Trp Val Gln Lys Leu Gly 
        435                  440                445            
Gly Leu His Phe Pro Lys Glu Gln Pro Gln Gln Thr Val Ile Ala Asp 
    450                  455                460                
His Ser Leu Ser Ala Ser His Met Glu Thr Thr Met Lys Leu Leu Lys 
465                  470                475                  480
Thr Arg Val Gln Ser Arg Leu Ala Leu His Lys Gln Phe Ala Ser Leu 
                485                  490                495    
Glu His Gly Ile Val Pro Val Thr Ser Asp Cys Gln Tyr Leu Phe Pro 
            500                  505                510        
Ala Lys Val Val Ser Arg Leu Val Lys Trp Val Thr Val Ala His Glu 
        515                  520                525            
Asp Tyr Met Glu Leu His Phe Thr Lys Asp Ile Val Asp Ala Gly Leu 
    530                  535                540                
Ala Gly Asp Thr Asn Leu Tyr Tyr Met Ala Leu Ile Glu Arg Gly Thr 
545                  550                555                  560
Ala Lys Leu Gln Ala Ala Val Val Leu Asn Pro Gly Tyr Ser Ser Ile 
                565                  570                575    
Pro Pro Val Phe Gln Leu Cys Leu Asn Trp Lys Gly Glu Lys Thr Asn 
            580                  585                590        
Ser Asn Asp Asp Asn Ile Arg Ala Met Glu Gly Glu Val Asn Val Cys 
        595                  600                605            
Tyr Lys Glu Leu Cys Gly Pro Trp Pro Ser His Gln Leu Leu Thr Asn 
    610                  615                620                
Gln Leu Gln Arg Leu Cys Val Leu Leu Asp Val Tyr Leu Glu Thr Glu 
625                  630                635                  640
Ser His Asp Asp Ser Val Glu Gly Pro Lys Glu Phe Pro Gln Glu Lys 
                645                  650                655    
Met Cys Leu Arg Leu Phe Arg Gly Pro Ser Arg Met Lys Pro Phe Lys 
            660                  665                670        
Tyr Asn His Pro Gln Gly Phe Phe Ser His Arg 
        675                  680            

<210> 76
<211> 64
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..64
<223> /mol_type="DNA"
      /note="THOC5 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 76
gggtcctagc aggatgaagc catttaaata caaccatcct cagggattct tcagccatcg     60

ctga                                                                  64


<210> 77
<211> 383
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..383
<223> /mol_type="DNA"
      /note="CHEK2-THOC5 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 77
atgtctcggg agtcggatgt tgaggctcag cagtctcatg gcagcagtgc ctgttcacag      60

ccccatggca gcgttaccca gtcccaaggc tcctcctcac agtcccaggg catatccagc     120

tcctctacca gcacgatgcc aaactccagc cagtcctctc actccagctc tgggacactg     180

agctccttag agacagtgtc cactcaggaa ctctattcta ttcctgagga ccaagaacct     240

gaggaccaag aacctgagga gcctacccct gccccctggg ctcgattatg ggcccttcag     300

gatggatttg ccaatcttgg ggtcctagca ggatgaagcc atttaaatac aaccatcctc     360

agggattctt cagccatcgc tga                                             383


<210> 78
<211> 111
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..111
<223> /mol_type="protein"
      /note="CHEK2-THOC5 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 78
Met Ser Arg Glu Ser Asp Val Glu Ala Gln Gln Ser His Gly Ser Ser 
1               5                   10                   15    
Ala Cys Ser Gln Pro His Gly Ser Val Thr Gln Ser Gln Gly Ser Ser 
            20                   25                  30        
Ser Gln Ser Gln Gly Ile Ser Ser Ser Ser Thr Ser Thr Met Pro Asn 
        35                   40                  45            
Ser Ser Gln Ser Ser His Ser Ser Ser Gly Thr Leu Ser Ser Leu Glu 
    50                   55                  60                
Thr Val Ser Thr Gln Glu Leu Tyr Ser Ile Pro Glu Asp Gln Glu Pro 
65                   70                  75                  80
Glu Asp Gln Glu Pro Glu Glu Pro Thr Pro Ala Pro Trp Ala Arg Leu 
                85                   90                  95    
Trp Ala Leu Gln Asp Gly Phe Ala Asn Leu Gly Val Leu Ala Gly 
            100                  105                110    

<210> 79
<211> 834
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..834
<223> /mol_type="DNA"
      /note="M6PR (CCDS nucleotide sequence of M6PR (Gene ID: 4074))"
      /organism="Homo sapiens"

<400> 79
atgttccctt tctacagctg ctggaggact ggactgctac tactactcct ggctgtggca      60

gtgagagaat cctggcagac agaagaaaaa acttgcgact tggtaggaga aaagggtaaa     120

gagtcagaga aagagttggc tctagtgaag aggctgaaac cactgtttaa taaaagcttt     180

gagagcactg tgggccaggg ttcagacaca tacatctaca tcttcagggt gtgccgggaa     240

gctggcaacc acacttctgg ggcaggcctg gtgcaaatca acaaaagtaa tgggaaggag     300

acagtggtag ggagactcaa cgagactcac atcttcaacg gaagtaattg gatcatgctg     360

atctataaag ggggtgatga atatgacaac cactgtggca aggagcagcg tcgtgcagtg     420

gtgatgatct cctgcaatcg acacacccta gcggacaatt ttaaccctgt gtctgaggag     480

cgtggcaaag tccaagattg tttctacctc tttgagatgg atagcagcct ggcctgttca     540

ccagagatct cccacctcag tgtgggttcc atcttacttg tcacgtttgc atcactggtt     600

gctgtttatg ttgttggggg gttcctatac cagcgactgg tagtgggagc caaaggaatg     660

gagcagtttc cccacttagc cttctggcag gatcttggca acctggtagc agatggctgt     720

gactttgtct gccgttctaa acctcgaaat gtgcctgcag catatcgtgg tgtgggggat     780

gaccagctgg gggaggagtc agaagaaagg gatgaccatt tattaccaat gtag           834


<210> 80
<211> 277
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..277
<223> /mol_type="protein"
      /note="M6PR (full-length protein)"
      /organism="Homo sapiens"

<400> 80
Met Phe Pro Phe Tyr Ser Cys Trp Arg Thr Gly Leu Leu Leu Leu Leu 
1               5                   10                   15    
Leu Ala Val Ala Val Arg Glu Ser Trp Gln Thr Glu Glu Lys Thr Cys 
            20                   25                  30        
Asp Leu Val Gly Glu Lys Gly Lys Glu Ser Glu Lys Glu Leu Ala Leu 
        35                   40                  45            
Val Lys Arg Leu Lys Pro Leu Phe Asn Lys Ser Phe Glu Ser Thr Val 
    50                   55                  60                
Gly Gln Gly Ser Asp Thr Tyr Ile Tyr Ile Phe Arg Val Cys Arg Glu 
65                   70                  75                  80
Ala Gly Asn His Thr Ser Gly Ala Gly Leu Val Gln Ile Asn Lys Ser 
                85                   90                  95    
Asn Gly Lys Glu Thr Val Val Gly Arg Leu Asn Glu Thr His Ile Phe 
            100                  105                110        
Asn Gly Ser Asn Trp Ile Met Leu Ile Tyr Lys Gly Gly Asp Glu Tyr 
        115                  120                125            
Asp Asn His Cys Gly Lys Glu Gln Arg Arg Ala Val Val Met Ile Ser 
    130                  135                140                
Cys Asn Arg His Thr Leu Ala Asp Asn Phe Asn Pro Val Ser Glu Glu 
145                  150                155                  160
Arg Gly Lys Val Gln Asp Cys Phe Tyr Leu Phe Glu Met Asp Ser Ser 
                165                  170                175    
Leu Ala Cys Ser Pro Glu Ile Ser His Leu Ser Val Gly Ser Ile Leu 
            180                  185                190        
Leu Val Thr Phe Ala Ser Leu Val Ala Val Tyr Val Val Gly Gly Phe 
        195                  200                205            
Leu Tyr Gln Arg Leu Val Val Gly Ala Lys Gly Met Glu Gln Phe Pro 
    210                  215                220                
His Leu Ala Phe Trp Gln Asp Leu Gly Asn Leu Val Ala Asp Gly Cys 
225                  230                235                  240
Asp Phe Val Cys Arg Ser Lys Pro Arg Asn Val Pro Ala Ala Tyr Arg 
                245                  250                255    
Gly Val Gly Asp Asp Gln Leu Gly Glu Glu Ser Glu Glu Arg Asp Asp 
            260                  265                270        
His Leu Leu Pro Met 
        275        

<210> 81
<211> 176
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..176
<223> /mol_type="DNA"
      /note="M6PR (preferred gene fragment)"
      /organism="artificial sequences"

<400> 81
atgttccctt tctacagctg ctggaggact ggactgctac tactactcct ggctgtggca     60

gtgagagaat cctggcagac agaagaaaaa acttgcgact tggtaggaga aaagggtaaa    120

gagtcagaga aagagttggc tctagtgaag aggctgaaac cactgtttaa taaaag        176


<210> 82
<211> 58
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..58
<223> /mol_type="protein"
      /note="M6PR (preferred protein fragment)"
      /organism="artificial sequences"

<400> 82
Met Phe Pro Phe Tyr Ser Cys Trp Arg Thr Gly Leu Leu Leu Leu Leu 
1               5                   10                   15    
Leu Ala Val Ala Val Arg Glu Ser Trp Gln Thr Glu Glu Lys Thr Cys 
            20                   25                  30        
Asp Leu Val Gly Glu Lys Gly Lys Glu Ser Glu Lys Glu Leu Ala Leu 
        35                   40                  45            
Val Lys Arg Leu Lys Pro Leu Phe Asn Lys 
    50                   55            

<210> 83
<211> 1377
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1377
<223> /mol_type="DNA"
      /note="CD4 (CCDS nucleotide sequence of CD4 (Gene ID: 920))"
      /organism="Homo sapiens"

<400> 83
atgaaccggg gagtcccttt taggcacttg cttctggtgc tgcaactggc gctcctccca      60

gcagccactc agggaaagaa agtggtgctg ggcaaaaaag gggatacagt ggaactgacc     120

tgtacagctt cccagaagaa gagcatacaa ttccactgga aaaactccaa ccagataaag     180

attctgggaa atcagggctc cttcttaact aaaggtccat ccaagctgaa tgatcgcgct     240

gactcaagaa gaagcctttg ggaccaagga aactttcccc tgatcatcaa gaatcttaag     300

atagaagact cagatactta catctgtgaa gtggaggacc agaaggagga ggtgcaattg     360

ctagtgttcg gattgactgc caactctgac acccacctgc ttcaggggca gagcctgacc     420

ctgaccttgg agagcccccc tggtagtagc ccctcagtgc aatgtaggag tccaaggggt     480

aaaaacatac agggggggaa gaccctctcc gtgtctcagc tggagctcca ggatagtggc     540

acctggacat gcactgtctt gcagaaccag aagaaggtgg agttcaaaat agacatcgtg     600

gtgctagctt tccagaaggc ctccagcata gtctataaga aagaggggga acaggtggag     660

ttctccttcc cactcgcctt tacagttgaa aagctgacgg gcagtggcga gctgtggtgg     720

caggcggaga gggcttcctc ctccaagtct tggatcacct ttgacctgaa gaacaaggaa     780

gtgtctgtaa aacgggttac ccaggaccct aagctccaga tgggcaagaa gctcccgctc     840

cacctcaccc tgccccaggc cttgcctcag tatgctggct ctggaaacct caccctggcc     900

cttgaagcga aaacaggaaa gttgcatcag gaagtgaacc tggtggtgat gagagccact     960

cagctccaga aaaatttgac ctgtgaggtg tggggaccca cctcccctaa gctgatgctg    1020

agtttgaaac tggagaacaa ggaggcaaag gtctcgaagc gggagaaggc ggtgtgggtg    1080

ctgaaccctg aggcggggat gtggcagtgt ctgctgagtg actcgggaca ggtcctgctg    1140

gaatccaaca tcaaggttct gcccacatgg tccaccccgg tgcagccaat ggccctgatt    1200

gtgctggggg gcgtcgccgg cctcctgctt ttcattgggc taggcatctt cttctgtgtc    1260

aggtgccggc accgaaggcg ccaagcagag cggatgtctc agatcaagag actcctcagt    1320

gagaagaaga cctgccagtg tcctcaccgg tttcagaaga catgtagccc catttga       1377


<210> 84
<211> 458
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..458
<223> /mol_type="protein"
      /note="CD4 (full-length protein)"
      /organism="Homo sapiens"

<400> 84
Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gln Leu 
1               5                   10                   15    
Ala Leu Leu Pro Ala Ala Thr Gln Gly Lys Lys Val Val Leu Gly Lys 
            20                   25                  30        
Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gln Lys Lys Ser 
        35                   40                  45            
Ile Gln Phe His Trp Lys Asn Ser Asn Gln Ile Lys Ile Leu Gly Asn 
    50                   55                  60                
Gln Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65                   70                  75                  80
Asp Ser Arg Arg Ser Leu Trp Asp Gln Gly Asn Phe Pro Leu Ile Ile 
                85                   90                  95    
Lys Asn Leu Lys Ile Glu Asp Ser Asp Thr Tyr Ile Cys Glu Val Glu 
            100                  105                110        
Asp Gln Lys Glu Glu Val Gln Leu Leu Val Phe Gly Leu Thr Ala Asn 
        115                  120                125            
Ser Asp Thr His Leu Leu Gln Gly Gln Ser Leu Thr Leu Thr Leu Glu 
    130                  135                140                
Ser Pro Pro Gly Ser Ser Pro Ser Val Gln Cys Arg Ser Pro Arg Gly 
145                  150                155                  160
Lys Asn Ile Gln Gly Gly Lys Thr Leu Ser Val Ser Gln Leu Glu Leu 
                165                  170                175    
Gln Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gln Asn Gln Lys Lys 
            180                  185                190        
Val Glu Phe Lys Ile Asp Ile Val Val Leu Ala Phe Gln Lys Ala Ser 
        195                  200                205            
Ser Ile Val Tyr Lys Lys Glu Gly Glu Gln Val Glu Phe Ser Phe Pro 
    210                  215                220                
Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225                  230                235                  240
Gln Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp Ile Thr Phe Asp Leu 
                245                  250                255    
Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gln Asp Pro Lys Leu 
            260                  265                270        
Gln Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gln Ala Leu 
        275                  280                285            
Pro Gln Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
    290                  295                300                
Thr Gly Lys Leu His Gln Glu Val Asn Leu Val Val Met Arg Ala Thr 
305                  310                315                  320
Gln Leu Gln Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
                325                  330                335    
Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
            340                  345                350        
Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
        355                  360                365            
Gln Cys Leu Leu Ser Asp Ser Gly Gln Val Leu Leu Glu Ser Asn Ile 
    370                  375                380                
Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gln Pro Met Ala Leu Ile 
385                  390                395                  400
Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe Ile Gly Leu Gly Ile 
                405                  410                415    
Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gln Ala Glu Arg Met 
            420                  425                430        
Ser Gln Ile Lys Arg Leu Leu Ser Glu Lys Lys Thr Cys Gln Cys Pro 
        435                  440                445            
His Arg Phe Gln Lys Thr Cys Ser Pro Ile 
    450                  455            

<210> 85
<211> 221
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..221
<223> /mol_type="DNA"
      /note="CD4 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 85
ttctgcccac atggtccacc ccggtgcagc caatggccct gattgtgctg gggggcgtcg     60

ccggcctcct gcttttcatt gggctaggca tcttcttctg tgtcaggtgc cggcaccgaa    120

ggcgccaagc agagcggatg tctcagatca agagactcct cagtgagaag aagacctgcc    180

agtgtcctca ccggtttcag aagacatgta gccccatttg a                        221


<210> 86
<211> 397
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..397
<223> /mol_type="DNA"
      /note="M6PR-CD4 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 86
atgttccctt tctacagctg ctggaggact ggactgctac tactactcct ggctgtggca      60

gtgagagaat cctggcagac agaagaaaaa acttgcgact tggtaggaga aaagggtaaa     120

gagtcagaga aagagttggc tctagtgaag aggctgaaac cactgtttaa taaaagttct     180

gcccacatgg tccaccccgg tgcagccaat ggccctgatt gtgctggggg gcgtcgccgg     240

cctcctgctt ttcattgggc taggcatctt cttctgtgtc aggtgccggc accgaaggcg     300

ccaagcagag cggatgtctc agatcaagag actcctcagt gagaagaaga cctgccagtg     360

tcctcaccgg tttcagaaga catgtagccc catttga                              397


<210> 87
<211> 113
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..113
<223> /mol_type="protein"
      /note="M6PR-CD4 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 87
Met Phe Pro Phe Tyr Ser Cys Trp Arg Thr Gly Leu Leu Leu Leu Leu 
1               5                   10                   15    
Leu Ala Val Ala Val Arg Glu Ser Trp Gln Thr Glu Glu Lys Thr Cys 
            20                   25                  30        
Asp Leu Val Gly Glu Lys Gly Lys Glu Ser Glu Lys Glu Leu Ala Leu 
        35                   40                  45            
Val Lys Arg Leu Lys Pro Leu Phe Asn Lys Ser Ser Ala His Met Val 
    50                   55                  60                
His Pro Gly Ala Ala Asn Gly Pro Asp Cys Ala Gly Gly Arg Arg Arg 
65                   70                  75                  80
Pro Pro Ala Phe His Trp Ala Arg His Leu Leu Leu Cys Gln Val Pro 
                85                   90                  95    
Ala Pro Lys Ala Pro Ser Arg Ala Asp Val Ser Asp Gln Glu Thr Pro 
            100                  105                110        
Gln 
    

<210> 88
<211> 1065
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1065
<223> /mol_type="DNA"
      /note="ZBP1 (CCDS nucleotide sequence of ZBP1 (Gene ID: 81030))"
      /organism="Homo sapiens"

<400> 88
atggcccagg ctcctgctga cccgggcaga gaagccgaga ggccccagca acatgcagct      60

acaattccag agacccctgg ccctcagttc agccaacaac gggaggaaga catctacagg     120

tttctcaaag acaatggtcc ccagagggcc ctggtcatcg cccaagcact gggaatgagg     180

acagcaaaag atgtgaaccg agacttgtac aggatgaaga gcaggcacct tctggacatg     240

gatgagcagt ccaaagcatg gacgatttac cgcccagaag attctggaag aagagcaaag     300

tcagcctcaa ttatttacca gcacaatcca atcaacatga tctgccagaa tggacccaac     360

agctggattt ccattgcaaa ctccgaagcc atccagattg gacacgggaa catcattaca     420

agacagacag tctccaggga ggacggttcc gccggtccac gccacctccc ttcaatggca     480

ccaggtgatt cctcaacttg ggggacccta gttgatccct gggggcccca ggacatccac     540

atggagcagt ccatactgag acgggtgcag ctgggacaca gcaatgagat gaggctccac     600

ggcgtcccgt ccgagggccc tgcccacatc ccccctggca gccccccagt ctctgccact     660

gctgccggcc cagaagcttc gtttgaagca agaattccca gtccaggaac tcaccctgag     720

ggggaagccg cccagagaat ccacatgaaa tcgtgctttc tcgaggacgc caccatcggc     780

aacagcaaca aaatgtctat cagcccaggg gtggctggcc caggaggagt cgcagggtct     840

ggagaggggg agccagggga ggacgcaggt cgtcgtcccg cagacacaca atccagaagt     900

cactttcctc gagacattgg tcagcccatc actcccagcc actcgaagct cacccccaag     960

ctggaaacta tgactcttgg aaacaggagt cacaaagctg cagaaggcag ccactatgtg    1020

gatgaagcct cacacgaggg gagctggtgg ggaggtggga tttag                    1065


<210> 89
<211> 354
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..354
<223> /mol_type="protein"
      /note="ZBP1 (full-length protein)"
      /organism="Homo sapiens"

<400> 89
Met Ala Gln Ala Pro Ala Asp Pro Gly Arg Glu Ala Glu Arg Pro Gln 
1               5                   10                   15    
Gln His Ala Ala Thr Ile Pro Glu Thr Pro Gly Pro Gln Phe Ser Gln 
            20                   25                  30        
Gln Arg Glu Glu Asp Ile Tyr Arg Phe Leu Lys Asp Asn Gly Pro Gln 
        35                   40                  45            
Arg Ala Leu Val Ile Ala Gln Ala Leu Gly Met Arg Thr Ala Lys Asp 
    50                   55                  60                
Val Asn Arg Asp Leu Tyr Arg Met Lys Ser Arg His Leu Leu Asp Met 
65                   70                  75                  80
Asp Glu Gln Ser Lys Ala Trp Thr Ile Tyr Arg Pro Glu Asp Ser Gly 
                85                   90                  95    
Arg Arg Ala Lys Ser Ala Ser Ile Ile Tyr Gln His Asn Pro Ile Asn 
            100                  105                110        
Met Ile Cys Gln Asn Gly Pro Asn Ser Trp Ile Ser Ile Ala Asn Ser 
        115                  120                125            
Glu Ala Ile Gln Ile Gly His Gly Asn Ile Ile Thr Arg Gln Thr Val 
    130                  135                140                
Ser Arg Glu Asp Gly Ser Ala Gly Pro Arg His Leu Pro Ser Met Ala 
145                  150                155                  160
Pro Gly Asp Ser Ser Thr Trp Gly Thr Leu Val Asp Pro Trp Gly Pro 
                165                  170                175    
Gln Asp Ile His Met Glu Gln Ser Ile Leu Arg Arg Val Gln Leu Gly 
            180                  185                190        
His Ser Asn Glu Met Arg Leu His Gly Val Pro Ser Glu Gly Pro Ala 
        195                  200                205            
His Ile Pro Pro Gly Ser Pro Pro Val Ser Ala Thr Ala Ala Gly Pro 
    210                  215                220                
Glu Ala Ser Phe Glu Ala Arg Ile Pro Ser Pro Gly Thr His Pro Glu 
225                  230                235                  240
Gly Glu Ala Ala Gln Arg Ile His Met Lys Ser Cys Phe Leu Glu Asp 
                245                  250                255    
Ala Thr Ile Gly Asn Ser Asn Lys Met Ser Ile Ser Pro Gly Val Ala 
            260                  265                270        
Gly Pro Gly Gly Val Ala Gly Ser Gly Glu Gly Glu Pro Gly Glu Asp 
        275                  280                285            
Ala Gly Arg Arg Pro Ala Asp Thr Gln Ser Arg Ser His Phe Pro Arg 
    290                  295                300                
Asp Ile Gly Gln Pro Ile Thr Pro Ser His Ser Lys Leu Thr Pro Lys 
305                  310                315                  320
Leu Glu Thr Met Thr Leu Gly Asn Arg Ser His Lys Ala Ala Glu Gly 
                325                  330                335    
Ser His Tyr Val Asp Glu Ala Ser His Glu Gly Ser Trp Trp Gly Gly 
            340                  345                350        
Gly Ile 
        

<210> 90
<211> 34
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..34
<223> /mol_type="DNA"
      /note="ZBP1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 90
atggcccagg ctcctgctga cccgggcaga gaag                                 34


<210> 91
<211> 11
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..11
<223> /mol_type="protein"
      /note="ZBP1 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 91
Met Ala Gln Ala Pro Ala Asp Pro Gly Arg Glu 
1               5                   10    

<210> 92
<211> 1755
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1755
<223> /mol_type="DNA"
      /note="BCAS1 (CCDS nucleotide sequence of BCAS1 (Gene ID: 8537))"
      /organism="Homo sapiens"

<400> 92
atgggtaacc aaatgagtgt tccccaaaga gttgaagacc aagagaatga accagaagca      60

gagacttacc aggacaacgc gtctgctctg aacggggttc cagtggtggt gtcgacccac     120

acagttcagc acttagagga agtcgacttg ggaataagtg tcaagacgga taatgtggcc     180

acttcttccc ccgagacaac ggagataagt gctgttgcgg atgccaacgg aaagaatctt     240

gggaaagagg ccaaacccga ggcaccagct gctaaatctc gttttttctt gatgctctct     300

cggcctgtac caggacgtac cggagaccaa gccgcagatt catcccttgg atcagtgaag     360

cttgatgtca gctccaataa agctccagcg aacaaagacc caagtgagag ctggacactt     420

ccggtggcag ctggaccggg gcaggacaca gataaaaccc cagggcacgc cccggcccaa     480

gacaaggtcc tctctgccgc cagggatccc acgcttctcc cacctgagac agggggagca     540

ggaggagaag ctccctccaa gcccaaggac tccagctttt ttgacaaatt cttcaagctg     600

gacaagggac aggaaaaggt gccaggtgac agccaacagg aagccaagag ggcagagcat     660

caagacaagg tggatgaggt tcctggctta tcagggcagt ccgatgatgt ccctgcaggg     720

aaggacatag ttgacggcaa ggaaaaagaa ggacaagaac ttggaactgc ggattgctct     780

gtccctgggg acccagaagg actggagact gcaaaggacg attcccaggc agcagctata     840

gcagagaata ataattccat catgagtttc tttaaaactc tggtttcacc taacaaagct     900

gaaacaaaaa aggacccaga agacacgggt gctgaaaagt cacccaccac ttcagctgac     960

cttaagtcag acaaagccaa ctttacatcc caggagaccc aaggggctgg caagaattcc    1020

aaaggatgca acccatcggg gcacacacag tccgtgacaa cccctgaacc tgcgaaggaa    1080

ggcaccaagg agaaatcagg acccacctct ctgcctctgg gcaaactgtt ttggaaaaag    1140

tcagttaaag aggactcagt ccccacaggt gcggaggaga atgtggtgtg tgagtcacca    1200

gtagagatta taaagtccaa ggaagtagaa tcagccttac aaacagtgga cctcaacgaa    1260

ggagatgctg cacctgaacc cacagaagcg aaactcaaaa gagaagaaag caaaccaaga    1320

acctctctga tggcgtttct cagacaaatg tcagtgaaag gggatggagg gatcacccac    1380

tcagaagaaa taaatgggaa agactccagc tgccaaacat cagactccac agaaaagact    1440

atcacaccgc cagagcctga accaacagga gcaccacaga agggtaaaga gggctcctcg    1500

aaggacaaga agtcagcagc cgagatgaac aagcagaaga gcaacaagca ggaagccaaa    1560

gaaccagccc agtgcacaga gcaggccacg gtggacacga actcactgca gaatggggac    1620

aagctccaaa agagacctga gaagcggcag cagtcccttg ggggcttctt taaaggcctg    1680

ggaccaaagc ggatgttgga tgctcaagtg caaacagacc cagtatccat cggaccagtt    1740

ggcaaatcca agtaa                                                     1755


<210> 93
<211> 584
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..584
<223> /mol_type="protein"
      /note="BCAS1 (full-length protein)"
      /organism="Homo sapiens"

<400> 93
Met Gly Asn Gln Met Ser Val Pro Gln Arg Val Glu Asp Gln Glu Asn 
1               5                   10                   15    
Glu Pro Glu Ala Glu Thr Tyr Gln Asp Asn Ala Ser Ala Leu Asn Gly 
            20                   25                  30        
Val Pro Val Val Val Ser Thr His Thr Val Gln His Leu Glu Glu Val 
        35                   40                  45            
Asp Leu Gly Ile Ser Val Lys Thr Asp Asn Val Ala Thr Ser Ser Pro 
    50                   55                  60                
Glu Thr Thr Glu Ile Ser Ala Val Ala Asp Ala Asn Gly Lys Asn Leu 
65                   70                  75                  80
Gly Lys Glu Ala Lys Pro Glu Ala Pro Ala Ala Lys Ser Arg Phe Phe 
                85                   90                  95    
Leu Met Leu Ser Arg Pro Val Pro Gly Arg Thr Gly Asp Gln Ala Ala 
            100                  105                110        
Asp Ser Ser Leu Gly Ser Val Lys Leu Asp Val Ser Ser Asn Lys Ala 
        115                  120                125            
Pro Ala Asn Lys Asp Pro Ser Glu Ser Trp Thr Leu Pro Val Ala Ala 
    130                  135                140                
Gly Pro Gly Gln Asp Thr Asp Lys Thr Pro Gly His Ala Pro Ala Gln 
145                  150                155                  160
Asp Lys Val Leu Ser Ala Ala Arg Asp Pro Thr Leu Leu Pro Pro Glu 
                165                  170                175    
Thr Gly Gly Ala Gly Gly Glu Ala Pro Ser Lys Pro Lys Asp Ser Ser 
            180                  185                190        
Phe Phe Asp Lys Phe Phe Lys Leu Asp Lys Gly Gln Glu Lys Val Pro 
        195                  200                205            
Gly Asp Ser Gln Gln Glu Ala Lys Arg Ala Glu His Gln Asp Lys Val 
    210                  215                220                
Asp Glu Val Pro Gly Leu Ser Gly Gln Ser Asp Asp Val Pro Ala Gly 
225                  230                235                  240
Lys Asp Ile Val Asp Gly Lys Glu Lys Glu Gly Gln Glu Leu Gly Thr 
                245                  250                255    
Ala Asp Cys Ser Val Pro Gly Asp Pro Glu Gly Leu Glu Thr Ala Lys 
            260                  265                270        
Asp Asp Ser Gln Ala Ala Ala Ile Ala Glu Asn Asn Asn Ser Ile Met 
        275                  280                285            
Ser Phe Phe Lys Thr Leu Val Ser Pro Asn Lys Ala Glu Thr Lys Lys 
    290                  295                300                
Asp Pro Glu Asp Thr Gly Ala Glu Lys Ser Pro Thr Thr Ser Ala Asp 
305                  310                315                  320
Leu Lys Ser Asp Lys Ala Asn Phe Thr Ser Gln Glu Thr Gln Gly Ala 
                325                  330                335    
Gly Lys Asn Ser Lys Gly Cys Asn Pro Ser Gly His Thr Gln Ser Val 
            340                  345                350        
Thr Thr Pro Glu Pro Ala Lys Glu Gly Thr Lys Glu Lys Ser Gly Pro 
        355                  360                365            
Thr Ser Leu Pro Leu Gly Lys Leu Phe Trp Lys Lys Ser Val Lys Glu 
    370                  375                380                
Asp Ser Val Pro Thr Gly Ala Glu Glu Asn Val Val Cys Glu Ser Pro 
385                  390                395                  400
Val Glu Ile Ile Lys Ser Lys Glu Val Glu Ser Ala Leu Gln Thr Val 
                405                  410                415    
Asp Leu Asn Glu Gly Asp Ala Ala Pro Glu Pro Thr Glu Ala Lys Leu 
            420                  425                430        
Lys Arg Glu Glu Ser Lys Pro Arg Thr Ser Leu Met Ala Phe Leu Arg 
        435                  440                445            
Gln Met Ser Val Lys Gly Asp Gly Gly Ile Thr His Ser Glu Glu Ile 
    450                  455                460                
Asn Gly Lys Asp Ser Ser Cys Gln Thr Ser Asp Ser Thr Glu Lys Thr 
465                  470                475                  480
Ile Thr Pro Pro Glu Pro Glu Pro Thr Gly Ala Pro Gln Lys Gly Lys 
                485                  490                495    
Glu Gly Ser Ser Lys Asp Lys Lys Ser Ala Ala Glu Met Asn Lys Gln 
            500                  505                510        
Lys Ser Asn Lys Gln Glu Ala Lys Glu Pro Ala Gln Cys Thr Glu Gln 
        515                  520                525            
Ala Thr Val Asp Thr Asn Ser Leu Gln Asn Gly Asp Lys Leu Gln Lys 
    530                  535                540                
Arg Pro Glu Lys Arg Gln Gln Ser Leu Gly Gly Phe Phe Lys Gly Leu 
545                  550                555                  560
Gly Pro Lys Arg Met Leu Asp Ala Gln Val Gln Thr Asp Pro Val Ser 
                565                  570                575    
Ile Gly Pro Val Gly Lys Ser Lys 
            580                

<210> 94
<211> 339
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..339
<223> /mol_type="DNA"
      /note="BCAS1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 94
acatcagact ccacagaaaa gactatcaca ccgccagagc ctgaaccaac aggagcacca      60

cagaagggta aagagggctc ctcgaaggac aagaagtcag cagccgagat gaacaagcag     120

aagagcaaca agcaggaagc caaagaacca gcccagtgca cagagcaggc cacggtggac     180

acgaactcac tgcagaatgg ggacaagctc caaaagagac ctgagaagcg gcagcagtcc     240

cttgggggct tctttaaagg cctgggacca aagcggatgt tggatgctca agtgcaaaca     300

gacccagtat ccatcggacc agttggcaaa tccaagtaa                            339


<210> 95
<211> 373
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..373
<223> /mol_type="DNA"
      /note="ZBP1-BCAS1 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 95
atggcccagg ctcctgctga cccgggcaga gaagacatca gactccacag aaaagactat      60

cacaccgcca gagcctgaac caacaggagc accacagaag ggtaaagagg gctcctcgaa     120

ggacaagaag tcagcagccg agatgaacaa gcagaagagc aacaagcagg aagccaaaga     180

accagcccag tgcacagagc aggccacggt ggacacgaac tcactgcaga atggggacaa     240

gctccaaaag agacctgaga agcggcagca gtcccttggg ggcttcttta aaggcctggg     300

accaaagcgg atgttggatg ctcaagtgca aacagaccca gtatccatcg gaccagttgg     360

caaatccaag taa                                                        373


<210> 96
<211> 25
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..25
<223> /mol_type="protein"
      /note="ZBP1-BCAS1 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 96
Met Ala Gln Ala Pro Ala Asp Pro Gly Arg Glu Asp Ile Arg Leu His 
1               5                   10                   15    
Arg Lys Asp Tyr His Thr Ala Arg Ala 
            20                   25

<210> 97
<211> 1776
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1776
<223> /mol_type="DNA"
      /note="EBF1 (CCDS nucleotide sequence of EBF1 (Gene ID: 1879))"
      /organism="Homo sapiens"

<400> 97
atgtttggga ttcaggaaag catccaacgg agtggaagca gcatgaagga agagccgctg      60

ggcagcggca tgaacgcggt gcggacgtgg atgcagggcg ccggggtgct ggacgccaac     120

acggcggcgc agagcggggt gggtctggcc cgggctcact ttgagaagca gccgccttcc     180

aatctgcgga aatccaactt cttccacttc gtcctggccc tctacgacag acagggccag     240

cccgtggaga tcgagaggac agcgtttgtg gggttcgtgg agaaggaaaa agaagccaac     300

agcgaaaaga ccaataacgg aattcactac cggcttcagc ttctctacag caatgggata     360

aggacggagc aggatttcta cgtgcgcctc attgactcca tgacaaaaca agccatagtg     420

tatgaaggcc aagacaagaa cccagaaatg tgccgagtct tgctcacaca tgagatcatg     480

tgcagccgct gttgtgacaa gaaaagctgt ggcaaccgaa atgagactcc ctcagatcca     540

gtgataattg acaggttctt cttgaaattt ttcctcaaat gtaaccaaaa ttgcctaaag     600

aatgcgggaa acccacgtga catgcggaga ttccaggtcg tggtgtctac gacagtcaat     660

gtggatggcc atgtcctggc agtctctgat aacatgtttg tccataataa ttccaagcat     720

gggcggaggg ctcggaggct tgacccctcg gaaggtacgc cctcttatct ggaacatgct     780

actccctgta tcaaagccat cagcccgagt gaaggatgga cgacgggagg tgcgactgtg     840

atcatcatag gggacaattt ctttgatggg ttacaggtca tattcggtac catgctggtc     900

tggagtgagt tgatcactcc tcatgccatc cgtgtgcaga cccctcctcg gcacatccct     960

ggtgttgtgg aagtcacact gtcctacaaa tctaagcagt tctgcaaagg aacaccaggc    1020

agattcattt atacagcgct caacgaaccc accatcgatt atggtttcca gaggttacag    1080

aaggtcattc ctcggcaccc tggtgaccct gagcgtttgc caaaggaagt aatactcaaa    1140

agggctgcgg atctggtaga agcactgtat gggatgccac acaacaacca ggaaatcatt    1200

ctgaagagag cggccgacat tgccgaggcc ctgtacagtg ttccccgcaa ccacaaccaa    1260

ctcccggccc ttgctaacac ctcggtccac gcagggatga tgggcgtgaa ttcgttcagt    1320

ggacaactgg ccgtgaatgt ctccgaggca tcacaagcca ccaatcaggg tttcacccgc    1380

aactcaagca gcgtatcacc acacgggtac gtgccgagca ccactcccca gcagaccaac    1440

tataactccg tcaccacgag catgaacgga tacggctctg ccgcaatgtc caatttgggc    1500

ggctccccca ccttcctcaa cggctcagct gccaactccc cctatgccat agtgccatcc    1560

agccccacca tggcctcctc cacaagcctc ccctccaact gcagcagctc ctcgggcatc    1620

ttctccttct caccagccaa catggtctca gccgtgaaac agaagagtgc tttcgcacca    1680

gtcgtcagac cccagacctc cccacctccc acctgcacca gcaccaacgg gaacagcctg    1740

caagcgatat ctggcatgat tgttcctcct atgtga                              1776


<210> 98
<211> 591
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..591
<223> /mol_type="protein"
      /note="EBF1 (full-length protein)"
      /organism="Homo sapiens"

<400> 98
Met Phe Gly Ile Gln Glu Ser Ile Gln Arg Ser Gly Ser Ser Met Lys 
1               5                   10                   15    
Glu Glu Pro Leu Gly Ser Gly Met Asn Ala Val Arg Thr Trp Met Gln 
            20                   25                  30        
Gly Ala Gly Val Leu Asp Ala Asn Thr Ala Ala Gln Ser Gly Val Gly 
        35                   40                  45            
Leu Ala Arg Ala His Phe Glu Lys Gln Pro Pro Ser Asn Leu Arg Lys 
    50                   55                  60                
Ser Asn Phe Phe His Phe Val Leu Ala Leu Tyr Asp Arg Gln Gly Gln 
65                   70                  75                  80
Pro Val Glu Ile Glu Arg Thr Ala Phe Val Gly Phe Val Glu Lys Glu 
                85                   90                  95    
Lys Glu Ala Asn Ser Glu Lys Thr Asn Asn Gly Ile His Tyr Arg Leu 
            100                  105                110        
Gln Leu Leu Tyr Ser Asn Gly Ile Arg Thr Glu Gln Asp Phe Tyr Val 
        115                  120                125            
Arg Leu Ile Asp Ser Met Thr Lys Gln Ala Ile Val Tyr Glu Gly Gln 
    130                  135                140                
Asp Lys Asn Pro Glu Met Cys Arg Val Leu Leu Thr His Glu Ile Met 
145                  150                155                  160
Cys Ser Arg Cys Cys Asp Lys Lys Ser Cys Gly Asn Arg Asn Glu Thr 
                165                  170                175    
Pro Ser Asp Pro Val Ile Ile Asp Arg Phe Phe Leu Lys Phe Phe Leu 
            180                  185                190        
Lys Cys Asn Gln Asn Cys Leu Lys Asn Ala Gly Asn Pro Arg Asp Met 
        195                  200                205            
Arg Arg Phe Gln Val Val Val Ser Thr Thr Val Asn Val Asp Gly His 
    210                  215                220                
Val Leu Ala Val Ser Asp Asn Met Phe Val His Asn Asn Ser Lys His 
225                  230                235                  240
Gly Arg Arg Ala Arg Arg Leu Asp Pro Ser Glu Gly Thr Pro Ser Tyr 
                245                  250                255    
Leu Glu His Ala Thr Pro Cys Ile Lys Ala Ile Ser Pro Ser Glu Gly 
            260                  265                270        
Trp Thr Thr Gly Gly Ala Thr Val Ile Ile Ile Gly Asp Asn Phe Phe 
        275                  280                285            
Asp Gly Leu Gln Val Ile Phe Gly Thr Met Leu Val Trp Ser Glu Leu 
    290                  295                300                
Ile Thr Pro His Ala Ile Arg Val Gln Thr Pro Pro Arg His Ile Pro 
305                  310                315                  320
Gly Val Val Glu Val Thr Leu Ser Tyr Lys Ser Lys Gln Phe Cys Lys 
                325                  330                335    
Gly Thr Pro Gly Arg Phe Ile Tyr Thr Ala Leu Asn Glu Pro Thr Ile 
            340                  345                350        
Asp Tyr Gly Phe Gln Arg Leu Gln Lys Val Ile Pro Arg His Pro Gly 
        355                  360                365            
Asp Pro Glu Arg Leu Pro Lys Glu Val Ile Leu Lys Arg Ala Ala Asp 
    370                  375                380                
Leu Val Glu Ala Leu Tyr Gly Met Pro His Asn Asn Gln Glu Ile Ile 
385                  390                395                  400
Leu Lys Arg Ala Ala Asp Ile Ala Glu Ala Leu Tyr Ser Val Pro Arg 
                405                  410                415    
Asn His Asn Gln Leu Pro Ala Leu Ala Asn Thr Ser Val His Ala Gly 
            420                  425                430        
Met Met Gly Val Asn Ser Phe Ser Gly Gln Leu Ala Val Asn Val Ser 
        435                  440                445            
Glu Ala Ser Gln Ala Thr Asn Gln Gly Phe Thr Arg Asn Ser Ser Ser 
    450                  455                460                
Val Ser Pro His Gly Tyr Val Pro Ser Thr Thr Pro Gln Gln Thr Asn 
465                  470                475                  480
Tyr Asn Ser Val Thr Thr Ser Met Asn Gly Tyr Gly Ser Ala Ala Met 
                485                  490                495    
Ser Asn Leu Gly Gly Ser Pro Thr Phe Leu Asn Gly Ser Ala Ala Asn 
            500                  505                510        
Ser Pro Tyr Ala Ile Val Pro Ser Ser Pro Thr Met Ala Ser Ser Thr 
        515                  520                525            
Ser Leu Pro Ser Asn Cys Ser Ser Ser Ser Gly Ile Phe Ser Phe Ser 
    530                  535                540                
Pro Ala Asn Met Val Ser Ala Val Lys Gln Lys Ser Ala Phe Ala Pro 
545                  550                555                  560
Val Val Arg Pro Gln Thr Ser Pro Pro Pro Thr Cys Thr Ser Thr Asn 
                565                  570                575    
Gly Asn Ser Leu Gln Ala Ile Ser Gly Met Ile Val Pro Pro Met 
            580                  585                590    

<210> 99
<211> 554
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..554
<223> /mol_type="DNA"
      /note="EBF1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 99
atgtttggga ttcaggaaag catccaacgg agtggaagca gcatgaagga agagccgctg      60

ggcagcggca tgaacgcggt gcggacgtgg atgcagggcg ccggggtgct ggacgccaac     120

acggcggcgc agagcggggt gggtctggcc cgggctcact ttgagaagca gccgccttcc     180

aatctgcgga aatccaactt cttccacttc gtcctggccc tctacgacag acagggccag     240

cccgtggaga tcgagaggac agcgtttgtg gggttcgtgg agaaggaaaa agaagccaac     300

agcgaaaaga ccaataacgg aattcactac cggcttcagc ttctctacag caatgggata     360

aggacggagc aggatttcta cgtgcgcctc attgactcca tgacaaaaca agccatagtg     420

tatgaaggcc aagacaagaa cccagaaatg tgccgagtct tgctcacaca tgagatcatg     480

tgcagccgct gttgtgacaa gaaaagctgt ggcaaccgaa atgagactcc ctcagatcca     540

gtgataattg acag                                                       554


<210> 100
<211> 184
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..184
<223> /mol_type="protein"
      /note="EBF1 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 100
Met Phe Gly Ile Gln Glu Ser Ile Gln Arg Ser Gly Ser Ser Met Lys 
1               5                   10                   15    
Glu Glu Pro Leu Gly Ser Gly Met Asn Ala Val Arg Thr Trp Met Gln 
            20                   25                  30        
Gly Ala Gly Val Leu Asp Ala Asn Thr Ala Ala Gln Ser Gly Val Gly 
        35                   40                  45            
Leu Ala Arg Ala His Phe Glu Lys Gln Pro Pro Ser Asn Leu Arg Lys 
    50                   55                  60                
Ser Asn Phe Phe His Phe Val Leu Ala Leu Tyr Asp Arg Gln Gly Gln 
65                   70                  75                  80
Pro Val Glu Ile Glu Arg Thr Ala Phe Val Gly Phe Val Glu Lys Glu 
                85                   90                  95    
Lys Glu Ala Asn Ser Glu Lys Thr Asn Asn Gly Ile His Tyr Arg Leu 
            100                  105                110        
Gln Leu Leu Tyr Ser Asn Gly Ile Arg Thr Glu Gln Asp Phe Tyr Val 
        115                  120                125            
Arg Leu Ile Asp Ser Met Thr Lys Gln Ala Ile Val Tyr Glu Gly Gln 
    130                  135                140                
Asp Lys Asn Pro Glu Met Cys Arg Val Leu Leu Thr His Glu Ile Met 
145                  150                155                  160
Cys Ser Arg Cys Cys Asp Lys Lys Ser Cys Gly Asn Arg Asn Glu Thr 
                165                  170                175    
Pro Ser Asp Pro Val Ile Ile Asp 
            180                

<210> 101
<211> 4789
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4789
<223> /mol_type="DNA"
      /note="ATP10B (CCDS nucleotide sequence of ATP10B (Gene ID: 23120
      ) plus 395 nucleotides of the 5’UTR)"
      /organism="Homo sapiens"

<400> 101
caaccgaaat gagactccct cagatccagt gataattgac aggcttcctt tgacaagaga       60

atcagaaccg actggtgaca tttgtttcaa tgaaagcaac agtgtgaaga aacttgcaat      120

ttattctatc aacaatttgc ccagctaaag tactacatct gccccctctt cctgtggctg      180

taggggcaca gcaaaggtca ctggtctaac ctccttaaag ggactccgct aacagaaacc      240

accaaatgga gtgggaaaaa gaaaagggat cccctatccc caactccagc catctagatt      300

aagaaaagcc agctgactgg acagtagcac agcccagtca cctcatggac aaatttccta      360

ggaaagaacc tctcccatct atctacttat cactctcctt tgaggttccg gccacagatc      420

ttcgcctgct gctggaaatg gccctctcag tggactcatc gtggcatcgg tggcagtgga      480

gagtcagaga tggcttcccc cattgtcctc ggaaaccaca ccgctgctct ctccagagaa      540

agggagacag agctacaact tgacacagca gcgggtcgtg ttccccaaca acagcatatt      600

ccatcaagat tgggaagagg tctccaggag atacctggca acagaacctg cacaaccaaa      660

tacaccctct tcaccttcct gccccggaat ctctttgagc aatttcatag atgggctaac      720

ctctatttcc tgttcctggt gattttgaac tggatgccct cctggaagtc ttccacagag      780

aaatcaccat gttaccattg gccattgtcc tgttcgtcat catgatcaag gatggcatgg      840

aggacttcaa gagacaccgc tttgataaag caataaactg ctccaacatc gaatttatga      900

aagaaaagag cagacctatg tgcagaagtg ctggaaggat gtgcgcgtgg gagacttcat      960

ccaaatgaaa tgcaatgaga ttgtcccagc agacatactc ctcctttttt cctctgcccc     1020

aatgggatat gccatctgga aactgccagc ttggatggag agacaaacct caagcaaaga     1080

tgtgtcgtga agggcttctc acagcaggag gtacagttcg aaccagagct tttccacaat     1140

acctcgtgtg tgagaaaccc aacaaccacc tcaacaaatt taagggttat atggagcatc     1200

ctgaccagac caggactggc tttggctgtg agagtcttct gcttcgaggc tgcaccatca     1260

gaaacaccga atggctgttg gcattgtcat ctatgcaggc catgagacga aagccatgct     1320

gaacaacagt ggcccccggt acaaacgcag caagattgag cggcgcatga atatagacat     1380

cttcttctgc attgggacct catcctcatg tgccttattg gagctgtagg tcacagcatc     1440

tggaatggga cctttgaaga acaccctccc ttcgatgtgc cagatgccaa tggcagcttc     1500

cttcccagtg cccttggggg cttcacatgt tcctcacaat gatcatcctg ctccaggtgc     1560

tgatccccat ctctttgtat gtctccattg agctggtgaa gctcgggcaa gtgttcttct     1620

tgagcaatga ccttgacctg tatgatgaag aaccgattta tccattcaat gtcgagccct     1680

caacatcgca gaggacttgg gccagatcca gtacatcttc tccgataaga cggggaccct     1740

gacagagaac aagatggtgt tccgacgttg caccatcagg gcagcgagta ttctcaccaa     1800

gaaaatgcta agcgactgga gaccccaaag gagctggact cagatggtga agagtggacc     1860

caataccaat gcctgtcctt ctcggctaga tgggcccagg atccacaact atgagaagcc     1920

aaaaaggtgc tcagcctctg aggaggagcc agagtgcccg ggtgcccatc cagggccact     1980

accggcaaag gtctatgggg caccgtgaaa gctcacagcc tcctgtggcc ttagcagctc     2040

catagaaaaa gatgtaactc cagataaaaa cctactgacc aaggttcgag atgctgccct     2100

gtggttggag accttgtcag acagcagacc tgccaaggct tccctctcca ccacctcctc     2160

attgctgatt tcttccttgc cttaaccatc tgcaactctg tcatggtgtc cacaaccacc     2220

gagcccaggc agagggtcac catcaaaccc tcaagcaagg ctctggggac gtccctggag     2280

aagattagca gctcttccag aagttgaagc tattgagcct cagccagtca ttctcatcca     2340

ctgcaccctc tgacacagac ctcggggaga gcttaggggc caacgtggcc accacagact     2400

cggatgagag agagatgcat ctgtgtgcag tggaggtgac tccactgatg acggtggcta     2460

caggagcagc atgtgggacc agggcgacat cctggagtct gggtcaggca cttccttgga     2520

ggaggcattg gaggccccag cacagacctg gccaggcctg agttctgtta cgaggctgag     2580

agccctgatg aggccgccct ggtgcacgct gcccatgcct acagcttcac actagtgtcc     2640

cggacacctg agcaggtgac tgtgcgctgc cccagggcac ctgcctcacc ttcagcctcc     2700

tctgcaccct gggctttgac tctgtcagga agagaatgtc tgtggttgtg aggcacccac     2760

tgactggcga gattgttgtc tacaccaagg gtgcgactcg gtcatcatgg acctgctgga     2820

agacccagcc tgcgtacctg acattaatat ggaaaagaag ctgagaaaaa tccgagcccg     2880

gacccaaaag catctagact tgtatgcaag agatggcctg ccacactatg cattgccaag     2940

aaggttgtaa gcgaagagga cttccggaga tgggccagtt tccggcgtga ggctgaggca     3000

tccctcgaca accgagatga gcttctcatg gaaactgcac agcatctgag aatcaactca     3060

ccttacttgg agccactggg atcgaagacc ggctgcagga aggagttcca gatacgattg     3120

ccactctgcg ggaggctggg atccagctct gggtcctgac tggagataag caggaacagc     3180

ggtcaacatt gcccattcct gcagactgtt aaatcagacc gacactgttt ataccatcaa     3240

tacagagaat caggagacct gtgaatccat cctcaattgt gcattggaag agctaaagca     3300

attcgtgaac tacagaagcc agaccgcaag ctctttggat tccgcttacc ttccaagaca     3360

ccatccatca cctcagaagc tgtggttcca gaagctggat tggtcatcga tgggaagaca     3420

ttgaatgcct cttccaggga aagctagaga agaagtttct ggaattgacc cagtattgtc     3480

ggtccgtcct gtgctgccgc tccacgccac tccagaagag tatgatagtc aagctggtgc     3540

gagacaagtt gcgcgtatga ccctttccat aggtgatgga gcaaatgatg taagcatgat     3600

tcaagctgct gatattggaa ttggaatatc tggacaggaa ggcatgcagg ctgtcatgtc     3660

cagcgacttt gccatcaccc gcttaagcat ctcaagaagt tgctgctcgt gcatggccac     3720

tggtgttact cgcgcctggc caggatggtg gtgtactacc tctacaagaa cgtgtgctac     3780

gtcaacctgc tcttctggta tcagttcttc gtggtttctc cagctccacc atgattgatt     3840

actggcagat gatattcttc aatctcttct ttacctcctt gcctcctctt gtctttggag     3900

tccttgacaa agacatctct gcagaaacac tcctggcttg cctgagctat acaagagtgg     3960

ccagaactct gagtgctata acctgtcgac tttctggatt tctatggtgg atgcattcta     4020

ccagagcctc atctgtttct ttatccctta cctggcctat aaggctctga tatagatgtc     4080

tttacctttg ggacaccaat caacaccatc tccctcacca caatcctttt gcaccaggca     4140

atggaaatga agacatggac cattttccac ggagtcgtgc tcctcggcag ctcctgatgt     4200

actttctggt atccctcctg tacaatgcca cctgcgtcat ctgcaacagc cccaccaatc     4260

cctattgggt gatggaaggc cagctctcaa accccacttt ctacctcgtc tgctttctac     4320

accagttgtt gctcttctcc caagatactt tttcctgtct ctgcaaggaa cttgtgggaa     4380

gtctctaatc tcaaaagctc agaaaattga caaactcccc ccagacaaaa gaaacctgga     4440

aatccgagtt ggagaagcag acagaggcct gcccctgtcc ccgaagtggc tcgaccaact     4500

caccacccag tgtcatctat cacaggacag gacttcagtg ccagcacccc aaagagctct     4560

aaccctccca agggaagcat gtggaagagt cagtactcca cgaacagaga tgtggcacgg     4620

agtgcatgag ggatgactca tgctcagggg actcctcagc tcaactctca tccggggagc     4680

acctgctggg acctaacaga taatggccta ctcaagagga cagactgata tgtgccggtg     4740

ctcaaagagg agcagccatc gccgatccca gagttcactg accatatga                 4789


<210> 102
<211> 1461
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1461
<223> /mol_type="protein"
      /note="ATP10B (full-length protein)"
      /organism="Homo sapiens"

<400> 102
Met Ala Leu Ser Val Asp Ser Ser Trp His Arg Trp Gln Trp Arg Val 
1               5                   10                   15    
Arg Asp Gly Phe Pro His Cys Pro Ser Glu Thr Thr Pro Leu Leu Ser 
            20                   25                  30        
Pro Glu Lys Gly Arg Gln Ser Tyr Asn Leu Thr Gln Gln Arg Val Val 
        35                   40                  45            
Phe Pro Asn Asn Ser Ile Phe His Gln Asp Trp Glu Glu Val Ser Arg 
    50                   55                  60                
Arg Tyr Pro Gly Asn Arg Thr Cys Thr Thr Lys Tyr Thr Leu Phe Thr 
65                   70                  75                  80
Phe Leu Pro Arg Asn Leu Phe Glu Gln Phe His Arg Trp Ala Asn Leu 
                85                   90                  95    
Tyr Phe Leu Phe Leu Val Ile Leu Asn Trp Met Pro Ser Met Glu Val 
            100                  105                110        
Phe His Arg Glu Ile Thr Met Leu Pro Leu Ala Ile Val Leu Phe Val 
        115                  120                125            
Ile Met Ile Lys Asp Gly Met Glu Asp Phe Lys Arg His Arg Phe Asp 
    130                  135                140                
Lys Ala Ile Asn Cys Ser Asn Ile Arg Ile Tyr Glu Arg Lys Glu Gln 
145                  150                155                  160
Thr Tyr Val Gln Lys Cys Trp Lys Asp Val Arg Val Gly Asp Phe Ile 
                165                  170                175    
Gln Met Lys Cys Asn Glu Ile Val Pro Ala Asp Ile Leu Leu Leu Phe 
            180                  185                190        
Ser Ser Asp Pro Asn Gly Ile Cys His Leu Glu Thr Ala Ser Leu Asp 
        195                  200                205            
Gly Glu Thr Asn Leu Lys Gln Arg Cys Val Val Lys Gly Phe Ser Gln 
    210                  215                220                
Gln Glu Val Gln Phe Glu Pro Glu Leu Phe His Asn Thr Ile Val Cys 
225                  230                235                  240
Glu Lys Pro Asn Asn His Leu Asn Lys Phe Lys Gly Tyr Met Glu His 
                245                  250                255    
Pro Asp Gln Thr Arg Thr Gly Phe Gly Cys Glu Ser Leu Leu Leu Arg 
            260                  265                270        
Gly Cys Thr Ile Arg Asn Thr Glu Met Ala Val Gly Ile Val Ile Tyr 
        275                  280                285            
Ala Gly His Glu Thr Lys Ala Met Leu Asn Asn Ser Gly Pro Arg Tyr 
    290                  295                300                
Lys Arg Ser Lys Ile Glu Arg Arg Met Asn Ile Asp Ile Phe Phe Cys 
305                  310                315                  320
Ile Gly Ile Leu Ile Leu Met Cys Leu Ile Gly Ala Val Gly His Ser 
                325                  330                335    
Ile Trp Asn Gly Thr Phe Glu Glu His Pro Pro Phe Asp Val Pro Asp 
            340                  345                350        
Ala Asn Gly Ser Phe Leu Pro Ser Ala Leu Gly Gly Phe Tyr Met Phe 
        355                  360                365            
Leu Thr Met Ile Ile Leu Leu Gln Val Leu Ile Pro Ile Ser Leu Tyr 
    370                  375                380                
Val Ser Ile Glu Leu Val Lys Leu Gly Gln Val Phe Phe Leu Ser Asn 
385                  390                395                  400
Asp Leu Asp Leu Tyr Asp Glu Glu Thr Asp Leu Ser Ile Gln Cys Arg 
                405                  410                415    
Ala Leu Asn Ile Ala Glu Asp Leu Gly Gln Ile Gln Tyr Ile Phe Ser 
            420                  425                430        
Asp Lys Thr Gly Thr Leu Thr Glu Asn Lys Met Val Phe Arg Arg Cys 
        435                  440                445            
Thr Ile Met Gly Ser Glu Tyr Ser His Gln Glu Asn Ala Lys Arg Leu 
    450                  455                460                
Glu Thr Pro Lys Glu Leu Asp Ser Asp Gly Glu Glu Trp Thr Gln Tyr 
465                  470                475                  480
Gln Cys Leu Ser Phe Ser Ala Arg Trp Ala Gln Asp Pro Ala Thr Met 
                485                  490                495    
Arg Ser Gln Lys Gly Ala Gln Pro Leu Arg Arg Ser Gln Ser Ala Arg 
            500                  505                510        
Val Pro Ile Gln Gly His Tyr Arg Gln Arg Ser Met Gly His Arg Glu 
        515                  520                525            
Ser Ser Gln Pro Pro Val Ala Phe Ser Ser Ser Ile Glu Lys Asp Val 
    530                  535                540                
Thr Pro Asp Lys Asn Leu Leu Thr Lys Val Arg Asp Ala Ala Leu Trp 
545                  550                555                  560
Leu Glu Thr Leu Ser Asp Ser Arg Pro Ala Lys Ala Ser Leu Ser Thr 
                565                  570                575    
Thr Ser Ser Ile Ala Asp Phe Phe Leu Ala Leu Thr Ile Cys Asn Ser 
            580                  585                590        
Val Met Val Ser Thr Thr Thr Glu Pro Arg Gln Arg Val Thr Ile Lys 
        595                  600                605            
Pro Ser Ser Lys Ala Leu Gly Thr Ser Leu Glu Lys Ile Gln Gln Leu 
    610                  615                620                
Phe Gln Lys Leu Lys Leu Leu Ser Leu Ser Gln Ser Phe Ser Ser Thr 
625                  630                635                  640
Ala Pro Ser Asp Thr Asp Leu Gly Glu Ser Leu Gly Ala Asn Val Ala 
                645                  650                655    
Thr Thr Asp Ser Asp Glu Arg Asp Asp Ala Ser Val Cys Ser Gly Gly 
            660                  665                670        
Asp Ser Thr Asp Asp Gly Gly Tyr Arg Ser Ser Met Trp Asp Gln Gly 
        675                  680                685            
Asp Ile Leu Glu Ser Gly Ser Gly Thr Ser Leu Glu Glu Ala Leu Glu 
    690                  695                700                
Ala Pro Ala Thr Asp Leu Ala Arg Pro Glu Phe Cys Tyr Glu Ala Glu 
705                  710                715                  720
Ser Pro Asp Glu Ala Ala Leu Val His Ala Ala His Ala Tyr Ser Phe 
                725                  730                735    
Thr Leu Val Ser Arg Thr Pro Glu Gln Val Thr Val Arg Leu Pro Gln 
            740                  745                750        
Gly Thr Cys Leu Thr Phe Ser Leu Leu Cys Thr Leu Gly Phe Asp Ser 
        755                  760                765            
Val Arg Lys Arg Met Ser Val Val Val Arg His Pro Leu Thr Gly Glu 
    770                  775                780                
Ile Val Val Tyr Thr Lys Gly Ala Asp Ser Val Ile Met Asp Leu Leu 
785                  790                795                  800
Glu Asp Pro Ala Cys Val Pro Asp Ile Asn Met Glu Lys Lys Leu Arg 
                805                  810                815    
Lys Ile Arg Ala Arg Thr Gln Lys His Leu Asp Leu Tyr Ala Arg Asp 
            820                  825                830        
Gly Leu Arg Thr Leu Cys Ile Ala Lys Lys Val Val Ser Glu Glu Asp 
        835                  840                845            
Phe Arg Arg Trp Ala Ser Phe Arg Arg Glu Ala Glu Ala Ser Leu Asp 
    850                  855                860                
Asn Arg Asp Glu Leu Leu Met Glu Thr Ala Gln His Leu Glu Asn Gln 
865                  870                875                  880
Leu Thr Leu Leu Gly Ala Thr Gly Ile Glu Asp Arg Leu Gln Glu Gly 
                885                  890                895    
Val Pro Asp Thr Ile Ala Thr Leu Arg Glu Ala Gly Ile Gln Leu Trp 
            900                  905                910        
Val Leu Thr Gly Asp Lys Gln Glu Thr Ala Val Asn Ile Ala His Ser 
        915                  920                925            
Cys Arg Leu Leu Asn Gln Thr Asp Thr Val Tyr Thr Ile Asn Thr Glu 
    930                  935                940                
Asn Gln Glu Thr Cys Glu Ser Ile Leu Asn Cys Ala Leu Glu Glu Leu 
945                  950                955                  960
Lys Gln Phe Arg Glu Leu Gln Lys Pro Asp Arg Lys Leu Phe Gly Phe 
                965                  970                975    
Arg Leu Pro Ser Lys Thr Pro Ser Ile Thr Ser Glu Ala Val Val Pro 
            980                  985                990        
Glu Ala Gly Leu Val Ile Asp Gly Lys Thr Leu Asn Ala Ile Phe Gln 
        995                  1000                1005            
Gly Lys Leu Glu Lys Lys Phe Leu Glu Leu Thr Gln Tyr Cys Arg Ser 
    1010                1015                1020                
Val Leu Cys Cys Arg Ser Thr Pro Leu Gln Lys Ser Met Ile Val Lys 
1025                1030                1035                1040
Leu Val Arg Asp Lys Leu Arg Val Met Thr Leu Ser Ile Gly Asp Gly 
                1045                1050                1055    
Ala Asn Asp Val Ser Met Ile Gln Ala Ala Asp Ile Gly Ile Gly Ile 
            1060                1065                1070        
Ser Gly Gln Glu Gly Met Gln Ala Val Met Ser Ser Asp Phe Ala Ile 
        1075                1080                1085            
Thr Arg Phe Lys His Leu Lys Lys Leu Leu Leu Val His Gly His Trp 
    1090                1095                1100                
Cys Tyr Ser Arg Leu Ala Arg Met Val Val Tyr Tyr Leu Tyr Lys Asn 
1105                1110                1115                1120
Val Cys Tyr Val Asn Leu Leu Phe Trp Tyr Gln Phe Phe Cys Gly Phe 
                1125                1130                1135    
Ser Ser Ser Thr Met Ile Asp Tyr Trp Gln Met Ile Phe Phe Asn Leu 
            1140                1145                1150        
Phe Phe Thr Ser Leu Pro Pro Leu Val Phe Gly Val Leu Asp Lys Asp 
        1155                1160                1165            
Ile Ser Ala Glu Thr Leu Leu Ala Leu Pro Glu Leu Tyr Lys Ser Gly 
    1170                1175                1180                
Gln Asn Ser Glu Cys Tyr Asn Leu Ser Thr Phe Trp Ile Ser Met Val 
1185                1190                1195                1200
Asp Ala Phe Tyr Gln Ser Leu Ile Cys Phe Phe Ile Pro Tyr Leu Ala 
                1205                1210                1215    
Tyr Lys Gly Ser Asp Ile Asp Val Phe Thr Phe Gly Thr Pro Ile Asn 
            1220                1225                1230        
Thr Ile Ser Leu Thr Thr Ile Leu Leu His Gln Ala Met Glu Met Lys 
        1235                1240                1245            
Thr Trp Thr Ile Phe His Gly Val Val Leu Leu Gly Ser Phe Leu Met 
    1250                1255                1260                
Tyr Phe Leu Val Ser Leu Leu Tyr Asn Ala Thr Cys Val Ile Cys Asn 
1265                1270                1275                1280
Ser Pro Thr Asn Pro Tyr Trp Val Met Glu Gly Gln Leu Ser Asn Pro 
                1285                1290                1295    
Thr Phe Tyr Leu Val Cys Phe Leu Thr Pro Val Val Ala Leu Leu Pro 
            1300                1305                1310        
Arg Tyr Phe Phe Leu Ser Leu Gln Gly Thr Cys Gly Lys Ser Leu Ile 
        1315                1320                1325            
Ser Lys Ala Gln Lys Ile Asp Lys Leu Pro Pro Asp Lys Arg Asn Leu 
    1330                1335                1340                
Glu Ile Gln Ser Trp Arg Ser Arg Gln Arg Pro Ala Pro Val Pro Glu 
1345                1350                1355                1360
Val Ala Arg Pro Thr His His Pro Val Ser Ser Ile Thr Gly Gln Asp 
                1365                1370                1375    
Phe Ser Ala Ser Thr Pro Lys Ser Ser Asn Pro Pro Lys Arg Lys His 
            1380                1385                1390        
Val Glu Glu Ser Val Leu His Glu Gln Arg Cys Gly Thr Glu Cys Met 
        1395                1400                1405            
Arg Asp Asp Ser Cys Ser Gly Asp Ser Ser Ala Gln Leu Ser Ser Gly 
    1410                1415                1420                
Glu His Leu Leu Gly Pro Asn Arg Ile Met Ala Tyr Ser Arg Gly Gln 
1425                1430                1435                1440
Thr Asp Met Cys Arg Cys Ser Lys Arg Ser Ser His Arg Arg Ser Gln 
                1445                1450                1455    
Ser Ser Leu Thr Ile 
            1460    

<210> 103
<211> 4747
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..4747
<223> /mol_type="DNA"
      /note="ATP10B (preferred gene fragment)"
      /organism="artificial sequences"

<400> 103
gcttcctttg acaagagaat cagaaccgac tggtgacatt tgtttcaatg aaagcaacag       60

tgtgaagaaa cttgcaattt attctatcaa caatttgccc agctaaagta ctacatctgc      120

cccctcttcc tgtggctgta ggggcacagc aaaggtcact ggtctaacct ccttaaaggg      180

actccgctaa cagaaaccac caaatggagt gggaaaaaga aaagggatcc cctatcccca      240

actccagcca tctagattaa gaaaagccag ctgactggac agtagcacag cccagtcacc      300

tcatggacaa atttcctagg aaagaacctc tcccatctat ctacttatca ctctcctttg      360

aggttccggc cacagatctt cgcctgctgc tggaaatggc cctctcagtg gactcatcgt      420

ggcatcggtg gcagtggaga gtcagagatg gcttccccca ttgtcctcgg aaaccacacc      480

gctgctctct ccagagaaag ggagacagag ctacaacttg acacagcagc gggtcgtgtt      540

ccccaacaac agcatattcc atcaagattg ggaagaggtc tccaggagat acctggcaac      600

agaacctgca caaccaaata caccctcttc accttcctgc cccggaatct ctttgagcaa      660

tttcatagat gggctaacct ctatttcctg ttcctggtga ttttgaactg gatgccctcc      720

tggaagtctt ccacagagaa atcaccatgt taccattggc cattgtcctg ttcgtcatca      780

tgatcaagga tggcatggag gacttcaaga gacaccgctt tgataaagca ataaactgct      840

ccaacatcga atttatgaaa gaaaagagca gacctatgtg cagaagtgct ggaaggatgt      900

gcgcgtggga gacttcatcc aaatgaaatg caatgagatt gtcccagcag acatactcct      960

ccttttttcc tctgccccaa tgggatatgc catctggaaa ctgccagctt ggatggagag     1020

acaaacctca agcaaagatg tgtcgtgaag ggcttctcac agcaggaggt acagttcgaa     1080

ccagagcttt tccacaatac ctcgtgtgtg agaaacccaa caaccacctc aacaaattta     1140

agggttatat ggagcatcct gaccagacca ggactggctt tggctgtgag agtcttctgc     1200

ttcgaggctg caccatcaga aacaccgaat ggctgttggc attgtcatct atgcaggcca     1260

tgagacgaaa gccatgctga acaacagtgg cccccggtac aaacgcagca agattgagcg     1320

gcgcatgaat atagacatct tcttctgcat tgggacctca tcctcatgtg ccttattgga     1380

gctgtaggtc acagcatctg gaatgggacc tttgaagaac accctccctt cgatgtgcca     1440

gatgccaatg gcagcttcct tcccagtgcc cttgggggct tcacatgttc ctcacaatga     1500

tcatcctgct ccaggtgctg atccccatct ctttgtatgt ctccattgag ctggtgaagc     1560

tcgggcaagt gttcttcttg agcaatgacc ttgacctgta tgatgaagaa ccgatttatc     1620

cattcaatgt cgagccctca acatcgcaga ggacttgggc cagatccagt acatcttctc     1680

cgataagacg gggaccctga cagagaacaa gatggtgttc cgacgttgca ccatcagggc     1740

agcgagtatt ctcaccaaga aaatgctaag cgactggaga ccccaaagga gctggactca     1800

gatggtgaag agtggaccca ataccaatgc ctgtccttct cggctagatg ggcccaggat     1860

ccacaactat gagaagccaa aaaggtgctc agcctctgag gaggagccag agtgcccggg     1920

tgcccatcca gggccactac cggcaaaggt ctatggggca ccgtgaaagc tcacagcctc     1980

ctgtggcctt agcagctcca tagaaaaaga tgtaactcca gataaaaacc tactgaccaa     2040

ggttcgagat gctgccctgt ggttggagac cttgtcagac agcagacctg ccaaggcttc     2100

cctctccacc acctcctcat tgctgatttc ttccttgcct taaccatctg caactctgtc     2160

atggtgtcca caaccaccga gcccaggcag agggtcacca tcaaaccctc aagcaaggct     2220

ctggggacgt ccctggagaa gattagcagc tcttccagaa gttgaagcta ttgagcctca     2280

gccagtcatt ctcatccact gcaccctctg acacagacct cggggagagc ttaggggcca     2340

acgtggccac cacagactcg gatgagagag agatgcatct gtgtgcagtg gaggtgactc     2400

cactgatgac ggtggctaca ggagcagcat gtgggaccag ggcgacatcc tggagtctgg     2460

gtcaggcact tccttggagg aggcattgga ggccccagca cagacctggc caggcctgag     2520

ttctgttacg aggctgagag ccctgatgag gccgccctgg tgcacgctgc ccatgcctac     2580

agcttcacac tagtgtcccg gacacctgag caggtgactg tgcgctgccc cagggcacct     2640

gcctcacctt cagcctcctc tgcaccctgg gctttgactc tgtcaggaag agaatgtctg     2700

tggttgtgag gcacccactg actggcgaga ttgttgtcta caccaagggt gcgactcggt     2760

catcatggac ctgctggaag acccagcctg cgtacctgac attaatatgg aaaagaagct     2820

gagaaaaatc cgagcccgga cccaaaagca tctagacttg tatgcaagag atggcctgcc     2880

acactatgca ttgccaagaa ggttgtaagc gaagaggact tccggagatg ggccagtttc     2940

cggcgtgagg ctgaggcatc cctcgacaac cgagatgagc ttctcatgga aactgcacag     3000

catctgagaa tcaactcacc ttacttggag ccactgggat cgaagaccgg ctgcaggaag     3060

gagttccaga tacgattgcc actctgcggg aggctgggat ccagctctgg gtcctgactg     3120

gagataagca ggaacagcgg tcaacattgc ccattcctgc agactgttaa atcagaccga     3180

cactgtttat accatcaata cagagaatca ggagacctgt gaatccatcc tcaattgtgc     3240

attggaagag ctaaagcaat tcgtgaacta cagaagccag accgcaagct ctttggattc     3300

cgcttacctt ccaagacacc atccatcacc tcagaagctg tggttccaga agctggattg     3360

gtcatcgatg ggaagacatt gaatgcctct tccagggaaa gctagagaag aagtttctgg     3420

aattgaccca gtattgtcgg tccgtcctgt gctgccgctc cacgccactc cagaagagta     3480

tgatagtcaa gctggtgcga gacaagttgc gcgtatgacc ctttccatag gtgatggagc     3540

aaatgatgta agcatgattc aagctgctga tattggaatt ggaatatctg gacaggaagg     3600

catgcaggct gtcatgtcca gcgactttgc catcacccgc ttaagcatct caagaagttg     3660

ctgctcgtgc atggccactg gtgttactcg cgcctggcca ggatggtggt gtactacctc     3720

tacaagaacg tgtgctacgt caacctgctc ttctggtatc agttcttcgt ggtttctcca     3780

gctccaccat gattgattac tggcagatga tattcttcaa tctcttcttt acctccttgc     3840

ctcctcttgt ctttggagtc cttgacaaag acatctctgc agaaacactc ctggcttgcc     3900

tgagctatac aagagtggcc agaactctga gtgctataac ctgtcgactt tctggatttc     3960

tatggtggat gcattctacc agagcctcat ctgtttcttt atcccttacc tggcctataa     4020

ggctctgata tagatgtctt tacctttggg acaccaatca acaccatctc cctcaccaca     4080

atccttttgc accaggcaat ggaaatgaag acatggacca ttttccacgg agtcgtgctc     4140

ctcggcagct cctgatgtac tttctggtat ccctcctgta caatgccacc tgcgtcatct     4200

gcaacagccc caccaatccc tattgggtga tggaaggcca gctctcaaac cccactttct     4260

acctcgtctg ctttctacac cagttgttgc tcttctccca agatactttt tcctgtctct     4320

gcaaggaact tgtgggaagt ctctaatctc aaaagctcag aaaattgaca aactcccccc     4380

agacaaaaga aacctggaaa tccgagttgg agaagcagac agaggcctgc ccctgtcccc     4440

gaagtggctc gaccaactca ccacccagtg tcatctatca caggacagga cttcagtgcc     4500

agcaccccaa agagctctaa ccctcccaag ggaagcatgt ggaagagtca gtactccacg     4560

aacagagatg tggcacggag tgcatgaggg atgactcatg ctcaggggac tcctcagctc     4620

aactctcatc cggggagcac ctgctgggac ctaacagata atggcctact caagaggaca     4680

gactgatatg tgccggtgct caaagaggag cagccatcgc cgatcccaga gttcactgac     4740

catatga                                                               4747


<210> 104
<211> 5297
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..5297
<223> /mol_type="DNA"
      /note="EBF1-ATP10B (preferred fusion gene)"
      /organism="artificial sequences"

<400> 104
atgtttggga ttcaggaaag catccaacgg agtggaagca gcatgaagga agagccgctg       60

ggcagcggca tgaacgcggt gcggacgtgg atgcagggcg ccggggtgct ggacgccaac      120

acggcgggca gagcggggtg ggtctggccc gggctcactt tgagaagcag ccgccttcca      180

atctgcggaa atccaacttc ttccacttcg tcctggccct ctacgacaga cagggccagc      240

ccgtggagat cgagggacag cgtttgtggg gttcgtggag aaggaaaaag aagccaacag      300

cgaaaagacc aataacggaa ttcactaccg gcttcagctt ctctacagca atgggataag      360

gacggagcag gatttctacg tcgcctcatt gactccatga caaaacaagc catagtgtat      420

gaaggccaag acaagaaccc agaaatgtgc cgagtcttgc tcacacatga gatcatgtgc      480

agccgctgtt gtgacaagaa aagctgtgca accgaaatga gactccctca gatccagtga      540

taattgacag gcttcctttg acaagagaat cagaaccgac tggtgacatt tgtttcaatg      600

aaagcaacag tgtgaagaaa cttgcaattt attctatcaa caatttgccc agctaaagta      660

ctacatctgc cccctcttcc tgtggctgta ggggcacagc aaaggtcact ggtctaacct      720

ccttaaaggg actccgctaa cagaaaccac caaatggagt gggaaaaaga aaagggatcc      780

cctatcccca actccagcca tctagattaa gaaaagccag ctgactggac agtagcacag      840

cccagtcacc tcatggacaa atttcctagg aaagaacctc tcccatctat ctacttatca      900

ctctcctttg aggttccggc cacagatctt cgcctgctgc tggaaatggc cctctcagtg      960

gactcatcgt ggcatcggtg gcagtggaga gtcagagatg gcttccccca ttgtcctcgg     1020

aaaccacacc gctgctctct ccagagaaag ggagacagag ctacaacttg acacagcagc     1080

gggtcgtgtt ccccaacaac agcatattcc atcaagattg ggaagaggtc tccaggagat     1140

acctggcaac agaacctgca caaccaaata caccctcttc accttcctgc cccggaatct     1200

ctttgagcaa tttcatagat gggctaacct ctatttcctg ttcctggtga ttttgaactg     1260

gatgccctcc tggaagtctt ccacagagaa atcaccatgt taccattggc cattgtcctg     1320

ttcgtcatca tgatcaagga tggcatggag gacttcaaga gacaccgctt tgataaagca     1380

ataaactgct ccaacatcga atttatgaaa gaaaagagca gacctatgtg cagaagtgct     1440

ggaaggatgt gcgcgtggga gacttcatcc aaatgaaatg caatgagatt gtcccagcag     1500

acatactcct ccttttttcc tctgccccaa tgggatatgc catctggaaa ctgccagctt     1560

ggatggagag acaaacctca agcaaagatg tgtcgtgaag ggcttctcac agcaggaggt     1620

acagttcgaa ccagagcttt tccacaatac ctcgtgtgtg agaaacccaa caaccacctc     1680

aacaaattta agggttatat ggagcatcct gaccagacca ggactggctt tggctgtgag     1740

agtcttctgc ttcgaggctg caccatcaga aacaccgaat ggctgttggc attgtcatct     1800

atgcaggcca tgagacgaaa gccatgctga acaacagtgg cccccggtac aaacgcagca     1860

agattgagcg gcgcatgaat atagacatct tcttctgcat tgggacctca tcctcatgtg     1920

ccttattgga gctgtaggtc acagcatctg gaatgggacc tttgaagaac accctccctt     1980

cgatgtgcca gatgccaatg gcagcttcct tcccagtgcc cttgggggct tcacatgttc     2040

ctcacaatga tcatcctgct ccaggtgctg atccccatct ctttgtatgt ctccattgag     2100

ctggtgaagc tcgggcaagt gttcttcttg agcaatgacc ttgacctgta tgatgaagaa     2160

ccgatttatc cattcaatgt cgagccctca acatcgcaga ggacttgggc cagatccagt     2220

acatcttctc cgataagacg gggaccctga cagagaacaa gatggtgttc cgacgttgca     2280

ccatcagggc agcgagtatt ctcaccaaga aaatgctaag cgactggaga ccccaaagga     2340

gctggactca gatggtgaag agtggaccca ataccaatgc ctgtccttct cggctagatg     2400

ggcccaggat ccacaactat gagaagccaa aaaggtgctc agcctctgag gaggagccag     2460

agtgcccggg tgcccatcca gggccactac cggcaaaggt ctatggggca ccgtgaaagc     2520

tcacagcctc ctgtggcctt agcagctcca tagaaaaaga tgtaactcca gataaaaacc     2580

tactgaccaa ggttcgagat gctgccctgt ggttggagac cttgtcagac agcagacctg     2640

ccaaggcttc cctctccacc acctcctcat tgctgatttc ttccttgcct taaccatctg     2700

caactctgtc atggtgtcca caaccaccga gcccaggcag agggtcacca tcaaaccctc     2760

aagcaaggct ctggggacgt ccctggagaa gattagcagc tcttccagaa gttgaagcta     2820

ttgagcctca gccagtcatt ctcatccact gcaccctctg acacagacct cggggagagc     2880

ttaggggcca acgtggccac cacagactcg gatgagagag agatgcatct gtgtgcagtg     2940

gaggtgactc cactgatgac ggtggctaca ggagcagcat gtgggaccag ggcgacatcc     3000

tggagtctgg gtcaggcact tccttggagg aggcattgga ggccccagca cagacctggc     3060

caggcctgag ttctgttacg aggctgagag ccctgatgag gccgccctgg tgcacgctgc     3120

ccatgcctac agcttcacac tagtgtcccg gacacctgag caggtgactg tgcgctgccc     3180

cagggcacct gcctcacctt cagcctcctc tgcaccctgg gctttgactc tgtcaggaag     3240

agaatgtctg tggttgtgag gcacccactg actggcgaga ttgttgtcta caccaagggt     3300

gcgactcggt catcatggac ctgctggaag acccagcctg cgtacctgac attaatatgg     3360

aaaagaagct gagaaaaatc cgagcccgga cccaaaagca tctagacttg tatgcaagag     3420

atggcctgcc acactatgca ttgccaagaa ggttgtaagc gaagaggact tccggagatg     3480

ggccagtttc cggcgtgagg ctgaggcatc cctcgacaac cgagatgagc ttctcatgga     3540

aactgcacag catctgagaa tcaactcacc ttacttggag ccactgggat cgaagaccgg     3600

ctgcaggaag gagttccaga tacgattgcc actctgcggg aggctgggat ccagctctgg     3660

gtcctgactg gagataagca ggaacagcgg tcaacattgc ccattcctgc agactgttaa     3720

atcagaccga cactgtttat accatcaata cagagaatca ggagacctgt gaatccatcc     3780

tcaattgtgc attggaagag ctaaagcaat tcgtgaacta cagaagccag accgcaagct     3840

ctttggattc cgcttacctt ccaagacacc atccatcacc tcagaagctg tggttccaga     3900

agctggattg gtcatcgatg ggaagacatt gaatgcctct tccagggaaa gctagagaag     3960

aagtttctgg aattgaccca gtattgtcgg tccgtcctgt gctgccgctc cacgccactc     4020

cagaagagta tgatagtcaa gctggtgcga gacaagttgc gcgtatgacc ctttccatag     4080

gtgatggagc aaatgatgta agcatgattc aagctgctga tattggaatt ggaatatctg     4140

gacaggaagg catgcaggct gtcatgtcca gcgactttgc catcacccgc ttaagcatct     4200

caagaagttg ctgctcgtgc atggccactg gtgttactcg cgcctggcca ggatggtggt     4260

gtactacctc tacaagaacg tgtgctacgt caacctgctc ttctggtatc agttcttcgt     4320

ggtttctcca gctccaccat gattgattac tggcagatga tattcttcaa tctcttcttt     4380

acctccttgc ctcctcttgt ctttggagtc cttgacaaag acatctctgc agaaacactc     4440

ctggcttgcc tgagctatac aagagtggcc agaactctga gtgctataac ctgtcgactt     4500

tctggatttc tatggtggat gcattctacc agagcctcat ctgtttcttt atcccttacc     4560

tggcctataa ggctctgata tagatgtctt tacctttggg acaccaatca acaccatctc     4620

cctcaccaca atccttttgc accaggcaat ggaaatgaag acatggacca ttttccacgg     4680

agtcgtgctc ctcggcagct cctgatgtac tttctggtat ccctcctgta caatgccacc     4740

tgcgtcatct gcaacagccc caccaatccc tattgggtga tggaaggcca gctctcaaac     4800

cccactttct acctcgtctg ctttctacac cagttgttgc tcttctccca agatactttt     4860

tcctgtctct gcaaggaact tgtgggaagt ctctaatctc aaaagctcag aaaattgaca     4920

aactcccccc agacaaaaga aacctggaaa tccgagttgg agaagcagac agaggcctgc     4980

ccctgtcccc gaagtggctc gaccaactca ccacccagtg tcatctatca caggacagga     5040

cttcagtgcc agcaccccaa agagctctaa ccctcccaag ggaagcatgt ggaagagtca     5100

gtactccacg aacagagatg tggcacggag tgcatgaggg atgactcatg ctcaggggac     5160

tcctcagctc aactctcatc cggggagcac ctgctgggac ctaacagata atggcctact     5220

caagaggaca gactgatatg tgccggtgct caaagaggag cagccatcgc cgatcccaga     5280

gttcactgac catatga                                                    5297


<210> 105
<211> 231
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..231
<223> /mol_type="protein"
      /note="EBF1-ATP10B (preferred fusion protein)"
      /organism="artificial sequences"

<400> 105
Met Phe Gly Ile Gln Glu Ser Ile Gln Arg Ser Gly Ser Ser Met Lys 
1               5                   10                   15    
Glu Glu Pro Leu Gly Ser Gly Met Asn Ala Val Arg Thr Trp Met Gln 
            20                   25                  30        
Gly Ala Gly Val Leu Asp Ala Asn Thr Ala Ala Gln Ser Gly Val Gly 
        35                   40                  45            
Leu Ala Arg Ala His Phe Glu Lys Gln Pro Pro Ser Asn Leu Arg Lys 
    50                   55                  60                
Ser Asn Phe Phe His Phe Val Leu Ala Leu Tyr Asp Arg Gln Gly Gln 
65                   70                  75                  80
Pro Val Glu Ile Glu Arg Thr Ala Phe Val Gly Phe Val Glu Lys Glu 
                85                   90                  95    
Lys Glu Ala Asn Ser Glu Lys Thr Asn Asn Gly Ile His Tyr Arg Leu 
            100                  105                110        
Gln Leu Leu Tyr Ser Asn Gly Ile Arg Thr Glu Gln Asp Phe Tyr Val 
        115                  120                125            
Arg Leu Ile Asp Ser Met Thr Lys Gln Ala Ile Val Tyr Glu Gly Gln 
    130                  135                140                
Asp Lys Asn Pro Glu Met Cys Arg Val Leu Leu Thr His Glu Ile Met 
145                  150                155                  160
Cys Ser Arg Cys Cys Asp Lys Lys Ser Cys Gly Asn Arg Asn Glu Thr 
                165                  170                175    
Pro Ser Asp Pro Val Ile Ile Asp Arg Leu Pro Leu Thr Arg Glu Ser 
            180                  185                190        
Glu Pro Thr Gly Asp Ile Cys Phe Asn Glu Ser Asn Ser Val Lys Lys 
        195                  200                205            
Leu Ala Ile Tyr Ser Asp Gln Gln Phe Ala Gln Leu Lys Tyr Tyr Ile 
    210                  215                220                
Cys Pro Leu Phe Leu Trp Leu 
225                  230    

<210> 106
<211> 810
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..810
<223> /mol_type="DNA"
      /note="PIGF (CCDS nucleotide sequence of PIGF (Gene ID: 5281) inc
      luding 150 nucleotides of the 5’UTR)"
      /organism="Homo sapiens"

<400> 106
ttttgcccgc ccctcccttt ctgcgactgg ctgttaggcg tgggtctccg ccccacagcc      60

ctgcgcctgc gcgtgggcct gggccgcccg tcgtacctga tggtaggagg cagtagttcc     120

ccgcttccct tccgcgggag ggagagttag atgaaagata acgatatcaa gagactactg     180

tatacccatc ttttatgcat attttcaatt atcctaagtg tcttcattcc atcactcttc     240

ttggagaact tctcaatatt ggaaacacac ttgacatggt tgtgcatctg ttctggtttt     300

gtaactgctg tcaatctagt actatattta gtagtgaaac caaatacatc ctctaaaaga     360

agttcattat cacacaaggt aactggattt ttgaaatgct gtatctactt tcttatgtct     420

tgtttctcct ttcatgtaat ttttgttctg tatggagcac cactgataga gttggcattg     480

gaaacatttt tatttgcagt tattttgtct acttttacta ctgtgccttg cttatgtttg     540

ttaggaccaa acctcaaagc atggctaaga gtgttcagta gaaatggagt tacatccata     600

tgggagaata gtctccagat cactacaatt tctagctttg taggagcatg gcttggagca     660

cttcctattc cactggattg ggaaagacca tggcaggtat ggcccatctc ctgtacgctt     720

ggagcgacct ttggctacgt ggctggcctt gttatttcac cactctggat atactggaat     780

agaaagcaac ttacatacaa gaacaattaa                                      810


<210> 107
<211> 219
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..219
<223> /mol_type="protein"
      /note="PIGF (full-length protein)"
      /organism="Homo sapiens"

<400> 107
Met Lys Asp Asn Asp Ile Lys Arg Leu Leu Tyr Thr His Leu Leu Cys 
1               5                   10                   15    
Ile Phe Ser Ile Ile Leu Ser Val Phe Ile Pro Ser Leu Phe Leu Glu 
            20                   25                  30        
Asn Phe Ser Ile Leu Glu Thr His Leu Thr Trp Leu Cys Ile Cys Ser 
        35                   40                  45            
Gly Phe Val Thr Ala Val Asn Leu Val Leu Tyr Leu Val Val Lys Pro 
    50                   55                  60                
Asn Thr Ser Ser Lys Arg Ser Ser Leu Ser His Lys Val Thr Gly Phe 
65                   70                  75                  80
Leu Lys Cys Cys Ile Tyr Phe Leu Met Ser Cys Phe Ser Phe His Val 
                85                   90                  95    
Ile Phe Val Leu Tyr Gly Ala Pro Leu Ile Glu Leu Ala Leu Glu Thr 
            100                  105                110        
Phe Leu Phe Ala Val Ile Leu Ser Thr Phe Thr Thr Val Pro Cys Leu 
        115                  120                125            
Cys Leu Leu Gly Pro Asn Leu Lys Ala Trp Leu Arg Val Phe Ser Arg 
    130                  135                140                
Asn Gly Val Thr Ser Ile Trp Glu Asn Ser Leu Gln Ile Thr Thr Ile 
145                  150                155                  160
Ser Ser Phe Val Gly Ala Trp Leu Gly Ala Leu Pro Ile Pro Leu Asp 
                165                  170                175    
Trp Glu Arg Pro Trp Gln Val Trp Pro Ile Ser Cys Thr Leu Gly Ala 
            180                  185                190        
Thr Phe Gly Tyr Val Ala Gly Leu Val Ile Ser Pro Leu Trp Ile Tyr 
        195                  200                205            
Trp Asn Arg Lys Gln Leu Thr Tyr Lys Asn Asn 
    210                  215                

<210> 108
<211> 150
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..150
<223> /mol_type="DNA"
      /note="PIGF (preferred gene fragment (part of the 5’UTR))"
      /organism="artificial sequences"

<400> 108
ttttgcccgc ccctcccttt ctgcgactgg ctgttaggcg tgggtctccg ccccacagcc     60

ctgcgcctgc gcgtgggcct gggccgcccg tcgtacctga tggtaggagg cagtagttcc    120

ccgcttccct tccgcgggag ggagagttag                                     150


<210> 109
<400> 109
000

<210> 110
<211> 669
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..669
<223> /mol_type="DNA"
      /note="CHMP3 (CHMP3 (Gene ID: 51652))"
      /organism="Homo sapiens"

<400> 110
atggggctgt ttggaaagac ccaggagaag ccgcccaaag aactggtcaa tgagtggtca      60

ttgaagataa gaaaggaaat gagagttgtt gacaggcaaa taagggatat ccaaagagaa     120

gaagaaaaag tgaaacgatc tgtgaaagat gctgccaaga agggccagaa ggatgtctgc     180

atagttctgg ccaaggagat gatcaggtca aggaaggctg tgagcaagct gtatgcatcc     240

aaagcacaca tgaactcagt gctcatgggg atgaagaacc agctcgcggt cttgcgagtg     300

gctggttccc tgcagaagag cacagaagtg atgaaggcca tgcaaagtct tgtgaagatt     360

ccagagattc aggccaccat gagggagttg tccaaagaaa tgatgaaggc tgggatcata     420

gaggagatgt tagaggacac ttttgaaagc atggacgatc aggaagaaat ggaggaagaa     480

gcagaaatgg aaattgacag aattctcttt gaaattacag caggggcctt gggcaaagca     540

cccagtaaag tgactgatgc ccttccagag ccagaacctc caggagcgat ggctgcctca     600

gaggatgagg aggaggagga agaggctctg gaggccatgc agtcccggct ggccacactc     660

cgcagctag                                                             669


<210> 111
<211> 222
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..222
<223> /mol_type="protein"
      /note="CHMP3 (full-length protein)"
      /organism="Homo sapiens"

<400> 111
Met Gly Leu Phe Gly Lys Thr Gln Glu Lys Pro Pro Lys Glu Leu Val 
1               5                   10                   15    
Asn Glu Trp Ser Leu Lys Ile Arg Lys Glu Met Arg Val Val Asp Arg 
            20                   25                  30        
Gln Ile Arg Asp Ile Gln Arg Glu Glu Glu Lys Val Lys Arg Ser Val 
        35                   40                  45            
Lys Asp Ala Ala Lys Lys Gly Gln Lys Asp Val Cys Ile Val Leu Ala 
    50                   55                  60                
Lys Glu Met Ile Arg Ser Arg Lys Ala Val Ser Lys Leu Tyr Ala Ser 
65                   70                  75                  80
Lys Ala His Met Asn Ser Val Leu Met Gly Met Lys Asn Gln Leu Ala 
                85                   90                  95    
Val Leu Arg Val Ala Gly Ser Leu Gln Lys Ser Thr Glu Val Met Lys 
            100                  105                110        
Ala Met Gln Ser Leu Val Lys Ile Pro Glu Ile Gln Ala Thr Met Arg 
        115                  120                125            
Glu Leu Ser Lys Glu Met Met Lys Ala Gly Ile Ile Glu Glu Met Leu 
    130                  135                140                
Glu Asp Thr Phe Glu Ser Met Asp Asp Gln Glu Glu Met Glu Glu Glu 
145                  150                155                  160
Ala Glu Met Glu Ile Asp Arg Ile Leu Phe Glu Ile Thr Ala Gly Ala 
                165                  170                175    
Leu Gly Lys Ala Pro Ser Lys Val Thr Asp Ala Leu Pro Glu Pro Glu 
            180                  185                190        
Pro Pro Gly Ala Met Ala Ala Ser Glu Asp Glu Glu Glu Glu Glu Glu 
        195                  200                205            
Ala Leu Glu Ala Met Gln Ser Arg Leu Ala Thr Leu Arg Ser 
    210                  215                220        

<210> 112
<211> 563
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..563
<223> /mol_type="DNA"
      /note="CHMP3 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 112
atatccaaag agaagaagaa aaagtgaaac gatctgtgaa agatgctgcc aagaagggcc      60

agaaggatgt ctgcatagtt ctggccaagg agatgatcag gtcaaggaag gctgtgagca     120

agctgtatgc atccaaagca cacatgaact cagtgctcat ggggatgaag aaccagctcg     180

cggtcttgcg agtggctggt tccctgcaga agagcacaga agtgatgaag gccatgcaaa     240

gtcttgtgaa gattccagag attcaggcca ccatgaggga gttgtccaaa gaaatgatga     300

aggctgggat catagaggag atgttagagg acacttttga aagcatggac gatcaggaag     360

aaatggagga agaagcagaa atggaaattg acagaattct ctttgaaatt acagcagggg     420

ccttgggcaa agcacccagt aaagtgactg atgcccttcc agagccagaa cctccaggag     480

cgatggctgc ctcagaggat gaggaggagg aggaagaggc tctggaggcc atgcagtccc     540

ggctggccac actccgcagc tag                                             563


<210> 113
<211> 713
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..713
<223> /mol_type="DNA"
      /note="PIGF-CHMP3 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 113
ttttgcccgc ccctcccttt ctgcgactgg ctgttaggcg tgggtctccg ccccacagcc      60

ctgcgcctgc gcgtgggcct gggccgcccg tcgtacctga tggtaggagg cagtagttcc     120

ccgcttccct tccgcgggag ggagagttag atatccaaag agaagaagaa aaagtgaaac     180

gatctgtgaa agatgctgcc aagaagggcc agaaggatgt ctgcatagtt ctggccaagg     240

agatgatcag gtcaaggaag gctgtgagca agctgtatgc atccaaagca cacatgaact     300

cagtgctcat ggggatgaag aaccagctcg cggtcttgcg agtggctggt tccctgcaga     360

agagcacaga agtgatgaag gccatgcaaa gtcttgtgaa gattccagag attcaggcca     420

ccatgaggga gttgtccaaa gaaatgatga aggctgggat catagaggag atgttagagg     480

acacttttga aagcatggac gatcaggaag aaatggagga agaagcagaa atggaaattg     540

acagaattct ctttgaaatt acagcagggg ccttgggcaa agcacccagt aaagtgactg     600

atgcccttcc agagccagaa cctccaggag cgatggctgc ctcagaggat gaggaggagg     660

aggaagaggc tctggaggcc atgcagtccc ggctggccac actccgcagc tag            713


<210> 114
<400> 114
000

<210> 115
<211> 4500
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4500
<223> /mol_type="DNA"
      /note="GRLF1 (GRLF1 (Gene ID: 2909))"
      /organism="Homo sapiens"

<400> 115
atgatgatgg caagaaagca agatgtccga attcccacct acaacatcag tgtggtggga       60

ttatctggga ccgagaagga aaagggccag tgtgggattg gaaagtcttg tttgtgcaac      120

cgcttcgtgc gcccgagtgc tgacgagttt cacttggacc atacctccgt cctcagcacc      180

agtgactttg gagggcgagt ggtcaataat gaccactttc tctactgggg agaagttagc      240

cgctccctgg aggattgtgt ggaatgtaag atgcacattg tggagcagac tgaatttatt      300

gatgatcaga cttttcaacc tcatcgaagc acggccctgc agccctatat caagagagct      360

gctgcgacca agcttgcatc agctgaaaaa ctcatgtact tttgcactga ccagctgggg      420

ctggagcagg actttgagca gaaacaaatg ccagacggaa agctgctggt tgatggtttt      480

cttcttggta ttgatgttag caggggcatg aataggaact ttgatgacca gctcaagttt      540

gtctccaatc tctacaatca gcttgcaaaa acaaaaaagc ccatagtggt ggtcctgact      600

aagtgtgacg aaggtgttga gcggtacatt agagatgcac atacttttgc cttaagcaaa      660

aagaacctcc aggttgtgga gacctcagcg agatccaatg taaacgtgga cttggctttc      720

agcaccttag tgcaactcat tgataaaagt cggggaaaga caaaaatcat tccttatttt      780

gaagctctca agcagcagag tcagcagata gctacagcaa aagacaagta tgagtggctg      840

gtgagtcgca ttgtgaaaaa ccacaatgag aactggctga gtgtcagccg aaagatgcag      900

gcctctccag aataccagga ctatgtctac ctggaaggga ctcagaaagc caagaagctg      960

tttctacagc acatccaccg cctcaagcat gagcatatcg agcgtaggag aaagctgtac     1020

ctggcagccc tgccattagc ttttgaagct cttataccta atctagatga aatagaccac     1080

ctaagctgca taaaagccaa aaagctctta gaaaccaagc cagaattctt gaagtggttt     1140

gttgtgcttg aagagacccc atgggatgcc accagtcaca ttgacaacat ggaaaacgaa     1200

cggattccct ttgatttaat ggataccgtc cctgcagagc agctatacga ggcccactta     1260

gagaagctga ggaacgaaag gaaaagagtt gagatgcgaa gggcgtttaa agaaaacctg     1320

gagacttctc ctttcataac tcccggaaag ccttgggaag aggcccgtag ttttattatg     1380

aatgaggatt tctaccagtg gctggaggaa tctgtataca tggatattta tggcaaacac     1440

caaaagcaaa ttatagataa agcaaaggaa gaatttcagg agttgctttt ggaatattca     1500

gaattgtttt atgaactgga gctggatgct aagcccagca aggagaagat gggtgttatt     1560

caggatgttc tgggagagga acagcgattt aaagcattac aaaagctcca agcagagcgt     1620

gatgccctta ttctgaaaca cattcatttt gtgtaccacc caacaaagga gacatgcccc     1680

agctgcccag cttgtgtgga cgctaagatt gagcacttga ttagttctcg gtttatccgg     1740

ccgtctgacc ggaatcagaa aaattcactc tctgacccta acattgatag aatcaacttg     1800

gttatattgg gcaaagacgg ccttgcccga gagttggcca atgagattcg agctctttgt     1860

acaaatgatg acaagtatgt gatagatggt aaaatgtatg agctttccct gaggccaata     1920

gaggggaatg tcaggcttcc tgtgaactct ttccagacgc caacatttca gccccacggc     1980

tgtctctgcc tttacaattc aaaggaatcg ctatcctatg tagtggaaag tatagagaag     2040

agtagagagt ccacgctggg ccggcgggat aatcatttag tccatctccc ccttacatta     2100

attttggtta acaagagagg agacaccagt ggagagactc tgcatagctt aatacagcaa     2160

ggtcaacaaa ttgctagcaa acttcagtgt gtctttctcg accctgcttc tgctggcatt     2220

ggttacggac gcaacattaa tgaaaagcaa atcagtcaag ttttgaaggg actcctggac     2280

tctaagcgta acttaaacct ggtcagttct actgctagca tcaaagattt ggctgatgtt     2340

gatctgcgaa ttgttatgtg tctgatgtgt ggagatcctt ttagtgcaga tgacatactt     2400

tttcctgtcc ttcagtccca aacctgtaaa tcttcccatt gtggaagcaa caactctgtt     2460

ttacttgaac taccaatcgg actgcacaag aagcggattg aactgtctgt tctttcatac     2520

cattcctcct ttagcatcag aaagagccgg ttggttcatg ggtacattgt tttttattca     2580

gccaaacgta aggcctcttt ggctatgtta cgtgcctttc tttgtgaagt gcaggatatt     2640

atccctattc agcttgtagc actcactgat ggcgctgtag atgtcctgga caatgactta     2700

agtagggaac agctaactga gggggaggag attgctcaag aaattgacgg aaggttcaca     2760

agcatcccct gtagccaacc ccagcataaa cttgagatct ttcacccatt ttttaaagat     2820

gtggtggaaa aaaagaacat aatcgaggct actcatatgt acgataatgc tgccgaggcc     2880

tgtagcacca ccgaagaggt gtttaactcc ccccgggcag gatcaccgct ctgcaactca     2940

aacctgcagg attcagaaga agatatcgag ccatcttaca gcctgtttcg agaagacaca     3000

tcactgcctt ctctgtccaa agaccattct aagctctcta tggaactgga gggaaatgat     3060

gggctgtctt tcattatgag caattttgag agtaaactga acaacaaagt acctccgcca     3120

gtcaaaccaa agcctcctgt ccattttgaa attacaaagg gggatctatc ttatttagac     3180

caaggccata gggatggaca gaggaagtct gtgtcttcta gcccctggct gcctcaggat     3240

gggtttgatc cttctgacta tgctgaaccc atggatgctg tggtgaagcc aaggaatgaa     3300

gaagaaaaca tatactccgt gccccatgac agcacccaag gcaaaatcat caccattcgg     3360

aatatcaaca aagcccagtc caacggcagc gggaatggtt ctgacagtga aatggacacc     3420

agctctctag agcgagggcg caaggtttcc atcgtgagca agccagtgct gtacaggacg     3480

agatgcaccc ggctggggcg gtttgctagt taccggacca gcttcagcgt ggggagtgat     3540

gatgagctgg ggcccatccg gaagaaagag gaggatcagg catcccaggg ttataaaggg     3600

gacaatgctg tcattccata cgaaacagac gaagacccgc ggaggaggaa tattcttcgc     3660

agcctaagga ggaacactaa gaaaccaaag cccaaacccc ggccatccat cacaaaggca     3720

acctgggaga gtaactattt tggggtgccc ttaacaactg tcgtgactcc agagaagccg     3780

atccccattt ttattgaaag atgtattgag tacattgaag ccacaggact gagcacggaa     3840

ggcatctacc gggtcagcgg gaacaagtct gagatggaga gtctgcagag acagtttgat     3900

caagaccaca acctggacct ggcagagaaa gactttacgg tgaataccgt ggctggtgcc     3960

atgaagagct ttttctcaga actgcctgac cccctggtcc cgtataacat gcagatcgac     4020

ttggtggaag cacacaaaat caacgaccgg gagcagaagt tgcatgccct taaggaggta     4080

ttaaagaaat ttccaaagga aaaccacgaa gtcttcaagt atgtcatctc tcacctaaac     4140

aaggtcagcc acaacaacaa ggtgaatctc atgaccagcg agaacctctc catctgcttc     4200

tggcccacct tgatgagacc tgatttcagc actatggacg ccctcacagc cacgcgcacc     4260

taccagacaa tcattgaact ctttatccag cagtgcccct tcttcttcta caatcggccc     4320

atcaccgagc cccccggcgc caggcccagc tccccctctg ccgtggcttc caccgtcccc     4380

ttcctcactt ccacgcctgt cacaagtcag ccgtcgcccc cacagtcgcc tccacccacc     4440

ccccagtccc caatgcagcc actgcttccc tcccagcttc aagccgaaca cacgctgtga     4500


<210> 116
<211> 1499
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1499
<223> /mol_type="protein"
      /note="GRLF1 (full-length protein)"
      /organism="Homo sapiens"

<400> 116
Met Met Met Ala Arg Lys Gln Asp Val Arg Ile Pro Thr Tyr Asn Ile 
1               5                   10                   15    
Ser Val Val Gly Leu Ser Gly Thr Glu Lys Glu Lys Gly Gln Cys Gly 
            20                   25                  30        
Ile Gly Lys Ser Cys Leu Cys Asn Arg Phe Val Arg Pro Ser Ala Asp 
        35                   40                  45            
Glu Phe His Leu Asp His Thr Ser Val Leu Ser Thr Ser Asp Phe Gly 
    50                   55                  60                
Gly Arg Val Val Asn Asn Asp His Phe Leu Tyr Trp Gly Glu Val Ser 
65                   70                  75                  80
Arg Ser Leu Glu Asp Cys Val Glu Cys Lys Met His Ile Val Glu Gln 
                85                   90                  95    
Thr Glu Phe Ile Asp Asp Gln Thr Phe Gln Pro His Arg Ser Thr Ala 
            100                  105                110        
Leu Gln Pro Tyr Ile Lys Arg Ala Ala Ala Thr Lys Leu Ala Ser Ala 
        115                  120                125            
Glu Lys Leu Met Tyr Phe Cys Thr Asp Gln Leu Gly Leu Glu Gln Asp 
    130                  135                140                
Phe Glu Gln Lys Gln Met Pro Asp Gly Lys Leu Leu Val Asp Gly Phe 
145                  150                155                  160
Leu Leu Gly Ile Asp Val Ser Arg Gly Met Asn Arg Asn Phe Asp Asp 
                165                  170                175    
Gln Leu Lys Phe Val Ser Asn Leu Tyr Asn Gln Leu Ala Lys Thr Lys 
            180                  185                190        
Lys Pro Ile Val Val Val Leu Thr Lys Cys Asp Glu Gly Val Glu Arg 
        195                  200                205            
Tyr Ile Arg Asp Ala His Thr Phe Ala Leu Ser Lys Lys Asn Leu Gln 
    210                  215                220                
Val Val Glu Thr Ser Ala Arg Ser Asn Val Asn Val Asp Leu Ala Phe 
225                  230                235                  240
Ser Thr Leu Val Gln Leu Ile Asp Lys Ser Arg Gly Lys Thr Lys Ile 
                245                  250                255    
Ile Pro Tyr Phe Glu Ala Leu Lys Gln Gln Ser Gln Gln Ile Ala Thr 
            260                  265                270        
Ala Lys Asp Lys Tyr Glu Trp Leu Val Ser Arg Ile Val Lys Asn His 
        275                  280                285            
Asn Glu Asn Trp Leu Ser Val Ser Arg Lys Met Gln Ala Ser Pro Glu 
    290                  295                300                
Tyr Gln Asp Tyr Val Tyr Leu Glu Gly Thr Gln Lys Ala Lys Lys Leu 
305                  310                315                  320
Phe Leu Gln His Ile His Arg Leu Lys His Glu His Ile Glu Arg Arg 
                325                  330                335    
Arg Lys Leu Tyr Leu Ala Ala Leu Pro Leu Ala Phe Glu Ala Leu Ile 
            340                  345                350        
Pro Asn Leu Asp Glu Ile Asp His Leu Ser Cys Ile Lys Ala Lys Lys 
        355                  360                365            
Leu Leu Glu Thr Lys Pro Glu Phe Leu Lys Trp Phe Val Val Leu Glu 
    370                  375                380                
Glu Thr Pro Trp Asp Ala Thr Ser His Ile Asp Asn Met Glu Asn Glu 
385                  390                395                  400
Arg Ile Pro Phe Asp Leu Met Asp Thr Val Pro Ala Glu Gln Leu Tyr 
                405                  410                415    
Glu Ala His Leu Glu Lys Leu Arg Asn Glu Arg Lys Arg Val Glu Met 
            420                  425                430        
Arg Arg Ala Phe Lys Glu Asn Leu Glu Thr Ser Pro Phe Ile Thr Pro 
        435                  440                445            
Gly Lys Pro Trp Glu Glu Ala Arg Ser Phe Ile Met Asn Glu Asp Phe 
    450                  455                460                
Tyr Gln Trp Leu Glu Glu Ser Val Tyr Met Asp Ile Tyr Gly Lys His 
465                  470                475                  480
Gln Lys Gln Ile Ile Asp Lys Ala Lys Glu Glu Phe Gln Glu Leu Leu 
                485                  490                495    
Leu Glu Tyr Ser Glu Leu Phe Tyr Glu Leu Glu Leu Asp Ala Lys Pro 
            500                  505                510        
Ser Lys Glu Lys Met Gly Val Ile Gln Asp Val Leu Gly Glu Glu Gln 
        515                  520                525            
Arg Phe Lys Ala Leu Gln Lys Leu Gln Ala Glu Arg Asp Ala Leu Ile 
    530                  535                540                
Leu Lys His Ile His Phe Val Tyr His Pro Thr Lys Glu Thr Cys Pro 
545                  550                555                  560
Ser Cys Pro Ala Cys Val Asp Ala Lys Ile Glu His Leu Ile Ser Ser 
                565                  570                575    
Arg Phe Ile Arg Pro Ser Asp Arg Asn Gln Lys Asn Ser Leu Ser Asp 
            580                  585                590        
Pro Asn Ile Asp Arg Ile Asn Leu Val Ile Leu Gly Lys Asp Gly Leu 
        595                  600                605            
Ala Arg Glu Leu Ala Asn Glu Ile Arg Ala Leu Cys Thr Asn Asp Asp 
    610                  615                620                
Lys Tyr Val Ile Asp Gly Lys Met Tyr Glu Leu Ser Leu Arg Pro Ile 
625                  630                635                  640
Glu Gly Asn Val Arg Leu Pro Val Asn Ser Phe Gln Thr Pro Thr Phe 
                645                  650                655    
Gln Pro His Gly Cys Leu Cys Leu Tyr Asn Ser Lys Glu Ser Leu Ser 
            660                  665                670        
Tyr Val Val Glu Ser Ile Glu Lys Ser Arg Glu Ser Thr Leu Gly Arg 
        675                  680                685            
Arg Asp Asn His Leu Val His Leu Pro Leu Thr Leu Ile Leu Val Asn 
    690                  695                700                
Lys Arg Gly Asp Thr Ser Gly Glu Thr Leu His Ser Leu Ile Gln Gln 
705                  710                715                  720
Gly Gln Gln Ile Ala Ser Lys Leu Gln Cys Val Phe Leu Asp Pro Ala 
                725                  730                735    
Ser Ala Gly Ile Gly Tyr Gly Arg Asn Ile Asn Glu Lys Gln Ile Ser 
            740                  745                750        
Gln Val Leu Lys Gly Leu Leu Asp Ser Lys Arg Asn Leu Asn Leu Val 
        755                  760                765            
Ser Ser Thr Ala Ser Ile Lys Asp Leu Ala Asp Val Asp Leu Arg Ile 
    770                  775                780                
Val Met Cys Leu Met Cys Gly Asp Pro Phe Ser Ala Asp Asp Ile Leu 
785                  790                795                  800
Phe Pro Val Leu Gln Ser Gln Thr Cys Lys Ser Ser His Cys Gly Ser 
                805                  810                815    
Asn Asn Ser Val Leu Leu Glu Leu Pro Ile Gly Leu His Lys Lys Arg 
            820                  825                830        
Ile Glu Leu Ser Val Leu Ser Tyr His Ser Ser Phe Ser Ile Arg Lys 
        835                  840                845            
Ser Arg Leu Val His Gly Tyr Ile Val Phe Tyr Ser Ala Lys Arg Lys 
    850                  855                860                
Ala Ser Leu Ala Met Leu Arg Ala Phe Leu Cys Glu Val Gln Asp Ile 
865                  870                875                  880
Ile Pro Ile Gln Leu Val Ala Leu Thr Asp Gly Ala Val Asp Val Leu 
                885                  890                895    
Asp Asn Asp Leu Ser Arg Glu Gln Leu Thr Glu Gly Glu Glu Ile Ala 
            900                  905                910        
Gln Glu Ile Asp Gly Arg Phe Thr Ser Ile Pro Cys Ser Gln Pro Gln 
        915                  920                925            
His Lys Leu Glu Ile Phe His Pro Phe Phe Lys Asp Val Val Glu Lys 
    930                  935                940                
Lys Asn Ile Ile Glu Ala Thr His Met Tyr Asp Asn Ala Ala Glu Ala 
945                  950                955                  960
Cys Ser Thr Thr Glu Glu Val Phe Asn Ser Pro Arg Ala Gly Ser Pro 
                965                  970                975    
Leu Cys Asn Ser Asn Leu Gln Asp Ser Glu Glu Asp Ile Glu Pro Ser 
            980                  985                990        
Tyr Ser Leu Phe Arg Glu Asp Thr Ser Leu Pro Ser Leu Ser Lys Asp 
        995                  1000                1005            
His Ser Lys Leu Ser Met Glu Leu Glu Gly Asn Asp Gly Leu Ser Phe 
    1010                1015                1020                
Ile Met Ser Asn Phe Glu Ser Lys Leu Asn Asn Lys Val Pro Pro Pro 
1025                1030                1035                1040
Val Lys Pro Lys Pro Pro Val His Phe Glu Ile Thr Lys Gly Asp Leu 
                1045                1050                1055    
Ser Tyr Leu Asp Gln Gly His Arg Asp Gly Gln Arg Lys Ser Val Ser 
            1060                1065                1070        
Ser Ser Pro Trp Leu Pro Gln Asp Gly Phe Asp Pro Ser Asp Tyr Ala 
        1075                1080                1085            
Glu Pro Met Asp Ala Val Val Lys Pro Arg Asn Glu Glu Glu Asn Ile 
    1090                1095                1100                
Tyr Ser Val Pro His Asp Ser Thr Gln Gly Lys Ile Ile Thr Ile Arg 
1105                1110                1115                1120
Asn Ile Asn Lys Ala Gln Ser Asn Gly Ser Gly Asn Gly Ser Asp Ser 
                1125                1130                1135    
Glu Met Asp Thr Ser Ser Leu Glu Arg Gly Arg Lys Val Ser Ile Val 
            1140                1145                1150        
Ser Lys Pro Val Leu Tyr Arg Thr Arg Cys Thr Arg Leu Gly Arg Phe 
        1155                1160                1165            
Ala Ser Tyr Arg Thr Ser Phe Ser Val Gly Ser Asp Asp Glu Leu Gly 
    1170                1175                1180                
Pro Ile Arg Lys Lys Glu Glu Asp Gln Ala Ser Gln Gly Tyr Lys Gly 
1185                1190                1195                1200
Asp Asn Ala Val Ile Pro Tyr Glu Thr Asp Glu Asp Pro Arg Arg Arg 
                1205                1210                1215    
Asn Ile Leu Arg Ser Leu Arg Arg Asn Thr Lys Lys Pro Lys Pro Lys 
            1220                1225                1230        
Pro Arg Pro Ser Ile Thr Lys Ala Thr Trp Glu Ser Asn Tyr Phe Gly 
        1235                1240                1245            
Val Pro Leu Thr Thr Val Val Thr Pro Glu Lys Pro Ile Pro Ile Phe 
    1250                1255                1260                
Ile Glu Arg Cys Ile Glu Tyr Ile Glu Ala Thr Gly Leu Ser Thr Glu 
1265                1270                1275                1280
Gly Ile Tyr Arg Val Ser Gly Asn Lys Ser Glu Met Glu Ser Leu Gln 
                1285                1290                1295    
Arg Gln Phe Asp Gln Asp His Asn Leu Asp Leu Ala Glu Lys Asp Phe 
            1300                1305                1310        
Thr Val Asn Thr Val Ala Gly Ala Met Lys Ser Phe Phe Ser Glu Leu 
        1315                1320                1325            
Pro Asp Pro Leu Val Pro Tyr Asn Met Gln Ile Asp Leu Val Glu Ala 
    1330                1335                1340                
His Lys Ile Asn Asp Arg Glu Gln Lys Leu His Ala Leu Lys Glu Val 
1345                1350                1355                1360
Leu Lys Lys Phe Pro Lys Glu Asn His Glu Val Phe Lys Tyr Val Ile 
                1365                1370                1375    
Ser His Leu Asn Lys Val Ser His Asn Asn Lys Val Asn Leu Met Thr 
            1380                1385                1390        
Ser Glu Asn Leu Ser Ile Cys Phe Trp Pro Thr Leu Met Arg Pro Asp 
        1395                1400                1405            
Phe Ser Thr Met Asp Ala Leu Thr Ala Thr Arg Thr Tyr Gln Thr Ile 
    1410                1415                1420                
Ile Glu Leu Phe Ile Gln Gln Cys Pro Phe Phe Phe Tyr Asn Arg Pro 
1425                1430                1435                1440
Ile Thr Glu Pro Pro Gly Ala Arg Pro Ser Ser Pro Ser Ala Val Ala 
                1445                1450                1455    
Ser Thr Val Pro Phe Leu Thr Ser Thr Pro Val Thr Ser Gln Pro Ser 
            1460                1465                1470        
Pro Pro Gln Ser Pro Pro Pro Thr Pro Gln Ser Pro Met Gln Pro Leu 
        1475                1480                1485            
Leu Pro Ser Gln Leu Gln Ala Glu His Thr Leu 
    1490                1495                

<210> 117
<211> 3826
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..3826
<223> /mol_type="DNA"
      /note="GRLF1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 117
atgatgatgg caagaaagca agatgtccga attcccacct acaacatcag tgtggtggga       60

ttatctggga ccgagaagga aaagggccag tgtgggattg gaaagtcttg tttgtgcaac      120

cgcttcgtgc gcccgagtgc tgacgagttt cacttggacc atacctccgt cctcagcacc      180

agtgactttg gagggcgagt ggtcaataat gaccactttc tctactgggg agaagttagc      240

cgctccctgg aggattgtgt ggaatgtaag atgcacattg tggagcagac tgaatttatt      300

gatgatcaga cttttcaacc tcatcgaagc acggccctgc agccctatat caagagagct      360

gctgcgacca agcttgcatc agctgaaaaa ctcatgtact tttgcactga ccagctgggg      420

ctggagcagg actttgagca gaaacaaatg ccagacggaa agctgctggt tgatggtttt      480

cttcttggta ttgatgttag caggggcatg aataggaact ttgatgacca gctcaagttt      540

gtctccaatc tctacaatca gcttgcaaaa acaaaaaagc ccatagtggt ggtcctgact      600

aagtgtgacg aaggtgttga gcggtacatt agagatgcac atacttttgc cttaagcaaa      660

aagaacctcc aggttgtgga gacctcagcg agatccaatg taaacgtgga cttggctttc      720

agcaccttag tgcaactcat tgataaaagt cggggaaaga caaaaatcat tccttatttt      780

gaagctctca agcagcagag tcagcagata gctacagcaa aagacaagta tgagtggctg      840

gtgagtcgca ttgtgaaaaa ccacaatgag aactggctga gtgtcagccg aaagatgcag      900

gcctctccag aataccagga ctatgtctac ctggaaggga ctcagaaagc caagaagctg      960

tttctacagc acatccaccg cctcaagcat gagcatatcg agcgtaggag aaagctgtac     1020

ctggcagccc tgccattagc ttttgaagct cttataccta atctagatga aatagaccac     1080

ctaagctgca taaaagccaa aaagctctta gaaaccaagc cagaattctt gaagtggttt     1140

gttgtgcttg aagagacccc atgggatgcc accagtcaca ttgacaacat ggaaaacgaa     1200

cggattccct ttgatttaat ggataccgtc cctgcagagc agctatacga ggcccactta     1260

gagaagctga ggaacgaaag gaaaagagtt gagatgcgaa gggcgtttaa agaaaacctg     1320

gagacttctc ctttcataac tcccggaaag ccttgggaag aggcccgtag ttttattatg     1380

aatgaggatt tctaccagtg gctggaggaa tctgtataca tggatattta tggcaaacac     1440

caaaagcaaa ttatagataa agcaaaggaa gaatttcagg agttgctttt ggaatattca     1500

gaattgtttt atgaactgga gctggatgct aagcccagca aggagaagat gggtgttatt     1560

caggatgttc tgggagagga acagcgattt aaagcattac aaaagctcca agcagagcgt     1620

gatgccctta ttctgaaaca cattcatttt gtgtaccacc caacaaagga gacatgcccc     1680

agctgcccag cttgtgtgga cgctaagatt gagcacttga ttagttctcg gtttatccgg     1740

ccgtctgacc ggaatcagaa aaattcactc tctgacccta acattgatag aatcaacttg     1800

gttatattgg gcaaagacgg ccttgcccga gagttggcca atgagattcg agctctttgt     1860

acaaatgatg acaagtatgt gatagatggt aaaatgtatg agctttccct gaggccaata     1920

gaggggaatg tcaggcttcc tgtgaactct ttccagacgc caacatttca gccccacggc     1980

tgtctctgcc tttacaattc aaaggaatcg ctatcctatg tagtggaaag tatagagaag     2040

agtagagagt ccacgctggg ccggcgggat aatcatttag tccatctccc ccttacatta     2100

attttggtta acaagagagg agacaccagt ggagagactc tgcatagctt aatacagcaa     2160

ggtcaacaaa ttgctagcaa acttcagtgt gtctttctcg accctgcttc tgctggcatt     2220

ggttacggac gcaacattaa tgaaaagcaa atcagtcaag ttttgaaggg actcctggac     2280

tctaagcgta acttaaacct ggtcagttct actgctagca tcaaagattt ggctgatgtt     2340

gatctgcgaa ttgttatgtg tctgatgtgt ggagatcctt ttagtgcaga tgacatactt     2400

tttcctgtcc ttcagtccca aacctgtaaa tcttcccatt gtggaagcaa caactctgtt     2460

ttacttgaac taccaatcgg actgcacaag aagcggattg aactgtctgt tctttcatac     2520

cattcctcct ttagcatcag aaagagccgg ttggttcatg ggtacattgt tttttattca     2580

gccaaacgta aggcctcttt ggctatgtta cgtgcctttc tttgtgaagt gcaggatatt     2640

atccctattc agcttgtagc actcactgat ggcgctgtag atgtcctgga caatgactta     2700

agtagggaac agctaactga gggggaggag attgctcaag aaattgacgg aaggttcaca     2760

agcatcccct gtagccaacc ccagcataaa cttgagatct ttcacccatt ttttaaagat     2820

gtggtggaaa aaaagaacat aatcgaggct actcatatgt acgataatgc tgccgaggcc     2880

tgtagcacca ccgaagaggt gtttaactcc ccccgggcag gatcaccgct ctgcaactca     2940

aacctgcagg attcagaaga agatatcgag ccatcttaca gcctgtttcg agaagacaca     3000

tcactgcctt ctctgtccaa agaccattct aagctctcta tggaactgga gggaaatgat     3060

gggctgtctt tcattatgag caattttgag agtaaactga acaacaaagt acctccgcca     3120

gtcaaaccaa agcctcctgt ccattttgaa attacaaagg gggatctatc ttatttagac     3180

caaggccata gggatggaca gaggaagtct gtgtcttcta gcccctggct gcctcaggat     3240

gggtttgatc cttctgacta tgctgaaccc atggatgctg tggtgaagcc aaggaatgaa     3300

gaagaaaaca tatactccgt gccccatgac agcacccaag gcaaaatcat caccattcgg     3360

aatatcaaca aagcccagtc caacggcagc gggaatggtt ctgacagtga aatggacacc     3420

agctctctag agcgagggcg caaggtttcc atcgtgagca agccagtgct gtacaggacg     3480

agatgcaccc ggctggggcg gtttgctagt taccggacca gcttcagcgt ggggagtgat     3540

gatgagctgg ggcccatccg gaagaaagag gaggatcagg catcccaggg ttataaaggg     3600

gacaatgctg tcattccata cgaaacagac gaagacccgc ggaggaggaa tattcttcgc     3660

agcctaagga ggaacactaa gaaaccaaag cccaaacccc ggccatccat cacaaaggca     3720

acctgggaga gtaactattt tggggtgccc ttaacaactg tcgtgactcc agagaagccg     3780

atccccattt ttattgaaag atgtattgag tacattgaag ccacag                    3826


<210> 118
<211> 1275
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..1275
<223> /mol_type="protein"
      /note="GRLF1 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 118
Met Met Met Ala Arg Lys Gln Asp Val Arg Ile Pro Thr Tyr Asn Ile 
1               5                   10                   15    
Ser Val Val Gly Leu Ser Gly Thr Glu Lys Glu Lys Gly Gln Cys Gly 
            20                   25                  30        
Ile Gly Lys Ser Cys Leu Cys Asn Arg Phe Val Arg Pro Ser Ala Asp 
        35                   40                  45            
Glu Phe His Leu Asp His Thr Ser Val Leu Ser Thr Ser Asp Phe Gly 
    50                   55                  60                
Gly Arg Val Val Asn Asn Asp His Phe Leu Tyr Trp Gly Glu Val Ser 
65                   70                  75                  80
Arg Ser Leu Glu Asp Cys Val Glu Cys Lys Met His Ile Val Glu Gln 
                85                   90                  95    
Thr Glu Phe Ile Asp Asp Gln Thr Phe Gln Pro His Arg Ser Thr Ala 
            100                  105                110        
Leu Gln Pro Tyr Ile Lys Arg Ala Ala Ala Thr Lys Leu Ala Ser Ala 
        115                  120                125            
Glu Lys Leu Met Tyr Phe Cys Thr Asp Gln Leu Gly Leu Glu Gln Asp 
    130                  135                140                
Phe Glu Gln Lys Gln Met Pro Asp Gly Lys Leu Leu Val Asp Gly Phe 
145                  150                155                  160
Leu Leu Gly Ile Asp Val Ser Arg Gly Met Asn Arg Asn Phe Asp Asp 
                165                  170                175    
Gln Leu Lys Phe Val Ser Asn Leu Tyr Asn Gln Leu Ala Lys Thr Lys 
            180                  185                190        
Lys Pro Ile Val Val Val Leu Thr Lys Cys Asp Glu Gly Val Glu Arg 
        195                  200                205            
Tyr Ile Arg Asp Ala His Thr Phe Ala Leu Ser Lys Lys Asn Leu Gln 
    210                  215                220                
Val Val Glu Thr Ser Ala Arg Ser Asn Val Asn Val Asp Leu Ala Phe 
225                  230                235                  240
Ser Thr Leu Val Gln Leu Ile Asp Lys Ser Arg Gly Lys Thr Lys Ile 
                245                  250                255    
Ile Pro Tyr Phe Glu Ala Leu Lys Gln Gln Ser Gln Gln Ile Ala Thr 
            260                  265                270        
Ala Lys Asp Lys Tyr Glu Trp Leu Val Ser Arg Ile Val Lys Asn His 
        275                  280                285            
Asn Glu Asn Trp Leu Ser Val Ser Arg Lys Met Gln Ala Ser Pro Glu 
    290                  295                300                
Tyr Gln Asp Tyr Val Tyr Leu Glu Gly Thr Gln Lys Ala Lys Lys Leu 
305                  310                315                  320
Phe Leu Gln His Ile His Arg Leu Lys His Glu His Ile Glu Arg Arg 
                325                  330                335    
Arg Lys Leu Tyr Leu Ala Ala Leu Pro Leu Ala Phe Glu Ala Leu Ile 
            340                  345                350        
Pro Asn Leu Asp Glu Ile Asp His Leu Ser Cys Ile Lys Ala Lys Lys 
        355                  360                365            
Leu Leu Glu Thr Lys Pro Glu Phe Leu Lys Trp Phe Val Val Leu Glu 
    370                  375                380                
Glu Thr Pro Trp Asp Ala Thr Ser His Ile Asp Asn Met Glu Asn Glu 
385                  390                395                  400
Arg Ile Pro Phe Asp Leu Met Asp Thr Val Pro Ala Glu Gln Leu Tyr 
                405                  410                415    
Glu Ala His Leu Glu Lys Leu Arg Asn Glu Arg Lys Arg Val Glu Met 
            420                  425                430        
Arg Arg Ala Phe Lys Glu Asn Leu Glu Thr Ser Pro Phe Ile Thr Pro 
        435                  440                445            
Gly Lys Pro Trp Glu Glu Ala Arg Ser Phe Ile Met Asn Glu Asp Phe 
    450                  455                460                
Tyr Gln Trp Leu Glu Glu Ser Val Tyr Met Asp Ile Tyr Gly Lys His 
465                  470                475                  480
Gln Lys Gln Ile Ile Asp Lys Ala Lys Glu Glu Phe Gln Glu Leu Leu 
                485                  490                495    
Leu Glu Tyr Ser Glu Leu Phe Tyr Glu Leu Glu Leu Asp Ala Lys Pro 
            500                  505                510        
Ser Lys Glu Lys Met Gly Val Ile Gln Asp Val Leu Gly Glu Glu Gln 
        515                  520                525            
Arg Phe Lys Ala Leu Gln Lys Leu Gln Ala Glu Arg Asp Ala Leu Ile 
    530                  535                540                
Leu Lys His Ile His Phe Val Tyr His Pro Thr Lys Glu Thr Cys Pro 
545                  550                555                  560
Ser Cys Pro Ala Cys Val Asp Ala Lys Ile Glu His Leu Ile Ser Ser 
                565                  570                575    
Arg Phe Ile Arg Pro Ser Asp Arg Asn Gln Lys Asn Ser Leu Ser Asp 
            580                  585                590        
Pro Asn Ile Asp Arg Ile Asn Leu Val Ile Leu Gly Lys Asp Gly Leu 
        595                  600                605            
Ala Arg Glu Leu Ala Asn Glu Ile Arg Ala Leu Cys Thr Asn Asp Asp 
    610                  615                620                
Lys Tyr Val Ile Asp Gly Lys Met Tyr Glu Leu Ser Leu Arg Pro Ile 
625                  630                635                  640
Glu Gly Asn Val Arg Leu Pro Val Asn Ser Phe Gln Thr Pro Thr Phe 
                645                  650                655    
Gln Pro His Gly Cys Leu Cys Leu Tyr Asn Ser Lys Glu Ser Leu Ser 
            660                  665                670        
Tyr Val Val Glu Ser Ile Glu Lys Ser Arg Glu Ser Thr Leu Gly Arg 
        675                  680                685            
Arg Asp Asn His Leu Val His Leu Pro Leu Thr Leu Ile Leu Val Asn 
    690                  695                700                
Lys Arg Gly Asp Thr Ser Gly Glu Thr Leu His Ser Leu Ile Gln Gln 
705                  710                715                  720
Gly Gln Gln Ile Ala Ser Lys Leu Gln Cys Val Phe Leu Asp Pro Ala 
                725                  730                735    
Ser Ala Gly Ile Gly Tyr Gly Arg Asn Ile Asn Glu Lys Gln Ile Ser 
            740                  745                750        
Gln Val Leu Lys Gly Leu Leu Asp Ser Lys Arg Asn Leu Asn Leu Val 
        755                  760                765            
Ser Ser Thr Ala Ser Ile Lys Asp Leu Ala Asp Val Asp Leu Arg Ile 
    770                  775                780                
Val Met Cys Leu Met Cys Gly Asp Pro Phe Ser Ala Asp Asp Ile Leu 
785                  790                795                  800
Phe Pro Val Leu Gln Ser Gln Thr Cys Lys Ser Ser His Cys Gly Ser 
                805                  810                815    
Asn Asn Ser Val Leu Leu Glu Leu Pro Ile Gly Leu His Lys Lys Arg 
            820                  825                830        
Ile Glu Leu Ser Val Leu Ser Tyr His Ser Ser Phe Ser Ile Arg Lys 
        835                  840                845            
Ser Arg Leu Val His Gly Tyr Ile Val Phe Tyr Ser Ala Lys Arg Lys 
    850                  855                860                
Ala Ser Leu Ala Met Leu Arg Ala Phe Leu Cys Glu Val Gln Asp Ile 
865                  870                875                  880
Ile Pro Ile Gln Leu Val Ala Leu Thr Asp Gly Ala Val Asp Val Leu 
                885                  890                895    
Asp Asn Asp Leu Ser Arg Glu Gln Leu Thr Glu Gly Glu Glu Ile Ala 
            900                  905                910        
Gln Glu Ile Asp Gly Arg Phe Thr Ser Ile Pro Cys Ser Gln Pro Gln 
        915                  920                925            
His Lys Leu Glu Ile Phe His Pro Phe Phe Lys Asp Val Val Glu Lys 
    930                  935                940                
Lys Asn Ile Ile Glu Ala Thr His Met Tyr Asp Asn Ala Ala Glu Ala 
945                  950                955                  960
Cys Ser Thr Thr Glu Glu Val Phe Asn Ser Pro Arg Ala Gly Ser Pro 
                965                  970                975    
Leu Cys Asn Ser Asn Leu Gln Asp Ser Glu Glu Asp Ile Glu Pro Ser 
            980                  985                990        
Tyr Ser Leu Phe Arg Glu Asp Thr Ser Leu Pro Ser Leu Ser Lys Asp 
        995                  1000                1005            
His Ser Lys Leu Ser Met Glu Leu Glu Gly Asn Asp Gly Leu Ser Phe 
    1010                1015                1020                
Ile Met Ser Asn Phe Glu Ser Lys Leu Asn Asn Lys Val Pro Pro Pro 
1025                1030                1035                1040
Val Lys Pro Lys Pro Pro Val His Phe Glu Ile Thr Lys Gly Asp Leu 
                1045                1050                1055    
Ser Tyr Leu Asp Gln Gly His Arg Asp Gly Gln Arg Lys Ser Val Ser 
            1060                1065                1070        
Ser Ser Pro Trp Leu Pro Gln Asp Gly Phe Asp Pro Ser Asp Tyr Ala 
        1075                1080                1085            
Glu Pro Met Asp Ala Val Val Lys Pro Arg Asn Glu Glu Glu Asn Ile 
    1090                1095                1100                
Tyr Ser Val Pro His Asp Ser Thr Gln Gly Lys Ile Ile Thr Ile Arg 
1105                1110                1115                1120
Asn Ile Asn Lys Ala Gln Ser Asn Gly Ser Gly Asn Gly Ser Asp Ser 
                1125                1130                1135    
Glu Met Asp Thr Ser Ser Leu Glu Arg Gly Arg Lys Val Ser Ile Val 
            1140                1145                1150        
Ser Lys Pro Val Leu Tyr Arg Thr Arg Cys Thr Arg Leu Gly Arg Phe 
        1155                1160                1165            
Ala Ser Tyr Arg Thr Ser Phe Ser Val Gly Ser Asp Asp Glu Leu Gly 
    1170                1175                1180                
Pro Ile Arg Lys Lys Glu Glu Asp Gln Ala Ser Gln Gly Tyr Lys Gly 
1185                1190                1195                1200
Asp Asn Ala Val Ile Pro Tyr Glu Thr Asp Glu Asp Pro Arg Arg Arg 
                1205                1210                1215    
Asn Ile Leu Arg Ser Leu Arg Arg Asn Thr Lys Lys Pro Lys Pro Lys 
            1220                1225                1230        
Pro Arg Pro Ser Ile Thr Lys Ala Thr Trp Glu Ser Asn Tyr Phe Gly 
        1235                1240                1245            
Val Pro Leu Thr Thr Val Val Thr Pro Glu Lys Pro Ile Pro Ile Phe 
    1250                1255                1260                
Ile Glu Arg Cys Ile Glu Tyr Ile Glu Ala Thr 
1265                1270                1275

<210> 119
<211> 2028
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2028
<223> /mol_type="DNA"
      /note="DLGAP1 (CCDS nucleotide sequence of DLGAP1 (Gene ID: 9229)
      )"
      /organism="Homo sapiens"

<400> 119
atgaacttaa ttttccataa agacattctg tttggcattc cagctaataa ggttccacaa      60

gatgaatgga cagggtacac cccacgaggt aaagatgatg aaattccatg ccgaagaatg     120

cggagtggca gttatatcaa ggccatgggg gatgaagaca gtggagactc agacacgagt     180

cctaagcctt ctccaaaagt tgctgcgcgg agagaaagct atctcaaggc tactcagcca     240

tcccttacag aactcaccac actcaaaatc tccaatgaac actcacccaa actccagatc     300

cggagtcata gttacctgag ggcagtgagt gaagtctcca tcaaccggag cctggacagc     360

ctggaccctg caggcttgct cacatcacca aagttccgct ccaggaatga gagctacatg     420

cgagccatga gcaccatcag ccaggtgagc gagatggaag tgaacgggca gttcgagtcc     480

gtgtgcgagt ccgtgttcag cgagctggag tcgcaggccg tggaagcgct ggacctgccc     540

atgcccggct gcttccgcat gcggagccac agctatgtgc gggccattga gaaaggctgc     600

tcccaggacg acgagtgcgt gtccctgagg tcgtcctcgc cgccgcgcac caccaccacc     660

gttaggacca tccagagcag cacggtgtca tcttgcatta caacatataa gaagacacca     720

cctccagtcc cacccagaac taccacgaaa cctttcattt ctatcacagc ccagagtagc     780

acagagtcag cccaggatgc ctacatggac ggacagggcc agcgaggaga tattatcagc     840

cagtctggac tcagcaactc caccgagagc ctggacagta tgaaggctct gacagccgcc     900

atcgaagctg caaacgccca gatccatggc cctgccagtc aacacatggg caataacact     960

gccaccgtca ccaccacgac taccatagcc accgtcacca cggaggacag gaagaaggac    1020

cactttaaga aaaatcgatg cctgtctatc gggatacagg tggatgatgc tgaagaacct    1080

gacaaaacag gggagaataa agcacccagt aagttccagt ccgtgggagt gcaagtagaa    1140

gaagagaagt gcttccgcag gttcactcga tccaacagtg tgacgacagc agtacaggcc    1200

gacctggact tccatgataa tctggaaaat tctctggaat ctatagagga caattcgtgt    1260

cctggcccca tggccagaca gttctcccgc gatgccagca cctccacagt cagcattcag    1320

ggctcaggaa accattacca tgcctgtgcc gccgatgatg actttgacac ggattttgac    1380

ccctctattc tgcctcctcc ggacccctgg attgactcta tcactgaaga ccctctggag    1440

gccgtgcaaa ggtcagtgtg ccaccgggat ggccactggt tcctgaagct tctccaggca    1500

gagcgagacc gcatggaggg gtggtgtcaa cagatggagc gggaagaacg ggaaaacaac    1560

ctgcccgaag acattctagg aaaaatccga accgcagtgg gcagtgccca acttctcatg    1620

gcccagaaat tctaccagtt cagagaactg tgtgaagaaa acctgaatcc taatgctcat    1680

ccaagaccca cctcccagga tttggcgggg ttttgggaca tgctgcagtt gtccatagaa    1740

aatattagta tgaaatttga tgaacttcat cagttaaagg ccaataattg gaaacagatg    1800

gatcctcttg acaagaagga gagaagggcc cctcctccag tgccaaagaa gccggcgaag    1860

ggccccgcgc cgctgatccg ggagcgctcg ctggagagct cgcagcgcca ggaggcccgc    1920

aagcgcctga tggccgccaa gcgcgccgcg tccgtccgcc agaactcggc caccgagagc    1980

gccgagagca tcgagatcta catccccgag gcgcagaccc ggctctga                 2028


<210> 120
<211> 675
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..675
<223> /mol_type="protein"
      /note="DLGAP1 (full-length protein)"
      /organism="Homo sapiens"

<400> 120
Met Asn Leu Ile Phe His Lys Asp Ile Leu Phe Gly Ile Pro Ala Asn 
1               5                   10                   15    
Lys Val Pro Gln Asp Glu Trp Thr Gly Tyr Thr Pro Arg Gly Lys Asp 
            20                   25                  30        
Asp Glu Ile Pro Cys Arg Arg Met Arg Ser Gly Ser Tyr Ile Lys Ala 
        35                   40                  45            
Met Gly Asp Glu Asp Ser Gly Asp Ser Asp Thr Ser Pro Lys Pro Ser 
    50                   55                  60                
Pro Lys Val Ala Ala Arg Arg Glu Ser Tyr Leu Lys Ala Thr Gln Pro 
65                   70                  75                  80
Ser Leu Thr Glu Leu Thr Thr Leu Lys Ile Ser Asn Glu His Ser Pro 
                85                   90                  95    
Lys Leu Gln Ile Arg Ser His Ser Tyr Leu Arg Ala Val Ser Glu Val 
            100                  105                110        
Ser Ile Asn Arg Ser Leu Asp Ser Leu Asp Pro Ala Gly Leu Leu Thr 
        115                  120                125            
Ser Pro Lys Phe Arg Ser Arg Asn Glu Ser Tyr Met Arg Ala Met Ser 
    130                  135                140                
Thr Ile Ser Gln Val Ser Glu Met Glu Val Asn Gly Gln Phe Glu Ser 
145                  150                155                  160
Val Cys Glu Ser Val Phe Ser Glu Leu Glu Ser Gln Ala Val Glu Ala 
                165                  170                175    
Leu Asp Leu Pro Met Pro Gly Cys Phe Arg Met Arg Ser His Ser Tyr 
            180                  185                190        
Val Arg Ala Ile Glu Lys Gly Cys Ser Gln Asp Asp Glu Cys Val Ser 
        195                  200                205            
Leu Arg Ser Ser Ser Pro Pro Arg Thr Thr Thr Thr Val Arg Thr Ile 
    210                  215                220                
Gln Ser Ser Thr Val Ser Ser Cys Ile Thr Thr Tyr Lys Lys Thr Pro 
225                  230                235                  240
Pro Pro Val Pro Pro Arg Thr Thr Thr Lys Pro Phe Ile Ser Ile Thr 
                245                  250                255    
Ala Gln Ser Ser Thr Glu Ser Ala Gln Asp Ala Tyr Met Asp Gly Gln 
            260                  265                270        
Gly Gln Arg Gly Asp Ile Ile Ser Gln Ser Gly Leu Ser Asn Ser Thr 
        275                  280                285            
Glu Ser Leu Asp Ser Met Lys Ala Leu Thr Ala Ala Ile Glu Ala Ala 
    290                  295                300                
Asn Ala Gln Ile His Gly Pro Ala Ser Gln His Met Gly Asn Asn Thr 
305                  310                315                  320
Ala Thr Val Thr Thr Thr Thr Thr Ile Ala Thr Val Thr Thr Glu Asp 
                325                  330                335    
Arg Lys Lys Asp His Phe Lys Lys Asn Arg Cys Leu Ser Ile Gly Ile 
            340                  345                350        
Gln Val Asp Asp Ala Glu Glu Pro Asp Lys Thr Gly Glu Asn Lys Ala 
        355                  360                365            
Pro Ser Lys Phe Gln Ser Val Gly Val Gln Val Glu Glu Glu Lys Cys 
    370                  375                380                
Phe Arg Arg Phe Thr Arg Ser Asn Ser Val Thr Thr Ala Val Gln Ala 
385                  390                395                  400
Asp Leu Asp Phe His Asp Asn Leu Glu Asn Ser Leu Glu Ser Ile Glu 
                405                  410                415    
Asp Asn Ser Cys Pro Gly Pro Met Ala Arg Gln Phe Ser Arg Asp Ala 
            420                  425                430        
Ser Thr Ser Thr Val Ser Ile Gln Gly Ser Gly Asn His Tyr His Ala 
        435                  440                445            
Cys Ala Ala Asp Asp Asp Phe Asp Thr Asp Phe Asp Pro Ser Ile Leu 
    450                  455                460                
Pro Pro Pro Asp Pro Trp Ile Asp Ser Ile Thr Glu Asp Pro Leu Glu 
465                  470                475                  480
Ala Val Gln Arg Ser Val Cys His Arg Asp Gly His Trp Phe Leu Lys 
                485                  490                495    
Leu Leu Gln Ala Glu Arg Asp Arg Met Glu Gly Trp Cys Gln Gln Met 
            500                  505                510        
Glu Arg Glu Glu Arg Glu Asn Asn Leu Pro Glu Asp Ile Leu Gly Lys 
        515                  520                525            
Ile Arg Thr Ala Val Gly Ser Ala Gln Leu Leu Met Ala Gln Lys Phe 
    530                  535                540                
Tyr Gln Phe Arg Glu Leu Cys Glu Glu Asn Leu Asn Pro Asn Ala His 
545                  550                555                  560
Pro Arg Pro Thr Ser Gln Asp Leu Ala Gly Phe Trp Asp Met Leu Gln 
                565                  570                575    
Leu Ser Ile Glu Asn Ile Ser Met Lys Phe Asp Glu Leu His Gln Leu 
            580                  585                590        
Lys Ala Asn Asn Trp Lys Gln Met Asp Pro Leu Asp Lys Lys Glu Arg 
        595                  600                605            
Arg Ala Pro Pro Pro Val Pro Lys Lys Pro Ala Lys Gly Pro Ala Pro 
    610                  615                620                
Leu Ile Arg Glu Arg Ser Leu Glu Ser Ser Gln Arg Gln Glu Ala Arg 
625                  630                635                  640
Lys Arg Leu Met Ala Ala Lys Arg Ala Ala Ser Val Arg Gln Asn Ser 
                645                  650                655    
Ala Thr Glu Ser Ala Glu Ser Ile Glu Ile Tyr Ile Pro Glu Ala Gln 
            660                  665                670        
Thr Arg Leu 
        675

<210> 121
<211> 1977
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1977
<223> /mol_type="DNA"
      /note="DLGAP1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 121
gttccacaag atgaatggac agggtacacc ccacgaggta aagatgatga aattccatgc      60

cgaagaatgc ggagtggcag ttatatcaag gccatggggg atgaagacag tggagactca     120

gacacgagtc ctaagccttc tccaaaagtt gctgcgcgga gagaaagcta tctcaaggct     180

actcagccat cccttacaga actcaccaca ctcaaaatct ccaatgaaca ctcacccaaa     240

ctccagatcc ggagtcatag ttacctgagg gcagtgagtg aagtctccat caaccggagc     300

ctggacagcc tggaccctgc aggcttgctc acatcaccaa agttccgctc caggaatgag     360

agctacatgc gagccatgag caccatcagc caggtgagcg agatggaagt gaacgggcag     420

ttcgagtccg tgtgcgagtc cgtgttcagc gagctggagt cgcaggccgt ggaagcgctg     480

gacctgccca tgcccggctg cttccgcatg cggagccaca gctatgtgcg ggccattgag     540

aaaggctgct cccaggacga cgagtgcgtg tccctgaggt cgtcctcgcc gccgcgcacc     600

accaccaccg ttaggaccat ccagagcagc acggtgtcat cttgcattac aacatataag     660

aagacaccac ctccagtccc acccagaact accacgaaac ctttcatttc tatcacagcc     720

cagagtagca cagagtcagc ccaggatgcc tacatggacg gacagggcca gcgaggagat     780

attatcagcc agtctggact cagcaactcc accgagagcc tggacagtat gaaggctctg     840

acagccgcca tcgaagctgc aaacgcccag atccatggcc ctgccagtca acacatgggc     900

aataacactg ccaccgtcac caccacgact accatagcca ccgtcaccac ggaggacagg     960

aagaaggacc actttaagaa aaatcgatgc ctgtctatcg ggatacaggt ggatgatgct    1020

gaagaacctg acaaaacagg ggagaataaa gcacccagta agttccagtc cgtgggagtg    1080

caagtagaag aagagaagtg cttccgcagg ttcactcgat ccaacagtgt gacgacagca    1140

gtacaggccg acctggactt ccatgataat ctggaaaatt ctctggaatc tatagaggac    1200

aattcgtgtc ctggccccat ggccagacag ttctcccgcg atgccagcac ctccacagtc    1260

agcattcagg gctcaggaaa ccattaccat gcctgtgccg ccgatgatga ctttgacacg    1320

gattttgacc cctctattct gcctcctccg gacccctgga ttgactctat cactgaagac    1380

cctctggagg ccgtgcaaag gtcagtgtgc caccgggatg gccactggtt cctgaagctt    1440

ctccaggcag agcgagaccg catggagggg tggtgtcaac agatggagcg ggaagaacgg    1500

gaaaacaacc tgcccgaaga cattctagga aaaatccgaa ccgcagtggg cagtgcccaa    1560

cttctcatgg cccagaaatt ctaccagttc agagaactgt gtgaagaaaa cctgaatcct    1620

aatgctcatc caagacccac ctcccaggat ttggcggggt tttgggacat gctgcagttg    1680

tccatagaaa atattagtat gaaatttgat gaacttcatc agttaaaggc caataattgg    1740

aaacagatgg atcctcttga caagaaggag agaagggccc ctcctccagt gccaaagaag    1800

ccggcgaagg gccccgcgcc gctgatccgg gagcgctcgc tggagagctc gcagcgccag    1860

gaggcccgca agcgcctgat ggccgccaag cgcgccgcgt ccgtccgcca gaactcggcc    1920

accgagagcg ccgagagcat cgagatctac atccccgagg cgcagacccg gctctga       1977


<210> 122
<211> 5803
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..5803
<223> /mol_type="DNA"
      /note="GRLF1-DLGAP1 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 122
atgatgatgg caagaaagca agatgtccga attcccacct acaacatcag tgtggtggga       60

ttatctggga ccgagaagga aaagggccag tgtgggattg gaaagtcttg tttgtgcaac      120

cgcttcgtgc gcccgagtgc tgacgagttt cacttggacc atacctccgt cctcagcacc      180

agtgactttg gagggcgagt ggtcaataat gaccactttc tctactgggg agaagttagc      240

cgctccctgg aggattgtgt ggaatgtaag atgcacattg tggagcagac tgaatttatt      300

gatgatcaga cttttcaacc tcatcgaagc acggccctgc agccctatat caagagagct      360

gctgcgacca agcttgcatc agctgaaaaa ctcatgtact tttgcactga ccagctgggg      420

ctggagcagg actttgagca gaaacaaatg ccagacggaa agctgctggt tgatggtttt      480

cttcttggta ttgatgttag caggggcatg aataggaact ttgatgacca gctcaagttt      540

gtctccaatc tctacaatca gcttgcaaaa acaaaaaagc ccatagtggt ggtcctgact      600

aagtgtgacg aaggtgttga gcggtacatt agagatgcac atacttttgc cttaagcaaa      660

aagaacctcc aggttgtgga gacctcagcg agatccaatg taaacgtgga cttggctttc      720

agcaccttag tgcaactcat tgataaaagt cggggaaaga caaaaatcat tccttatttt      780

gaagctctca agcagcagag tcagcagata gctacagcaa aagacaagta tgagtggctg      840

gtgagtcgca ttgtgaaaaa ccacaatgag aactggctga gtgtcagccg aaagatgcag      900

gcctctccag aataccagga ctatgtctac ctggaaggga ctcagaaagc caagaagctg      960

tttctacagc acatccaccg cctcaagcat gagcatatcg agcgtaggag aaagctgtac     1020

ctggcagccc tgccattagc ttttgaagct cttataccta atctagatga aatagaccac     1080

ctaagctgca taaaagccaa aaagctctta gaaaccaagc cagaattctt gaagtggttt     1140

gttgtgcttg aagagacccc atgggatgcc accagtcaca ttgacaacat ggaaaacgaa     1200

cggattccct ttgatttaat ggataccgtc cctgcagagc agctatacga ggcccactta     1260

gagaagctga ggaacgaaag gaaaagagtt gagatgcgaa gggcgtttaa agaaaacctg     1320

gagacttctc ctttcataac tcccggaaag ccttgggaag aggcccgtag ttttattatg     1380

aatgaggatt tctaccagtg gctggaggaa tctgtataca tggatattta tggcaaacac     1440

caaaagcaaa ttatagataa agcaaaggaa gaatttcagg agttgctttt ggaatattca     1500

gaattgtttt atgaactgga gctggatgct aagcccagca aggagaagat gggtgttatt     1560

caggatgttc tgggagagga acagcgattt aaagcattac aaaagctcca agcagagcgt     1620

gatgccctta ttctgaaaca cattcatttt gtgtaccacc caacaaagga gacatgcccc     1680

agctgcccag cttgtgtgga cgctaagatt gagcacttga ttagttctcg gtttatccgg     1740

ccgtctgacc ggaatcagaa aaattcactc tctgacccta acattgatag aatcaacttg     1800

gttatattgg gcaaagacgg ccttgcccga gagttggcca atgagattcg agctctttgt     1860

acaaatgatg acaagtatgt gatagatggt aaaatgtatg agctttccct gaggccaata     1920

gaggggaatg tcaggcttcc tgtgaactct ttccagacgc caacatttca gccccacggc     1980

tgtctctgcc tttacaattc aaaggaatcg ctatcctatg tagtggaaag tatagagaag     2040

agtagagagt ccacgctggg ccggcgggat aatcatttag tccatctccc ccttacatta     2100

attttggtta acaagagagg agacaccagt ggagagactc tgcatagctt aatacagcaa     2160

ggtcaacaaa ttgctagcaa acttcagtgt gtctttctcg accctgcttc tgctggcatt     2220

ggttacggac gcaacattaa tgaaaagcaa atcagtcaag ttttgaaggg actcctggac     2280

tctaagcgta acttaaacct ggtcagttct actgctagca tcaaagattt ggctgatgtt     2340

gatctgcgaa ttgttatgtg tctgatgtgt ggagatcctt ttagtgcaga tgacatactt     2400

tttcctgtcc ttcagtccca aacctgtaaa tcttcccatt gtggaagcaa caactctgtt     2460

ttacttgaac taccaatcgg actgcacaag aagcggattg aactgtctgt tctttcatac     2520

cattcctcct ttagcatcag aaagagccgg ttggttcatg ggtacattgt tttttattca     2580

gccaaacgta aggcctcttt ggctatgtta cgtgcctttc tttgtgaagt gcaggatatt     2640

atccctattc agcttgtagc actcactgat ggcgctgtag atgtcctgga caatgactta     2700

agtagggaac agctaactga gggggaggag attgctcaag aaattgacgg aaggttcaca     2760

agcatcccct gtagccaacc ccagcataaa cttgagatct ttcacccatt ttttaaagat     2820

gtggtggaaa aaaagaacat aatcgaggct actcatatgt acgataatgc tgccgaggcc     2880

tgtagcacca ccgaagaggt gtttaactcc ccccgggcag gatcaccgct ctgcaactca     2940

aacctgcagg attcagaaga agatatcgag ccatcttaca gcctgtttcg agaagacaca     3000

tcactgcctt ctctgtccaa agaccattct aagctctcta tggaactgga gggaaatgat     3060

gggctgtctt tcattatgag caattttgag agtaaactga acaacaaagt acctccgcca     3120

gtcaaaccaa agcctcctgt ccattttgaa attacaaagg gggatctatc ttatttagac     3180

caaggccata gggatggaca gaggaagtct gtgtcttcta gcccctggct gcctcaggat     3240

gggtttgatc cttctgacta tgctgaaccc atggatgctg tggtgaagcc aaggaatgaa     3300

gaagaaaaca tatactccgt gccccatgac agcacccaag gcaaaatcat caccattcgg     3360

aatatcaaca aagcccagtc caacggcagc gggaatggtt ctgacagtga aatggacacc     3420

agctctctag agcgagggcg caaggtttcc atcgtgagca agccagtgct gtacaggacg     3480

agatgcaccc ggctggggcg gtttgctagt taccggacca gcttcagcgt ggggagtgat     3540

gatgagctgg ggcccatccg gaagaaagag gaggatcagg catcccaggg ttataaaggg     3600

gacaatgctg tcattccata cgaaacagac gaagacccgc ggaggaggaa tattcttcgc     3660

agcctaagga ggaacactaa gaaaccaaag cccaaacccc ggccatccat cacaaaggca     3720

acctgggaga gtaactattt tggggtgccc ttaacaactg tcgtgactcc agagaagccg     3780

atccccattt ttattgaaag atgtattgag tacattgaag ccacaggttc cacaagatga     3840

atggacaggg tacaccccac gaggtaaaga tgatgaaatt ccatgccgaa gaatgcggag     3900

tggcagttat atcaaggcca tgggggatga agacagtgga gactcagaca cgagtcctaa     3960

gccttctcca aaagttgctg cgcggagaga aagctatctc aaggctactc agccatccct     4020

tacagaactc accacactca aaatctccaa tgaacactca cccaaactcc agatccggag     4080

tcatagttac ctgagggcag tgagtgaagt ctccatcaac cggagcctgg acagcctgga     4140

ccctgcaggc ttgctcacat caccaaagtt ccgctccagg aatgagagct acatgcgagc     4200

catgagcacc atcagccagg tgagcgagat ggaagtgaac gggcagttcg agtccgtgtg     4260

cgagtccgtg ttcagcgagc tggagtcgca ggccgtggaa gcgctggacc tgcccatgcc     4320

cggctgcttc cgcatgcgga gccacagcta tgtgcgggcc attgagaaag gctgctccca     4380

ggacgacgag tgcgtgtccc tgaggtcgtc ctcgccgccg cgcaccacca ccaccgttag     4440

gaccatccag agcagcacgg tgtcatcttg cattacaaca tataagaaga caccacctcc     4500

agtcccaccc agaactacca cgaaaccttt catttctatc acagcccaga gtagcacaga     4560

gtcagcccag gatgcctaca tggacggaca gggccagcga ggagatatta tcagccagtc     4620

tggactcagc aactccaccg agagcctgga cagtatgaag gctctgacag ccgccatcga     4680

agctgcaaac gcccagatcc atggccctgc cagtcaacac atgggcaata acactgccac     4740

cgtcaccacc acgactacca tagccaccgt caccacggag gacaggaaga aggaccactt     4800

taagaaaaat cgatgcctgt ctatcgggat acaggtggat gatgctgaag aacctgacaa     4860

aacaggggag aataaagcac ccagtaagtt ccagtccgtg ggagtgcaag tagaagaaga     4920

gaagtgcttc cgcaggttca ctcgatccaa cagtgtgacg acagcagtac aggccgacct     4980

ggacttccat gataatctgg aaaattctct ggaatctata gaggacaatt cgtgtcctgg     5040

ccccatggcc agacagttct cccgcgatgc cagcacctcc acagtcagca ttcagggctc     5100

aggaaaccat taccatgcct gtgccgccga tgatgacttt gacacggatt ttgacccctc     5160

tattctgcct cctccggacc cctggattga ctctatcact gaagaccctc tggaggccgt     5220

gcaaaggtca gtgtgccacc gggatggcca ctggttcctg aagcttctcc aggcagagcg     5280

agaccgcatg gaggggtggt gtcaacagat ggagcgggaa gaacgggaaa acaacctgcc     5340

cgaagacatt ctaggaaaaa tccgaaccgc agtgggcagt gcccaacttc tcatggccca     5400

gaaattctac cagttcagag aactgtgtga agaaaacctg aatcctaatg ctcatccaag     5460

acccacctcc caggatttgg cggggttttg ggacatgctg cagttgtcca tagaaaatat     5520

tagtatgaaa tttgatgaac ttcatcagtt aaaggccaat aattggaaac agatggatcc     5580

tcttgacaag aaggagagaa gggcccctcc tccagtgcca aagaagccgg cgaagggccc     5640

cgcgccgctg atccgggagc gctcgctgga gagctcgcag cgccaggagg cccgcaagcg     5700

cctgatggcc gccaagcgcg ccgcgtccgt ccgccagaac tcggccaccg agagcgccga     5760

gagcatcgag atctacatcc ccgaggcgca gacccggctc tga                       5803


<210> 123
<211> 1279
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..1279
<223> /mol_type="protein"
      /note="GRLF1-DLGAP1 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 123
Met Met Met Ala Arg Lys Gln Asp Val Arg Ile Pro Thr Tyr Asn Ile 
1               5                   10                   15    
Ser Val Val Gly Leu Ser Gly Thr Glu Lys Glu Lys Gly Gln Cys Gly 
            20                   25                  30        
Ile Gly Lys Ser Cys Leu Cys Asn Arg Phe Val Arg Pro Ser Ala Asp 
        35                   40                  45            
Glu Phe His Leu Asp His Thr Ser Val Leu Ser Thr Ser Asp Phe Gly 
    50                   55                  60                
Gly Arg Val Val Asn Asn Asp His Phe Leu Tyr Trp Gly Glu Val Ser 
65                   70                  75                  80
Arg Ser Leu Glu Asp Cys Val Glu Cys Lys Met His Ile Val Glu Gln 
                85                   90                  95    
Thr Glu Phe Ile Asp Asp Gln Thr Phe Gln Pro His Arg Ser Thr Ala 
            100                  105                110        
Leu Gln Pro Tyr Ile Lys Arg Ala Ala Ala Thr Lys Leu Ala Ser Ala 
        115                  120                125            
Glu Lys Leu Met Tyr Phe Cys Thr Asp Gln Leu Gly Leu Glu Gln Asp 
    130                  135                140                
Phe Glu Gln Lys Gln Met Pro Asp Gly Lys Leu Leu Val Asp Gly Phe 
145                  150                155                  160
Leu Leu Gly Ile Asp Val Ser Arg Gly Met Asn Arg Asn Phe Asp Asp 
                165                  170                175    
Gln Leu Lys Phe Val Ser Asn Leu Tyr Asn Gln Leu Ala Lys Thr Lys 
            180                  185                190        
Lys Pro Ile Val Val Val Leu Thr Lys Cys Asp Glu Gly Val Glu Arg 
        195                  200                205            
Tyr Ile Arg Asp Ala His Thr Phe Ala Leu Ser Lys Lys Asn Leu Gln 
    210                  215                220                
Val Val Glu Thr Ser Ala Arg Ser Asn Val Asn Val Asp Leu Ala Phe 
225                  230                235                  240
Ser Thr Leu Val Gln Leu Ile Asp Lys Ser Arg Gly Lys Thr Lys Ile 
                245                  250                255    
Ile Pro Tyr Phe Glu Ala Leu Lys Gln Gln Ser Gln Gln Ile Ala Thr 
            260                  265                270        
Ala Lys Asp Lys Tyr Glu Trp Leu Val Ser Arg Ile Val Lys Asn His 
        275                  280                285            
Asn Glu Asn Trp Leu Ser Val Ser Arg Lys Met Gln Ala Ser Pro Glu 
    290                  295                300                
Tyr Gln Asp Tyr Val Tyr Leu Glu Gly Thr Gln Lys Ala Lys Lys Leu 
305                  310                315                  320
Phe Leu Gln His Ile His Arg Leu Lys His Glu His Ile Glu Arg Arg 
                325                  330                335    
Arg Lys Leu Tyr Leu Ala Ala Leu Pro Leu Ala Phe Glu Ala Leu Ile 
            340                  345                350        
Pro Asn Leu Asp Glu Ile Asp His Leu Ser Cys Ile Lys Ala Lys Lys 
        355                  360                365            
Leu Leu Glu Thr Lys Pro Glu Phe Leu Lys Trp Phe Val Val Leu Glu 
    370                  375                380                
Glu Thr Pro Trp Asp Ala Thr Ser His Ile Asp Asn Met Glu Asn Glu 
385                  390                395                  400
Arg Ile Pro Phe Asp Leu Met Asp Thr Val Pro Ala Glu Gln Leu Tyr 
                405                  410                415    
Glu Ala His Leu Glu Lys Leu Arg Asn Glu Arg Lys Arg Val Glu Met 
            420                  425                430        
Arg Arg Ala Phe Lys Glu Asn Leu Glu Thr Ser Pro Phe Ile Thr Pro 
        435                  440                445            
Gly Lys Pro Trp Glu Glu Ala Arg Ser Phe Ile Met Asn Glu Asp Phe 
    450                  455                460                
Tyr Gln Trp Leu Glu Glu Ser Val Tyr Met Asp Ile Tyr Gly Lys His 
465                  470                475                  480
Gln Lys Gln Ile Ile Asp Lys Ala Lys Glu Glu Phe Gln Glu Leu Leu 
                485                  490                495    
Leu Glu Tyr Ser Glu Leu Phe Tyr Glu Leu Glu Leu Asp Ala Lys Pro 
            500                  505                510        
Ser Lys Glu Lys Met Gly Val Ile Gln Asp Val Leu Gly Glu Glu Gln 
        515                  520                525            
Arg Phe Lys Ala Leu Gln Lys Leu Gln Ala Glu Arg Asp Ala Leu Ile 
    530                  535                540                
Leu Lys His Ile His Phe Val Tyr His Pro Thr Lys Glu Thr Cys Pro 
545                  550                555                  560
Ser Cys Pro Ala Cys Val Asp Ala Lys Ile Glu His Leu Ile Ser Ser 
                565                  570                575    
Arg Phe Ile Arg Pro Ser Asp Arg Asn Gln Lys Asn Ser Leu Ser Asp 
            580                  585                590        
Pro Asn Ile Asp Arg Ile Asn Leu Val Ile Leu Gly Lys Asp Gly Leu 
        595                  600                605            
Ala Arg Glu Leu Ala Asn Glu Ile Arg Ala Leu Cys Thr Asn Asp Asp 
    610                  615                620                
Lys Tyr Val Ile Asp Gly Lys Met Tyr Glu Leu Ser Leu Arg Pro Ile 
625                  630                635                  640
Glu Gly Asn Val Arg Leu Pro Val Asn Ser Phe Gln Thr Pro Thr Phe 
                645                  650                655    
Gln Pro His Gly Cys Leu Cys Leu Tyr Asn Ser Lys Glu Ser Leu Ser 
            660                  665                670        
Tyr Val Val Glu Ser Ile Glu Lys Ser Arg Glu Ser Thr Leu Gly Arg 
        675                  680                685            
Arg Asp Asn His Leu Val His Leu Pro Leu Thr Leu Ile Leu Val Asn 
    690                  695                700                
Lys Arg Gly Asp Thr Ser Gly Glu Thr Leu His Ser Leu Ile Gln Gln 
705                  710                715                  720
Gly Gln Gln Ile Ala Ser Lys Leu Gln Cys Val Phe Leu Asp Pro Ala 
                725                  730                735    
Ser Ala Gly Ile Gly Tyr Gly Arg Asn Ile Asn Glu Lys Gln Ile Ser 
            740                  745                750        
Gln Val Leu Lys Gly Leu Leu Asp Ser Lys Arg Asn Leu Asn Leu Val 
        755                  760                765            
Ser Ser Thr Ala Ser Ile Lys Asp Leu Ala Asp Val Asp Leu Arg Ile 
    770                  775                780                
Val Met Cys Leu Met Cys Gly Asp Pro Phe Ser Ala Asp Asp Ile Leu 
785                  790                795                  800
Phe Pro Val Leu Gln Ser Gln Thr Cys Lys Ser Ser His Cys Gly Ser 
                805                  810                815    
Asn Asn Ser Val Leu Leu Glu Leu Pro Ile Gly Leu His Lys Lys Arg 
            820                  825                830        
Ile Glu Leu Ser Val Leu Ser Tyr His Ser Ser Phe Ser Ile Arg Lys 
        835                  840                845            
Ser Arg Leu Val His Gly Tyr Ile Val Phe Tyr Ser Ala Lys Arg Lys 
    850                  855                860                
Ala Ser Leu Ala Met Leu Arg Ala Phe Leu Cys Glu Val Gln Asp Ile 
865                  870                875                  880
Ile Pro Ile Gln Leu Val Ala Leu Thr Asp Gly Ala Val Asp Val Leu 
                885                  890                895    
Asp Asn Asp Leu Ser Arg Glu Gln Leu Thr Glu Gly Glu Glu Ile Ala 
            900                  905                910        
Gln Glu Ile Asp Gly Arg Phe Thr Ser Ile Pro Cys Ser Gln Pro Gln 
        915                  920                925            
His Lys Leu Glu Ile Phe His Pro Phe Phe Lys Asp Val Val Glu Lys 
    930                  935                940                
Lys Asn Ile Ile Glu Ala Thr His Met Tyr Asp Asn Ala Ala Glu Ala 
945                  950                955                  960
Cys Ser Thr Thr Glu Glu Val Phe Asn Ser Pro Arg Ala Gly Ser Pro 
                965                  970                975    
Leu Cys Asn Ser Asn Leu Gln Asp Ser Glu Glu Asp Ile Glu Pro Ser 
            980                  985                990        
Tyr Ser Leu Phe Arg Glu Asp Thr Ser Leu Pro Ser Leu Ser Lys Asp 
        995                  1000                1005            
His Ser Lys Leu Ser Met Glu Leu Glu Gly Asn Asp Gly Leu Ser Phe 
    1010                1015                1020                
Ile Met Ser Asn Phe Glu Ser Lys Leu Asn Asn Lys Val Pro Pro Pro 
1025                1030                1035                1040
Val Lys Pro Lys Pro Pro Val His Phe Glu Ile Thr Lys Gly Asp Leu 
                1045                1050                1055    
Ser Tyr Leu Asp Gln Gly His Arg Asp Gly Gln Arg Lys Ser Val Ser 
            1060                1065                1070        
Ser Ser Pro Trp Leu Pro Gln Asp Gly Phe Asp Pro Ser Asp Tyr Ala 
        1075                1080                1085            
Glu Pro Met Asp Ala Val Val Lys Pro Arg Asn Glu Glu Glu Asn Ile 
    1090                1095                1100                
Tyr Ser Val Pro His Asp Ser Thr Gln Gly Lys Ile Ile Thr Ile Arg 
1105                1110                1115                1120
Asn Ile Asn Lys Ala Gln Ser Asn Gly Ser Gly Asn Gly Ser Asp Ser 
                1125                1130                1135    
Glu Met Asp Thr Ser Ser Leu Glu Arg Gly Arg Lys Val Ser Ile Val 
            1140                1145                1150        
Ser Lys Pro Val Leu Tyr Arg Thr Arg Cys Thr Arg Leu Gly Arg Phe 
        1155                1160                1165            
Ala Ser Tyr Arg Thr Ser Phe Ser Val Gly Ser Asp Asp Glu Leu Gly 
    1170                1175                1180                
Pro Ile Arg Lys Lys Glu Glu Asp Gln Ala Ser Gln Gly Tyr Lys Gly 
1185                1190                1195                1200
Asp Asn Ala Val Ile Pro Tyr Glu Thr Asp Glu Asp Pro Arg Arg Arg 
                1205                1210                1215    
Asn Ile Leu Arg Ser Leu Arg Arg Asn Thr Lys Lys Pro Lys Pro Lys 
            1220                1225                1230        
Pro Arg Pro Ser Ile Thr Lys Ala Thr Trp Glu Ser Asn Tyr Phe Gly 
        1235                1240                1245            
Val Pro Leu Thr Thr Val Val Thr Pro Glu Lys Pro Ile Pro Ile Phe 
    1250                1255                1260                
Ile Glu Arg Cys Ile Glu Tyr Ile Glu Ala Thr Gly Ser Thr Arg 
1265                1270                1275                

<210> 124
<211> 1398
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1398
<223> /mol_type="DNA"
      /note="E2F3 (CCDS nucleotide sequence of E2F3 (Gene ID: 1871))"
      /organism="Homo sapiens"

<400> 124
atgagaaagg gaatccagcc cgctctggag cagtacctgg tgaccgccgg gggtggggag      60

ggggcggctg tcgtcgccgc cgccgctgca gcctccatgg acaaaagggc actgctagcc     120

agccccggct tcgccgccgc cgccgccgct gccgccgccc cgggcgcgta catccagatc     180

ctcaccacga acacttccac cacctcctgt tcctcctccc tccaaagcgg cgccgtagcc     240

gccggccccc tcctccccag tgcccccggc gcggagcaga ccgccggcag cctcctctac     300

accacgccgc acggaccctc cagcagagcc gggctgctgc agcagccacc agcgctggga     360

cgcggcggca gcggcggcgg cggcggccct ccggcaaagc gaaggctgga gctaggagaa     420

agcggtcatc agtacctctc agatggttta aaaaccccca agggcaaagg aagagctgca     480

ctacgaagtc cagatagtcc aaaaactcca aaatctccct cagaaaaaac gcggtatgat     540

acgtctcttg gtctgctcac caagaagttc attcagctcc tgagccagtc acccgatggg     600

gtattggatt tgaacaaggc agcagaagtg ctaaaagtgc aaaagagaag gatttatgat     660

atcaccaacg ttctggaagg catccacctc attaagaaga agtctaaaaa caacgtccaa     720

tggatgggct gcagtctgtc tgaggatggg ggcatgctgg cccagtgtca aggcctgtca     780

aaagaagtga ccgagctcag tcaggaagag aagaaattag atgaactgat ccaaagctgc     840

accctggacc tcaaactgtt aaccgaggat tcagagaatc aaaggttagc ttatgttaca     900

tatcaagata ttcgaaaaat tagtggcctt aaagaccaaa ctgttatagt tgtgaaagcc     960

cctccagaaa caagacttga agtgcctgac tcaatagaga gcctacaaat acatttggca    1020

agtacccaag ggcccattga ggtttactta tgtccagaag agactgaaac acacagtcca    1080

atgaaaacaa acaaccaaga ccacaatggg aatatcccta aacccgcttc caaagacttg    1140

gcttcaacca actcaggaca tagcgattgc tcagtttcta tgggaaacct ttctcctctg    1200

gcctccccag ccaacctctt acagcagact gaggaccaaa ttccttccaa cctagaagga    1260

ccgtttgtga acttactgcc tcccctgctg caagaggact atctcctgag cctcggggag    1320

gaggaaggca tcagcgatct cttcgatgct tacgatttgg aaaagctccc actggtggaa    1380

gacttcatgt gtagttga                                                  1398


<210> 125
<211> 465
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..465
<223> /mol_type="protein"
      /note="E2F3 (full-length protein)"
      /organism="Homo sapiens"

<400> 125
Met Arg Lys Gly Ile Gln Pro Ala Leu Glu Gln Tyr Leu Val Thr Ala 
1               5                   10                   15    
Gly Gly Gly Glu Gly Ala Ala Val Val Ala Ala Ala Ala Ala Ala Ser 
            20                   25                  30        
Met Asp Lys Arg Ala Leu Leu Ala Ser Pro Gly Phe Ala Ala Ala Ala 
        35                   40                  45            
Ala Ala Ala Ala Ala Pro Gly Ala Tyr Ile Gln Ile Leu Thr Thr Asn 
    50                   55                  60                
Thr Ser Thr Thr Ser Cys Ser Ser Ser Leu Gln Ser Gly Ala Val Ala 
65                   70                  75                  80
Ala Gly Pro Leu Leu Pro Ser Ala Pro Gly Ala Glu Gln Thr Ala Gly 
                85                   90                  95    
Ser Leu Leu Tyr Thr Thr Pro His Gly Pro Ser Ser Arg Ala Gly Leu 
            100                  105                110        
Leu Gln Gln Pro Pro Ala Leu Gly Arg Gly Gly Ser Gly Gly Gly Gly 
        115                  120                125            
Gly Pro Pro Ala Lys Arg Arg Leu Glu Leu Gly Glu Ser Gly His Gln 
    130                  135                140                
Tyr Leu Ser Asp Gly Leu Lys Thr Pro Lys Gly Lys Gly Arg Ala Ala 
145                  150                155                  160
Leu Arg Ser Pro Asp Ser Pro Lys Thr Pro Lys Ser Pro Ser Glu Lys 
                165                  170                175    
Thr Arg Tyr Asp Thr Ser Leu Gly Leu Leu Thr Lys Lys Phe Ile Gln 
            180                  185                190        
Leu Leu Ser Gln Ser Pro Asp Gly Val Leu Asp Leu Asn Lys Ala Ala 
        195                  200                205            
Glu Val Leu Lys Val Gln Lys Arg Arg Ile Tyr Asp Ile Thr Asn Val 
    210                  215                220                
Leu Glu Gly Ile His Leu Ile Lys Lys Lys Ser Lys Asn Asn Val Gln 
225                  230                235                  240
Trp Met Gly Cys Ser Leu Ser Glu Asp Gly Gly Met Leu Ala Gln Cys 
                245                  250                255    
Gln Gly Leu Ser Lys Glu Val Thr Glu Leu Ser Gln Glu Glu Lys Lys 
            260                  265                270        
Leu Asp Glu Leu Ile Gln Ser Cys Thr Leu Asp Leu Lys Leu Leu Thr 
        275                  280                285            
Glu Asp Ser Glu Asn Gln Arg Leu Ala Tyr Val Thr Tyr Gln Asp Ile 
    290                  295                300                
Arg Lys Ile Ser Gly Leu Lys Asp Gln Thr Val Ile Val Val Lys Ala 
305                  310                315                  320
Pro Pro Glu Thr Arg Leu Glu Val Pro Asp Ser Ile Glu Ser Leu Gln 
                325                  330                335    
Ile His Leu Ala Ser Thr Gln Gly Pro Ile Glu Val Tyr Leu Cys Pro 
            340                  345                350        
Glu Glu Thr Glu Thr His Ser Pro Met Lys Thr Asn Asn Gln Asp His 
        355                  360                365            
Asn Gly Asn Ile Pro Lys Pro Ala Ser Lys Asp Leu Ala Ser Thr Asn 
    370                  375                380                
Ser Gly His Ser Asp Cys Ser Val Ser Met Gly Asn Leu Ser Pro Leu 
385                  390                395                  400
Ala Ser Pro Ala Asn Leu Leu Gln Gln Thr Glu Asp Gln Ile Pro Ser 
                405                  410                415    
Asn Leu Glu Gly Pro Phe Val Asn Leu Leu Pro Pro Leu Leu Gln Glu 
            420                  425                430        
Asp Tyr Leu Leu Ser Leu Gly Glu Glu Glu Gly Ile Ser Asp Leu Phe 
        435                  440                445            
Asp Ala Tyr Asp Leu Glu Lys Leu Pro Leu Val Glu Asp Phe Met Cys 
    450                  455                460                
Ser 
465

<210> 126
<211> 1135
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1135
<223> /mol_type="DNA"
      /note="E2F3 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 126
atgagaaagg gaatccagcc cgctctggag cagtacctgg tgaccgccgg gggtggggag      60

ggggcggctg tcgtcgccgc cgccgctgca gcctccatgg acaaaagggc actgctagcc     120

agccccggct tcgccgccgc cgccgccgct gccgccgccc cgggcgcgta catccagatc     180

ctcaccacga acacttccac cacctcctgt tcctcctccc tccaaagcgg cgccgtagcc     240

gccggccccc tcctccccag tgcccccggc gcggagcaga ccgccggcag cctcctctac     300

accacgccgc acggaccctc cagcagagcc gggctgctgc agcagccacc agcgctggga     360

cgcggcggca gcggcggcgg cggcggccct ccggcaaagc gaaggctgga gctaggagaa     420

agcggtcatc agtacctctc agatggttta aaaaccccca agggcaaagg aagagctgca     480

ctacgaagtc cagatagtcc aaaaactcca aaatctccct cagaaaaaac gcggtatgat     540

acgtctcttg gtctgctcac caagaagttc attcagctcc tgagccagtc acccgatggg     600

gtattggatt tgaacaaggc agcagaagtg ctaaaagtgc aaaagagaag gatttatgat     660

atcaccaacg ttctggaagg catccacctc attaagaaga agtctaaaaa caacgtccaa     720

tggatgggct gcagtctgtc tgaggatggg ggcatgctgg cccagtgtca aggcctgtca     780

aaagaagtga ccgagctcag tcaggaagag aagaaattag atgaactgat ccaaagctgc     840

accctggacc tcaaactgtt aaccgaggat tcagagaatc aaaggttagc ttatgttaca     900

tatcaagata ttcgaaaaat tagtggcctt aaagaccaaa ctgttatagt tgtgaaagcc     960

cctccagaaa caagacttga agtgcctgac tcaatagaga gcctacaaat acatttggca    1020

agtacccaag ggcccattga ggtttactta tgtccagaag agactgaaac acacagtcca    1080

atgaaaacaa acaaccaaga ccacaatggg aatatcccta aacccgcttc caaag         1135


<210> 127
<211> 378
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..378
<223> /mol_type="protein"
      /note="E2F3 (preferred protein fragment)"
      /organism="artificial sequences"

<400> 127
Met Arg Lys Gly Ile Gln Pro Ala Leu Glu Gln Tyr Leu Val Thr Ala 
1               5                   10                   15    
Gly Gly Gly Glu Gly Ala Ala Val Val Ala Ala Ala Ala Ala Ala Ser 
            20                   25                  30        
Met Asp Lys Arg Ala Leu Leu Ala Ser Pro Gly Phe Ala Ala Ala Ala 
        35                   40                  45            
Ala Ala Ala Ala Ala Pro Gly Ala Tyr Ile Gln Ile Leu Thr Thr Asn 
    50                   55                  60                
Thr Ser Thr Thr Ser Cys Ser Ser Ser Leu Gln Ser Gly Ala Val Ala 
65                   70                  75                  80
Ala Gly Pro Leu Leu Pro Ser Ala Pro Gly Ala Glu Gln Thr Ala Gly 
                85                   90                  95    
Ser Leu Leu Tyr Thr Thr Pro His Gly Pro Ser Ser Arg Ala Gly Leu 
            100                  105                110        
Leu Gln Gln Pro Pro Ala Leu Gly Arg Gly Gly Ser Gly Gly Gly Gly 
        115                  120                125            
Gly Pro Pro Ala Lys Arg Arg Leu Glu Leu Gly Glu Ser Gly His Gln 
    130                  135                140                
Tyr Leu Ser Asp Gly Leu Lys Thr Pro Lys Gly Lys Gly Arg Ala Ala 
145                  150                155                  160
Leu Arg Ser Pro Asp Ser Pro Lys Thr Pro Lys Ser Pro Ser Glu Lys 
                165                  170                175    
Thr Arg Tyr Asp Thr Ser Leu Gly Leu Leu Thr Lys Lys Phe Ile Gln 
            180                  185                190        
Leu Leu Ser Gln Ser Pro Asp Gly Val Leu Asp Leu Asn Lys Ala Ala 
        195                  200                205            
Glu Val Leu Lys Val Gln Lys Arg Arg Ile Tyr Asp Ile Thr Asn Val 
    210                  215                220                
Leu Glu Gly Ile His Leu Ile Lys Lys Lys Ser Lys Asn Asn Val Gln 
225                  230                235                  240
Trp Met Gly Cys Ser Leu Ser Glu Asp Gly Gly Met Leu Ala Gln Cys 
                245                  250                255    
Gln Gly Leu Ser Lys Glu Val Thr Glu Leu Ser Gln Glu Glu Lys Lys 
            260                  265                270        
Leu Asp Glu Leu Ile Gln Ser Cys Thr Leu Asp Leu Lys Leu Leu Thr 
        275                  280                285            
Glu Asp Ser Glu Asn Gln Arg Leu Ala Tyr Val Thr Tyr Gln Asp Ile 
    290                  295                300                
Arg Lys Ile Ser Gly Leu Lys Asp Gln Thr Val Ile Val Val Lys Ala 
305                  310                315                  320
Pro Pro Glu Thr Arg Leu Glu Val Pro Asp Ser Ile Glu Ser Leu Gln 
                325                  330                335    
Ile His Leu Ala Ser Thr Gln Gly Pro Ile Glu Val Tyr Leu Cys Pro 
            340                  345                350        
Glu Glu Thr Glu Thr His Ser Pro Met Lys Thr Asn Asn Gln Asp His 
        355                  360                365            
Asn Gly Asn Ile Pro Lys Pro Ala Ser Lys 
    370                  375            

<210> 128
<211> 1904
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1904
<223> /mol_type="DNA"
      /note="FLJ22536 (long intergenic non-protein coding RNA 340; CCDS
       nucleotide sequence of FLJ22536 (Gene ID: 401237))"
      /organism="Homo sapiens"

<400> 128
gtctgctccg ggacttggaa caaaaggggg aactctgatg aactctcttt cctcccctct      60

cccccggacg ccggggtatc tccctctcgc aactttgccg ccccgacttt ctctgctgtc     120

aggccgggaa aaagtgtccg aacgcctcgt ggactgcagc gggggaaatg tcccttaaaa     180

gtgcgacgaa gtggggaaga aggtgtaatt actattatca gcatctagaa agcatcatga     240

atttgctgga gtacttccta gcactgacct ccttcattct gcgttgttct tactggatct     300

ttccatcagc caacaatatg gaagtaccaa tacaaggtca aatcattcct ggattcatct     360

ggagttgctt aaaagttaaa tcattggaat ttttgatgat accttttcta tatggattac     420

aatttgatcg ctgggaattc tccaccttaa agaagtaccc tcaggtgact acagatgtgt     480

taacacccag catgttccgg taggagactt tctggatggg gaagatttcc aggaattggc     540

aacaagctca tttcactggt gggtttgctg aagcattatc acaagacagt cagaatgact     600

gatgagtgct cttcaggtgt gaatcatggc aatacagtga aagacagtga tttactgctt     660

ttgagggcgt gcatgtatat gattaacgga tggaagtgca ggactccaag atttacttcc     720

ttccctttcc agcagaatta cctgagacga gtaaaatcta ctggtggagt cactccatta     780

ttcttatctg tggagatcta gatcttgatt tgaaagtttc tgagaaaatc ttcagctcag     840

acttgagggt caactttacc agctgaagga tctgcattta ctgctcaacc acatctaatt     900

tgatgtcctc tgcagattta aaatgtgtgc cttctcttcc gtcaccaagt catccctggg     960

ttactactga acatccttct caattccccc cgacccatgg atggctgttc tccattgtct    1020

gtttcaccag atgtcctcaa aacaaacaga cagaagaagg aagtggctaa tggagctgtg    1080

gagtccaagt gtgactgcca agaggaatcc agcaaagcca aaaagcccaa gcatgtagcc    1140

ctgcccgaag cacgccacac gcatggaaaa cccagaggaa atgagtgagg atcaatggga    1200

agaagagagc cagccaggaa gttgaagatt tgtccaggag cagatagctg aagagagaga    1260

gagagaagag agaacggctt acagctcagg tcctctctcc atgcttagga accactacaa    1320

atgctactgc cttgagtctc attttgtttc cctctggaaa ccacatgtgt accttgtttg    1380

caacagtatg ggctcacagg cagaaggaat tttccttgtc ttggatgaga cttttgactt    1440

ggacttttgg gttaagttct ggagaccaga aggccaaaat caaaagtatg ggcaggcttg    1500

atttctttag aagactccag cggagaactg tgtctccttg cttctgattc tacatctcca    1560

tccatgggcc actgtttcag caacctcagc cagtgcaaca caacctcagc caagaagagt    1620

atgcagagaa aggagtcccc tacctgccac aaaactgttg tctgaaaact gtctcatatt    1680

gtctcaagtt gtcattcatt gtgaattaga cctgtttaac atgtaatctg caacatgctt    1740

cactgtctaa ttttccagag cccctcatat aaggaactgt attattggta taatcatcat    1800

ggtgaagaag ttggtatgtg ggggagagat gacagaaaca gagagtaagt cagagctggc    1860

tgcctgacag ataaaaagga aatgaccaaa aaaaaaaaaa aaaa                     1904


<210> 129
<211> 448
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..448
<223> /mol_type="DNA"
      /note="FLJ22536 (preferred fragment)"
      /organism="artificial sequences"

<400> 129
ttctggagac cagaaggcca aaatcaaaag tatgggcagg cttgatttct ttagaagact      60

ccagcggaga actgtgtctc cttgcttctg attctacatc tccatccatg ggccactgtt     120

tcagcaacct cagccagtgc aacacaacct cagccaagaa gagtatgcag agaaaggagt     180

cccctacctg ccacaaaact gttgtctgaa aactgtctca tattgtctca agttgtcatt     240

cattgtgaat tagacctgtt taacatgtaa tctgcaacat gcttcactgt ctaattttcc     300

agagcccctc atataaggaa ctgtattatt ggtataatca tcatggtgaa gaagttggta     360

tgtgggggag agatgacaga aacagagagt aagtcagagc tggctgcctg acagataaaa     420

aggaaatgac caaaaaaaaa aaaaaaaa                                        448


<210> 130
<211> 1583
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1583
<223> /mol_type="DNA"
      /note="E2F3-FLJ22536 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 130
atgagaaagg gaatccagcc cgctctggag cagtacctgg tgaccgccgg gggtggggag      60

ggggcggctg tcgtcgccgc cgccgctgca gcctccatgg acaaaagggc actgctagcc     120

agccccggct tcgccgccgc cgccgccgct gccgccgccc cgggcgcgta catccagatc     180

ctcaccacga acacttccac cacctcctgt tcctcctccc tccaaagcgg cgccgtagcc     240

gccggccccc tcctccccag tgcccccggc gcggagcaga ccgccggcag cctcctctac     300

accacgccgc acggaccctc cagcagagcc gggctgctgc agcagccacc agcgctggga     360

cgcggcggca gcggcggcgg cggcggccct ccggcaaagc gaaggctgga gctaggagaa     420

agcggtcatc agtacctctc agatggttta aaaaccccca agggcaaagg aagagctgca     480

ctacgaagtc cagatagtcc aaaaactcca aaatctccct cagaaaaaac gcggtatgat     540

acgtctcttg gtctgctcac caagaagttc attcagctcc tgagccagtc acccgatggg     600

gtattggatt tgaacaaggc agcagaagtg ctaaaagtgc aaaagagaag gatttatgat     660

atcaccaacg ttctggaagg catccacctc attaagaaga agtctaaaaa caacgtccaa     720

tggatgggct gcagtctgtc tgaggatggg ggcatgctgg cccagtgtca aggcctgtca     780

aaagaagtga ccgagctcag tcaggaagag aagaaattag atgaactgat ccaaagctgc     840

accctggacc tcaaactgtt aaccgaggat tcagagaatc aaaggttagc ttatgttaca     900

tatcaagata ttcgaaaaat tagtggcctt aaagaccaaa ctgttatagt tgtgaaagcc     960

cctccagaaa caagacttga agtgcctgac tcaatagaga gcctacaaat acatttggca    1020

agtacccaag ggcccattga ggtttactta tgtccagaag agactgaaac acacagtcca    1080

atgaaaacaa acaaccaaga ccacaatggg aatatcccta aacccgcttc caaagttctg    1140

gagaccagaa ggccaaaatc aaaagtatgg gcaggcttga tttctttaga agactccagc    1200

ggagaactgt gtctccttgc ttctgattct acatctccat ccatgggcca ctgtttcagc    1260

aacctcagcc agtgcaacac aacctcagcc aagaagagta tgcagagaaa ggagtcccct    1320

acctgccaca aaactgttgt ctgaaaactg tctcatattg tctcaagttg tcattcattg    1380

tgaattagac ctgtttaaca tgtaatctgc aacatgcttc actgtctaat tttccagagc    1440

ccctcatata aggaactgta ttattggtat aatcatcatg gtgaagaagt tggtatgtgg    1500

gggagagatg acagaaacag agagtaagtc agagctggct gcctgacaga taaaaaggaa    1560

atgaccaaaa aaaaaaaaaa aaa                                            1583


<210> 131
<211> 447
<212> PRT
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..447
<223> /mol_type="protein"
      /note="E2F3-FLJ22536 (preferred fusion protein)"
      /organism="artificial sequences"

<400> 131
Met Arg Lys Gly Ile Gln Pro Ala Leu Glu Gln Tyr Leu Val Thr Ala 
1               5                   10                   15    
Gly Gly Gly Glu Gly Ala Ala Val Val Ala Ala Ala Ala Ala Ala Ser 
            20                   25                  30        
Met Asp Lys Arg Ala Leu Leu Ala Ser Pro Gly Phe Ala Ala Ala Ala 
        35                   40                  45            
Ala Ala Ala Ala Ala Pro Gly Ala Tyr Ile Gln Ile Leu Thr Thr Asn 
    50                   55                  60                
Thr Ser Thr Thr Ser Cys Ser Ser Ser Leu Gln Ser Gly Ala Val Ala 
65                   70                  75                  80
Ala Gly Pro Leu Leu Pro Ser Ala Pro Gly Ala Glu Gln Thr Ala Gly 
                85                   90                  95    
Ser Leu Leu Tyr Thr Thr Pro His Gly Pro Ser Ser Arg Ala Gly Leu 
            100                  105                110        
Leu Gln Gln Pro Pro Ala Leu Gly Arg Gly Gly Ser Gly Gly Gly Gly 
        115                  120                125            
Gly Pro Pro Ala Lys Arg Arg Leu Glu Leu Gly Glu Ser Gly His Gln 
    130                  135                140                
Tyr Leu Ser Asp Gly Leu Lys Thr Pro Lys Gly Lys Gly Arg Ala Ala 
145                  150                155                  160
Leu Arg Ser Pro Asp Ser Pro Lys Thr Pro Lys Ser Pro Ser Glu Lys 
                165                  170                175    
Thr Arg Tyr Asp Thr Ser Leu Gly Leu Leu Thr Lys Lys Phe Ile Gln 
            180                  185                190        
Leu Leu Ser Gln Ser Pro Asp Gly Val Leu Asp Leu Asn Lys Ala Ala 
        195                  200                205            
Glu Val Leu Lys Val Gln Lys Arg Arg Ile Tyr Asp Ile Thr Asn Val 
    210                  215                220                
Leu Glu Gly Ile His Leu Ile Lys Lys Lys Ser Lys Asn Asn Val Gln 
225                  230                235                  240
Trp Met Gly Cys Ser Leu Ser Glu Asp Gly Gly Met Leu Ala Gln Cys 
                245                  250                255    
Gln Gly Leu Ser Lys Glu Val Thr Glu Leu Ser Gln Glu Glu Lys Lys 
            260                  265                270        
Leu Asp Glu Leu Ile Gln Ser Cys Thr Leu Asp Leu Lys Leu Leu Thr 
        275                  280                285            
Glu Asp Ser Glu Asn Gln Arg Leu Ala Tyr Val Thr Tyr Gln Asp Ile 
    290                  295                300                
Arg Lys Ile Ser Gly Leu Lys Asp Gln Thr Val Ile Val Val Lys Ala 
305                  310                315                  320
Pro Pro Glu Thr Arg Leu Glu Val Pro Asp Ser Ile Glu Ser Leu Gln 
                325                  330                335    
Ile His Leu Ala Ser Thr Gln Gly Pro Ile Glu Val Tyr Leu Cys Pro 
            340                  345                350        
Glu Glu Thr Glu Thr His Ser Pro Met Lys Thr Asn Asn Gln Asp His 
        355                  360                365            
Asn Gly Asn Ile Pro Lys Pro Ala Ser Lys Val Leu Glu Thr Arg Arg 
    370                  375                380                
Pro Lys Ser Lys Val Trp Ala Gly Leu Ile Ser Leu Glu Asp Ser Ser 
385                  390                395                  400
Gly Glu Leu Cys Leu Leu Ala Ser Asp Ser Thr Ser Pro Ser Met Gly 
                405                  410                415    
His Cys Phe Ser Asn Leu Ser Gln Cys Asn Thr Thr Ser Ala Lys Lys 
            420                  425                430        
Ser Met Gln Arg Lys Glu Ser Pro Thr Cys His Lys Thr Val Val 
        435                  440                445        

<210> 132
<211> 1433
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1433
<223> /mol_type="DNA"
      /note="GPR160 (CCDS nucleotide sequence of GPR160 (Gene ID: 26996
      ) including 416 nucleotides of the 5’UTR)"
      /organism="Homo sapiens"

<400> 132
ccggcccttg tttcgttggg cccggacggg acgtgcgcgc tcaaaggttg cccgtctctg      60

acgcccgcat ttcctggtct ggagccggct gagccacagc agggtcgccg cggggtcccg     120

gggccgtgct cccctgcccc tcccgggagc gcgcggggcg gggcggggcg gggcgggacc     180

aggcgggcga gctgggccct cgcccctccc tcgggcggtc acctgggcac gggcgctgca     240

ggtgtcgggg cctcaacctt gcggagccga cagccatcga tcctcgggtg gcctcgaggt     300

ggtggcaggg ccgccccctg cagtccggag acgaacgcac ggaccgggcc tccggaggca     360

ggttcggctg gaaggaaccg ctctcgcttc gtcctacact tgcgcaaatg tctccgatga     420

ctgctctctc ttcagagaac tgctcttttc agtaccagtt acgtcaaaca aaccagcccc     480

tagatgttaa ctatctgcta ttcttgatca tacttgggaa aatattatta aatatcctta     540

cactaggaat gagaagaaaa aacacctgtc aaaattttat ggaatatttt tgcatttcac     600

tagcattcgt tgatctttta cttttggtaa acatttccat tatattgtat ttcagggatt     660

ttgtactttt aagcattagg ttcactaaat accacatctg cctatttact caaattattt     720

cctttactta tggctttttg cattatccag ttttcctgac agcttgtata gattattgcc     780

tgaatttctc taaaacaacc aagctttcat ttaagtgtca aaaattattt tatttcttta     840

cagtaatttt aatttggatt tcagtccttg cttatgtttt gggagaccca gccatctacc     900

aaagcctgaa ggcacagaat gcttattctc gtcactgtcc tttctatgtc agcattcaga     960

gttactggct gtcatttttc atggtgatga ttttatttgt agctttcata acctgttggg    1020

aagaagttac tactttggta caggctatca ggataacttc ctatatgaat gaaactatct    1080

tatattttcc tttttcatcc cactccagtt atactgtgag atctaaaaaa atattcttat    1140

ccaagctcat tgtctgtttt ctcagtacct ggttaccatt tgtactactt caggtaatca    1200

ttgttttact taaagttcag attccagcat atattgagat gaatattccc tggttatact    1260

ttgtcaatag ttttctcatt gctacagtgt attggtttaa ttgtcacaag cttaatttaa    1320

aagacattgg attacctttg gatccatttg tcaactggaa gtgctgcttc attccactta    1380

caattcctaa tcttgagcaa attgaaaagc ctatatcaat aatgatttgt taa           1433


<210> 133
<211> 338
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..338
<223> /mol_type="protein"
      /note="GPR160 (full-length protein)"
      /organism="Homo sapiens"

<400> 133
Met Thr Ala Leu Ser Ser Glu Asn Cys Ser Phe Gln Tyr Gln Leu Arg 
1               5                   10                   15    
Gln Thr Asn Gln Pro Leu Asp Val Asn Tyr Leu Leu Phe Leu Ile Ile 
            20                   25                  30        
Leu Gly Lys Ile Leu Leu Asn Ile Leu Thr Leu Gly Met Arg Arg Lys 
        35                   40                  45            
Asn Thr Cys Gln Asn Phe Met Glu Tyr Phe Cys Ile Ser Leu Ala Phe 
    50                   55                  60                
Val Asp Leu Leu Leu Leu Val Asn Ile Ser Ile Ile Leu Tyr Phe Arg 
65                   70                  75                  80
Asp Phe Val Leu Leu Ser Ile Arg Phe Thr Lys Tyr His Ile Cys Leu 
                85                   90                  95    
Phe Thr Gln Ile Ile Ser Phe Thr Tyr Gly Phe Leu His Tyr Pro Val 
            100                  105                110        
Phe Leu Thr Ala Cys Ile Asp Tyr Cys Leu Asn Phe Ser Lys Thr Thr 
        115                  120                125            
Lys Leu Ser Phe Lys Cys Gln Lys Leu Phe Tyr Phe Phe Thr Val Ile 
    130                  135                140                
Leu Ile Trp Ile Ser Val Leu Ala Tyr Val Leu Gly Asp Pro Ala Ile 
145                  150                155                  160
Tyr Gln Ser Leu Lys Ala Gln Asn Ala Tyr Ser Arg His Cys Pro Phe 
                165                  170                175    
Tyr Val Ser Ile Gln Ser Tyr Trp Leu Ser Phe Phe Met Val Met Ile 
            180                  185                190        
Leu Phe Val Ala Phe Ile Thr Cys Trp Glu Glu Val Thr Thr Leu Val 
        195                  200                205            
Gln Ala Ile Arg Ile Thr Ser Tyr Met Asn Glu Thr Ile Leu Tyr Phe 
    210                  215                220                
Pro Phe Ser Ser His Ser Ser Tyr Thr Val Arg Ser Lys Lys Ile Phe 
225                  230                235                  240
Leu Ser Lys Leu Ile Val Cys Phe Leu Ser Thr Trp Leu Pro Phe Val 
                245                  250                255    
Leu Leu Gln Val Ile Ile Val Leu Leu Lys Val Gln Ile Pro Ala Tyr 
            260                  265                270        
Ile Glu Met Asn Ile Pro Trp Leu Tyr Phe Val Asn Ser Phe Leu Ile 
        275                  280                285            
Ala Thr Val Tyr Trp Phe Asn Cys His Lys Leu Asn Leu Lys Asp Ile 
    290                  295                300                
Gly Leu Pro Leu Asp Pro Phe Val Asn Trp Lys Cys Cys Phe Ile Pro 
305                  310                315                  320
Leu Thr Ile Pro Asn Leu Glu Gln Ile Glu Lys Pro Ile Ser Ile Met 
                325                  330                335    
Ile Cys 
        

<210> 134
<211> 416
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..416
<223> /mol_type="DNA"
      /note="GPR160 (preferred fragement (part of the 5’UTR)"
      /organism="artificial sequences"

<400> 134
ccggcccttg tttcgttggg cccggacggg acgtgcgcgc tcaaaggttg cccgtctctg      60

acgcccgcat ttcctggtct ggagccggct gagccacagc agggtcgccg cggggtcccg     120

gggccgtgct cccctgcccc tcccgggagc gcgcggggcg gggcggggcg gggcgggacc     180

aggcgggcga gctgggccct cgcccctccc tcgggcggtc acctgggcac gggcgctgca     240

ggtgtcgggg cctcaacctt gcggagccga cagccatcga tcctcgggtg gcctcgaggt     300

ggtggcaggg ccgccccctg cagtccggag acgaacgcac ggaccgggcc tccggaggca     360

ggttcggctg gaaggaaccg ctctcgcttc gtcctacact tgcgcaaatg tctccg         416


<210> 135
<400> 135
000

<210> 136
<211> 1347
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1347
<223> /mol_type="DNA"
      /note="NCEH1 (CCDS nucleotide sequence of NCEH1 (Gene ID: 57552))
      "
      /organism="Homo sapiens"

<400> 136
atgagcagct gccgcgggca gaaagttgcc ggaggtctcc gggtggtatc gccctttcct      60

ctttgccagc ccgctggcga gccgagccag ggcaagatga ggtcgtcctg tgtcctgctc     120

accgccctgg tggcgctggc cgcctattac gtctacatcc cgctgcctgg ctccgtgtcc     180

gacccctgga agctgatgct gctggacgcc actttccggg gtgcacagca agtgagtaac     240

ctgatccact acctgggact gagccatcac ctgctggcac tgaattttat cattgtttct     300

tttggcaaaa aaagcgcgtg gtcttctgcc caagtgaagg tgaccgacac agactttgat     360

ggtgtggaag tcagagtgtt tgaaggccct ccgaagcccg aagagccact gaaacgcagc     420

gtcgtttata tccacggagg aggctgggcc ttggcaagtg caagtgcgtc ctggtcacct     480

tcagatgaaa tcaggtatta tgatgagctg tgtacagcaa tggctgagga attgaatgct     540

gtcattgttt ccattgaata caggctagtt ccaaaggttt attttcctga gcaaattcat     600

gatgttgtac gggccacaaa gtatttcctg aagccagaag tcttacagaa gtatatggtt     660

gatccaggca gaatttgcat ttctggtgac agtgctggtg gaaatctggc tgctgccctt     720

ggacaacagt ttactcaaga tgccagccta aaaaataagc tcaaactaca agctttaatt     780

tatccagttc ttcaagcttt agattttaac acaccatctt atcagcaaaa tgtgaacacc     840

ccaatcctgc cccgctatgt catggtgaag tattgggtgg actacttcaa aggcaactat     900

gactttgtgc aggcaatgat cgttaacaat cacacttcac ttgatgtgga agaggctgct     960

gctgtcaggg cccgtctaaa ctggacatcc ctcttgcctg catccttcac aaagaactac    1020

aagcctgttg tacagaccac aggcaatgcc aggattgtcc aggagcttcc tcagttgctg    1080

gatgcccgct ccgccccact cattgcagac caggcagtgc tgcagctcct cccaaagacc    1140

tacattctga cgtgtgagca tgatgtcctc agagacgatg gcatcatgta tgccaagcgt    1200

ttggagagtg ccggtgtgga ggtgaccctg gatcactttg aggatggctt tcacggatgt    1260

atgattttca ctagctggcc caccaacttc tcagtgggaa tccggactag gaatagttac    1320

atcaagtggc tagatcaaaa cctgtaa                                        1347


<210> 137
<211> 448
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..448
<223> /mol_type="protein"
      /note="NCEH1 (full-length protein)"
      /organism="Homo sapiens"

<400> 137
Met Ser Ser Cys Arg Gly Gln Lys Val Ala Gly Gly Leu Arg Val Val 
1               5                   10                   15    
Ser Pro Phe Pro Leu Cys Gln Pro Ala Gly Glu Pro Ser Gln Gly Lys 
            20                   25                  30        
Met Arg Ser Ser Cys Val Leu Leu Thr Ala Leu Val Ala Leu Ala Ala 
        35                   40                  45            
Tyr Tyr Val Tyr Ile Pro Leu Pro Gly Ser Val Ser Asp Pro Trp Lys 
    50                   55                  60                
Leu Met Leu Leu Asp Ala Thr Phe Arg Gly Ala Gln Gln Val Ser Asn 
65                   70                  75                  80
Leu Ile His Tyr Leu Gly Leu Ser His His Leu Leu Ala Leu Asn Phe 
                85                   90                  95    
Ile Ile Val Ser Phe Gly Lys Lys Ser Ala Trp Ser Ser Ala Gln Val 
            100                  105                110        
Lys Val Thr Asp Thr Asp Phe Asp Gly Val Glu Val Arg Val Phe Glu 
        115                  120                125            
Gly Pro Pro Lys Pro Glu Glu Pro Leu Lys Arg Ser Val Val Tyr Ile 
    130                  135                140                
His Gly Gly Gly Trp Ala Leu Ala Ser Ala Ser Ala Ser Trp Ser Pro 
145                  150                155                  160
Ser Asp Glu Ile Arg Tyr Tyr Asp Glu Leu Cys Thr Ala Met Ala Glu 
                165                  170                175    
Glu Leu Asn Ala Val Ile Val Ser Ile Glu Tyr Arg Leu Val Pro Lys 
            180                  185                190        
Val Tyr Phe Pro Glu Gln Ile His Asp Val Val Arg Ala Thr Lys Tyr 
        195                  200                205            
Phe Leu Lys Pro Glu Val Leu Gln Lys Tyr Met Val Asp Pro Gly Arg 
    210                  215                220                
Ile Cys Ile Ser Gly Asp Ser Ala Gly Gly Asn Leu Ala Ala Ala Leu 
225                  230                235                  240
Gly Gln Gln Phe Thr Gln Asp Ala Ser Leu Lys Asn Lys Leu Lys Leu 
                245                  250                255    
Gln Ala Leu Ile Tyr Pro Val Leu Gln Ala Leu Asp Phe Asn Thr Pro 
            260                  265                270        
Ser Tyr Gln Gln Asn Val Asn Thr Pro Ile Leu Pro Arg Tyr Val Met 
        275                  280                285            
Val Lys Tyr Trp Val Asp Tyr Phe Lys Gly Asn Tyr Asp Phe Val Gln 
    290                  295                300                
Ala Met Ile Val Asn Asn His Thr Ser Leu Asp Val Glu Glu Ala Ala 
305                  310                315                  320
Ala Val Arg Ala Arg Leu Asn Trp Thr Ser Leu Leu Pro Ala Ser Phe 
                325                  330                335    
Thr Lys Asn Tyr Lys Pro Val Val Gln Thr Thr Gly Asn Ala Arg Ile 
            340                  345                350        
Val Gln Glu Leu Pro Gln Leu Leu Asp Ala Arg Ser Ala Pro Leu Ile 
        355                  360                365            
Ala Asp Gln Ala Val Leu Gln Leu Leu Pro Lys Thr Tyr Ile Leu Thr 
    370                  375                380                
Cys Glu His Asp Val Leu Arg Asp Asp Gly Ile Met Tyr Ala Lys Arg 
385                  390                395                  400
Leu Glu Ser Ala Gly Val Glu Val Thr Leu Asp His Phe Glu Asp Gly 
                405                  410                415    
Phe His Gly Cys Met Ile Phe Thr Ser Trp Pro Thr Asn Phe Ser Val 
            420                  425                430        
Gly Ile Arg Thr Arg Asn Ser Tyr Ile Lys Trp Leu Asp Gln Asn Leu 
        435                  440                445            

<210> 138
<211> 1113
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1113
<223> /mol_type="DNA"
      /note="NCEH1 (preferred gene fragment)"
      /organism="artificial sequences"

<400> 138
agtaacctga tccactacct gggactgagc catcacctgc tggcactgaa ttttatcatt      60

gtttcttttg gcaaaaaaag cgcgtggtct tctgcccaag tgaaggtgac cgacacagac     120

tttgatggtg tggaagtcag agtgtttgaa ggccctccga agcccgaaga gccactgaaa     180

cgcagcgtcg tttatatcca cggaggaggc tgggccttgg caagtgcaag tgcgtcctgg     240

tcaccttcag atgaaatcag gtattatgat gagctgtgta cagcaatggc tgaggaattg     300

aatgctgtca ttgtttccat tgaatacagg ctagttccaa aggtttattt tcctgagcaa     360

attcatgatg ttgtacgggc cacaaagtat ttcctgaagc cagaagtctt acagaagtat     420

atggttgatc caggcagaat ttgcatttct ggtgacagtg ctggtggaaa tctggctgct     480

gcccttggac aacagtttac tcaagatgcc agcctaaaaa ataagctcaa actacaagct     540

ttaatttatc cagttcttca agctttagat tttaacacac catcttatca gcaaaatgtg     600

aacaccccaa tcctgccccg ctatgtcatg gtgaagtatt gggtggacta cttcaaaggc     660

aactatgact ttgtgcaggc aatgatcgtt aacaatcaca cttcacttga tgtggaagag     720

gctgctgctg tcagggcccg tctaaactgg acatccctct tgcctgcatc cttcacaaag     780

aactacaagc ctgttgtaca gaccacaggc aatgccagga ttgtccagga gcttcctcag     840

ttgctggatg cccgctccgc cccactcatt gcagaccagg cagtgctgca gctcctccca     900

aagacctaca ttctgacgtg tgagcatgat gtcctcagag acgatggcat catgtatgcc     960

aagcgtttgg agagtgccgg tgtggaggtg accctggatc actttgagga tggctttcac    1020

ggatgtatga ttttcactag ctggcccacc aacttctcag tgggaatccg gactaggaat    1080

agttacatca agtggctaga tcaaaacctg taa                                 1113


<210> 139
<211> 1529
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..1529
<223> /mol_type="DNA"
      /note="GPR160-NCEH1 (preferred fusion gene)"
      /organism="artificial sequences"

<400> 139
ccggcccttg tttcgttggg cccggacggg acgtgcgcgc tcaaaggttg cccgtctctg      60

acgcccgcat ttcctggtct ggagccggct gagccacagc agggtcgccg cggggtcccg     120

gggccgtgct cccctgcccc tcccgggagc gcgcggggcg gggcggggcg gggcgggacc     180

aggcgggcga gctgggccct cgcccctccc tcgggcggtc acctgggcac gggcgctgca     240

ggtgtcgggg cctcaacctt gcggagccga cagccatcga tcctcgggtg gcctcgaggt     300

ggtggcaggg ccgccccctg cagtccggag acgaacgcac ggaccgggcc tccggaggca     360

ggttcggctg gaaggaaccg ctctcgcttc gtcctacact tgcgcaaatg tctccgagta     420

acctgatcca ctacctggga ctgagccatc acctgctggc actgaatttt atcattgttt     480

cttttggcaa aaaaagcgcg tggtcttctg cccaagtgaa ggtgaccgac acagactttg     540

atggtgtgga agtcagagtg tttgaaggcc ctccgaagcc cgaagagcca ctgaaacgca     600

gcgtcgttta tatccacgga ggaggctggg ccttggcaag tgcaagtgcg tcctggtcac     660

cttcagatga aatcaggtat tatgatgagc tgtgtacagc aatggctgag gaattgaatg     720

ctgtcattgt ttccattgaa tacaggctag ttccaaaggt ttattttcct gagcaaattc     780

atgatgttgt acgggccaca aagtatttcc tgaagccaga agtcttacag aagtatatgg     840

ttgatccagg cagaatttgc atttctggtg acagtgctgg tggaaatctg gctgctgccc     900

ttggacaaca gtttactcaa gatgccagcc taaaaaataa gctcaaacta caagctttaa     960

tttatccagt tcttcaagct ttagatttta acacaccatc ttatcagcaa aatgtgaaca    1020

ccccaatcct gccccgctat gtcatggtga agtattgggt ggactacttc aaaggcaact    1080

atgactttgt gcaggcaatg atcgttaaca atcacacttc acttgatgtg gaagaggctg    1140

ctgctgtcag ggcccgtcta aactggacat ccctcttgcc tgcatccttc acaaagaact    1200

acaagcctgt tgtacagacc acaggcaatg ccaggattgt ccaggagctt cctcagttgc    1260

tggatgcccg ctccgcccca ctcattgcag accaggcagt gctgcagctc ctcccaaaga    1320

cctacattct gacgtgtgag catgatgtcc tcagagacga tggcatcatg tatgccaagc    1380

gtttggagag tgccggtgtg gaggtgaccc tggatcactt tgaggatggc tttcacggat    1440

gtatgatttt cactagctgg cccaccaact tctcagtggg aatccggact aggaatagtt    1500

acatcaagtg gctagatcaa aacctgtaa                                      1529


<210> 140
<400> 140
000

<210> 141
<211> 1119
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1119
<223> /mol_type="protein"
      /note="ADCY1 (full length protein of ADCY1 (Gene ID: 107)"
      /organism="Homo sapiens"

<400> 141
Met Ala Gly Ala Pro Arg Gly Gly Gly Gly Gly Gly Gly Gly Ala Gly 
1               5                   10                   15    
Glu Pro Gly Gly Ala Glu Arg Ala Ala Gly Thr Ser Arg Arg Arg Gly 
            20                   25                  30        
Leu Arg Ala Cys Asp Glu Glu Phe Ala Cys Pro Glu Leu Glu Ala Leu 
        35                   40                  45            
Phe Arg Gly Tyr Thr Leu Arg Leu Glu Gln Ala Ala Thr Leu Lys Ala 
    50                   55                  60                
Leu Ala Val Leu Ser Leu Leu Ala Gly Ala Leu Ala Leu Ala Glu Leu 
65                   70                  75                  80
Leu Gly Ala Pro Gly Pro Ala Pro Gly Leu Ala Lys Gly Ser His Pro 
                85                   90                  95    
Val His Cys Val Leu Phe Leu Ala Leu Leu Val Val Thr Asn Val Arg 
            100                  105                110        
Ser Leu Gln Val Pro Gln Leu Gln Gln Val Gly Gln Leu Ala Leu Leu 
        115                  120                125            
Phe Ser Leu Thr Phe Ala Leu Leu Cys Cys Pro Phe Ala Leu Gly Gly 
    130                  135                140                
Pro Ala Arg Gly Ser Ala Gly Ala Ala Gly Gly Pro Ala Thr Ala Glu 
145                  150                155                  160
Gln Gly Val Trp Gln Leu Leu Leu Val Thr Phe Val Ser Tyr Ala Leu 
                165                  170                175    
Leu Pro Val Arg Ser Leu Leu Ala Ile Gly Phe Gly Leu Val Val Ala 
            180                  185                190        
Ala Ser His Leu Leu Val Thr Ala Thr Leu Val Pro Ala Lys Arg Pro 
        195                  200                205            
Arg Leu Trp Arg Thr Leu Gly Ala Asn Ala Leu Leu Phe Val Gly Val 
    210                  215                220                
Asn Met Tyr Gly Val Phe Val Arg Ile Leu Thr Glu Arg Ser Gln Arg 
225                  230                235                  240
Lys Ala Phe Leu Gln Ala Arg Ser Cys Ile Glu Asp Arg Leu Arg Leu 
                245                  250                255    
Glu Asp Glu Asn Glu Lys Gln Glu Arg Leu Leu Met Ser Leu Leu Pro 
            260                  265                270        
Arg Asn Val Ala Met Glu Met Lys Glu Asp Phe Leu Lys Pro Pro Glu 
        275                  280                285            
Arg Ile Phe His Lys Ile Tyr Ile Gln Arg His Asp Asn Val Ser Ile 
    290                  295                300                
Leu Phe Ala Asp Ile Val Gly Phe Thr Gly Leu Ala Ser Gln Cys Thr 
305                  310                315                  320
Ala Gln Glu Leu Val Lys Leu Leu Asn Glu Leu Phe Gly Lys Phe Asp 
                325                  330                335    
Glu Leu Ala Thr Glu Asn His Cys Arg Arg Ile Lys Ile Leu Gly Asp 
            340                  345                350        
Cys Tyr Tyr Cys Val Ser Gly Leu Thr Gln Pro Lys Thr Asp His Ala 
        355                  360                365            
His Cys Cys Val Glu Met Gly Leu Asp Met Ile Asp Thr Ile Thr Ser 
    370                  375                380                
Val Ala Glu Ala Thr Glu Val Asp Leu Asn Met Arg Val Gly Leu His 
385                  390                395                  400
Thr Gly Arg Val Leu Cys Gly Val Leu Gly Leu Arg Lys Trp Gln Tyr 
                405                  410                415    
Asp Val Trp Ser Asn Asp Val Thr Leu Ala Asn Val Met Glu Ala Ala 
            420                  425                430        
Gly Leu Pro Gly Lys Val His Ile Thr Lys Thr Thr Leu Ala Cys Leu 
        435                  440                445            
Asn Gly Asp Tyr Glu Val Glu Pro Gly Tyr Gly His Glu Arg Asn Ser 
    450                  455                460                
Phe Leu Lys Thr His Asn Ile Glu Thr Phe Phe Ile Val Pro Ser His 
465                  470                475                  480
Arg Arg Lys Ile Phe Pro Gly Leu Ile Leu Ser Asp Ile Lys Pro Ala 
                485                  490                495    
Lys Arg Met Lys Phe Lys Thr Val Cys Tyr Leu Leu Val Gln Leu Met 
            500                  505                510        
His Cys Arg Lys Met Phe Lys Ala Glu Ile Pro Phe Ser Asn Val Met 
        515                  520                525            
Thr Cys Glu Asp Asp Asp Lys Arg Arg Ala Leu Arg Thr Ala Ser Glu 
    530                  535                540                
Lys Leu Arg Asn Arg Ser Ser Phe Ser Thr Asn Val Val Tyr Thr Thr 
545                  550                555                  560
Pro Gly Thr Arg Val Asn Arg Tyr Ile Ser Arg Leu Leu Glu Ala Arg 
                565                  570                575    
Gln Thr Glu Leu Glu Met Ala Asp Leu Asn Phe Phe Thr Leu Lys Tyr 
            580                  585                590        
Lys His Val Glu Arg Glu Gln Lys Tyr His Gln Leu Gln Asp Glu Tyr 
        595                  600                605            
Phe Thr Ser Ala Val Val Leu Thr Leu Ile Leu Ala Ala Leu Phe Gly 
    610                  615                620                
Leu Val Tyr Leu Leu Ile Phe Pro Gln Ser Val Val Val Leu Leu Leu 
625                  630                635                  640
Leu Val Phe Cys Ile Cys Phe Leu Val Ala Cys Val Leu Tyr Leu His 
                645                  650                655    
Ile Thr Arg Val Gln Cys Phe Pro Gly Cys Leu Thr Ile Gln Ile Arg 
            660                  665                670        
Thr Val Leu Cys Ile Phe Ile Val Val Leu Ile Tyr Ser Val Ala Gln 
        675                  680                685            
Gly Cys Val Val Gly Cys Leu Pro Trp Ala Trp Ser Ser Lys Pro Asn 
    690                  695                700                
Ser Ser Leu Val Val Leu Ser Ser Gly Gly Gln Arg Thr Ala Leu Pro 
705                  710                715                  720
Thr Leu Pro Cys Glu Ser Thr His His Ala Leu Leu Cys Cys Leu Val 
                725                  730                735    
Gly Thr Leu Pro Leu Ala Ile Phe Phe Arg Val Ser Ser Leu Pro Lys 
            740                  745                750        
Met Ile Leu Leu Ser Gly Leu Thr Thr Ser Tyr Ile Leu Val Leu Glu 
        755                  760                765            
Leu Ser Gly Tyr Thr Arg Thr Gly Gly Gly Ala Val Ser Gly Arg Ser 
    770                  775                780                
Tyr Glu Pro Ile Val Ala Ile Leu Leu Phe Ser Cys Ala Leu Ala Leu 
785                  790                795                  800
His Ala Arg Gln Val Asp Ile Arg Leu Arg Leu Asp Tyr Leu Trp Ala 
                805                  810                815    
Ala Gln Ala Glu Glu Glu Arg Glu Asp Met Glu Lys Val Lys Leu Asp 
            820                  825                830        
Asn Arg Arg Ile Leu Phe Asn Leu Leu Pro Ala His Val Ala Gln His 
        835                  840                845            
Phe Leu Met Ser Asn Pro Arg Asn Met Asp Leu Tyr Tyr Gln Ser Tyr 
    850                  855                860                
Ser Gln Val Gly Val Met Phe Ala Ser Ile Pro Asn Phe Asn Asp Phe 
865                  870                875                  880
Tyr Ile Glu Leu Asp Gly Asn Asn Met Gly Val Glu Cys Leu Arg Leu 
                885                  890                895    
Leu Asn Glu Ile Ile Ala Asp Phe Asp Glu Leu Met Glu Lys Asp Phe 
            900                  905                910        
Tyr Lys Asp Ile Glu Lys Ile Lys Thr Ile Gly Ser Thr Tyr Met Ala 
        915                  920                925            
Ala Val Gly Leu Ala Pro Thr Ser Gly Thr Lys Ala Lys Lys Ser Ile 
    930                  935                940                
Ser Ser His Leu Ser Thr Leu Ala Asp Phe Ala Ile Glu Met Phe Asp 
945                  950                955                  960
Val Leu Asp Glu Ile Asn Tyr Gln Ser Tyr Asn Asp Phe Val Leu Arg 
                965                  970                975    
Val Gly Ile Asn Val Gly Pro Val Val Ala Gly Val Ile Gly Ala Arg 
            980                  985                990        
Arg Pro Gln Tyr Asp Ile Trp Gly Asn Thr Val Asn Val Ala Ser Arg 
        995                  1000                1005            
Met Asp Ser Thr Gly Val Gln Gly Arg Ile Gln Val Thr Glu Glu Val 
    1010                1015                1020                
His Arg Leu Leu Arg Arg Cys Pro Tyr His Phe Val Cys Arg Gly Lys 
1025                1030                1035                1040
Val Ser Val Lys Gly Lys Gly Glu Met Leu Thr Tyr Phe Leu Glu Gly 
                1045                1050                1055    
Arg Thr Asp Gly Asn Gly Ser Gln Ile Arg Ser Leu Gly Leu Asp Arg 
            1060                1065                1070        
Lys Met Cys Pro Phe Gly Arg Ala Gly Leu Gln Gly Arg Arg Pro Pro 
        1075                1080                1085            
Val Cys Pro Met Pro Gly Val Ser Val Arg Ala Gly Leu Pro Pro His 
    1090                1095                1100                
Ser Pro Gly Gln Tyr Leu Pro Ser Ala Ala Ala Gly Lys Glu Ala 
1105                1110                1115                

<210> 142
<211> 1091
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1091
<223> /mol_type="protein"
      /note="ADCY2 (full length protein of ADCY2 (Gene ID: 108)"
      /organism="Homo sapiens"

<400> 142
Met Trp Gln Glu Ala Met Arg Arg Arg Arg Tyr Leu Arg Asp Arg Ser 
1               5                   10                   15    
Glu Glu Ala Ala Gly Gly Gly Asp Gly Leu Pro Arg Ser Arg Asp Trp 
            20                   25                  30        
Leu Tyr Glu Ser Tyr Tyr Cys Met Ser Gln Gln His Pro Leu Ile Val 
        35                   40                  45            
Phe Leu Leu Leu Ile Val Met Gly Ser Cys Leu Ala Leu Leu Ala Val 
    50                   55                  60                
Phe Phe Ala Leu Gly Leu Glu Val Glu Asp His Val Ala Phe Leu Ile 
65                   70                  75                  80
Thr Val Pro Thr Ala Leu Ala Ile Phe Phe Ala Ile Phe Ile Leu Val 
                85                   90                  95    
Cys Ile Glu Ser Val Phe Lys Lys Leu Leu Arg Leu Phe Ser Leu Val 
            100                  105                110        
Ile Trp Ile Cys Leu Val Ala Met Gly Tyr Leu Phe Met Cys Phe Gly 
        115                  120                125            
Gly Thr Val Ser Pro Trp Asp Gln Val Ser Phe Phe Leu Phe Ile Ile 
    130                  135                140                
Phe Val Val Tyr Thr Met Leu Pro Phe Asn Met Arg Asp Ala Ile Ile 
145                  150                155                  160
Ala Ser Val Leu Thr Ser Ser Ser His Thr Ile Val Leu Ser Val Cys 
                165                  170                175    
Leu Ser Ala Thr Pro Gly Gly Lys Glu His Leu Val Trp Gln Ile Leu 
            180                  185                190        
Ala Asn Val Ile Ile Phe Ile Cys Gly Asn Leu Ala Gly Ala Tyr His 
        195                  200                205            
Lys His Leu Met Glu Leu Ala Leu Gln Gln Thr Tyr Gln Asp Thr Cys 
    210                  215                220                
Asn Cys Ile Lys Ser Arg Ile Lys Leu Glu Phe Glu Lys Arg Gln Gln 
225                  230                235                  240
Glu Arg Leu Leu Leu Ser Leu Leu Pro Ala His Ile Ala Met Glu Met 
                245                  250                255    
Lys Ala Glu Ile Ile Gln Arg Leu Gln Gly Pro Lys Ala Gly Gln Met 
            260                  265                270        
Glu Asn Thr Asn Asn Phe His Asn Leu Tyr Val Lys Arg His Thr Asn 
        275                  280                285            
Val Ser Ile Leu Tyr Ala Asp Ile Val Gly Phe Thr Arg Leu Ala Ser 
    290                  295                300                
Asp Cys Ser Pro Gly Glu Leu Val His Met Leu Asn Glu Leu Phe Gly 
305                  310                315                  320
Lys Phe Asp Gln Ile Ala Lys Glu Asn Glu Cys Met Arg Ile Lys Ile 
                325                  330                335    
Leu Gly Asp Cys Tyr Tyr Cys Val Ser Gly Leu Pro Ile Ser Leu Pro 
            340                  345                350        
Asn His Ala Lys Asn Cys Val Lys Met Gly Leu Asp Met Cys Glu Ala 
        355                  360                365            
Ile Lys Lys Val Arg Asp Ala Thr Gly Val Asp Ile Asn Met Arg Val 
    370                  375                380                
Gly Val His Ser Gly Asn Val Leu Cys Gly Val Ile Gly Leu Gln Lys 
385                  390                395                  400
Trp Gln Tyr Asp Val Trp Ser His Asp Val Thr Leu Ala Asn His Met 
                405                  410                415    
Glu Ala Gly Gly Val Pro Gly Arg Val His Ile Ser Ser Val Thr Leu 
            420                  425                430        
Glu His Leu Asn Gly Ala Tyr Lys Val Glu Glu Gly Asp Gly Asp Ile 
        435                  440                445            
Arg Asp Pro Tyr Leu Lys Gln His Leu Val Lys Thr Tyr Phe Val Ile 
    450                  455                460                
Asn Pro Lys Gly Glu Arg Arg Ser Pro Gln His Leu Phe Arg Pro Arg 
465                  470                475                  480
His Thr Leu Asp Gly Ala Lys Met Arg Ala Ser Val Arg Met Thr Arg 
                485                  490                495    
Tyr Leu Glu Ser Trp Gly Ala Ala Lys Pro Phe Ala His Leu His His 
            500                  505                510        
Arg Asp Ser Met Thr Thr Glu Asn Gly Lys Ile Ser Thr Thr Asp Val 
        515                  520                525            
Pro Met Gly Gln His Asn Phe Gln Asn Arg Thr Leu Arg Thr Lys Ser 
    530                  535                540                
Gln Lys Lys Arg Phe Glu Glu Glu Leu Asn Glu Arg Met Ile Gln Ala 
545                  550                555                  560
Ile Asp Gly Ile Asn Ala Gln Lys Gln Trp Leu Lys Ser Glu Asp Ile 
                565                  570                575    
Gln Arg Ile Ser Leu Leu Phe Tyr Asn Lys Val Leu Glu Lys Glu Tyr 
            580                  585                590        
Arg Ala Thr Ala Leu Pro Ala Phe Lys Tyr Tyr Val Thr Cys Ala Cys 
        595                  600                605            
Leu Ile Phe Phe Cys Ile Phe Ile Val Gln Ile Leu Val Leu Pro Lys 
    610                  615                620                
Thr Ser Val Leu Gly Ile Ser Phe Gly Ala Ala Phe Leu Leu Leu Ala 
625                  630                635                  640
Phe Ile Leu Phe Val Cys Phe Ala Gly Gln Leu Leu Gln Cys Ser Lys 
                645                  650                655    
Lys Ala Ser Pro Leu Leu Met Trp Leu Leu Lys Ser Ser Gly Ile Ile 
            660                  665                670        
Ala Asn Arg Pro Trp Pro Arg Ile Ser Leu Thr Ile Ile Thr Thr Ala 
        675                  680                685            
Ile Ile Leu Met Met Ala Val Phe Asn Met Phe Phe Leu Ser Asp Ser 
    690                  695                700                
Glu Glu Thr Ile Pro Pro Thr Ala Asn Thr Thr Asn Thr Ser Phe Ser 
705                  710                715                  720
Ala Ser Asn Asn Gln Val Ala Ile Leu Arg Ala Gln Asn Leu Phe Phe 
                725                  730                735    
Leu Pro Tyr Phe Ile Tyr Ser Cys Ile Leu Gly Leu Ile Ser Cys Ser 
            740                  745                750        
Val Phe Leu Arg Val Asn Tyr Glu Leu Lys Met Leu Ile Met Met Val 
        755                  760                765            
Ala Leu Val Gly Tyr Asn Thr Ile Leu Leu His Thr His Ala His Val 
    770                  775                780                
Leu Gly Asp Tyr Ser Gln Val Leu Phe Glu Arg Pro Gly Ile Trp Lys 
785                  790                795                  800
Asp Leu Lys Thr Met Gly Ser Val Ser Leu Ser Ile Phe Phe Ile Thr 
                805                  810                815    
Leu Leu Val Leu Gly Arg Gln Asn Glu Tyr Tyr Cys Arg Leu Asp Phe 
            820                  825                830        
Leu Trp Lys Asn Lys Phe Lys Lys Glu Arg Glu Glu Ile Glu Thr Met 
        835                  840                845            
Glu Asn Leu Asn Arg Val Leu Leu Glu Asn Val Leu Pro Ala His Val 
    850                  855                860                
Ala Glu His Phe Leu Ala Arg Ser Leu Lys Asn Glu Glu Leu Tyr His 
865                  870                875                  880
Gln Ser Tyr Asp Cys Val Cys Val Met Phe Ala Ser Ile Pro Asp Phe 
                885                  890                895    
Lys Glu Phe Tyr Thr Glu Ser Asp Val Asn Lys Glu Gly Leu Glu Cys 
            900                  905                910        
Leu Arg Leu Leu Asn Glu Ile Ile Ala Asp Phe Asp Asp Leu Leu Ser 
        915                  920                925            
Lys Pro Lys Phe Ser Gly Val Glu Lys Ile Lys Thr Ile Gly Ser Thr 
    930                  935                940                
Tyr Met Ala Ala Thr Gly Leu Ser Ala Val Pro Ser Gln Glu His Ser 
945                  950                955                  960
Gln Glu Pro Glu Arg Gln Tyr Met His Ile Gly Thr Met Val Glu Phe 
                965                  970                975    
Ala Phe Ala Leu Val Gly Lys Leu Asp Ala Ile Asn Lys His Ser Phe 
            980                  985                990        
Asn Asp Phe Lys Leu Arg Val Gly Ile Asn His Gly Pro Val Ile Ala 
        995                  1000                1005            
Gly Val Ile Gly Ala Gln Lys Pro Gln Tyr Asp Ile Trp Gly Asn Thr 
    1010                1015                1020                
Val Asn Val Ala Ser Arg Met Asp Ser Thr Gly Val Leu Asp Lys Ile 
1025                1030                1035                1040
Gln Val Thr Glu Glu Thr Ser Leu Val Leu Gln Thr Leu Gly Tyr Thr 
                1045                1050                1055    
Cys Thr Cys Arg Gly Ile Ile Asn Val Lys Gly Lys Gly Asp Leu Lys 
            1060                1065                1070        
Thr Tyr Phe Val Asn Thr Glu Met Ser Arg Ser Leu Ser Gln Ser Asn 
        1075                1080                1085            
Val Ala Ser 
    1090    

<210> 143
<211> 1077
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1077
<223> /mol_type="protein"
      /note="ADCY4 (full length protein of ADCY4 (Gene ID: 196883)"
      /organism="Homo sapiens"

<400> 143
Met Ala Arg Leu Phe Ser Pro Arg Pro Pro Pro Ser Glu Asp Leu Phe 
1               5                   10                   15    
Tyr Glu Thr Tyr Tyr Ser Leu Ser Gln Gln Tyr Pro Leu Leu Leu Leu 
            20                   25                  30        
Leu Leu Gly Ile Val Leu Cys Ala Leu Ala Ala Leu Leu Ala Val Ala 
        35                   40                  45            
Trp Ala Ser Gly Arg Glu Leu Thr Ser Asp Pro Ser Phe Leu Thr Thr 
    50                   55                  60                
Val Leu Cys Ala Leu Gly Gly Phe Ser Leu Leu Leu Gly Leu Ala Ser 
65                   70                  75                  80
Arg Glu Gln Arg Leu Gln Arg Trp Thr Arg Pro Leu Ser Gly Leu Val 
                85                   90                  95    
Trp Val Ala Leu Leu Ala Leu Gly His Ala Phe Leu Phe Thr Gly Gly 
            100                  105                110        
Val Val Ser Ala Trp Asp Gln Val Ser Tyr Phe Leu Phe Val Ile Phe 
        115                  120                125            
Thr Ala Tyr Ala Met Leu Pro Leu Gly Met Arg Asp Ala Ala Val Ala 
    130                  135                140                
Gly Leu Ala Ser Ser Leu Ser His Leu Leu Val Leu Gly Leu Tyr Leu 
145                  150                155                  160
Gly Pro Gln Pro Asp Ser Arg Pro Ala Leu Leu Pro Gln Leu Ala Ala 
                165                  170                175    
Asn Ala Val Leu Phe Leu Cys Gly Asn Val Ala Gly Val Tyr His Lys 
            180                  185                190        
Ala Leu Met Glu Arg Ala Leu Arg Ala Thr Phe Arg Glu Ala Leu Ser 
        195                  200                205            
Ser Leu His Ser Arg Arg Arg Leu Asp Thr Glu Lys Lys His Gln Glu 
    210                  215                220                
His Leu Leu Leu Ser Ile Leu Pro Ala Tyr Leu Ala Arg Glu Met Lys 
225                  230                235                  240
Ala Glu Ile Met Ala Arg Leu Gln Ala Gly Gln Gly Ser Arg Pro Glu 
                245                  250                255    
Ser Thr Asn Asn Phe His Ser Leu Tyr Val Lys Arg His Gln Gly Val 
            260                  265                270        
Ser Val Leu Tyr Ala Asp Ile Val Gly Phe Thr Arg Leu Ala Ser Glu 
        275                  280                285            
Cys Ser Pro Lys Glu Leu Val Leu Met Leu Asn Glu Leu Phe Gly Lys 
    290                  295                300                
Phe Asp Gln Ile Ala Lys Glu His Glu Cys Met Arg Ile Lys Ile Leu 
305                  310                315                  320
Gly Asp Cys Tyr Tyr Cys Val Ser Gly Leu Pro Leu Ser Leu Pro Asp 
                325                  330                335    
His Ala Ile Asn Cys Val Arg Met Gly Leu Asp Met Cys Arg Ala Ile 
            340                  345                350        
Arg Lys Leu Arg Ala Ala Thr Gly Val Asp Ile Asn Met Arg Val Gly 
        355                  360                365            
Val His Ser Gly Ser Val Leu Cys Gly Val Ile Gly Leu Gln Lys Trp 
    370                  375                380                
Gln Tyr Asp Val Trp Ser His Asp Val Thr Leu Ala Asn His Met Glu 
385                  390                395                  400
Ala Gly Gly Val Pro Gly Arg Val His Ile Thr Gly Ala Thr Leu Ala 
                405                  410                415    
Leu Leu Ala Gly Ala Tyr Ala Val Glu Asp Ala Gly Met Glu His Arg 
            420                  425                430        
Asp Pro Tyr Leu Arg Glu Leu Gly Glu Pro Thr Tyr Leu Val Ile Asp 
        435                  440                445            
Pro Arg Ala Glu Glu Glu Asp Glu Lys Gly Thr Ala Gly Gly Leu Leu 
    450                  455                460                
Ser Ser Leu Glu Gly Leu Lys Met Arg Pro Ser Leu Leu Met Thr Arg 
465                  470                475                  480
Tyr Leu Glu Ser Trp Gly Ala Ala Lys Pro Phe Ala His Leu Ser His 
                485                  490                495    
Gly Asp Ser Pro Val Ser Thr Ser Thr Pro Leu Pro Glu Lys Thr Leu 
            500                  505                510        
Ala Ser Phe Ser Thr Gln Trp Ser Leu Asp Arg Ser Arg Thr Pro Arg 
        515                  520                525            
Gly Leu Asp Asp Glu Leu Asp Thr Gly Asp Ala Lys Phe Phe Gln Val 
    530                  535                540                
Ile Glu Gln Leu Asn Ser Gln Lys Gln Trp Lys Gln Ser Lys Asp Phe 
545                  550                555                  560
Asn Pro Leu Thr Leu Tyr Phe Arg Glu Lys Glu Met Glu Lys Glu Tyr 
                565                  570                575    
Arg Leu Ser Ala Ile Pro Ala Phe Lys Tyr Tyr Glu Ala Cys Thr Phe 
            580                  585                590        
Leu Val Phe Leu Ser Asn Phe Ile Ile Gln Met Leu Val Thr Asn Arg 
        595                  600                605            
Pro Pro Ala Leu Ala Ile Thr Tyr Ser Ile Thr Phe Leu Leu Phe Leu 
    610                  615                620                
Leu Ile Leu Phe Val Cys Phe Ser Glu Asp Leu Met Arg Cys Val Leu 
625                  630                635                  640
Lys Gly Pro Lys Met Leu His Trp Leu Pro Ala Leu Ser Gly Leu Val 
                645                  650                655    
Ala Thr Arg Pro Gly Leu Arg Ile Ala Leu Gly Thr Ala Thr Ile Leu 
            660                  665                670        
Leu Val Phe Ala Met Ala Ile Thr Ser Leu Phe Phe Phe Pro Thr Ser 
        675                  680                685            
Ser Asp Cys Pro Phe Gln Ala Pro Asn Val Ser Ser Met Ile Ser Asn 
    690                  695                700                
Leu Ser Trp Glu Leu Pro Gly Ser Leu Pro Leu Ile Ser Val Pro Tyr 
705                  710                715                  720
Ser Met His Cys Cys Thr Leu Gly Phe Leu Ser Cys Ser Leu Phe Leu 
                725                  730                735    
His Met Ser Phe Glu Leu Lys Leu Leu Leu Leu Leu Leu Trp Leu Ala 
            740                  745                750        
Ala Ser Cys Ser Leu Phe Leu His Ser His Ala Trp Leu Ser Glu Cys 
        755                  760                765            
Leu Ile Val Arg Leu Tyr Leu Gly Pro Leu Asp Ser Arg Pro Gly Val 
    770                  775                780                
Leu Lys Glu Pro Lys Leu Met Gly Ala Ile Ser Phe Phe Ile Phe Phe 
785                  790                795                  800
Phe Thr Leu Leu Val Leu Ala Arg Gln Asn Glu Tyr Tyr Cys Arg Leu 
                805                  810                815    
Asp Phe Leu Trp Lys Lys Lys Leu Arg Gln Glu Arg Glu Glu Thr Glu 
            820                  825                830        
Thr Met Glu Asn Leu Thr Arg Leu Leu Leu Glu Asn Val Leu Pro Ala 
        835                  840                845            
His Val Ala Pro Gln Phe Ile Gly Gln Asn Arg Arg Asn Glu Asp Leu 
    850                  855                860                
Tyr His Gln Ser Tyr Glu Cys Val Cys Val Leu Phe Ala Ser Val Pro 
865                  870                875                  880
Asp Phe Lys Glu Phe Tyr Ser Glu Ser Asn Ile Asn His Glu Gly Leu 
                885                  890                895    
Glu Cys Leu Arg Leu Leu Asn Glu Ile Ile Ala Asp Phe Asp Glu Leu 
            900                  905                910        
Leu Ser Lys Pro Lys Phe Ser Gly Val Glu Lys Ile Lys Thr Ile Gly 
        915                  920                925            
Ser Thr Tyr Met Ala Ala Thr Gly Leu Asn Ala Thr Ser Gly Gln Asp 
    930                  935                940                
Ala Gln Gln Asp Ala Glu Arg Ser Cys Ser His Leu Gly Thr Met Val 
945                  950                955                  960
Glu Phe Ala Val Ala Leu Gly Ser Lys Leu Asp Val Ile Asn Lys His 
                965                  970                975    
Ser Phe Asn Asn Phe Arg Leu Arg Val Gly Leu Asn His Gly Pro Val 
            980                  985                990        
Val Ala Gly Val Ile Gly Ala Gln Lys Pro Gln Tyr Asp Ile Trp Gly 
        995                  1000                1005            
Asn Thr Val Asn Val Ala Ser Arg Met Glu Ser Thr Gly Val Leu Gly 
    1010                1015                1020                
Lys Ile Gln Val Thr Glu Glu Thr Ala Trp Ala Leu Gln Ser Leu Gly 
1025                1030                1035                1040
Tyr Thr Cys Tyr Ser Arg Gly Val Ile Lys Val Lys Gly Lys Gly Gln 
                1045                1050                1055    
Leu Cys Thr Tyr Phe Leu Asn Thr Asp Leu Thr Arg Thr Gly Pro Pro 
            1060                1065                1070        
Ser Ala Thr Leu Gly 
        1075        

<210> 144
<211> 911
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..911
<223> /mol_type="protein"
      /note="ADCY5 (full length protein of ADCY5 (Gene ID: 111)"
      /organism="Homo sapiens"

<400> 144
Met Lys Ser Gln Lys Glu Gly Cys Cys Ser Arg Gly Asp Leu Ser Ile 
1               5                   10                   15    
Gln Thr Gly Pro Gly Gly Glu Trp Ala Pro Arg Arg Leu Val Ser Asn 
            20                   25                  30        
Val Leu Ile Phe Ser Cys Thr Asn Ile Val Gly Val Cys Thr His Tyr 
        35                   40                  45            
Pro Ala Glu Val Ser Gln Arg Gln Ala Phe Gln Glu Thr Arg Glu Cys 
    50                   55                  60                
Ile Gln Ala Arg Leu His Ser Gln Arg Glu Asn Gln Gln Gln Glu Arg 
65                   70                  75                  80
Leu Leu Leu Ser Val Leu Pro Arg His Val Ala Met Glu Met Lys Ala 
                85                   90                  95    
Asp Ile Asn Ala Lys Gln Glu Asp Met Met Phe His Lys Ile Tyr Ile 
            100                  105                110        
Gln Lys His Asp Asn Val Ser Ile Leu Phe Ala Asp Ile Glu Gly Phe 
        115                  120                125            
Thr Ser Leu Ala Ser Gln Cys Thr Ala Gln Glu Leu Val Met Thr Leu 
    130                  135                140                
Asn Glu Leu Phe Ala Arg Phe Asp Lys Leu Ala Ala Glu Asn His Cys 
145                  150                155                  160
Leu Arg Ile Lys Ile Leu Gly Asp Cys Tyr Tyr Cys Val Ser Gly Leu 
                165                  170                175    
Pro Glu Ala Arg Ala Asp His Ala His Cys Cys Val Glu Met Gly Met 
            180                  185                190        
Asp Met Ile Glu Ala Ile Ser Leu Val Arg Glu Val Thr Gly Val Asn 
        195                  200                205            
Val Asn Met Arg Val Gly Ile His Ser Gly Arg Val His Cys Gly Val 
    210                  215                220                
Leu Gly Leu Arg Lys Trp Gln Phe Asp Val Trp Ser Asn Asp Val Thr 
225                  230                235                  240
Leu Ala Asn His Met Glu Ala Gly Gly Lys Ala Gly Arg Ile His Ile 
                245                  250                255    
Thr Lys Ala Thr Leu Asn Tyr Leu Asn Gly Asp Tyr Glu Val Glu Pro 
            260                  265                270        
Gly Cys Gly Gly Glu Arg Asn Ala Tyr Leu Lys Glu His Ser Ile Glu 
        275                  280                285            
Thr Phe Leu Ile Leu Arg Cys Thr Gln Lys Arg Lys Glu Glu Lys Ala 
    290                  295                300                
Met Ile Ala Lys Met Asn Arg Gln Arg Thr Asn Ser Ile Gly His Asn 
305                  310                315                  320
Pro Pro His Trp Gly Ala Glu Arg Pro Phe Tyr Asn His Leu Gly Gly 
                325                  330                335    
Asn Gln Val Ser Lys Glu Met Lys Arg Met Gly Phe Glu Asp Pro Lys 
            340                  345                350        
Asp Lys Asn Ala Gln Glu Ser Ala Asn Pro Glu Asp Glu Val Asp Glu 
        355                  360                365            
Phe Leu Gly Arg Ala Ile Asp Ala Arg Ser Ile Asp Arg Leu Arg Ser 
    370                  375                380                
Glu His Val Arg Lys Phe Leu Leu Thr Phe Arg Glu Pro Asp Leu Glu 
385                  390                395                  400
Lys Lys Tyr Ser Lys Gln Val Asp Asp Arg Phe Gly Ala Tyr Val Ala 
                405                  410                415    
Cys Ala Ser Leu Val Phe Leu Phe Ile Cys Phe Val Gln Ile Thr Ile 
            420                  425                430        
Val Pro His Ser Ile Phe Met Leu Ser Phe Tyr Leu Thr Cys Ser Leu 
        435                  440                445            
Leu Leu Thr Leu Val Val Phe Val Ser Val Ile Tyr Ser Cys Val Lys 
    450                  455                460                
Leu Phe Pro Ser Pro Leu Gln Thr Leu Ser Arg Lys Ile Val Arg Ser 
465                  470                475                  480
Lys Met Asn Ser Thr Leu Val Gly Val Phe Thr Ile Thr Leu Val Phe 
                485                  490                495    
Leu Ala Ala Phe Val Asn Met Phe Thr Cys Asn Ser Arg Asp Leu Leu 
            500                  505                510        
Gly Cys Leu Ala Gln Glu His Asn Ile Ser Ala Ser Gln Val Asn Ala 
        515                  520                525            
Cys His Val Ala Glu Ser Ala Val Asn Tyr Ser Leu Gly Asp Glu Gln 
    530                  535                540                
Gly Phe Cys Gly Ser Pro Trp Pro Asn Cys Asn Phe Pro Glu Tyr Phe 
545                  550                555                  560
Thr Tyr Ser Val Leu Leu Ser Leu Leu Ala Cys Ser Val Phe Leu Gln 
                565                  570                575    
Ile Ser Cys Ile Gly Lys Leu Val Leu Met Leu Ala Ile Glu Leu Ile 
            580                  585                590        
Tyr Val Leu Ile Val Glu Val Pro Gly Val Thr Leu Phe Asp Asn Ala 
        595                  600                605            
Asp Leu Leu Val Thr Ala Asn Ala Ile Asp Phe Phe Asn Asn Gly Thr 
    610                  615                620                
Ser Gln Cys Pro Glu His Ala Thr Lys Val Ala Leu Lys Val Val Thr 
625                  630                635                  640
Pro Ile Ile Ile Ser Val Phe Val Leu Ala Leu Tyr Leu His Ala Gln 
                645                  650                655    
Gln Val Glu Ser Thr Ala Arg Leu Asp Phe Leu Trp Lys Leu Gln Ala 
            660                  665                670        
Thr Glu Glu Lys Glu Glu Met Glu Glu Leu Gln Ala Tyr Asn Arg Arg 
        675                  680                685            
Leu Leu His Asn Ile Leu Pro Lys Asp Val Ala Ala His Phe Leu Ala 
    690                  695                700                
Arg Glu Arg Arg Asn Asp Glu Leu Tyr Tyr Gln Ser Cys Glu Cys Val 
705                  710                715                  720
Ala Val Met Phe Ala Ser Ile Ala Asn Phe Ser Glu Phe Tyr Val Glu 
                725                  730                735    
Leu Glu Ala Asn Asn Glu Gly Val Glu Cys Leu Arg Leu Leu Asn Glu 
            740                  745                750        
Ile Ile Ala Asp Phe Asp Glu Ile Ile Ser Glu Asp Arg Phe Arg Gln 
        755                  760                765            
Leu Glu Lys Ile Lys Thr Ile Gly Ser Thr Tyr Met Ala Ala Ser Gly 
    770                  775                780                
Leu Asn Asp Ser Thr Tyr Asp Lys Val Gly Lys Thr His Ile Lys Ala 
785                  790                795                  800
Leu Ala Asp Phe Ala Met Lys Leu Met Asp Gln Met Lys Tyr Ile Asn 
                805                  810                815    
Glu His Ser Phe Asn Asn Phe Gln Met Lys Ile Gly Leu Asn Ile Gly 
            820                  825                830        
Pro Val Val Ala Gly Val Ile Gly Ala Arg Lys Pro Gln Tyr Asp Ile 
        835                  840                845            
Trp Gly Asn Thr Val Asn Val Ala Ser Arg Met Asp Ser Thr Gly Val 
    850                  855                860                
Pro Asp Arg Ile Gln Val Thr Thr Asp Met Tyr Gln Val Leu Ala Ala 
865                  870                875                  880
Asn Thr Tyr Gln Leu Glu Cys Arg Gly Val Val Lys Val Lys Gly Lys 
                885                  890                895    
Gly Glu Met Met Thr Tyr Phe Leu Asn Gly Gly Pro Pro Leu Ser 
            900                  905                910    

<210> 145
<211> 1168
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1168
<223> /mol_type="protein"
      /note="ADCY6 (full length protein of ADCY6 (Gene ID: 112)"
      /organism="Homo sapiens"

<400> 145
Met Ser Trp Phe Ser Gly Leu Leu Val Pro Lys Val Asp Glu Arg Lys 
1               5                   10                   15    
Thr Ala Trp Gly Glu Arg Asn Gly Gln Lys Arg Ser Arg Arg Arg Gly 
            20                   25                  30        
Thr Arg Ala Gly Gly Phe Cys Thr Pro Arg Tyr Met Ser Cys Leu Arg 
        35                   40                  45            
Asp Ala Glu Pro Pro Ser Pro Thr Pro Ala Gly Pro Pro Arg Cys Pro 
    50                   55                  60                
Trp Gln Asp Asp Ala Phe Ile Arg Arg Gly Gly Pro Gly Lys Gly Lys 
65                   70                  75                  80
Glu Leu Gly Leu Arg Ala Val Ala Leu Gly Phe Glu Asp Thr Glu Val 
                85                   90                  95    
Thr Thr Thr Ala Gly Gly Thr Ala Glu Val Ala Pro Asp Ala Val Pro 
            100                  105                110        
Arg Ser Gly Arg Ser Cys Trp Arg Arg Leu Val Gln Val Phe Gln Ser 
        115                  120                125            
Lys Gln Phe Arg Ser Ala Lys Leu Glu Arg Leu Tyr Gln Arg Tyr Phe 
    130                  135                140                
Phe Gln Met Asn Gln Ser Ser Leu Thr Leu Leu Met Ala Val Leu Val 
145                  150                155                  160
Leu Leu Thr Ala Val Leu Leu Ala Phe His Ala Ala Pro Ala Arg Pro 
                165                  170                175    
Gln Pro Ala Tyr Val Ala Leu Leu Ala Cys Ala Ala Ala Leu Phe Val 
            180                  185                190        
Gly Leu Met Val Val Cys Asn Arg His Ser Phe Arg Gln Asp Ser Met 
        195                  200                205            
Trp Val Val Ser Tyr Val Val Leu Gly Ile Leu Ala Ala Val Gln Val 
    210                  215                220                
Gly Gly Ala Leu Ala Ala Asp Pro Arg Ser Pro Ser Ala Gly Leu Trp 
225                  230                235                  240
Cys Pro Val Phe Phe Val Tyr Ile Ala Tyr Thr Leu Leu Pro Ile Arg 
                245                  250                255    
Met Arg Ala Ala Val Leu Ser Gly Leu Gly Leu Ser Thr Leu His Leu 
            260                  265                270        
Ile Leu Ala Trp Gln Leu Asn Arg Gly Asp Ala Phe Leu Trp Lys Gln 
        275                  280                285            
Leu Gly Ala Asn Val Leu Leu Phe Leu Cys Thr Asn Val Ile Gly Ile 
    290                  295                300                
Cys Thr His Tyr Pro Ala Glu Val Ser Gln Arg Gln Ala Phe Gln Glu 
305                  310                315                  320
Thr Arg Gly Tyr Ile Gln Ala Arg Leu His Leu Gln His Glu Asn Arg 
                325                  330                335    
Gln Gln Glu Arg Leu Leu Leu Ser Val Leu Pro Gln His Val Ala Met 
            340                  345                350        
Glu Met Lys Glu Asp Ile Asn Thr Lys Lys Glu Asp Met Met Phe His 
        355                  360                365            
Lys Ile Tyr Ile Gln Lys His Asp Asn Val Ser Ile Leu Phe Ala Asp 
    370                  375                380                
Ile Glu Gly Phe Thr Ser Leu Ala Ser Gln Cys Thr Ala Gln Glu Leu 
385                  390                395                  400
Val Met Thr Leu Asn Glu Leu Phe Ala Arg Phe Asp Lys Leu Ala Ala 
                405                  410                415    
Glu Asn His Cys Leu Arg Ile Lys Ile Leu Gly Asp Cys Tyr Tyr Cys 
            420                  425                430        
Val Ser Gly Leu Pro Glu Ala Arg Ala Asp His Ala His Cys Cys Val 
        435                  440                445            
Glu Met Gly Val Asp Met Ile Glu Ala Ile Ser Leu Val Arg Glu Val 
    450                  455                460                
Thr Gly Val Asn Val Asn Met Arg Val Gly Ile His Ser Gly Arg Val 
465                  470                475                  480
His Cys Gly Val Leu Gly Leu Arg Lys Trp Gln Phe Asp Val Trp Ser 
                485                  490                495    
Asn Asp Val Thr Leu Ala Asn His Met Glu Ala Gly Gly Arg Ala Gly 
            500                  505                510        
Arg Ile His Ile Thr Arg Ala Thr Leu Gln Tyr Leu Asn Gly Asp Tyr 
        515                  520                525            
Glu Val Glu Pro Gly Arg Gly Gly Glu Arg Asn Ala Tyr Leu Lys Glu 
    530                  535                540                
Gln His Ile Glu Thr Phe Leu Ile Leu Gly Ala Ser Gln Lys Arg Lys 
545                  550                555                  560
Glu Glu Lys Ala Met Leu Ala Lys Leu Gln Arg Thr Arg Ala Asn Ser 
                565                  570                575    
Met Glu Gly Leu Met Pro Arg Trp Val Pro Asp Arg Ala Phe Ser Arg 
            580                  585                590        
Thr Lys Asp Ser Lys Ala Phe Arg Gln Met Gly Ile Asp Asp Ser Ser 
        595                  600                605            
Lys Asp Asn Arg Gly Thr Gln Asp Ala Leu Asn Pro Glu Asp Glu Val 
    610                  615                620                
Asp Glu Phe Leu Ser Arg Ala Ile Asp Ala Arg Ser Ile Asp Gln Leu 
625                  630                635                  640
Arg Lys Asp His Val Arg Arg Phe Leu Leu Thr Phe Gln Arg Glu Asp 
                645                  650                655    
Leu Glu Lys Lys Tyr Ser Arg Lys Val Asp Pro Arg Phe Gly Ala Tyr 
            660                  665                670        
Val Ala Cys Ala Leu Leu Val Phe Cys Phe Ile Cys Phe Ile Gln Leu 
        675                  680                685            
Leu Ile Phe Pro His Ser Thr Leu Met Leu Gly Ile Tyr Ala Ser Ile 
    690                  695                700                
Phe Leu Leu Leu Leu Ile Thr Val Leu Ile Cys Ala Val Tyr Ser Cys 
705                  710                715                  720
Gly Ser Leu Phe Pro Lys Ala Leu Gln Arg Leu Ser Arg Ser Ile Val 
                725                  730                735    
Arg Ser Arg Ala His Ser Thr Ala Val Gly Ile Phe Ser Val Leu Leu 
            740                  745                750        
Val Phe Thr Ser Ala Ile Ala Asn Met Phe Thr Cys Asn His Thr Pro 
        755                  760                765            
Ile Arg Ser Cys Ala Ala Arg Met Leu Asn Leu Thr Pro Ala Asp Ile 
    770                  775                780                
Thr Ala Cys His Leu Gln Gln Leu Asn Tyr Ser Leu Gly Leu Asp Ala 
785                  790                795                  800
Pro Leu Cys Glu Gly Thr Met Pro Thr Cys Ser Phe Pro Glu Tyr Phe 
                805                  810                815    
Ile Gly Asn Met Leu Leu Ser Leu Leu Ala Ser Ser Val Phe Leu His 
            820                  825                830        
Ile Ser Ser Ile Gly Lys Leu Ala Met Ile Phe Val Leu Gly Leu Ile 
        835                  840                845            
Tyr Leu Val Leu Leu Leu Leu Gly Pro Pro Ala Thr Ile Phe Asp Asn 
    850                  855                860                
Tyr Asp Leu Leu Leu Gly Val His Gly Leu Ala Ser Ser Asn Glu Thr 
865                  870                875                  880
Phe Asp Gly Leu Asp Cys Pro Ala Ala Gly Arg Val Ala Leu Lys Tyr 
                885                  890                895    
Met Thr Pro Val Ile Leu Leu Val Phe Ala Leu Ala Leu Tyr Leu His 
            900                  905                910        
Ala Gln Gln Val Glu Ser Thr Ala Arg Leu Asp Phe Leu Trp Lys Leu 
        915                  920                925            
Gln Ala Thr Gly Glu Lys Glu Glu Met Glu Glu Leu Gln Ala Tyr Asn 
    930                  935                940                
Arg Arg Leu Leu His Asn Ile Leu Pro Lys Asp Val Ala Ala His Phe 
945                  950                955                  960
Leu Ala Arg Glu Arg Arg Asn Asp Glu Leu Tyr Tyr Gln Ser Cys Glu 
                965                  970                975    
Cys Val Ala Val Met Phe Ala Ser Ile Ala Asn Phe Ser Glu Phe Tyr 
            980                  985                990        
Val Glu Leu Glu Ala Asn Asn Glu Gly Val Glu Cys Leu Arg Leu Leu 
        995                  1000                1005            
Asn Glu Ile Ile Ala Asp Phe Asp Glu Ile Ile Ser Glu Glu Arg Phe 
    1010                1015                1020                
Arg Gln Leu Glu Lys Ile Lys Thr Ile Gly Ser Thr Tyr Met Ala Ala 
1025                1030                1035                1040
Ser Gly Leu Asn Ala Ser Thr Tyr Asp Gln Val Gly Arg Ser His Ile 
                1045                1050                1055    
Thr Ala Leu Ala Asp Tyr Ala Met Arg Leu Met Glu Gln Met Lys His 
            1060                1065                1070        
Ile Asn Glu His Ser Phe Asn Asn Phe Gln Met Lys Ile Gly Leu Asn 
        1075                1080                1085            
Met Gly Pro Val Val Ala Gly Val Ile Gly Ala Arg Lys Pro Gln Tyr 
    1090                1095                1100                
Asp Ile Trp Gly Asn Thr Val Asn Val Ser Ser Arg Met Asp Ser Thr 
1105                1110                1115                1120
Gly Val Pro Asp Arg Ile Gln Val Thr Thr Asp Leu Tyr Gln Val Leu 
                1125                1130                1135    
Ala Ala Lys Gly Tyr Gln Leu Glu Cys Arg Gly Val Val Lys Val Lys 
            1140                1145                1150        
Gly Lys Gly Glu Met Thr Thr Tyr Phe Leu Asn Gly Gly Pro Ser Ser 
        1155                1160                1165            

<210> 146
<211> 1080
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1080
<223> /mol_type="protein"
      /note="ADCY7 (full length protein of ADCY7 (Gene ID: 113)"
      /organism="Homo sapiens"

<400> 146
Met Pro Ala Lys Gly Arg Tyr Phe Leu Asn Glu Gly Glu Glu Gly Pro 
1               5                   10                   15    
Asp Gln Asp Ala Leu Tyr Glu Lys Tyr Gln Leu Thr Ser Gln His Gly 
            20                   25                  30        
Pro Leu Leu Leu Thr Leu Leu Leu Val Ala Ala Thr Ala Cys Val Ala 
        35                   40                  45            
Leu Ile Ile Ile Ala Phe Ser Gln Gly Asp Pro Ser Arg His Gln Ala 
    50                   55                  60                
Ile Leu Gly Met Ala Phe Leu Val Leu Ala Val Phe Ala Ala Leu Ser 
65                   70                  75                  80
Val Leu Met Tyr Val Glu Cys Leu Leu Arg Arg Trp Leu Arg Ala Leu 
                85                   90                  95    
Ala Leu Leu Thr Trp Ala Cys Leu Val Ala Leu Gly Tyr Val Leu Val 
            100                  105                110        
Phe Asp Ala Trp Thr Lys Ala Ala Cys Ala Trp Glu Gln Val Pro Phe 
        115                  120                125            
Phe Leu Phe Ile Val Phe Val Val Tyr Thr Leu Leu Pro Phe Ser Met 
    130                  135                140                
Arg Gly Ala Val Ala Val Gly Ala Val Ser Thr Ala Ser His Leu Leu 
145                  150                155                  160
Val Leu Gly Ser Leu Met Gly Gly Phe Thr Thr Pro Ser Val Arg Val 
                165                  170                175    
Gly Leu Gln Leu Leu Ala Asn Ala Val Ile Phe Leu Cys Gly Asn Leu 
            180                  185                190        
Thr Gly Ala Phe His Lys His Gln Met Gln Asp Ala Ser Arg Asp Leu 
        195                  200                205            
Phe Thr Tyr Thr Val Lys Cys Ile Gln Ile Arg Arg Lys Leu Arg Ile 
    210                  215                220                
Glu Lys Arg Gln Gln Glu Asn Leu Leu Leu Ser Val Leu Pro Ala His 
225                  230                235                  240
Ile Ser Met Gly Met Lys Leu Ala Ile Ile Glu Arg Leu Lys Glu His 
                245                  250                255    
Gly Asp Arg Arg Cys Met Pro Asp Asn Asn Phe His Ser Leu Tyr Val 
            260                  265                270        
Lys Arg His Gln Asn Val Ser Ile Leu Tyr Ala Asp Ile Val Gly Phe 
        275                  280                285            
Thr Gln Leu Ala Ser Asp Cys Ser Pro Lys Glu Leu Val Val Val Leu 
    290                  295                300                
Asn Glu Leu Phe Gly Lys Phe Asp Gln Ile Ala Lys Ala Asn Glu Cys 
305                  310                315                  320
Met Arg Ile Lys Ile Leu Gly Asp Cys Tyr Tyr Cys Val Ser Gly Leu 
                325                  330                335    
Pro Val Ser Leu Pro Thr His Ala Arg Asn Cys Val Lys Met Gly Leu 
            340                  345                350        
Asp Met Cys Gln Ala Ile Lys Gln Val Arg Glu Ala Thr Gly Val Asp 
        355                  360                365            
Ile Asn Met Arg Val Gly Ile His Ser Gly Asn Val Leu Cys Gly Val 
    370                  375                380                
Ile Gly Leu Arg Lys Trp Gln Tyr Asp Val Trp Ser His Asp Val Ser 
385                  390                395                  400
Leu Ala Asn Arg Met Glu Ala Ala Gly Val Pro Gly Arg Val His Ile 
                405                  410                415    
Thr Glu Ala Thr Leu Lys His Leu Asp Lys Ala Tyr Glu Val Glu Asp 
            420                  425                430        
Gly His Gly Gln Gln Arg Asp Pro Tyr Leu Lys Glu Met Asn Ile Arg 
        435                  440                445            
Thr Tyr Leu Val Ile Asp Pro Arg Ser Gln Gln Pro Pro Pro Pro Ser 
    450                  455                460                
Gln His Leu Pro Arg Pro Lys Gly Asp Ala Ala Leu Lys Met Arg Ala 
465                  470                475                  480
Ser Val Arg Met Thr Arg Tyr Leu Glu Ser Trp Gly Ala Ala Arg Pro 
                485                  490                495    
Phe Ala His Leu Asn His Arg Glu Ser Val Ser Ser Gly Glu Thr His 
            500                  505                510        
Val Pro Asn Gly Arg Arg Pro Lys Ser Val Pro Gln Arg His Arg Arg 
        515                  520                525            
Thr Pro Asp Arg Ser Met Ser Pro Lys Gly Arg Ser Glu Asp Asp Ser 
    530                  535                540                
Tyr Asp Asp Glu Met Leu Ser Ala Ile Glu Gly Leu Ser Ser Thr Arg 
545                  550                555                  560
Pro Cys Cys Ser Lys Ser Asp Asp Phe Tyr Thr Phe Gly Ser Ile Phe 
                565                  570                575    
Leu Glu Lys Gly Phe Glu Arg Glu Tyr Arg Leu Ala Pro Ile Pro Arg 
            580                  585                590        
Ala Arg His Asp Phe Ala Cys Ala Ser Leu Ile Phe Val Cys Ile Leu 
        595                  600                605            
Leu Val His Val Leu Leu Met Pro Arg Thr Ala Ala Leu Gly Val Ser 
    610                  615                620                
Phe Gly Leu Val Ala Cys Val Leu Gly Leu Val Leu Gly Leu Cys Phe 
625                  630                635                  640
Ala Thr Lys Phe Ser Arg Cys Cys Pro Ala Arg Gly Thr Leu Cys Thr 
                645                  650                655    
Ile Ser Glu Arg Val Glu Thr Gln Pro Leu Leu Arg Leu Thr Leu Ala 
            660                  665                670        
Val Leu Thr Ile Gly Ser Leu Leu Thr Val Ala Ile Ile Asn Leu Pro 
        675                  680                685            
Leu Met Pro Phe Gln Val Pro Glu Leu Pro Val Gly Asn Glu Thr Gly 
    690                  695                700                
Leu Leu Ala Ala Ser Ser Lys Thr Arg Ala Leu Cys Glu Pro Leu Pro 
705                  710                715                  720
Tyr Tyr Thr Cys Ser Cys Val Leu Gly Phe Ile Ala Cys Ser Val Phe 
                725                  730                735    
Leu Arg Met Ser Leu Glu Pro Lys Val Val Leu Leu Thr Val Ala Leu 
            740                  745                750        
Val Ala Tyr Leu Val Leu Phe Asn Leu Ser Pro Cys Trp Gln Trp Asp 
        755                  760                765            
Cys Cys Gly Gln Gly Leu Gly Asn Leu Thr Lys Pro Asn Gly Thr Thr 
    770                  775                780                
Ser Gly Thr Pro Ser Cys Ser Trp Lys Asp Leu Lys Thr Met Thr Asn 
785                  790                795                  800
Phe Tyr Leu Val Leu Phe Tyr Ile Thr Leu Leu Thr Leu Ser Arg Gln 
                805                  810                815    
Ile Asp Tyr Tyr Cys Arg Leu Asp Cys Leu Trp Lys Lys Lys Phe Lys 
            820                  825                830        
Lys Glu His Glu Glu Phe Glu Thr Met Glu Asn Val Asn Arg Leu Leu 
        835                  840                845            
Leu Glu Asn Val Leu Pro Ala His Val Ala Ala His Phe Ile Gly Asp 
    850                  855                860                
Lys Leu Asn Glu Asp Trp Tyr His Gln Ser Tyr Asp Cys Val Cys Val 
865                  870                875                  880
Met Phe Ala Ser Val Pro Asp Phe Lys Val Phe Tyr Thr Glu Cys Asp 
                885                  890                895    
Val Asn Lys Glu Gly Leu Glu Cys Leu Arg Leu Leu Asn Glu Ile Ile 
            900                  905                910        
Ala Asp Phe Asp Glu Leu Leu Leu Lys Pro Lys Phe Ser Gly Val Glu 
        915                  920                925            
Lys Ile Lys Thr Ile Gly Ser Thr Tyr Met Ala Ala Ala Gly Leu Ser 
    930                  935                940                
Val Ala Ser Gly His Glu Asn Gln Glu Leu Glu Arg Gln His Ala His 
945                  950                955                  960
Ile Gly Val Met Val Glu Phe Ser Ile Ala Leu Met Ser Lys Leu Asp 
                965                  970                975    
Gly Ile Asn Arg His Ser Phe Asn Ser Phe Arg Leu Arg Val Gly Ile 
            980                  985                990        
Asn His Gly Pro Val Ile Ala Gly Val Ile Gly Ala Arg Lys Pro Gln 
        995                  1000                1005            
Tyr Asp Ile Trp Gly Asn Thr Val Asn Val Ala Ser Arg Met Glu Ser 
    1010                1015                1020                
Thr Gly Glu Leu Gly Lys Ile Gln Val Thr Glu Glu Thr Cys Thr Ile 
1025                1030                1035                1040
Leu Gln Gly Leu Gly Tyr Ser Cys Glu Cys Arg Gly Leu Ile Asn Val 
                1045                1050                1055    
Lys Gly Lys Gly Glu Leu Arg Thr Tyr Phe Val Cys Thr Asp Thr Ala 
            1060                1065                1070        
Lys Phe Gln Gly Leu Gly Leu Asn 
        1075                1080

<210> 147
<211> 1251
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1251
<223> /mol_type="protein"
      /note="ADCY8 (full length protein of ADCY8 (Gene ID: 114)"
      /organism="Homo sapiens"

<400> 147
Met Glu Leu Ser Asp Val Arg Cys Leu Thr Gly Ser Glu Glu Leu Tyr 
1               5                   10                   15    
Thr Ile His Pro Thr Pro Pro Ala Gly Asp Gly Arg Ser Ala Ser Arg 
            20                   25                  30        
Pro Gln Arg Leu Leu Trp Gln Thr Ala Val Arg His Ile Thr Glu Gln 
        35                   40                  45            
Arg Phe Ile His Gly His Arg Gly Gly Ser Gly Ser Gly Ser Gly Gly 
    50                   55                  60                
Ser Gly Lys Ala Ser Asp Pro Ala Gly Gly Gly Pro Asn His His Ala 
65                   70                  75                  80
Pro Gln Leu Ser Gly Asp Ser Ala Leu Pro Leu Tyr Ser Leu Gly Pro 
                85                   90                  95    
Gly Glu Arg Ala His Ser Thr Cys Gly Thr Lys Val Phe Pro Glu Arg 
            100                  105                110        
Ser Gly Ser Gly Ser Ala Ser Gly Ser Gly Gly Gly Gly Asp Leu Gly 
        115                  120                125            
Phe Leu His Leu Asp Cys Ala Pro Ser Asn Ser Asp Phe Phe Leu Asn 
    130                  135                140                
Gly Gly Tyr Ser Tyr Arg Gly Val Ile Phe Pro Thr Leu Arg Asn Ser 
145                  150                155                  160
Phe Lys Ser Arg Asp Leu Glu Arg Leu Tyr Gln Arg Tyr Phe Leu Gly 
                165                  170                175    
Gln Arg Arg Lys Ser Glu Val Val Met Asn Val Leu Asp Val Leu Thr 
            180                  185                190        
Lys Leu Thr Leu Leu Val Leu His Leu Ser Leu Ala Ser Ala Pro Met 
        195                  200                205            
Asp Pro Leu Lys Gly Ile Leu Leu Gly Phe Phe Thr Gly Ile Glu Val 
    210                  215                220                
Val Ile Cys Ala Leu Val Val Val Arg Lys Asp Thr Thr Ser His Thr 
225                  230                235                  240
Tyr Leu Gln Tyr Ser Gly Val Val Thr Trp Val Ala Met Thr Thr Gln 
                245                  250                255    
Ile Leu Ala Ala Gly Leu Gly Tyr Gly Leu Leu Gly Asp Gly Ile Gly 
            260                  265                270        
Tyr Val Leu Phe Thr Leu Phe Ala Thr Tyr Ser Met Leu Pro Leu Pro 
        275                  280                285            
Leu Thr Trp Ala Ile Leu Ala Gly Leu Gly Thr Ser Leu Leu Gln Val 
    290                  295                300                
Ile Leu Gln Val Val Ile Pro Arg Leu Ala Val Ile Ser Ile Asn Gln 
305                  310                315                  320
Val Val Ala Gln Ala Val Leu Phe Met Cys Met Asn Thr Ala Gly Ile 
                325                  330                335    
Phe Ile Ser Tyr Leu Ser Asp Arg Ala Gln Arg Gln Ala Phe Leu Glu 
            340                  345                350        
Thr Arg Arg Cys Val Glu Ala Arg Leu Arg Leu Glu Thr Glu Asn Gln 
        355                  360                365            
Arg Gln Glu Arg Leu Val Leu Ser Val Leu Pro Arg Phe Val Val Leu 
    370                  375                380                
Glu Met Ile Asn Asp Met Thr Asn Val Glu Asp Glu His Leu Gln His 
385                  390                395                  400
Gln Phe His Arg Ile Tyr Ile His Arg Tyr Glu Asn Val Ser Ile Leu 
                405                  410                415    
Phe Ala Asp Val Lys Gly Phe Thr Asn Leu Ser Thr Thr Leu Ser Ala 
            420                  425                430        
Gln Glu Leu Val Arg Met Leu Asn Glu Leu Phe Ala Arg Phe Asp Arg 
        435                  440                445            
Leu Ala His Glu His His Cys Leu Arg Ile Lys Ile Leu Gly Asp Cys 
    450                  455                460                
Tyr Tyr Cys Val Ser Gly Leu Pro Glu Pro Arg Gln Asp His Ala His 
465                  470                475                  480
Cys Cys Val Glu Met Gly Leu Ser Met Ile Lys Thr Ile Arg Tyr Val 
                485                  490                495    
Arg Ser Arg Thr Lys His Asp Val Asp Met Arg Ile Gly Ile His Ser 
            500                  505                510        
Gly Ser Val Leu Cys Gly Val Leu Gly Leu Arg Lys Trp Gln Phe Asp 
        515                  520                525            
Val Trp Ser Trp Asp Val Asp Ile Ala Asn Lys Leu Glu Ser Gly Gly 
    530                  535                540                
Ile Pro Gly Arg Ile His Ile Ser Lys Ala Thr Leu Asp Cys Leu Asn 
545                  550                555                  560
Gly Asp Tyr Asn Val Glu Glu Gly His Gly Lys Glu Arg Asn Glu Phe 
                565                  570                575    
Leu Arg Lys His Asn Ile Glu Thr Tyr Leu Ile Lys Gln Pro Glu Asp 
            580                  585                590        
Ser Leu Leu Ser Leu Pro Glu Asp Ile Val Lys Glu Ser Val Ser Ser 
        595                  600                605            
Ser Asp Arg Arg Asn Ser Gly Ala Thr Phe Thr Glu Gly Ser Trp Ser 
    610                  615                620                
Pro Glu Leu Pro Phe Asp Asn Ile Val Gly Lys Gln Asn Thr Leu Ala 
625                  630                635                  640
Ala Leu Thr Arg Asn Ser Ile Asn Leu Leu Pro Asn His Leu Ala Gln 
                645                  650                655    
Ala Leu His Val Gln Ser Gly Pro Glu Glu Ile Asn Lys Arg Ile Glu 
            660                  665                670        
His Thr Ile Asp Leu Arg Ser Gly Asp Lys Leu Arg Arg Glu His Ile 
        675                  680                685            
Lys Pro Phe Ser Leu Met Phe Lys Asp Ser Ser Leu Glu His Lys Tyr 
    690                  695                700                
Ser Gln Met Arg Asp Glu Val Phe Lys Ser Asn Leu Val Cys Ala Phe 
705                  710                715                  720
Ile Val Leu Leu Phe Ile Thr Ala Ile Gln Ser Leu Leu Pro Ser Ser 
                725                  730                735    
Arg Val Met Pro Met Thr Ile Gln Phe Ser Ile Leu Ile Met Leu His 
            740                  745                750        
Ser Ala Leu Val Leu Ile Thr Thr Ala Glu Asp Tyr Lys Cys Leu Pro 
        755                  760                765            
Leu Ile Leu Arg Lys Thr Cys Cys Trp Ile Asn Glu Thr Tyr Leu Ala 
    770                  775                780                
Arg Asn Val Ile Ile Phe Ala Ser Ile Leu Ile Asn Phe Leu Gly Ala 
785                  790                795                  800
Ile Leu Asn Ile Leu Trp Cys Asp Phe Asp Lys Ser Ile Pro Leu Lys 
                805                  810                815    
Asn Leu Thr Phe Asn Ser Ser Ala Val Phe Thr Asp Ile Cys Ser Tyr 
            820                  825                830        
Pro Glu Tyr Phe Val Phe Thr Gly Val Leu Ala Met Val Thr Cys Ala 
        835                  840                845            
Val Phe Leu Arg Leu Asn Ser Val Leu Lys Leu Ala Val Leu Leu Ile 
    850                  855                860                
Met Ile Ala Ile Tyr Ala Leu Leu Thr Glu Thr Val Tyr Ala Gly Leu 
865                  870                875                  880
Phe Leu Arg Tyr Asp Asn Leu Asn His Ser Gly Glu Asp Phe Leu Gly 
                885                  890                895    
Thr Lys Glu Val Ser Leu Leu Leu Met Ala Met Phe Leu Leu Ala Val 
            900                  905                910        
Phe Tyr His Gly Gln Gln Leu Glu Tyr Thr Ala Arg Leu Asp Phe Leu 
        915                  920                925            
Trp Arg Val Gln Ala Lys Glu Glu Ile Asn Glu Met Lys Glu Leu Arg 
    930                  935                940                
Glu His Asn Glu Asn Met Leu Arg Asn Ile Leu Pro Ser His Val Ala 
945                  950                955                  960
Arg His Phe Leu Glu Lys Asp Arg Asp Asn Glu Glu Leu Tyr Ser Gln 
                965                  970                975    
Ser Tyr Asp Ala Val Gly Val Met Phe Ala Ser Ile Pro Gly Phe Ala 
            980                  985                990        
Asp Phe Tyr Ser Gln Thr Glu Met Asn Asn Gln Gly Val Glu Cys Leu 
        995                  1000                1005            
Arg Leu Leu Asn Glu Ile Ile Ala Asp Phe Asp Glu Leu Leu Gly Glu 
    1010                1015                1020                
Asp Arg Phe Gln Asp Ile Glu Lys Ile Lys Thr Ile Gly Ser Thr Tyr 
1025                1030                1035                1040
Met Ala Val Ser Gly Leu Ser Pro Glu Lys Gln Gln Cys Glu Asp Lys 
                1045                1050                1055    
Trp Gly His Leu Cys Ala Leu Ala Asp Phe Ser Leu Ala Leu Thr Glu 
            1060                1065                1070        
Ser Ile Gln Glu Ile Asn Lys His Ser Phe Asn Asn Phe Glu Leu Arg 
        1075                1080                1085            
Ile Gly Ile Ser His Gly Ser Val Val Ala Gly Val Ile Gly Ala Lys 
    1090                1095                1100                
Lys Pro Gln Tyr Asp Ile Trp Gly Lys Thr Val Asn Leu Ala Ser Arg 
1105                1110                1115                1120
Met Asp Ser Thr Gly Val Ser Gly Arg Ile Gln Val Pro Glu Glu Thr 
                1125                1130                1135    
Tyr Leu Ile Leu Lys Asp Gln Gly Phe Ala Phe Asp Tyr Arg Gly Glu 
            1140                1145                1150        
Ile Tyr Val Lys Gly Ile Ser Glu Gln Glu Gly Lys Ile Lys Thr Tyr 
        1155                1160                1165            
Phe Leu Leu Gly Arg Val Gln Pro Asn Pro Phe Ile Leu Pro Pro Arg 
    1170                1175                1180                
Arg Leu Pro Gly Gln Tyr Ser Leu Ala Ala Val Val Leu Gly Leu Val 
1185                1190                1195                1200
Gln Ser Leu Asn Arg Gln Arg Gln Lys Gln Leu Leu Asn Glu Asn Asn 
                1205                1210                1215    
Asn Thr Gly Ile Ile Lys Gly His Tyr Asn Arg Arg Thr Leu Leu Ser 
            1220                1225                1230        
Pro Ser Gly Thr Glu Pro Gly Ala Gln Ala Glu Gly Thr Asp Lys Ser 
        1235                1240                1245            
Asp Leu Pro 
    1250    

<210> 148
<211> 1353
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1353
<223> /mol_type="protein"
      /note="ADCY9 (full length protein of ADCY9 (Gene ID: 115)"
      /organism="Homo sapiens"

<400> 148
Met Ala Ser Pro Pro His Gln Gln Leu Leu His His His Ser Thr Glu 
1               5                   10                   15    
Val Ser Cys Asp Ser Ser Gly Asp Ser Asn Ser Val Arg Val Lys Ile 
            20                   25                  30        
Asn Pro Lys Gln Leu Ser Ser Asn Ser His Pro Lys His Cys Lys Tyr 
        35                   40                  45            
Ser Ile Ser Ser Ser Cys Ser Ser Ser Gly Asp Ser Gly Gly Val Pro 
    50                   55                  60                
Arg Arg Val Gly Gly Gly Gly Arg Leu Arg Arg Gln Lys Lys Leu Pro 
65                   70                  75                  80
Gln Leu Phe Glu Arg Ala Ser Ser Arg Trp Trp Asp Pro Lys Phe Asp 
                85                   90                  95    
Ser Val Asn Leu Glu Glu Ala Cys Leu Glu Arg Cys Phe Pro Gln Thr 
            100                  105                110        
Gln Arg Arg Phe Arg Tyr Ala Leu Phe Tyr Ile Gly Phe Ala Cys Leu 
        115                  120                125            
Leu Trp Ser Ile Tyr Phe Ala Val His Met Arg Ser Arg Leu Ile Val 
    130                  135                140                
Met Val Ala Pro Ala Leu Cys Phe Leu Leu Val Cys Val Gly Phe Phe 
145                  150                155                  160
Leu Phe Thr Phe Thr Lys Leu Tyr Ala Arg His Tyr Ala Trp Thr Ser 
                165                  170                175    
Leu Ala Leu Thr Leu Leu Val Phe Ala Leu Thr Leu Ala Ala Gln Phe 
            180                  185                190        
Gln Val Leu Thr Pro Val Ser Gly Arg Gly Asp Ser Ser Asn Leu Thr 
        195                  200                205            
Ala Thr Ala Arg Pro Thr Asp Thr Cys Leu Ser Gln Val Gly Ser Phe 
    210                  215                220                
Ser Met Cys Ile Glu Val Leu Phe Leu Leu Tyr Thr Val Met His Leu 
225                  230                235                  240
Pro Leu Tyr Leu Ser Leu Cys Leu Gly Val Ala Tyr Ser Val Leu Phe 
                245                  250                255    
Glu Thr Phe Gly Tyr His Phe Arg Asp Glu Ala Cys Phe Pro Ser Pro 
            260                  265                270        
Gly Ala Gly Ala Leu His Trp Glu Leu Leu Ser Arg Gly Leu Leu His 
        275                  280                285            
Gly Cys Ile His Ala Ile Gly Val His Leu Phe Val Met Ser Gln Val 
    290                  295                300                
Arg Ser Arg Ser Thr Phe Leu Lys Val Gly Gln Ser Ile Met His Gly 
305                  310                315                  320
Lys Asp Leu Glu Val Glu Lys Ala Leu Lys Glu Arg Met Ile His Ser 
                325                  330                335    
Val Met Pro Arg Ile Ile Ala Asp Asp Leu Met Lys Gln Gly Asp Glu 
            340                  345                350        
Glu Ser Glu Asn Ser Val Lys Arg His Ala Thr Ser Ser Pro Lys Asn 
        355                  360                365            
Arg Lys Lys Lys Ser Ser Ile Gln Lys Ala Pro Ile Ala Phe Arg Pro 
    370                  375                380                
Phe Lys Met Gln Gln Ile Glu Glu Val Ser Ile Leu Phe Ala Asp Ile 
385                  390                395                  400
Val Gly Phe Thr Lys Met Ser Ala Asn Lys Ser Ala His Ala Leu Val 
                405                  410                415    
Gly Leu Leu Asn Asp Leu Phe Gly Arg Phe Asp Arg Leu Cys Glu Glu 
            420                  425                430        
Thr Lys Cys Glu Lys Ile Ser Thr Leu Gly Asp Cys Tyr Tyr Cys Val 
        435                  440                445            
Ala Gly Cys Pro Glu Pro Arg Ala Asp His Ala Tyr Cys Cys Ile Glu 
    450                  455                460                
Met Gly Leu Gly Met Ile Lys Ala Ile Glu Gln Phe Cys Gln Glu Lys 
465                  470                475                  480
Lys Glu Met Val Asn Met Arg Val Gly Val His Thr Gly Thr Val Leu 
                485                  490                495    
Cys Gly Ile Leu Gly Met Arg Arg Phe Lys Phe Asp Val Trp Ser Asn 
            500                  505                510        
Asp Val Asn Leu Ala Asn Leu Met Glu Gln Leu Gly Val Ala Gly Lys 
        515                  520                525            
Val His Ile Ser Glu Ala Thr Ala Lys Tyr Leu Asp Asp Arg Tyr Glu 
    530                  535                540                
Met Glu Asp Gly Lys Val Ile Glu Arg Leu Gly Gln Ser Val Val Ala 
545                  550                555                  560
Asp Gln Leu Lys Gly Leu Lys Thr Tyr Leu Ile Ser Gly Gln Arg Ala 
                565                  570                575    
Lys Glu Ser Arg Cys Ser Cys Ala Glu Ala Leu Leu Ser Gly Phe Glu 
            580                  585                590        
Val Ile Asp Gly Ser Gln Val Ser Ser Gly Pro Arg Gly Gln Gly Thr 
        595                  600                605            
Ala Ser Ser Gly Asn Val Ser Asp Leu Ala Gln Thr Val Lys Thr Phe 
    610                  615                620                
Asp Asn Leu Lys Thr Cys Pro Ser Cys Gly Ile Thr Phe Ala Pro Lys 
625                  630                635                  640
Ser Glu Ala Gly Ala Glu Gly Gly Ala Pro Gln Asn Gly Cys Gln Asp 
                645                  650                655    
Glu His Lys Asn Ser Thr Lys Ala Ser Gly Gly Pro Asn Pro Lys Thr 
            660                  665                670        
Gln Asn Gly Leu Leu Ser Pro Pro Gln Glu Glu Lys Leu Thr Asn Ser 
        675                  680                685            
Gln Thr Ser Leu Cys Glu Ile Leu Gln Glu Lys Gly Arg Trp Ala Gly 
    690                  695                700                
Val Ser Leu Asp Gln Ser Ala Leu Leu Pro Leu Arg Phe Lys Asn Ile 
705                  710                715                  720
Arg Glu Lys Thr Asp Ala His Phe Val Asp Val Ile Lys Glu Asp Ser 
                725                  730                735    
Leu Met Lys Asp Tyr Phe Phe Lys Pro Pro Ile Asn Gln Phe Ser Leu 
            740                  745                750        
Asn Phe Leu Asp Gln Glu Leu Glu Arg Ser Tyr Arg Thr Ser Tyr Gln 
        755                  760                765            
Glu Glu Val Ile Lys Asn Ser Pro Val Lys Thr Phe Ala Ser Pro Thr 
    770                  775                780                
Phe Ser Ser Leu Leu Asp Val Phe Leu Ser Thr Thr Val Phe Leu Thr 
785                  790                795                  800
Leu Ser Thr Thr Cys Phe Leu Lys Tyr Glu Ala Ala Thr Val Pro Pro 
                805                  810                815    
Pro Pro Ala Ala Leu Ala Val Phe Ser Ala Ala Leu Leu Leu Glu Val 
            820                  825                830        
Leu Ser Leu Ala Val Ser Ile Arg Met Val Phe Phe Leu Glu Asp Val 
        835                  840                845            
Met Ala Cys Thr Lys Arg Leu Leu Glu Trp Ile Ala Gly Trp Leu Pro 
    850                  855                860                
Arg His Cys Ile Gly Ala Ile Leu Val Ser Leu Pro Ala Leu Ala Val 
865                  870                875                  880
Tyr Ser His Val Thr Ser Glu Tyr Glu Thr Asn Ile His Phe Pro Val 
                885                  890                895    
Phe Thr Gly Ser Ala Ala Leu Ile Ala Val Val His Tyr Cys Asn Phe 
            900                  905                910        
Cys Gln Leu Ser Ser Trp Met Arg Ser Ser Leu Ala Thr Val Val Gly 
        915                  920                925            
Ala Gly Pro Leu Leu Leu Leu Tyr Val Ser Leu Cys Pro Asp Ser Ser 
    930                  935                940                
Val Leu Thr Ser Pro Leu Asp Ala Val Gln Asn Phe Ser Ser Glu Arg 
945                  950                955                  960
Asn Pro Cys Asn Ser Ser Val Pro Arg Asp Leu Arg Arg Pro Ala Ser 
                965                  970                975    
Leu Ile Gly Gln Glu Val Val Leu Val Phe Phe Leu Leu Leu Leu Leu 
            980                  985                990        
Val Trp Phe Leu Asn Arg Glu Phe Glu Val Ser Tyr Arg Leu His Tyr 
        995                  1000                1005            
His Gly Asp Val Glu Ala Asp Leu His Arg Thr Lys Ile Gln Ser Met 
    1010                1015                1020                
Arg Asp Gln Ala Asp Trp Leu Leu Arg Asn Ile Ile Pro Tyr His Val 
1025                1030                1035                1040
Ala Glu Gln Leu Lys Val Ser Gln Thr Tyr Ser Lys Asn His Asp Ser 
                1045                1050                1055    
Gly Gly Val Ile Phe Ala Ser Ile Val Asn Phe Ser Glu Phe Tyr Glu 
            1060                1065                1070        
Glu Asn Tyr Glu Gly Gly Lys Glu Cys Tyr Arg Val Leu Asn Glu Leu 
        1075                1080                1085            
Ile Gly Asp Phe Asp Glu Leu Leu Ser Lys Pro Asp Tyr Ser Ser Ile 
    1090                1095                1100                
Glu Lys Ile Lys Thr Ile Gly Ala Thr Tyr Met Ala Ala Ser Gly Leu 
1105                1110                1115                1120
Asn Thr Ala Gln Ala Gln Asp Gly Ser His Pro Gln Glu His Leu Gln 
                1125                1130                1135    
Ile Leu Phe Glu Phe Ala Lys Glu Met Met Arg Val Val Asp Asp Phe 
            1140                1145                1150        
Asn Asn Asn Met Leu Trp Phe Asn Phe Lys Leu Arg Val Gly Phe Asn 
        1155                1160                1165            
His Gly Pro Leu Thr Ala Gly Val Ile Gly Thr Thr Lys Leu Leu Tyr 
    1170                1175                1180                
Asp Ile Trp Gly Asp Thr Val Asn Ile Ala Ser Arg Met Asp Thr Thr 
1185                1190                1195                1200
Gly Val Glu Cys Arg Ile Gln Val Ser Glu Glu Ser Tyr Arg Val Leu 
                1205                1210                1215    
Ser Lys Met Gly Tyr Asp Phe Asp Tyr Arg Gly Thr Val Asn Val Lys 
            1220                1225                1230        
Gly Lys Gly Gln Met Lys Thr Tyr Leu Tyr Pro Lys Cys Thr Asp His 
        1235                1240                1245            
Arg Val Ile Pro Gln His Gln Leu Ser Ile Ser Pro Asp Ile Arg Val 
    1250                1255                1260                
Gln Val Asp Gly Ser Ile Gly Arg Ser Pro Thr Asp Glu Ile Ala Asn 
1265                1270                1275                1280
Leu Val Pro Ser Val Gln Tyr Val Asp Lys Thr Ser Leu Gly Ser Asp 
                1285                1290                1295    
Ser Ser Thr Gln Ala Lys Asp Ala His Leu Ser Pro Lys Arg Pro Trp 
            1300                1305                1310        
Lys Glu Pro Val Lys Ala Glu Glu Arg Gly Arg Phe Gly Lys Ala Ile 
        1315                1320                1325            
Glu Lys Asp Asp Cys Asp Glu Thr Gly Ile Glu Glu Ala Asn Glu Leu 
    1330                1335                1340                
Thr Lys Leu Asn Val Ser Lys Ser Val 
1345                1350            

<210> 149
<211> 1457
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1457
<223> /mol_type="protein"
      /note="ADCY10 (full length protein of ADCY10 (Gene ID: 55811)"
      /organism="Homo sapiens"

<400> 149
Met Leu Val Phe Gly Asp Glu Thr His Ser His Phe Leu Val Ile Gly 
1               5                   10                   15    
Gln Ala Val Asp Asp Val Arg Leu Ala Gln Asn Met Ala Gln Met Asn 
            20                   25                  30        
Asp Val Ile Leu Ser Pro Asn Cys Trp Gln Leu Cys Asp Arg Ser Met 
        35                   40                  45            
Ile Glu Ile Glu Ser Val Pro Asp Gln Arg Ala Val Lys Val Asn Phe 
    50                   55                  60                
Leu Lys Pro Pro Pro Asn Phe Asn Phe Asp Glu Phe Phe Thr Lys Cys 
65                   70                  75                  80
Thr Thr Phe Met His Tyr Tyr Pro Ser Gly Glu His Lys Asn Leu Leu 
                85                   90                  95    
Arg Leu Ala Cys Thr Leu Lys Pro Asp Pro Glu Leu Glu Met Ser Leu 
            100                  105                110        
Gln Lys Tyr Val Met Glu Ser Ile Leu Lys Gln Ile Asp Asn Lys Gln 
        115                  120                125            
Leu Gln Gly Tyr Leu Ser Glu Leu Arg Pro Val Thr Ile Val Phe Val 
    130                  135                140                
Asn Leu Met Phe Glu Asp Gln Asp Lys Ala Glu Glu Ile Gly Pro Ala 
145                  150                155                  160
Ile Gln Asp Ala Tyr Met His Ile Thr Ser Val Leu Lys Ile Phe Gln 
                165                  170                175    
Gly Gln Ile Asn Lys Val Phe Met Phe Asp Lys Gly Cys Ser Phe Leu 
            180                  185                190        
Cys Val Phe Gly Phe Pro Gly Glu Lys Val Pro Asp Glu Leu Thr His 
        195                  200                205            
Ala Leu Glu Cys Ala Met Asp Ile Phe Asp Phe Cys Ser Gln Val His 
    210                  215                220                
Lys Ile Gln Thr Val Ser Ile Gly Val Ala Ser Gly Ile Val Phe Cys 
225                  230                235                  240
Gly Ile Val Gly His Thr Val Arg His Glu Tyr Thr Val Ile Gly Gln 
                245                  250                255    
Lys Val Asn Leu Ala Ala Arg Met Met Met Tyr Tyr Pro Gly Ile Val 
            260                  265                270        
Thr Cys Asp Ser Val Thr Tyr Asn Gly Ser Asn Leu Pro Ala Tyr Phe 
        275                  280                285            
Phe Lys Glu Leu Pro Lys Lys Val Met Lys Gly Val Ala Asp Ser Gly 
    290                  295                300                
Pro Leu Tyr Gln Tyr Trp Gly Arg Thr Glu Lys Val Met Phe Gly Met 
305                  310                315                  320
Ala Cys Leu Ile Cys Asn Arg Lys Glu Asp Tyr Pro Leu Leu Gly Arg 
                325                  330                335    
Asn Lys Glu Ile Asn Tyr Phe Met Tyr Thr Met Lys Lys Phe Leu Ile 
            340                  345                350        
Ser Asn Ser Ser Gln Val Leu Met Tyr Glu Gly Leu Pro Gly Tyr Gly 
        355                  360                365            
Lys Ser Gln Ile Leu Met Lys Ile Glu Tyr Leu Ala Gln Gly Lys Asn 
    370                  375                380                
His Arg Ile Ile Ala Ile Ser Leu Asn Lys Ile Ser Phe His Gln Thr 
385                  390                395                  400
Phe Tyr Thr Ile Gln Met Phe Met Ala Asn Val Leu Gly Leu Asp Thr 
                405                  410                415    
Cys Lys His Tyr Lys Glu Arg Gln Thr Asn Leu Arg Asn Lys Val Met 
            420                  425                430        
Thr Leu Leu Asp Glu Lys Phe Tyr Cys Leu Leu Asn Asp Ile Phe His 
        435                  440                445            
Val Gln Phe Pro Ile Ser Arg Glu Ile Ser Arg Met Ser Thr Leu Lys 
    450                  455                460                
Lys Gln Lys Gln Leu Glu Ile Leu Phe Met Lys Ile Leu Lys Leu Ile 
465                  470                475                  480
Val Lys Glu Glu Arg Ile Ile Phe Ile Ile Asp Glu Ala Gln Phe Val 
                485                  490                495    
Asp Ser Thr Ser Trp Arg Phe Met Glu Lys Leu Ile Arg Thr Leu Pro 
            500                  505                510        
Ile Phe Ile Ile Met Ser Leu Cys Pro Phe Val Asn Ile Pro Cys Ala 
        515                  520                525            
Ala Ala Arg Ala Val Ile Lys Asn Arg Asn Thr Thr Tyr Ile Val Ile 
    530                  535                540                
Gly Ala Val Gln Pro Asn Asp Ile Ser Asn Lys Ile Cys Leu Asp Leu 
545                  550                555                  560
Asn Val Ser Cys Ile Ser Lys Glu Leu Asp Ser Tyr Leu Gly Glu Gly 
                565                  570                575    
Ser Cys Gly Ile Pro Phe Tyr Cys Glu Glu Leu Leu Lys Asn Leu Glu 
            580                  585                590        
His His Glu Val Leu Val Phe Gln Gln Thr Glu Ser Glu Glu Lys Thr 
        595                  600                605            
Asn Arg Thr Trp Asn Asn Leu Phe Lys Tyr Ser Ile Lys Leu Thr Glu 
    610                  615                620                
Lys Leu Asn Met Val Thr Leu His Ser Asp Lys Glu Ser Glu Glu Val 
625                  630                635                  640
Cys His Leu Thr Ser Gly Val Arg Leu Lys Asn Leu Ser Pro Pro Thr 
                645                  650                655    
Ser Leu Lys Glu Ile Ser Leu Ile Gln Leu Asp Ser Met Arg Leu Ser 
            660                  665                670        
His Gln Met Leu Val Arg Cys Ala Ala Ile Ile Gly Leu Thr Phe Thr 
        675                  680                685            
Thr Glu Leu Leu Phe Glu Ile Leu Pro Cys Trp Asn Met Lys Met Met 
    690                  695                700                
Ile Lys Thr Leu Ala Thr Leu Val Glu Ser Asn Ile Phe Tyr Cys Phe 
705                  710                715                  720
Arg Asn Gly Lys Glu Leu Gln Lys Ala Leu Lys Gln Asn Asp Pro Ser 
                725                  730                735    
Phe Glu Val His Tyr Arg Ser Leu Ser Leu Lys Pro Ser Glu Gly Met 
            740                  745                750        
Asp His Gly Glu Glu Glu Gln Leu Arg Glu Leu Glu Asn Glu Val Ile 
        755                  760                765            
Glu Cys His Arg Ile Arg Phe Cys Asn Pro Met Met Gln Lys Thr Ala 
    770                  775                780                
Tyr Glu Leu Trp Leu Lys Asp Gln Arg Lys Ala Met His Leu Lys Cys 
785                  790                795                  800
Ala Arg Phe Leu Glu Glu Asp Ala His Arg Cys Asp His Cys Arg Gly 
                805                  810                815    
Arg Asp Phe Ile Pro Tyr His His Phe Thr Val Asn Ile Arg Leu Asn 
            820                  825                830        
Ala Leu Asp Met Asp Ala Ile Lys Lys Met Ala Met Ser His Gly Phe 
        835                  840                845            
Lys Thr Glu Glu Lys Leu Ile Leu Ser Asn Ser Glu Ile Pro Glu Thr 
    850                  855                860                
Ser Ala Phe Phe Pro Glu Asn Arg Ser Pro Glu Glu Ile Arg Glu Lys 
865                  870                875                  880
Ile Leu Asn Phe Phe Asp His Val Leu Thr Lys Met Lys Thr Ser Asp 
                885                  890                895    
Glu Asp Ile Ile Pro Leu Glu Ser Cys Gln Cys Glu Glu Ile Leu Glu 
            900                  905                910        
Ile Val Ile Leu Pro Leu Ala His His Phe Leu Ala Leu Gly Glu Asn 
        915                  920                925            
Asp Lys Ala Leu Tyr Tyr Phe Leu Glu Ile Ala Ser Ala Tyr Leu Ile 
    930                  935                940                
Phe Cys Asp Asn Tyr Met Ala Tyr Met Tyr Leu Asn Glu Gly Gln Lys 
945                  950                955                  960
Leu Leu Lys Thr Leu Lys Lys Asp Lys Ser Trp Ser Gln Thr Phe Glu 
                965                  970                975    
Ser Ala Thr Phe Tyr Ser Leu Lys Gly Glu Val Cys Phe Asn Met Gly 
            980                  985                990        
Gln Ile Val Leu Ala Lys Lys Met Leu Arg Lys Ala Leu Lys Leu Leu 
        995                  1000                1005            
Asn Arg Ile Phe Pro Tyr Asn Leu Ile Ser Leu Phe Leu His Ile His 
    1010                1015                1020                
Val Glu Lys Asn Arg His Phe His Tyr Val Asn Arg Gln Ala Gln Glu 
1025                1030                1035                1040
Ser Pro Pro Pro Gly Lys Lys Arg Leu Ala Gln Leu Tyr Arg Gln Thr 
                1045                1050                1055    
Val Cys Leu Ser Leu Leu Trp Arg Ile Tyr Ser Tyr Ser Tyr Leu Phe 
            1060                1065                1070        
His Cys Lys Tyr Tyr Ala His Leu Ala Val Met Met Gln Met Asn Thr 
        1075                1080                1085            
Ala Leu Glu Thr Gln Asn Cys Phe Gln Ile Ile Lys Ala Tyr Leu Asp 
    1090                1095                1100                
Tyr Ser Leu Tyr His His Leu Ala Gly Tyr Lys Gly Val Trp Phe Lys 
1105                1110                1115                1120
Tyr Glu Val Met Ala Met Glu His Ile Phe Asn Leu Pro Leu Lys Gly 
                1125                1130                1135    
Glu Gly Ile Glu Ile Val Ala Tyr Val Ala Glu Thr Leu Val Phe Asn 
            1140                1145                1150        
Lys Leu Ile Met Gly His Leu Asp Leu Ala Ile Glu Leu Gly Ser Arg 
        1155                1160                1165            
Ala Leu Gln Met Trp Ala Leu Leu Gln Asn Pro Asn Arg His Tyr Gln 
    1170                1175                1180                
Ser Leu Cys Arg Leu Ser Arg Cys Leu Leu Leu Asn Ser Arg Tyr Pro 
1185                1190                1195                1200
Gln Leu Ile Gln Val Leu Gly Arg Leu Trp Glu Leu Ser Val Thr Gln 
                1205                1210                1215    
Glu His Ile Phe Ser Lys Ala Phe Phe Tyr Phe Val Cys Leu Asp Ile 
            1220                1225                1230        
Leu Leu Tyr Ser Gly Phe Val Tyr Arg Thr Phe Glu Glu Cys Leu Glu 
        1235                1240                1245            
Phe Ile His Gln Tyr Glu Asn Asn Arg Ile Leu Lys Phe His Ser Gly 
    1250                1255                1260                
Leu Leu Leu Gly Leu Tyr Ser Ser Val Ala Ile Trp Tyr Ala Arg Leu 
1265                1270                1275                1280
Gln Glu Trp Asp Asn Phe Tyr Lys Phe Ser Asn Arg Ala Lys Asn Leu 
                1285                1290                1295    
Leu Pro Arg Arg Thr Met Thr Leu Thr Tyr Tyr Asp Gly Ile Ser Arg 
            1300                1305                1310        
Tyr Met Glu Gly Gln Val Leu His Leu Gln Lys Gln Ile Lys Glu Gln 
        1315                1320                1325            
Ser Glu Asn Ala Gln Ala Ser Gly Glu Glu Leu Leu Lys Asn Leu Glu 
    1330                1335                1340                
Asn Leu Val Ala Gln Asn Thr Thr Gly Pro Val Phe Cys Pro Arg Leu 
1345                1350                1355                1360
Tyr His Leu Met Ala Tyr Val Cys Ile Leu Met Gly Asp Gly Gln Lys 
                1365                1370                1375    
Cys Gly Leu Phe Leu Asn Thr Ala Leu Arg Leu Ser Glu Thr Gln Gly 
            1380                1385                1390        
Asn Ile Leu Glu Lys Cys Trp Leu Asn Met Asn Lys Glu Ser Trp Tyr 
        1395                1400                1405            
Ser Thr Ser Glu Leu Lys Glu Asp Gln Trp Leu Gln Thr Ile Leu Ser 
    1410                1415                1420                
Leu Pro Ser Trp Glu Lys Ile Val Ala Gly Arg Val Asn Ile Gln Asp 
1425                1430                1435                1440
Leu Gln Lys Asn Lys Phe Leu Met Arg Ala Asn Thr Val Asp Asn His 
                1445                1450                1455    
Phe 
    

<210> 150
<211> 24
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..24
<223> /mol_type="DNA"
      /note="Primer SOS_5_amp"
      /organism="artificial sequences"

<400> 150
cagggatcca tgcaggcgca gcag                                           24


<210> 151
<211> 29
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..29
<223> /mol_type="DNA"
      /note="Primer ADCY3_3_amp"
      /organism="artificial sequences"

<400> 151
caggaattct caggagttgt ccaccacct                                      29


<210> 152
<211> 29
<212> DNA
<213> artificial sequences

<220> 
<221> SOURCE
<222> 1..29
<223> /mol_type="DNA"
      /note="Primer ZNF142_5_amp"
      /organism="artificial sequences"

<400> 152
atgcgtcgac attccccaga acagcgata                                      29


<210> 153
<211> 29
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..29
<223> /mol_type="DNA"
      /note="Primer PTK6_3_amp"
      /organism="artificial sequences"

<400> 153
cgtagtcgac tgcacccatc acctcagta                                      29


<210> 154
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="AGAP1_5_amp"
      /organism="artificial sequences"

<400> 154
accatgaact accagcagca                                                20


<210> 155
<211> 18
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..18
<223> /mol_type="DNA"
      /note="Primer IGFBP2_3_amp"
      /organism="artificial sequences"

<400> 155
gctggctgcg gtctactg                                                  18


<210> 156
<211> 22
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..22
<223> /mol_type="DNA"
      /note="Primer FNDC3B _5_amp"
      /organism="artificial sequences"

<400> 156
atgaatgtac gtcacaatga tg                                             22


<210> 157
<211> 21
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..21
<223> /mol_type="DNA"
      /note="Primer TNIK_3_amp"
      /organism="artificial sequences"

<400> 157
gccatgaaga taagtgccaa g                                              21


<210> 158
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="Primer  C12orf11_5_amp"
      /organism="artificial sequences"

<400> 158
aggaccaggc acgaaagtta                                                20


<210> 159
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="Primer RASSF8_3_amp"
      /organism="artificial sequences"

<400> 159
tctgttgggt ctcctcccta                                                20


<210> 160
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="Primer POMC_5_amp (break-point)"
      /organism="artificial sequences"

<400> 160
ttcaaaaacg ccatcatcaa                                                20


<210> 161
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="Primer ADD1_3_amp (break-point)"
      /organism="artificial sequences"

<400> 161
ggcaacaaaa cgacacagaa                                                20


<210> 162
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="Primer E2F4_5_amp (break-point)"
      /organism="artificial sequences"

<400> 162
cctgggactg atagcaagga                                                20


<210> 163
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="Primer RPL14_3_amp (break-point)"
      /organism="artificial sequences"

<400> 163
tgacccttct gagcttttgg                                                20


<210> 164
<211> 18
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..18
<223> /mol_type="DNA"
      /note="POMC_f"
      /organism="artificial sequences"

<400> 164
ctggccttgc tgcttcag                                                  18


<210> 165
<211> 20
<212> DNA
<213> artificial sequences

<220> 
<221> source
<222> 1..20
<223> /mol_type="DNA"
      /note="POMC_r"
      /organism="artificial sequences"

<400> 165
gaagtggccc atgacgtact                                                20


