                         SEQUENCE LISTING

<110>  Chang, Chawnshang
 
<120>  PSA and KLK2 as therapeutic targets and molecules inhbiting PSA 
       and KLK2

<130>  24376.42.9001

<160>  8     

<170>  PatentIn version 3.5

<210>  1
<211>  614
<212>  PRT
<213>  Homo sapiens


<220>
<221>  ARA70 -Homo sapiens nuclear receptor coactivator 4 (NCOA4),
<222>  (1)..(614)

<400>  1

Met Asn Thr Phe Gln Asp Gln Ser Gly Ser Ser Ser Asn Arg Glu Pro 
1               5                   10                  15      


Leu Leu Arg Cys Ser Asp Ala Arg Arg Asp Leu Glu Leu Ala Ile Gly 
            20                  25                  30          


Gly Val Leu Arg Ala Glu Gln Gln Ile Lys Asp Asn Leu Arg Glu Val 
        35                  40                  45              


Lys Ala Gln Ile His Ser Cys Ile Ser Arg His Leu Glu Cys Leu Arg 
    50                  55                  60                  


Ser Arg Glu Val Trp Leu Tyr Glu Gln Val Asp Leu Ile Tyr Gln Leu 
65                  70                  75                  80  


Lys Glu Glu Thr Leu Gln Gln Gln Ala Gln Gln Leu Tyr Ser Leu Leu 
                85                  90                  95      


Gly Gln Phe Asn Cys Leu Thr His Gln Leu Glu Cys Thr Gln Asn Lys 
            100                 105                 110         


Asp Leu Ala Asn Gln Val Ser Val Cys Leu Glu Arg Leu Gly Ser Leu 
        115                 120                 125             


Thr Leu Lys Pro Glu Asp Ser Thr Val Leu Leu Phe Glu Ala Asp Thr 
    130                 135                 140                 


Ile Thr Leu Arg Gln Thr Ile Thr Thr Phe Gly Ser Leu Lys Thr Ile 
145                 150                 155                 160 


Gln Ile Pro Glu His Leu Met Ala His Ala Ser Ser Ala Asn Ile Gly 
                165                 170                 175     


Pro Phe Leu Glu Lys Arg Gly Cys Ile Ser Met Pro Glu Gln Lys Ser 
            180                 185                 190         


Ala Ser Gly Ile Val Ala Val Pro Phe Ser Glu Trp Leu Leu Gly Ser 
        195                 200                 205             


Lys Pro Ala Ser Gly Tyr Gln Ala Pro Tyr Ile Pro Ser Thr Asp Pro 
    210                 215                 220                 


Gln Asp Trp Leu Thr Gln Lys Gln Thr Leu Glu Asn Ser Gln Thr Ser 
225                 230                 235                 240 


Ser Arg Ala Cys Asn Phe Phe Asn Asn Val Gly Gly Asn Leu Lys Gly 
                245                 250                 255     


Leu Glu Asn Trp Leu Leu Lys Ser Glu Lys Ser Ser Tyr Gln Lys Cys 
            260                 265                 270         


Asn Ser His Ser Thr Thr Ser Ser Phe Ser Ile Glu Met Glu Lys Val 
        275                 280                 285             


Gly Asp Gln Glu Leu Pro Asp Gln Asp Glu Met Asp Leu Ser Asp Trp 
    290                 295                 300                 


Leu Val Thr Pro Gln Glu Ser His Lys Leu Arg Lys Pro Glu Asn Gly 
305                 310                 315                 320 


Ser Arg Glu Thr Ser Glu Lys Phe Lys Leu Leu Phe Gln Ser Tyr Asn 
                325                 330                 335     


Val Asn Asp Trp Leu Val Lys Thr Asp Ser Cys Thr Asn Cys Gln Gly 
            340                 345                 350         


Asn Gln Pro Lys Gly Val Glu Ile Glu Asn Leu Gly Asn Leu Lys Cys 
        355                 360                 365             


Leu Asn Asp His Leu Glu Ala Lys Lys Pro Leu Ser Thr Pro Ser Met 
    370                 375                 380                 


Val Thr Glu Asp Trp Leu Val Gln Asn His Gln Asp Pro Cys Lys Val 
385                 390                 395                 400 


Glu Glu Val Cys Arg Ala Asn Glu Pro Cys Thr Ser Phe Ala Glu Cys 
                405                 410                 415     


Val Cys Asp Glu Asn Cys Glu Lys Glu Ala Leu Tyr Lys Trp Leu Leu 
            420                 425                 430         


Lys Lys Glu Gly Lys Asp Lys Asn Gly Met Pro Val Glu Pro Lys Pro 
        435                 440                 445             


Glu Pro Glu Lys His Lys Asp Ser Leu Asn Met Trp Leu Cys Pro Arg 
    450                 455                 460                 


Lys Glu Val Ile Glu Gln Thr Lys Ala Pro Lys Ala Met Thr Pro Ser 
465                 470                 475                 480 


Arg Ile Ala Asp Ser Phe Gln Val Ile Lys Asn Ser Pro Leu Ser Glu 
                485                 490                 495     


Trp Leu Ile Arg Pro Pro Tyr Lys Glu Gly Ser Pro Lys Glu Val Pro 
            500                 505                 510         


Gly Thr Glu Asp Arg Ala Gly Lys Gln Lys Phe Lys Ser Pro Met Asn 
        515                 520                 525             


Thr Ser Trp Cys Ser Phe Asn Thr Ala Asp Trp Val Leu Pro Gly Lys 
    530                 535                 540                 


Lys Met Gly Asn Leu Ser Gln Leu Ser Ser Gly Glu Asp Lys Trp Leu 
545                 550                 555                 560 


Leu Arg Lys Lys Ala Gln Glu Val Leu Leu Asn Ser Pro Leu Gln Glu 
                565                 570                 575     


Glu His Asn Phe Pro Pro Asp His Tyr Gly Leu Pro Ala Val Cys Asp 
            580                 585                 590         


Leu Phe Ala Cys Met Gln Leu Lys Val Asp Lys Glu Lys Trp Leu Tyr 
        595                 600                 605             


Arg Thr Pro Leu Gln Met 
    610                 


<210>  2
<211>  3505
<212>  DNA
<213>  Homo sapiens


<220>
<221>  ARA70 -Homo sapiens nuclear receptor coactivator 4 (NCOA4)
<222>  (1)..(3505)

<400>  2
ctggagttgc cgtgtgacgc gtgggcggga cgaggcccgg gctcggggac ctttcgcact       60

cgggtcaggg gtaaagcagc ctgtcgcttg ccgggcagct ggtgagtcgg tgacctggcc      120

tgtgaggagc agtgaggaga atgaatacct tccaagacca gagtggcagc tccagtaata      180

gagaacccct tttgaggtgt agtgatgcac ggagggactt ggagcttgct attggtggag      240

ttctccgggc tgaacagcaa attaaagata acttgcgaga ggtcaaagct cagattcaca      300

gttgcataag ccgtcacctg gaatgtctta gaagccgtga ggtatggctg tatgaacagg      360

tggaccttat ttatcagctt aaagaggaga cacttcaaca gcaggctcag cagctctact      420

cgttattggg ccagttcaat tgtcttactc atcaactgga gtgtacccaa aacaaagatc      480

tagccaatca agtctctgtg tgcctggaga gactgggcag tttgaccctt aagcctgaag      540

attcaactgt cctgctcttt gaagctgaca caattactct gcgccagacc atcaccacat      600

ttgggtctct caaaaccatt caaattcctg agcacttgat ggctcatgct agttcagcaa      660

atattgggcc cttcctggag aagagaggct gtatctccat gccagagcag aagtcagcat      720

ccggtattgt agctgtccct ttcagcgaat ggctccttgg aagcaaacct gccagtggtt      780

atcaagctcc ttacataccc agcaccgacc cccaggactg gcttacccaa aagcagacct      840

tggagaacag tcagacttct tccagagcct gcaatttctt caataatgtc gggggaaacc      900

taaagggctt agaaaactgg ctcctcaaga gtgaaaaatc aagttatcaa aagtgtaaca      960

gccattccac tactagttct ttctccattg aaatggaaaa ggttggagat caagagcttc     1020

ctgatcaaga tgagatggac ctatcagatt ggctagtgac tccccaggaa tcccataagc     1080

tgcggaagcc tgagaatggc agtcgtgaaa ccagtgagaa gtttaagctc ttattccagt     1140

cctataatgt gaatgattgg cttgtcaaga ctgactcctg taccaactgt cagggaaacc     1200

agcccaaagg tgtggagatt gaaaacctgg gcaatctgaa gtgcctgaat gaccacttgg     1260

aggccaagaa accattgtcc acccccagca tggttacaga ggattggctt gtccagaacc     1320

atcaggaccc atgtaaggta gaggaggtgt gcagagccaa tgagccctgc acaagctttg     1380

cagagtgtgt gtgtgatgag aattgtgaga aggaggctct gtataagtgg cttctgaaga     1440

aagaaggaaa ggataaaaat gggatgcctg tggaacccaa acctgagcct gagaagcata     1500

aagattccct gaatatgtgg ctctgtccta gaaaagaagt aatagaacaa actaaagcac     1560

caaaggcaat gactccttct agaattgctg attccttcca agtcataaag aacagcccct     1620

tgtcggagtg gcttatcagg cccccataca aagaaggaag tcccaaggaa gtgcctggta     1680

ctgaagacag agctggcaaa cagaagttta aaagccccat gaatacttcc tggtgttcct     1740

ttaacacagc tgactgggtc ctgccaggaa agaagatggg caacctcagc cagttatctt     1800

ctggagaaga caagtggctg cttcgaaaga aggcccagga agtattactt aattcacctc     1860

tacaggagga acataacttc cccccagacc attatggcct ccctgcagtt tgtgatctct     1920

ttgcctgtat gcagcttaaa gttgataaag agaagtggtt atatcgaact cctctacaga     1980

tgtgaaggaa tggacaagag ttgagcagcc tttctgctga ttatcacaca tcatgagctg     2040

agtgactgca gcttgccaaa tctttgtgtt tctgggtctg accaattagc ttagttcttc     2100

tcctgcctaa ttttgaacta gtaaagcaaa gtgagtcatc agattatgag ttactgttta     2160

aaagaaaaat gctgtttatt catgctgagg tgattcagtt ccctccttct tacagaagta     2220

ttttaattca ccccacacta gaaatgcagc atctttgtgg acgtcttttt cacaagcctc     2280

caaggctcct tagattgggt cgttactaaa agtacattaa aacactcttg tttatcgaag     2340

tatattgatg tattctaaag ctagtaaact tccctaacgt ttaattgccc tacagatgct     2400

tctcttgctg tgggttttct tttgttagtg gtctgaaata attattttcc tgttctatta     2460

atacatagtg tattttgcac aaaaaaatta acctggtcaa tagtgattac caaaatatat     2520

attaataatc ttggcaattt ttgacattaa ttatgaaaca ttttagccca cgttagttct     2580

acattattct tcacttaaac tcagctactg caaattttgt ctttctgtaa atgttattaa     2640

aatatccagt gagctcttta gaaggactca gtattatttc aagactattt ttgaggtaat     2700

tctagccttt taaaatattc tacagaccta cggggcttaa aagaacccca gtaccgacta     2760

agcaaatagg caaaagacat gttggaaatg tagtatagta cttgaaacag tcactatcat     2820

agggataatt ggtgcatcct gtgtaaatgg aagctgagct tgacacctgg tgcttttaag     2880

tagggataaa gtcatcctct cactgcaagc acagcatacc tgtacctcca aaagtgacgt     2940

tttagtgaac aggccgtttt caacacttgt gccttggggt gttcattgaa gctttgtgaa     3000

aactactgat gttttctcag tctccttaaa gttacgtcca tgctttaaaa tgtctgtgta     3060

ggagagaagt ggggtttata atgttttctc taagatatct ttgctgcttt ccagactttg     3120

aaactattaa gcttcttaac tgcctcttac cggaaatact tctggggaaa cttcatggtc     3180

ccaaaatgtc attgccatac agcttcacta gagttctttg aaccacagct gaaaagagct     3240

ttgtattatt ttttaattcc ctccccagat atcatttagg agtattatat aaaggtggtg     3300

ggcaaaaaca atgtaaggag cctttccagt tatcttgagt tgcagctctg tagtttcttg     3360

aggccaaaca cactgtattt tacaagtcaa aatataattt acattaatca ctatgttaat     3420

gagtatgtaa aacattcttt tgcattgatg aattttgtat ctgcttccat taaaagcata     3480

acagccacaa aaaaaaaaaa aaaaa                                           3505


<210>  3
<211>  223
<212>  PRT
<213>  Homo sapiens


<220>
<221>  Protein Homo sapiens kallikrein-related peptidase 2 (KLK2)
<222>  (1)..(223)

<400>  3

Met Trp Asp Leu Val Leu Ser Ile Ala Leu Ser Val Gly Cys Thr Gly 
1               5                   10                  15      


Ala Val Pro Leu Ile Gln Ser Arg Ile Val Gly Gly Trp Glu Cys Glu 
            20                  25                  30          


Lys His Ser Gln Pro Trp Gln Val Ala Val Tyr Ser His Gly Trp Ala 
        35                  40                  45              


His Cys Gly Gly Val Leu Val His Pro Gln Trp Val Leu Thr Ala Ala 
    50                  55                  60                  


His Cys Leu Lys Lys Asn Ser Gln Val Trp Leu Gly Arg His Asn Leu 
65                  70                  75                  80  


Phe Glu Pro Glu Asp Thr Gly Gln Arg Val Pro Val Ser His Ser Phe 
                85                  90                  95      


Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gln Ser Leu Arg 
            100                 105                 110         


Pro Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu 
        115                 120                 125             


Pro Ala Lys Ile Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gln 
    130                 135                 140                 


Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser Ile 
145                 150                 155                 160 


Glu Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gln Cys Val Ser Leu 
                165                 170                 175     


His Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val 
            180                 185                 190         


Thr Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr 
        195                 200                 205             


Cys Gly Val Ser His Pro Tyr Ser Gln His Leu Glu Gly Lys Gly 
    210                 215                 220             


<210>  4
<211>  2892
<212>  DNA
<213>  Homo sapiens


<220>
<221>  Homo sapiens kallikrein-related peptidase 2 (KLK2)
<222>  (1)..(2892)

<400>  4
agccccaaac tcaccacctg gccgtggaca cctgtgtcag catgtgggac ctggttctct       60

ccatcgcctt gtctgtgggg tgcactggtg ccgtgcccct catccagtct cggattgtgg      120

gaggctggga gtgtgagaag cattcccaac cctggcaggt ggctgtgtac agtcatggat      180

gggcacactg tgggggtgtc ctggtgcacc cccagtgggt gctcacagct gcccattgcc      240

taaagaagaa tagccaggtc tggctgggtc ggcacaacct gtttgagcct gaagacacag      300

gccagagggt ccctgtcagc cacagcttcc cacacccgct ctacaatatg agccttctga      360

agcatcaaag ccttagacca gatgaagact ccagccatga cctcatgctg ctccgcctgt      420

cagagcctgc caagatcaca gatgttgtga aggtcctggg cctgcccacc caggagccag      480

cactggggac cacctgctac gcctcaggct ggggcagcat cgaaccagag gagttcttgc      540

gccccaggag tcttcagtgt gtgagcctcc atctcctgtc caatgacatg tgtgctagag      600

cttactctga gaaggtgaca gagttcatgt tgtgtgctgg gctctggaca ggtggtaaag      660

acacttgtgg ggtgagtcat ccctactccc aacatctgga ggggaaaggg tgattctggg      720

ggtccacttg tctgtaatgg tgtgcttcaa ggtatcacat catggggccc tgagccatgt      780

gccctgcctg aaaagcctgc tgtgtacacc aaggtggtgc attaccggaa gtggatcaag      840

gacaccatcg cagccaaccc ctgagtgccc ctgtcccacc cctacctcta gtaaatttaa      900

gtccacctca cgttctggca tcacttggcc tttctggatg ctggacacct gaagcttgga      960

actcacctgg ccgaagctcg agcctcctga gtcctactga cctgtgcttt ctggtgtgga     1020

gtccagggct gctaggaaaa ggaatgggca gacacaggtg tatgccaatg tttctgaaat     1080

gggtataatt tcgtcctctc cttcggaaca ctggctgtct ctgaagactt ctcgctcagt     1140

ttcagtgagg acacacacaa agacgtgggt gaccatgttg tttgtggggt gcagagatgg     1200

gaggggtggg gcccaccctg gaagagtgga cagtgacaca aggtggacac tctctacaga     1260

tcactgagga taagctggag ccacaatgca tgaggcacac acacagcaag gatgacgctg     1320

taaacatagc ccacgctgtc ctgggggcac tgggaagcct agataaggcc gtgagcagaa     1380

agaaggggag gatcctccta tgttgttgaa ggagggacta gggggagaaa ctgaaagctg     1440

attaattaca ggaggtttgt tcaggtcccc caaaccaccg tcagatttga tgatttccta     1500

gcaggactta cagaaataaa gagctatcat gctgtggttt attatggttt gttacattga     1560

tgggatacat actgaaatca gcaaacaaaa cagatgtata gattagagtg tggagaaaac     1620

agaggaaaac ttgcagttac gaagactggc aacttggctt tactaagttt tcagactggc     1680

aggaagtcaa acctattagg ctgaggacct tgtggagtgt agctgatcca gctgatagag     1740

gaactagcca ggtgggggcc tttccctttg gatggggggc atatctgaca gttattctct     1800

ccaagtggag acttacggac agcatataat tctccctgca aggatgtatg ataatatgta     1860

caaagtaatt ccaactgagg aagctcacct gatccttagt gtccaaggtt tttactgggg     1920

gtctgtagga cgagtatgga gtacttgaat aattgacctg aagtcctcag acctgaggtt     1980

ccctagagtt caaacagata cagcatggtc cagagtccca gatgtacaaa aacagggatt     2040

catcacaaat cccatcttta gcatgaaggg tctggcatgg cccaaggccc caagtatatc     2100

aaggcacttg ggcagaacat gccaaggaat caaatgtcat ctcccaggag ttattcaagg     2160

gtgagccctt tacttgggat gtacaggctt tgagcagtgc agggctgctg agtcaacctt     2220

ttattgtaca ggggatgagg gaaagggaga ggatgaggaa gcccccctgg ggatttggtt     2280

tggtcttgtg atcaggtggt ctatggggct atccctacaa agaagaatcc agaaataggg     2340

gcacattgag gaatgatact gagcccaaag agcattcaat cattgtttta tttgccttct     2400

tttcacacca ttggtgaggg agggattacc accctggggt tatgaagatg gttgaacacc     2460

ccacacatag caccggagat atgagatcaa cagtttctta gccatagaga ttcacagccc     2520

agagcaggag gacgctgcac accatgcagg atgacatggg ggatgcgctc gggattggtg     2580

tgaagaagca aggactgtta gaggcaggct ttatagtaac aagacggtgg ggcaaactct     2640

gatttccgtg ggggaatgtc atggtcttgc tttactaagt tttgagactg gcaggtagtg     2700

aaactcatta ggctgagaac cttgtggaat gcagctgacc cagctgatag aggaagtagc     2760

caggtgggag cctttcccag tgggtgtggg acatatctgg caagattttg tggcactcct     2820

ggttacagat actggggcag caaataaaac tgaatcttgt tttcagacct taaaaaaaaa     2880

aaaaaaaaaa aa                                                         2892


<210>  5
<211>  238
<212>  PRT
<213>  Homo sapiens


<220>
<221>  Homo sapiens kallikrein-related peptidase 3 (KLK3), transcript 
       variant 3,
<222>  (1)..(238)

<400>  5

Met Trp Val Pro Val Val Phe Leu Thr Leu Ser Val Thr Trp Ile Gly 
1               5                   10                  15      


Ala Ala Pro Leu Ile Leu Ser Arg Ile Val Gly Gly Trp Glu Cys Glu 
            20                  25                  30          


Lys His Ser Gln Pro Trp Gln Val Leu Val Ala Ser Arg Gly Arg Ala 
        35                  40                  45              


Val Cys Gly Gly Val Leu Val His Pro Gln Trp Val Leu Thr Ala Ala 
    50                  55                  60                  


His Cys Ile Arg Asn Lys Ser Val Ile Leu Leu Gly Arg His Ser Leu 
65                  70                  75                  80  


Phe His Pro Glu Asp Thr Gly Gln Val Phe Gln Val Ser His Ser Phe 
                85                  90                  95      


Pro His Pro Leu Tyr Asp Met Ser Leu Leu Lys Asn Arg Phe Leu Arg 
            100                 105                 110         


Pro Gly Asp Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu 
        115                 120                 125             


Pro Ala Glu Leu Thr Asp Ala Val Lys Val Met Asp Leu Pro Thr Gln 
    130                 135                 140                 


Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser Ile 
145                 150                 155                 160 


Glu Pro Glu Glu Phe Leu Thr Pro Lys Lys Leu Gln Cys Val Asp Leu 
                165                 170                 175     


His Val Ile Ser Asn Asp Val Cys Ala Gln Val His Pro Gln Lys Val 
            180                 185                 190         


Thr Lys Phe Met Leu Cys Ala Gly Arg Trp Thr Gly Gly Lys Ser Thr 
        195                 200                 205             


Cys Ser Trp Val Ile Leu Ile Thr Glu Leu Thr Met Pro Ala Leu Pro 
    210                 215                 220                 


Met Val Leu His Gly Ser Leu Val Pro Trp Arg Gly Gly Val 
225                 230                 235             


<210>  6
<211>  1906
<212>  DNA
<213>  Homo sapiens


<220>
<221>  Homo sapiens kallikrein-related peptidase 3 (KLK3),
<222>  (1)..(1906)

<400>  6
agccccaagc ttaccacctg cacccggaga gctgtgtcac catgtgggtc ccggttgtct       60

tcctcaccct gtccgtgacg tggattggtg ctgcacccct catcctgtct cggattgtgg      120

gaggctggga gtgcgagaag cattcccaac cctggcaggt gcttgtggcc tctcgtggca      180

gggcagtctg cggcggtgtt ctggtgcacc cccagtgggt cctcacagct gcccactgca      240

tcaggaacaa aagcgtgatc ttgctgggtc ggcacagcct gtttcatcct gaagacacag      300

gccaggtatt tcaggtcagc cacagcttcc cacacccgct ctacgatatg agcctcctga      360

agaatcgatt cctcaggcca ggtgatgact ccagccacga cctcatgctg ctccgcctgt      420

cagagcctgc cgagctcacg gatgctgtga aggtcatgga cctgcccacc caggagccag      480

cactggggac cacctgctac gcctcaggct ggggcagcat tgaaccagag gagttcttga      540

ccccaaagaa acttcagtgt gtggacctcc atgttatttc caatgacgtg tgtgcgcaag      600

ttcaccctca gaaggtgacc aagttcatgc tgtgtgctgg acgctggaca gggggcaaaa      660

gcacctgctc gtgggtcatt ctgatcaccg aactgaccat gccagccctg ccgatggtcc      720

tccatggctc cctagtgccc tggagaggag gtgtctagtc agagagtagt cctggaaggt      780

ggcctctgtg aggagccacg gggacagcat cctgcagatg gtcctggccc ttgtcccacc      840

gacctgtcta caaggactgt cctcgtggac cctcccctct gcacaggagc tggaccctga      900

agtcccttcc ccaccggcca ggactggagc ccctacccct ctgttggaat ccctgcccac      960

cttcttctgg aagtcggctc tggagacatt tctctcttct tccaaagctg ggaactgcta     1020

tctgttatct gcctgtccag gtctgaaaga taggattgcc caggcagaaa ctgggactga     1080

cctatctcac tctctccctg cttttaccct tagggtgatt ctgggggccc acttgtctgt     1140

aatggtgtgc ttcaaggtat cacgtcatgg ggcagtgaac catgtgccct gcccgaaagg     1200

ccttccctgt acaccaaggt ggtgcattac cggaagtgga tcaaggacac catcgtggcc     1260

aacccctgag cacccctatc aaccccctat tgtagtaaac ttggaacctt ggaaatgacc     1320

aggccaagac tcaagcctcc ccagttctac tgacctttgt ccttaggtgt gaggtccagg     1380

gttgctagga aaagaaatca gcagacacag gtgtagacca gagtgtttct taaatggtgt     1440

aattttgtcc tctctgtgtc ctggggaata ctggccatgc ctggagacat atcactcaat     1500

ttctctgagg acacagatag gatggggtgt ctgtgttatt tgtggggtac agagatgaaa     1560

gaggggtggg atccacactg agagagtgga gagtgacatg tgctggacac tgtccatgaa     1620

gcactgagca gaagctggag gcacaacgca ccagacactc acagcaagga tggagctgaa     1680

aacataaccc actctgtcct ggaggcactg ggaagcctag agaaggctgt gagccaagga     1740

gggagggtct tcctttggca tgggatgggg atgaagtaag gagagggact ggaccccctg     1800

gaagctgatt cactatgggg ggaggtgtat tgaagtcctc cagacaaccc tcagatttga     1860

tgatttccta gtagaactca cagaaataaa gagctgttat actgtg                    1906


<210>  7
<211>  920
<212>  PRT
<213>  Homo sapiens


<220>
<221>  AR protein sequence
<222>  (1)..(920)

<400>  7

Met Glu Val Gln Leu Gly Leu Gly Arg Val Tyr Pro Arg Pro Pro Ser 
1               5                   10                  15      


Lys Thr Tyr Arg Gly Ala Phe Gln Asn Leu Phe Gln Ser Val Arg Glu 
            20                  25                  30          


Val Ile Gln Asn Pro Gly Pro Arg His Pro Glu Ala Ala Ser Ala Ala 
        35                  40                  45              


Pro Pro Gly Ala Ser Leu Leu Leu Leu Gln Gln Gln Gln Gln Gln Gln 
    50                  55                  60                  


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
65                  70                  75                  80  


Glu Thr Ser Pro Arg Gln Gln Gln Gln Gln Gln Gly Glu Asp Gly Ser 
                85                  90                  95      


Pro Gln Ala His Arg Arg Gly Pro Thr Gly Tyr Leu Val Leu Asp Glu 
            100                 105                 110         


Glu Gln Gln Pro Ser Gln Pro Gln Ser Ala Leu Glu Cys His Pro Glu 
        115                 120                 125             


Arg Gly Cys Val Pro Glu Pro Gly Ala Ala Val Ala Ala Ser Lys Gly 
    130                 135                 140                 


Leu Pro Gln Gln Leu Pro Ala Pro Pro Asp Glu Asp Asp Ser Ala Ala 
145                 150                 155                 160 


Pro Ser Thr Leu Ser Leu Leu Gly Pro Thr Phe Pro Gly Leu Ser Ser 
                165                 170                 175     


Cys Ser Ala Asp Leu Lys Asp Ile Leu Ser Glu Ala Ser Thr Met Gln 
            180                 185                 190         


Leu Leu Gln Gln Gln Gln Gln Glu Ala Val Ser Glu Gly Ser Ser Ser 
        195                 200                 205             


Gly Arg Ala Arg Glu Ala Ser Gly Ala Pro Thr Ser Ser Lys Asp Asn 
    210                 215                 220                 


Tyr Leu Gly Gly Thr Ser Thr Ile Ser Asp Asn Ala Lys Glu Leu Cys 
225                 230                 235                 240 


Lys Ala Val Ser Val Ser Met Gly Leu Gly Val Glu Ala Leu Glu His 
                245                 250                 255     


Leu Ser Pro Gly Glu Gln Leu Arg Gly Asp Cys Met Tyr Ala Pro Leu 
            260                 265                 270         


Leu Gly Val Pro Pro Ala Val Arg Pro Thr Pro Cys Ala Pro Leu Ala 
        275                 280                 285             


Glu Cys Lys Gly Ser Leu Leu Asp Asp Ser Ala Gly Lys Ser Thr Glu 
    290                 295                 300                 


Asp Thr Ala Glu Tyr Ser Pro Phe Lys Gly Gly Tyr Thr Lys Gly Leu 
305                 310                 315                 320 


Glu Gly Glu Ser Leu Gly Cys Ser Gly Ser Ala Ala Ala Gly Ser Ser 
                325                 330                 335     


Gly Thr Leu Glu Leu Pro Ser Thr Leu Ser Leu Tyr Lys Ser Gly Ala 
            340                 345                 350         


Leu Asp Glu Ala Ala Ala Tyr Gln Ser Arg Asp Tyr Tyr Asn Phe Pro 
        355                 360                 365             


Leu Ala Leu Ala Gly Pro Pro Pro Pro Pro Pro Pro Pro His Pro His 
    370                 375                 380                 


Ala Arg Ile Lys Leu Glu Asn Pro Leu Asp Tyr Gly Ser Ala Trp Ala 
385                 390                 395                 400 


Ala Ala Ala Ala Gln Cys Arg Tyr Gly Asp Leu Ala Ser Leu His Gly 
                405                 410                 415     


Ala Gly Ala Ala Gly Pro Gly Ser Gly Ser Pro Ser Ala Ala Ala Ser 
            420                 425                 430         


Ser Ser Trp His Thr Leu Phe Thr Ala Glu Glu Gly Gln Leu Tyr Gly 
        435                 440                 445             


Pro Cys Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
    450                 455                 460                 


Gly Gly Gly Gly Gly Gly Gly Gly Gly Glu Ala Gly Ala Val Ala Pro 
465                 470                 475                 480 


Tyr Gly Tyr Thr Arg Pro Pro Gln Gly Leu Ala Gly Gln Glu Ser Asp 
                485                 490                 495     


Phe Thr Ala Pro Asp Val Trp Tyr Pro Gly Gly Met Val Ser Arg Val 
            500                 505                 510         


Pro Tyr Pro Ser Pro Thr Cys Val Lys Ser Glu Met Gly Pro Trp Met 
        515                 520                 525             


Asp Ser Tyr Ser Gly Pro Tyr Gly Asp Met Arg Leu Glu Thr Ala Arg 
    530                 535                 540                 


Asp His Val Leu Pro Ile Asp Tyr Tyr Phe Pro Pro Gln Lys Thr Cys 
545                 550                 555                 560 


Leu Ile Cys Gly Asp Glu Ala Ser Gly Cys His Tyr Gly Ala Leu Thr 
                565                 570                 575     


Cys Gly Ser Cys Lys Val Phe Phe Lys Arg Ala Ala Glu Gly Lys Gln 
            580                 585                 590         


Lys Tyr Leu Cys Ala Ser Arg Asn Asp Cys Thr Ile Asp Lys Phe Arg 
        595                 600                 605             


Arg Lys Asn Cys Pro Ser Cys Arg Leu Arg Lys Cys Tyr Glu Ala Gly 
    610                 615                 620                 


Met Thr Leu Gly Ala Arg Lys Leu Lys Lys Leu Gly Asn Leu Lys Leu 
625                 630                 635                 640 


Gln Glu Glu Gly Glu Ala Ser Ser Thr Thr Ser Pro Thr Glu Glu Thr 
                645                 650                 655     


Thr Gln Lys Leu Thr Val Ser His Ile Glu Gly Tyr Glu Cys Gln Pro 
            660                 665                 670         


Ile Phe Leu Asn Val Leu Glu Ala Ile Glu Pro Gly Val Val Cys Ala 
        675                 680                 685             


Gly His Asp Asn Asn Gln Pro Asp Ser Phe Ala Ala Leu Leu Ser Ser 
    690                 695                 700                 


Leu Asn Glu Leu Gly Glu Arg Gln Leu Val His Val Val Lys Trp Ala 
705                 710                 715                 720 


Lys Ala Leu Pro Gly Phe Arg Asn Leu His Val Asp Asp Gln Met Ala 
                725                 730                 735     


Val Ile Gln Tyr Ser Trp Met Gly Leu Met Val Phe Ala Met Gly Trp 
            740                 745                 750         


Arg Ser Phe Thr Asn Val Asn Ser Arg Met Leu Tyr Phe Ala Pro Asp 
        755                 760                 765             


Leu Val Phe Asn Glu Tyr Arg Met His Lys Ser Arg Met Tyr Ser Gln 
    770                 775                 780                 


Cys Val Arg Met Arg His Leu Ser Gln Glu Phe Gly Trp Leu Gln Ile 
785                 790                 795                 800 


Thr Pro Gln Glu Phe Leu Cys Met Lys Ala Leu Leu Leu Phe Ser Ile 
                805                 810                 815     


Ile Pro Val Asp Gly Leu Lys Asn Gln Lys Phe Phe Asp Glu Leu Arg 
            820                 825                 830         


Met Asn Tyr Ile Lys Glu Leu Asp Arg Ile Ile Ala Cys Lys Arg Lys 
        835                 840                 845             


Asn Pro Thr Ser Cys Ser Arg Arg Phe Tyr Gln Leu Thr Lys Leu Leu 
    850                 855                 860                 


Asp Ser Val Gln Pro Ile Ala Arg Glu Leu His Gln Phe Thr Phe Asp 
865                 870                 875                 880 


Leu Leu Ile Lys Ser His Met Val Ser Val Asp Phe Pro Glu Met Met 
                885                 890                 895     


Ala Glu Ile Ile Ser Val Gln Val Pro Lys Ile Leu Ser Gly Lys Val 
            900                 905                 910         


Lys Pro Ile Tyr Phe His Thr Gln 
        915                 920 


<210>  8
<211>  4314
<212>  DNA
<213>  Homo sapiens


<220>
<221>  AR cDNA sequence
<222>  (1)..(4314)

<400>  8
cgagatcccg gggagccagc ttgctgggag agcgggacgg tccggagcaa gcccagaggc       60

agaggaggcg acagagggaa aaagggccga gctagccgct ccagtgctgt acaggagccg      120

aagggacgca ccacgccagc cccagcccgg ctccagcgac agccaacgcc tcttgcagcg      180

cggcggcttc gaagccgccg cccggagctg ccctttcctc ttcggtgaag tttttaaaag      240

ctgctaaaga ctcggaggaa gcaaggaaag tgcctggtag gactgacggc tgcctttgtc      300

ctcctcctct ccaccccgcc tccccccacc ctgccttccc cccctccccc gtcttctctc      360

ccgcagctgc ctcagtcggc tactctcagc caacccccct caccaccctt ctccccaccc      420

gcccccccgc ccccgtcggc ccagcgctgc cagcccgagt ttgcagagag gtaactccct      480

ttggctgcga gcgggcgagc tagctgcaca ttgcaaagaa ggctcttagg agccaggcga      540

ctggggagcg gcttcagcac tgcagccacg acccgcctgg ttaggctgca cgcggagaga      600

accctctgtt ttcccccact ctctctccac ctcctcctgc cttccccacc ccgagtgcgg      660

agccagagat caaaagatga aaaggcagtc aggtcttcag tagccaaaaa acaaaacaaa      720

caaaaacaaa aaagccgaaa taaaagaaaa agataataac tcagttctta tttgcaccta      780

cttcagtgga cactgaattt ggaaggtgga ggattttgtt tttttctttt aagatctggg      840

catcttttga atctaccctt caagtattaa gagacagact gtgagcctag cagggcagat      900

cttgtccacc gtgtgtcttc ttctgcacga gactttgagg ctgtcagagc gctttttgcg      960

tggttgctcc cgcaagtttc cttctctgga gcttcccgca ggtgggcagc tagctgcagc     1020

gactaccgca tcatcacagc ctgttgaact cttctgagca agagaagggg aggcggggta     1080

agggaagtag gtggaagatt cagccaagct caaggatgga agtgcagtta gggctgggaa     1140

gggtctaccc tcggccgccg tccaagacct accgaggagc tttccagaat ctgttccaga     1200

gcgtgcgcga agtgatccag aacccgggcc ccaggcaccc agaggccgcg agcgcagcac     1260

ctcccggcgc cagtttgctg ctgctgcagc agcagcagca gcagcagcag cagcagcagc     1320

agcagcagca gcagcagcag cagcagcagc agcaagagac tagccccagg cagcagcagc     1380

agcagcaggg tgaggatggt tctccccaag cccatcgtag aggccccaca ggctacctgg     1440

tcctggatga ggaacagcaa ccttcacagc cgcagtcggc cctggagtgc caccccgaga     1500

gaggttgcgt cccagagcct ggagccgccg tggccgccag caaggggctg ccgcagcagc     1560

tgccagcacc tccggacgag gatgactcag ctgccccatc cacgttgtcc ctgctgggcc     1620

ccactttccc cggcttaagc agctgctccg ctgaccttaa agacatcctg agcgaggcca     1680

gcaccatgca actccttcag caacagcagc aggaagcagt atccgaaggc agcagcagcg     1740

ggagagcgag ggaggcctcg ggggctccca cttcctccaa ggacaattac ttagggggca     1800

cttcgaccat ttctgacaac gccaaggagt tgtgtaaggc agtgtcggtg tccatgggcc     1860

tgggtgtgga ggcgttggag catctgagtc caggggaaca gcttcggggg gattgcatgt     1920

acgccccact tttgggagtt ccacccgctg tgcgtcccac tccttgtgcc ccattggccg     1980

aatgcaaagg ttctctgcta gacgacagcg caggcaagag cactgaagat actgctgagt     2040

attccccttt caagggaggt tacaccaaag ggctagaagg cgagagccta ggctgctctg     2100

gcagcgctgc agcagggagc tccgggacac ttgaactgcc gtctaccctg tctctctaca     2160

agtccggagc actggacgag gcagctgcgt accagagtcg cgactactac aactttccac     2220

tggctctggc cggaccgccg ccccctccgc cgcctcccca tccccacgct cgcatcaagc     2280

tggagaaccc gctggactac ggcagcgcct gggcggctgc ggcggcgcag tgccgctatg     2340

gggacctggc gagcctgcat ggcgcgggtg cagcgggacc cggttctggg tcaccctcag     2400

ccgccgcttc ctcatcctgg cacactctct tcacagccga agaaggccag ttgtatggac     2460

cgtgtggtgg tggtgggggt ggtggcggcg gcggcggcgg cggcggcggc ggcggcggcg     2520

gcggcggcgg cggcgaggcg ggagctgtag ccccctacgg ctacactcgg ccccctcagg     2580

ggctggcggg ccaggaaagc gacttcaccg cacctgatgt gtggtaccct ggcggcatgg     2640

tgagcagagt gccctatccc agtcccactt gtgtcaaaag cgaaatgggc ccctggatgg     2700

atagctactc cggaccttac ggggacatgc gtttggagac tgccagggac catgttttgc     2760

ccattgacta ttactttcca ccccagaaga cctgcctgat ctgtggagat gaagcttctg     2820

ggtgtcacta tggagctctc acatgtggaa gctgcaaggt cttcttcaaa agagccgctg     2880

aagggaaaca gaagtacctg tgcgccagca gaaatgattg cactattgat aaattccgaa     2940

ggaaaaattg tccatcttgt cgtcttcgga aatgttatga agcagggatg actctgggag     3000

cccggaagct gaagaaactt ggtaatctga aactacagga ggaaggagag gcttccagca     3060

ccaccagccc cactgaggag acaacccaga agctgacagt gtcacacatt gaaggctatg     3120

aatgtcagcc catctttctg aatgtcctgg aagccattga gccaggtgta gtgtgtgctg     3180

gacacgacaa caaccagccc gactcctttg cagccttgct ctctagcctc aatgaactgg     3240

gagagagaca gcttgtacac gtggtcaagt gggccaaggc cttgcctggc ttccgcaact     3300

tacacgtgga cgaccagatg gctgtcattc agtactcctg gatggggctc atggtgtttg     3360

ccatgggctg gcgatccttc accaatgtca actccaggat gctctacttc gcccctgatc     3420

tggttttcaa tgagtaccgc atgcacaagt cccggatgta cagccagtgt gtccgaatga     3480

ggcacctctc tcaagagttt ggatggctcc aaatcacccc ccaggaattc ctgtgcatga     3540

aagcactgct actcttcagc attattccag tggatgggct gaaaaatcaa aaattctttg     3600

atgaacttcg aatgaactac atcaaggaac tcgatcgtat cattgcatgc aaaagaaaaa     3660

atcccacatc ctgctcaaga cgcttctacc agctcaccaa gctcctggac tccgtgcagc     3720

ctattgcgag agagctgcat cagttcactt ttgacctgct aatcaagtca cacatggtga     3780

gcgtggactt tccggaaatg atggcagaga tcatctctgt gcaagtgccc aagatccttt     3840

ctgggaaagt caagcccatc tatttccaca cccagtgaag cattggaaac cctatttccc     3900

caccccagct catgccccct ttcagatgtc ttctgcctgt tataactctg cactactcct     3960

ctgcagtgcc ttggggaatt tcctctattg atgtacagtc tgtcatgaac atgttcctga     4020

attctatttg ctgggctttt tttttctctt tctctccttt ctttttcttc ttccctccct     4080

atctaaccct cccatggcac cttcagactt tgcttcccat tgtggctcct atctgtgttt     4140

tgaatggtgt tgtatgcctt taaatctgtg atgatcctca tatggcccag tgtcaagttg     4200

tgcttgttta cagcactact ctgtgccagc cacacaaacg tttacttatc ttatgccacg     4260

ggaagtttag agagctaaga ttatctgggg aaatcaaaac aaaaacaagc aaac           4314


