LOCUS HUMAF4Y 9390 bp mRNA PRI 31-DEC-1994 DEFINITION Human AF-4 mRNA, complete cds. ACCESSION L13773 NID g306446 VERSION L13773.1 GI:306446 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9390) AUTHORS Nakamura,T., Alder,H., Gu,Y., Prasad,R., Canaani,O., Kamada,N., Gale,R.P., Lange,B., Crist,W.M., Nowell,P.C., Croce,C.M. and Canaani,E. TITLE Genes on chromosomes 4, 9, and 19 involved in 11q23 abnormalities in acute leukemia share sequence homology and/or common motifs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (10), 4631-4635 (1993) MEDLINE 93281633 FEATURES Location/Qualifiers source 1. .9390 /organism="Homo sapiens" /db_xref="taxon:9606" /germline gene 421. .4053 /gene="AF-4" CDS 421. .4053 /gene="AF-4" /codon_start=1 /protein_id="AAA58360.1" /db_xref="PID:g306447" /db_xref="GI:306447" /translation="MAAQSSLYNDDRNLLRIREKERRNQEAHQEKEAFPEKIPLFGEP YKTAKGDELSSRIQNMLGNYEEVKEFLSTKSHTHRLDASENRLGKPKYPLIPDKGSSI PSSSFHTSVHHQSIHTPASGPLSVGNISHNPKMAQPRTEPMPSLHAKSCGPPDSQHLT QDRLGQEGFGSSHHKKGDRRADGDHCASVTDSAPERELSPLISLPSPVPPLSPIHSNQ QTLPRTQGSSKVHGSSNNSKGYCPAKSPKDLAVKVHDKETPQDSLVAPAQPPSQTFPP PSLPSKSVAMQQKPTAYVRPMDGQDQAPSESPELKPLPEDYRQQTFEKTDLKVPAKAK LTKLKMPSQSVEQTYSNEVHCVEEILKEMTHSWPPPLTAIHTPSTAEPSKFPFPTKDS QHVSSVTQNQKQYDTSSKTHSNSQQGTSSMLEDDLQLSDSEDSDSEQTPEKPPSSSAP PSAPQSLPEPVASAHSSSAESESTSDSDSSSDSESESSSSDSEENEPLETPAPEPEPP TTNKWQLDNWLTKVSQPAAPPEGPRSTEPPRRHPESKGSSDSATSQEHSESKDPPPKS SSKAPRAPPEAPHPGKRSCQKSPAQQEPPQRQTVGTKQPKKPVKASARAGSRTSLQGE REPGLLPYGSRDQTSKDKPKVKTKGRPRAAASNEPKPAVPPSSEKKKHKSSLPAPSKA LSGPEPAKDNVEDRTPEHFALVPLTESQGPPHSGSGSRTSGCRQAVVVQEDSRKDRLP LPLRDTKLLSPLRDTPPPQSLMVKITLDLLSRIPQPPGKGSRQRKAEDKQPPAGKKHS SEKRSSDSSSKLAKKRKGEAERDCDNKKIRLEKEIKSQSSSSSSSHKESSKTKPSRPS SQSSKKEMLPPPPVSSSSQKPAKPALKRSRREADTCGQDPPKSASSTKSNHKDSSIPK QRRVEGKGSRSSSEHKGSSGDTANPFPVPSLPNGNSKPGKPQVKFDKQQADLHMREAK KMKQKAELMTDRVGKAFKYLEAVLSFIECGIATESESQSSKSAYSVYSETVDLIKFIM SLKSFSDATAPTQEKIFAVLCMRCQSILNMAMFRCKKDIAIKYSRTLNKHFESSSKVA QAPSPCIASTGTPSPLSPMPSPASSVGSQSSAGSVGSSGVAATISTPVTIQNMTSSYV TITSHVLTAFDLWEQAEALTRKNKEFFARLSTNVCTLALNSSLVDLVHYTRQGFQQLQ ELTKTP" BASE COUNT 2609 a 2179 c 2063 g 2539 t ORIGIN 1 GGCAATTTCT TTTCCTTTCT AACTGTGGCC CGCGTTGTGC TGTTGCTGGG 51 CAGGCGTTGG GCGCCGGCGG TCTTCGAGCG TGGGGGCCCG CTGGCTTTCC 101 CTTCTCAGAA ACTGCGCCGG GGGCGCTCGC TTGCCCCGGA TTCGGACGCG 151 GCGCTCCCCG GGCTCGTCTG AAGTGCAGAT CGCCGCAGAG GCCCCAGTGC 201 CCGGATGTCC ATCAGGATTA GCGCGAGCCA ATACGGGCCG AGCCCGGGGC 251 TGCGCCGAGG ACGCCCGGGG CTCGAGAGCA GGTAGTCCCG TAACATCGGG 301 GCGCCGCGCC GGGACGCGTC CCCGCCCGGC TCCGCCAAAT GGTGAGCGCG 351 GCGCTGGCAG CAGGGCCCGC GGGGTGAAGG CGCTCATGGA CGGAAGACCC 401 CTGGCTCTAT AAGCTGAATT ATGGCAGCCC AGTCAAGTTT GTACAATGAC 451 GACAGAAACC TGCTTCGAAT TAGAGAGAAG GAAAGACGCA ACCAGGAAGC 501 CCACCAAGAG AAAGAGGCAT TTCCTGAAAA GATTCCCCTT TTTGGAGAGC 551 CCTACAAGAC AGCAAAAGGT GATGAGCTGT CTAGTCGAAT ACAGAACATG 601 TTGGGAAACT ACGAAGAAGT GAAGGAGTTC CTTAGTACTA AGTCTCACAC 651 TCATCGCCTG GATGCTTCTG AAAATAGGTT GGGAAAGCCG AAATATCCTT 701 TAATTCCTGA CAAAGGGAGC AGCATTCCAT CCAGCTCCTT CCACACTAGT 751 GTCCACCACC AGTCCATTCA CACTCCTGCG TCTGGACCAC TTTCTGTTGG 801 CAACATTAGC CACAATCCAA AGATGGCGCA GCCAAGAACT GAACCAATGC 851 CAAGTCTCCA TGCCAAAAGC TGCGGCCCAC CGGACAGCCA GCACCTGACC 901 CAGGATCGCC TTGGTCAGGA GGGGTTCGGC TCTAGTCATC ACAAGAAAGG 951 TGACCGAAGA GCTGACGGAG ACCACTGTGC TTCGGTGACA GATTCGGCTC 1001 CAGAGAGGGA GCTTTCTCCC TTAATCTCTT TGCCTTCCCC AGTTCCCCCT 1051 TTGTCACCTA TACATTCCAA CCAGCAAACT CTTCCCCGGA CGCAAGGAAG 1101 CAGCAAGGTT CATGGCAGCA GCAATAACAG TAAAGGCTAT TGCCCAGCCA 1151 AATCTCCCAA GGACCTAGCA GTGAAAGTCC ATGATAAAGA GACCCCTCAA 1201 GACAGTTTGG TGGCCCCTGC CCAGCCGCCT TCTCAGACAT TTCCACCTCC 1251 CTCCCTCCCC TCAAAAAGTG TTGCAATGCA GCAGAAGCCC ACGGCTTATG 1301 TCCGGCCCAT GGATGGTCAA GATCAGGCCC CTAGTGAATC CCCTGAACTG 1351 AAACCACTGC CGGAGGACTA TCGACAGCAG ACCTTTGAAA AAACAGACTT 1401 GAAAGTGCCT GCCAAAGCCA AGCTCACCAA ACTGAAGATG CCTTCTCAGT 1451 CAGTTGAGCA GACCTACTCC AATGAAGTCC ATTGTGTTGA AGAGATTCTG 1501 AAGGAAATGA CCCATTCATG GCCGCCTCCT TTGACAGCAA TACATACGCC 1551 TAGTACAGCT GAGCCATCCA AGTTTCCTTT CCCTACAAAG GACTCTCAGC 1601 ATGTCAGTTC TGTAACCCAA AACCAAAAAC AATATGATAC ATCTTCAAAA 1651 ACTCACTCAA ATTCTCAGCA AGGAACGTCA TCCATGCTCG AAGACGACCT 1701 TCAGCTCAGT GACAGTGAGG ACAGTGACAG TGAACAAACC CCAGAGAAGC 1751 CTCCCTCCTC ATCTGCACCT CCAAGTGCTC CACAGTCCCT TCCAGAACCA 1801 GTGGCATCAG CACATTCCAG CAGTGCAGAG TCAGAAAGCA CCAGTGACTC 1851 AGACAGTTCC TCAGACTCAG AGAGCGAGAG CAGTTCAAGT GACAGCGAAG 1901 AAAATGAGCC CCTAGAAACC CCAGCTCCGG AGCCTGAGCC TCCAACAACA 1951 AACAAATGGC AGCTGGACAA CTGGCTGACC AAAGTCAGCC AGCCAGCTGC 2001 GCCACCAGAG GGCCCCAGGA GCACAGAGCC CCCACGGCGG CACCCAGAGA 2051 GTAAGGGCAG CAGCGACAGT GCCACGAGTC AGGAGCATTC TGAATCCAAA 2101 GATCCTCCCC CTAAAAGCTC CAGCAAAGCC CCCCGGGCCC CACCCGAAGC 2151 CCCCCACCCC GGAAAGAGGA GCTGTCAGAA GTCTCCGGCA CAGCAGGAGC 2201 CCCCACAAAG GCAAACCGTT GGAACCAAAC AACCCAAAAA ACCTGTCAAG 2251 GCCTCTGCCC GGGCAGGTTC ACGGACCAGC CTGCAGGGGG AAAGGGAGCC 2301 AGGGCTTCTT CCCTATGGCT CCCGAGACCA GACTTCCAAA GACAAGCCCA 2351 AGGTGAAGAC GAAAGGACGG CCCCGGGCCG CAGCAAGCAA CGAACCCAAG 2401 CCAGCAGTGC CCCCCTCCAG TGAGAAGAAG AAGCACAAGA GCTCCCTCCC 2451 TGCCCCCTCT AAGGCTCTCT CAGGCCCAGA ACCCGCGAAG GACAATGTGG 2501 AGGACAGGAC CCCTGAGCAC TTTGCTCTTG TTCCCCTGAC TGAGAGCCAG 2551 GGCCCACCCC ACAGTGGCAG CGGCAGCAGG ACTAGTGGCT GCCGCCAAGC 2601 CGTGGTGGTC CAGGAGGACA GCCGCAAAGA CAGACTCCCA TTGCCTTTGA 2651 GAGACACCAA GCTGCTCTCA CCGCTCAGGG ACACTCCTCC CCCACAAAGC 2701 TTGATGGTGA AGATCACCCT AGACCTGCTC TCTCGGATAC CCCAGCCTCC 2751 CGGGAAGGGG AGCCGCCAGA GGAAAGCAGA AGATAAACAG CCGCCCGCAG 2801 GGAAGAAGCA CAGCTCTGAG AAGAGGAGCT CAGACAGCTC AAGCAAGTTG 2851 GCCAAAAAGA GAAAGGGTGA AGCAGAAAGA GACTGTGATA ACAAGAAAAT 2901 CAGACTGGAG AAGGAAATCA AATCACAGTC ATCTTCATCT TCATCCTCCC 2951 ACAAAGAATC TTCTAAAACA AAGCCCTCCA GGCCCTCCTC ACAGTCCTCA 3001 AAGAAGGAAA TGCTCCCCCC GCCACCCGTG TCCTCGTCCT CCCAGAAGCC 3051 AGCCAAGCCT GCACTTAAGA GGTCAAGGCG GGAAGCAGAC ACCTGTGGCC 3101 AGGACCCTCC CAAAAGTGCC AGCAGTACCA AGAGCAACCA CAAAGACTCT 3151 TCCATTCCCA AGCAGAGAAG AGTAGAGGGG AAGGGCTCCA GAAGCTCCTC 3201 GGAGCACAAG GGTTCTTCCG GAGATACTGC AAATCCTTTT CCAGTGCCTT 3251 CTTTGCCAAA TGGTAACTCT AAACCAGGGA AGCCTCAAGT GAAGTTTGAC 3301 AAACAACAAG CAGACCTTCA CATGAGGGAG GCAAAAAAGA TGAAGCAGAA 3351 AGCAGAGTTA ATGACGGACA GGGTTGGAAA GGCTTTTAAG TACCTGGAAG 3401 CCGTCTTGTC CTTCATTGAG TGCGGAATTG CCACAGAGTC TGAAAGCCAG 3451 TCATCCAAGT CAGCTTACTC TGTCTACTCA GAAACTGTAG ATCTCATTAA 3501 ATTCATAATG TCATTAAAAT CCTTCTCAGA TGCCACAGCG CCAACACAAG 3551 AGAAAATATT TGCTGTTTTA TGCATGCGTT GCCAGTCCAT TTTGAACATG 3601 GCGATGTTTC GTTGTAAAAA AGACATAGCA ATAAAGTATT CTCGTACTCT 3651 TAATAAACAC TTCGAGAGTT CTTCCAAAGT CGCCCAGGCA CCTTCTCCAT 3701 GCATTGCAAG CACAGGCACA CCATCCCCTC TTTCCCCAAT GCCTTCTCCT 3751 GCCAGCTCCG TAGGGTCCCA GTCAAGTGCT GGCAGTGTGG GGAGCAGTGG 3801 GGTGGCTGCC ACTATCAGCA CCCCAGTCAC CATCCAGAAT ATGACATCTT 3851 CCTATGTCAC CATCACATCC CATGTTCTTA CCGCCTTTGA CCTTTGGGAA 3901 CAGGCCGAGG CCCTCACGAG GAAGAATAAA GAATTCTTTG CTCGGCTCAG 3951 CACAAATGTG TGCACCTTGG CCCTCAACAG CAGTTTGGTG GACCTGGTGC 4001 ACTATACACG ACAGGGTTTT CAGCAGCTAC AAGAATTAAC CAAAACACCT 4051 TAATGGAGCC CCAGGTTGAT TCAATGCCTT GGGAACTATT TTTGCACATT 4101 GGAAGCCTCA AAAACAGTCC AGACGTTTGT TTCATCAGGA CACCAAACTC 4151 TAAAAAAGAA GCACCACGAG ATGGCCAGGA CATTTGTCCA CTTAAACTCT 4201 CAACAACAGT GTGATCATTG GTTGGACACT GTGGTTATGC AGAAGCAGAG 4251 ATGAGGAGGC TGGCCCCAGA GATGATCTTG CCCTTCCTAA CTAAAGGACA 4301 GAAGTGCAAT TTAGCTTAAA TGGGTGTATG AATGGTCTAG AAACATTTCT 4351 ATTTTTTTTT TAAACCAGCA GGATACAAGT TGCAAATGAA ATGAGGAGAA 4401 ACAGTTTCAA CTCTGAAAGT GAATTTCACG TCATCTCAGT AGCCACGCTA 4451 GTCCATTCCC AGAAGGAAAT TTTTTTTTTT AACAATGACT TTTGGTAAAG 4501 GGTTTTGTGG ATGATTTTTT TTCTTTTGAG TTTTGGGAGA AATATTTGTT 4551 TAATAACTTC TAATGGCCAT CTGTAAACCA TAAGTAATGA AGGACTCCAC 4601 TGTGCCCCAC TTTCTGCCAA TGAACAGTGG CTTGATAATA CCAAGTATTG 4651 TTGTAATTTA TAAAATTGAA GGCAACCCCC GCTCCTGCCG CCCCCAATCT 4701 CCCCATTGCC TAGAGCGCTG CACATTGACC CCAGCTCTGA CTTCTCATTA 4751 CTGTGCTGAA AGTCAGCCCA CGTCGGAGCG GTGAGGAGGA GCCACAGCAC 4801 ATGGGGTGCC ACCTCGAGGT CTGCACAGGA GGACTTGGCG CTGCCATTTC 4851 CTACCCCTGC CATTTCCCAC CCCTGCTTCA GCGAAAGGGA CTCTCTAACA 4901 GGGCAGTCAC TGTTGACTCT ATTCTGAATT TCCTCCCTTG GGGAAGAAGG 4951 GAACCAACAT TTATACCTGA CCAGATGGCT AAAGTGCTTT TAAAGTTTTG 5001 TTTAAGTAGA GCTGGAATTT GAGGTGCTGA TCTGTGGTCT ACAGTTATGT 5051 GGTAACTCAT GTTGTCCAGC CAACTCAGAG TTTCGTCAGT GAACAAGAAA 5101 CATGAAATCT GCTTCTTAGA GAGGCTATAT TTTTCTGCTA CAAATATTTT 5151 ATATTTATAG CAAAACTAGA CTTTCAGAGT CCTTGATTGT CTAGGGGAAG 5201 TTAACTCCCT GAGAGGATGT AGAGATTTGG GGTGGTTGAT TAGACTTTTG 5251 AAAAACTCAT CACCACATGC CTTCACTCCA GAGTGTTCTC AGCTAGATTT 5301 GATTTGGTTG AGGAGGAACT GTGGCCCTCC GTAAGTTATT GCCATAGTGT 5351 ATGCATTAAA CCAAGTCCAT TTTGAATGAC CTAAAATGAA GTAACACAAT 5401 CAGAAATCCC ATGTGCCCAT AAGCACAGAT TTTTCTTTTT CATTGAAACT 5451 TTAAAGGTTA TTATTGGAAA CATTACTTTG AGTGCAGTGT TTTTAAAAGC 5501 CAATTCTTTT TTATCCCTTT TAGAAGTAGA ATTTGCACAC TTACTACAAT 5551 TGAGGAGTGT CATCTCTATA ACTTTTTCTC CGCCTTTGTC CCATTCTGCC 5601 CCTGGACATG TTTCCTACCA AGCATGTTTC ACATTTTCCT ATTAGTGGAG 5651 GAGGGAGAAC CATATTTATT TATAATGAAG ACATCTAAGA TCCCTATGAT 5701 GAATGCAGGA ACTCTCTTGG TAGTTTGTAA ATACACAAAG GGATGTGTCG 5751 AGGGATGGGA GCGATGCTTA TCTCTCACAG TGTGAGTGGT CTGTGTGAGG 5801 CTGTTCCTTC AGTTCTTCTC CAGACTGTTC TTTGGTTGTC ACTTAAGTCA 5851 GAGGTCTGGT CCCTCATGTT TAGGTGAAAG CCAGAGAATG ACAGCTGTAG 5901 TCATATCTGA GCATAAGACC TTGATGTGTG ATTCCTGATG ACCGGTTTCA 5951 TTTATTCATG TAATAAAGCA AAGGCCCTGG TCCTTTTTAA ACTACTAGTT 6001 TTAAAAACCT GTGTTAAATG AACAGTAATT GCCTGGTAGG TTTGGTGTGT 6051 GTGTAGCATT GTGTGTCCAT CTGTTATATG TAAAGGACAA GGCACCAGAA 6101 TCAGGCTTTA TTTCGATATT GAAGATGTTA TTTAACATCT TTCTTTTTTC 6151 CTTACTCCCT TAGCCATCCC CTCCCCTTTT GTCCTATCAT TCCCTAGAAC 6201 AAGCCACCTG TCAATTGTGA AGGGTTGTGT TCTTTATGGC AGGTTCTATG 6251 CAGATTGTGC CAGAGCATGT GCGTGTTCTG TTGGCAAGCC ACAGTGCTCC 6301 CTTGACTGAA GACATTTCCA GGTAGATTTC TCAGCCAGCT CTAAAACAGA 6351 TTGCTTTTTC AGTGGCCTTA CTCTTTGTGG GTTTTTTTTT TTCTCTGAAC 6401 TTGATATAAA GATTTTATTT GTCCCTTGAA AAAGTAACAA ATGTGCATAG 6451 ATCAATTTGT ACTACTTTGG TCATTGGATA TTTCTGATCC TTATTGCATT 6501 GTACCTAAAG GAGAGTAACT AATGGTAACC TTTTTAATAG AGTATGTGAA 6551 AGGTAGTGGC TGATGAATCC TTAACGTTCA TAGGGTCTTT TTGCTGTTAC 6601 GGTTGTATAT AGAGGTCTGA AGGATTTTTA AAATGATTTG CACTTTTTCA 6651 CTGCATGCTT ACAATTCCCA AAGGCAAAAT CTGTACTGAG GTAGATCATT 6701 TGAAAGGGCT AGATTATAAA ATTAAGCCTT AGAGTATGGA AAGTTCTTAT 6751 AACAATAATA GTACACACTT CAGAGTAAGA CAAATGCAAA GCATCTTAAG 6801 GAGTGAAAAT AGAGTCTAAA TCTTGCCTTT GGCACTACAA GGTGTGTGTG 6851 TGTGTGTGTG TTGTGTGTCT TTAGTAGGAA ATGGAAGAAC ACTGTTTTAT 6901 TTTTTAAAGT GTTTAATGTT TCTGTCCTTT CTGTGAATTA TTGAATTTAA 6951 GAGCCCTGCT AAATAATGAA AAAACACTTT ACTAAAATTT ATCAAATTAT 7001 ACTGGGTTCG GATTGTGAAA ACATTGGCCA CCTAGTAGCA GTGGTGAGGA 7051 GTGGGAGGGC CCAGCAAGCA TTTATCAGAA ATAGAATCAC AATAGGAGGA 7101 GAATTTGGCT GTCTGATATT ATGATTTGAT TACAATACTG AATGGGAAAA 7151 GTATCTAATA TTTTGTAACA AAAAGACCTT CATATTATCT GTTTTGACCA 7201 AAATATGTAG CTATTTCCCT TACACAGATT GGACCGCACT TATCTCCCTT 7251 GTCCTGTATC CTTTAATTTC AGGTCTCAGG ATGTTTAGAA AGCTAAAACC 7301 CCCTACCCCT TTCTGGCTGA AAACTTGCCT TATTTGGTAT CTTACACATT 7351 AATGTTACTA GCATCAGGAG CTTACTGTTT TATTATGATT CATCTTCAGT 7401 AATTTTTAGA AGCAAGAAGA AAGCCATTGT GTCCTCTACA ATTAACAAAA 7451 CTTATCTCTG ATATACAAAG GGATATAAAT ATATACACTT AAATAGAGAA 7501 AAAGAGGTTG ATTGAATTGT GCCTTTGAGT GAACCCAGTT TTTAAATACC 7551 GCTGTGTTTG TTTCGCCATG GCTTCAGGGA TGCTACATGG CTCTTGCACC 7601 TTTTACTCCT CTGCTTTATG AAGTTTGAGT TGTATTTGTG CATCTTAAAG 7651 TAGGTTGAGG CTTGAGGCTG GGCTTTCGGG TTTTTTTGTT TTTTGTTTTG 7701 TTTTGTTTTG TTTTGTTTTC TTGTACTTAA ACCTGCTTGC TTCCTACCAC 7751 AGATTCTTTA TTTTCCCAAA CACTACAAAA AAACTTTTAA AACTTTGCCA 7801 TTTCATCTGT TTACACTCTT TGCCACTGAT TAGCAGTATT TAAATCTTGC 7851 AAGAATATTT TGTGCTTTCT TTAGAAACAC AAGAGTATAG ATTTTTCTCA 7901 CTGAAAAGTG AGAGTTACGC ATTGCAGCCA TGAAGGGATG CTAGGATCAA 7951 TTATGGCAGT ACCTTTTTTC CCCTCCTGTT CTTGAGCCAG TTGTCTCTTT 8001 TGTTTTGGGT CCCACTTAGG ATTAACGGAT GTAAGGTATT TTCCTGTGCC 8051 TTTATTTTGT GTCATTCTAT TGGAAGGAGG TGTAACGGCA GAATAGCATC 8101 GTGTTGGGGG TTTTCCTTCA AACACTGCAA GTGATATTGC CACCATGTGA 8151 ACCTCAAATA TGCAATCCAG TTGTGTTGGT TTCTCGGTGA CTTGGAGTGT 8201 TCATCTCTTC ATGAATTGTG AGCACTGACC ATGTTCTTCA GTTCTTAATT 8251 ATGGTGAGTT GACAAATACC AACTACTGCT TTTCTTTAGG TGGCTATAAA 8301 TTTCTTACTG TCAGGAGGAA ATGACATTAT ATTCTGTTCC ACTGAACGTC 8351 AGAGATCAGC AGGCACTGTA CTGGGTAGAG AAGTGCCTAT ACTTCTCTAC 8401 CTAAGAGGGC AGGAGGGAAA CCCTACAGCT CCTTGTGAGC CTATATATTA 8451 GTATATCGGC CTGGAGAGGA CAAGGGAATA AGACCACTCA TAGTGAGGCT 8501 GGCCAAGCTG CACTGGTCGG ACCAGGCAGT GGCTGACCTA AGGAAGGCAA 8551 CTTGCTTTGC TTAAAAGTAG ATTTTTTAAG CAATGCTTAA CACAGGCAGC 8601 ATTCACCTTT GTTCAGGCCA TCGACATGTA TTGTTAAAAT TACTGCATAT 8651 CCCCCTCAGA TATCAAGTAT ACACTGTTCA TGTTGGGGTT GTGTGTGTGT 8701 ATGTGTGTAT GTACGCACGC ATGTGTCCCA AATCTTGTTT TAATTTTTTT 8751 TTTCTGAATG TGATCATGTT TTGGATAATA CCTGAGCAGG GTTGCCTTTT 8801 TTTTATTTAT TACCATTATA TATTATATTA TATTATATAT TTTTTGCTTT 8851 CTTATAACTT TGGAGGAAAG TCAAATCTTG GTATTATTAA AATTGTTTTA 8901 AAAAGGAGTA AATTTTCCAG TTGATAAATG AAAATCACTG GCCTATGTTT 8951 AATAAGTTTT TCTTTAATTA CTGTGGAATA ACGTGCCAGC TATCATCAAC 9001 ACAATGATTT TGTACATAGG GTAGGGAAGC AGTGATGCTC TCAATGGGAA 9051 GATGTGCAAC ACAAATTAAG GGGAACTCCA TGTATTTTAC CTACTTCAGC 9101 AATGGAACTG CAACTTGGGG CTTTGTGAAT AAAATTTAGC TGCCTTGTAT 9151 AGTCGTTTGA AAGAATATGT GATCTGTGAG AGAATTATAG TTTTTTTTTA 9201 GAAGAAAAAT CTGCAAAAGA TCTTTCCAAA GACAATGTGC CACAGATCTT 9251 TTGTTCTCTG TAATGAGGAT TAATTGCTGT TTAAACAAAA ATGTAATTGT 9301 TCATCTTTAA ATTCTTTCCT TTTCATAAGA GGATCAAGCT GTAAAAAAAC 9351 AAAAAAATTA ATAAAAATTT CGAGAAATCA AAAAAAAAAA // LOCUS AF008915 7383 bp mRNA PRI 12-MAY-1998 DEFINITION Homo sapiens EVI5 homolog mRNA, complete cds. ACCESSION AF008915 NID g3093475 VERSION AF008915.1 GI:3093475 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7383) AUTHORS Roberts,T.P. and Cowell,J.C. TITLE Human EVI5 gene complete cDNA sequence JOURNAL Hum. Mol. Genet. (1998) In press REFERENCE 2 (bases 1 to 7383) AUTHORS Roberts,T.P. and Cowell,J.C. TITLE Direct Submission JOURNAL Submitted (17-JUN-1997) Neurosciences NC3-150, Cleveland Clinic Foundation, 9500 Euclid Avenue, Cleveland, OH 44195, USA FEATURES Location/Qualifiers source 1. .7383 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p22" /tissue_type="brain;liver" CDS 12. .2444 /function="cell cycle regulator" /note="similar to tre-2 oncogene and TBC1" /codon_start=1 /product="EVI-5 homolog" /protein_id="AAC16031.1" /db_xref="PID:g3093476" /db_xref="GI:3093476" /translation="MVTNKMTAAFRNPSGKQVATDKVAEKLSSTLSWVKNTVSHTVSQ MASQVASPSTSLHTTSSSTTLSTPALSPSSPSQLSPDVLELLAKLEEQNILLETDSKS LRSVNGSRRNSGSSLVSSSSASSNLSHLEEDSWILWGRIVNEWEDVRKKKEKQVKELV HKGIPHHFRAIVWQLLCSAQSMPIKDQYSELLKMTSPCEKLIRRDIARTYPEHNFFKE KDSLGQEVLFNVMKAYSLVDREVGYCQGSAFIVGLLLMQMPEEEAFCVFVKLMQDYRL RELFKPSMAELGLCMYQFECMIQEHLPELFVHFQSQSFHTSMYASSWFLTIFLTTFPL PVATRIFDIFMSEGLEIVFRVGLALLQMNQAELMQLDMEGMLQHFQKVIPHQFDGVPD KLIQAAYQVQYNSKKMKKLEKEYTTIKTKEMEEQVEIKRLRTENRLLKQRIETLEKHK CSSNYNEDFVLQLEKELVQARLSEAESQCALKEMQDKVLDIEKRNNSLPDENNIARLQ EELIAVKLREAEAIMGLKELRQQVKDLEEHWQRHLARTTGRWKDPPKKNAMNELQDEL MTIRLREAETQAEIREIKQRMMEMETQNQINSNHLRRAEQEVISLQEKVQYLSAQNKG LLTQLSEAKRKQAEIECKNKEEVMAVRLREADSIAAVAELRQHIAELEIQKEEGKLQG QLNKSDSNQYIGELKDQIAELNHELRCLKGQKGFSGQPPFDGIHIVNHLIGDDESFHS SDEDFIDNSLQETGVGFPLHGKSGSMSLDPAVADGSESETEDSVLETRESNQVVQKER PPRRRESYSTTV" BASE COUNT 2330 a 1296 c 1448 g 2309 t ORIGIN 1 TTCTGCTTAT CATGGTTACC AACAAAATGA CTGCTGCCTT TAGAAACCCT 51 AGTGGGAAAC AGGTGGCGAC AGACAAAGTT GCAGAAAAGC TGAGCTCTAC 101 TCTCTCATGG GTGAAGAACA CAGTATCGCA TACAGTCAGT CAGATGGCCA 151 GTCAGGTGGC AAGTCCATCT ACTTCATTAC ATACCACATC CTCATCTACC 201 ACACTATCAA CACCAGCCCT TTCACCATCT TCCCCATCAC AGTTGAGTCC 251 AGACGTCTTA GAACTCCTGG CTAAACTGGA AGAACAGAAT ATATTGTTAG 301 AAACGGATAG TAAGTCTTTA AGATCTGTAA ATGGGTCAAG AAGAAACAGT 351 GGCTCTTCTC TTGTGTCGAG TTCATCAGCC TCTAGCAACC TCAGTCACCT 401 TGAAGAAGAT TCTTGGATTC TTTGGGGAAG AATTGTTAAT GAATGGGAAG 451 ATGTACGCAA AAAGAAGGAA AAGCAAGTTA AGGAACTTGT TCATAAAGGG 501 ATACCCCATC ACTTTAGAGC AATAGTTTGG CAACTTTTAT GCAGTGCACA 551 AAGTATGCCA ATTAAGGATC AGTATTCAGA ACTCCTGAAA ATGACCTCGC 601 CTTGTGAAAA ATTGATCCGA AGGGACATTG CTAGAACTTA CCCTGAACAC 651 AACTTTTTTA AGGAAAAAGA TAGCCTTGGA CAGGAGGTTT TATTTAATGT 701 AATGAAGGCT TACTCTTTAG TAGATCGTGA GGTTGGTTAC TGTCAAGGAA 751 GTGCTTTTAT AGTTGGATTG TTGCTTATGC AGATGCCAGA AGAAGAAGCT 801 TTCTGTGTAT TTGTTAAATT AATGCAAGAT TATAGACTTC GTGAACTTTT 851 TAAACCAAGT ATGGCAGAAT TGGGCCTTTG TATGTACCAG TTTGAATGTA 901 TGATACAGGA GCATCTTCCA GAGCTCTTTG TACATTTTCA ATCTCAGAGT 951 TTTCATACCT CAATGTATGC ATCATCCTGG TTTCTGACTA TCTTTCTTAC 1001 AACTTTTCCA CTACCAGTTG CAACAAGGAT ATTTGATATC TTTATGTCTG 1051 AGGGTTTAGA AATAGTGTTT CGTGTAGGAT TAGCACTTCT TCAGATGAAT 1101 CAGGCAGAAC TGATGCAACT TGACATGGAA GGGATGTTAC AGCACTTTCA 1151 AAAGGTCATT CCACATCAGT TTGATGGTGT CCCAGACAAG CTAATCCAAG 1201 CAGCTTACCA AGTCCAATAC AATTCAAAAA AAATGAAAAA GCTTGAAAAG 1251 GAATACACTA CAATAAAAAC GAAAGAAATG GAAGAGCAAG TTGAAATTAA 1301 AAGGTTACGC ACAGAAAATA GACTTTTAAA ACAGCGCATC GAGACATTAG 1351 AAAAACATAA ATGCAGTTCC AACTACAACG AAGATTTTGT GCTACAGCTA 1401 GAGAAGGAAT TGGTCCAAGC CCGACTGAGT GAAGCTGAGT CTCAGTGTGC 1451 ATTAAAAGAG ATGCAGGATA AAGTCTTGGA TATAGAGAAG AGGAATAACT 1501 CCCTTCCTGA TGAGAATAAT ATTGCAAGGC TTCAGGAAGA ACTCATTGCT 1551 GTGAAACTTA GAGAAGCAGA AGCCATTATG GGTTTGAAAG AACTTAGACA 1601 GCAAGTCAAG GATTTAGAGG AACACTGGCA GCGCCACTTA GCTCGTACTA 1651 CTGGGAGATG GAAAGACCCA CCCAAGAAAA ATGCTATGAA TGAGTTACAG 1701 GATGAACTGA TGACCATTCG ACTTAGAGAA GCTGAAACAC AAGCAGAAAT 1751 AAGAGAAATA AAACAAAGGA TGATGGAAAT GGAAACACAG AACCAGATCA 1801 ATAGTAACCA TCTTCGAAGA GCAGAACAAG AGGTGATTAG CCTACAGGAG 1851 AAAGTGCAGT ATCTTTCTGC ACAGAACAAA GGACTCCTTA CTCAATTAAG 1901 TGAAGCAAAG CGTAAACAAG CAGAGATTGA ATGCAAGAAT AAGGAAGAAG 1951 TGATGGCTGT GAGGCTTCGG GAAGCAGATA GCATAGCTGC TGTGGCTGAA 2001 CTACGACAAC ACATTGCTGA GCTTGAAATC CAGAAAGAAG AAGGAAAGCT 2051 TCAAGGACAG CTTAACAAGT CTGATTCTAA CCAGTATATT GGGGAACTGA 2101 AAGATCAGAT AGCAGAGCTG AATCATGAGC TCAGGTGCCT AAAAGGCCAG 2151 AAGGGCTTCT CAGGCCAACC TCCTTTTGAT GGAATCCACA TTGTCAACCA 2201 TTTAATAGGA GATGATGAAT CATTCCATTC CTCCGATGAA GATTTTATAG 2251 ATAATTCCTT ACAGGAAACT GGTGTTGGTT TTCCTTTGCA CGGAAAATCT 2301 GGCTCGATGT CTTTGGACCC CGCAGTGGCA GATGGTAGTG AGAGCGAAAC 2351 AGAAGACAGT GTGCTGGAGA CCAGAGAGAG CAACCAAGTG GTTCAAAAGG 2401 AGCGGCCCCC GAGAAGAAGA GAGTCGTATT CAACCACTGT CTGACCATCA 2451 CTGTGACCTA GACTATGGAT TTATTTAAGG GATCAGTTAT CATATGATTA 2501 GGGCTTTTTG GAAATAATTT GTTTTCATAT GTATATATAT ATATATATAT 2551 ATATATTTTA CAGTCTTATC TGCCTTTTTA TCTTTGCCAA ATCTTTACCA 2601 CTGATTTACC ATTGCCATAA AGTAGACTGG TATATGGTTA ATGTGTAAAA 2651 CAAGGTTCTA ACAGTATTAC TGAATATTAA TGTTTCTTGT TTAGAAAATG 2701 CAACTATATC TATATGTGGG AACCATTTCT GAAGTTCAAA ACTATGGAAA 2751 ATTGAATTTT CTTCAAACAA AACTGCTCTT GTGCAATATT TTATGCTCAG 2801 TGAGCAAATT GTGTATTTAT GTTTATCAAT TTGTTCCCCA GGTGGCTGTC 2851 TACAAACATT TGACCAAAAC ATACATTTGA ATTACCTTGG GTTTTTTTTT 2901 ATTGGTGTTT TTTTTTTAAG ACAGAGATTC AGTTTGTCAC CCAGGCTGGA 2951 GTGCTGTGGT GTGATTTTAG TTCACTGCAA CTTCCGCTTC CCAGGTTCAA 3001 GCAATTTTCC TGCTTCAGCT TCCCGAGTAC CTGGGACTAT ATGTGCGCAC 3051 CACCACGCCT GGCTAATTTT TTGTATTTTT AGTAGAGATG GGGTTTCACC 3101 ATGTTGGCTG GGCTGGTCTC GAACTCCTAA CCTCAAGTGA TCCACCCGCC 3151 TCAGCCTCCC AAAGTGCTGG GATTACAGGT GTGAGCCACT GCACCCAGCC 3201 TACCTTGTCT GTTTGTGTCT GGGAGGTTTT TTTTTTCTAT TTTTATTTTT 3251 TCATGAAAAT TATTGGTGGG CCACTGAAAG TCCCCACACA CAAAGCCTTT 3301 ATTCTATATA ATTTTATAAA CACAAATTCA TGATTATCTG TTTTGAGAGT 3351 TTTAAGTTTT GTTTTAATGT TTAACTTTTA TGTGCATATG ATGCTTTCCA 3401 TGTGTTGGTT ACTAGAGTAG TAGGTTAACT ACAGACATAC TGTTTTGTTT 3451 GTACATATTT ATAAATCTGT ACCACCTAAC ATTGAACATC ATTTTATATG 3501 AAGAACATAC ATGTTGCAAA ATGACTGCTT TCAGCATCTA ACAGGACTTC 3551 TGTAAATAAT AGGTTAAAAG TTAAAAATAA AACACTAATG TTTTAAGAGC 3601 TTTAGTATTT TGCTTAGCAT TCAATACTAG CAGATTCATA ACTGATGCCT 3651 GTTCAAAAAC ACTGTTGGTG GAATCTTTTA TTTCATGTGT ATGTAACTAC 3701 TAAATTTTTC ATGACTGAAG AGGTTATAGC ACATATTAGT TAGTGCTAAT 3751 ATTCTACTGA ATATAATTTA AGCAATGGGC TAAGGTCCAA GAAACATAGC 3801 AATTATATCT ACAAATAATC TATTAACATG ATCTTCAGAA TCTCTAGTTT 3851 TTGTTCTATA GATAAAGCCA TAAGAATCCT TTCCTACCAG TTTTTCCTAT 3901 ATCATTTAGA TATTATTTCC TAGAAAACAG GATGCCTGTC ACTGATAAGT 3951 TAGAGAAAAC ACCCAAATTA TACTTGCACT ATGGAAAGAC GACATGGATT 4001 TGTTGTCTGG GTACTTTCTA CTCTCATTTT TGTTAGCCTA TGAAAAAAAA 4051 AAACATTTTG TTAACCTATG GAAAAAGCCT TGGAAGAGAA AAGTAGAATT 4101 CAGTATGTAT GGAGGCACAG TTTTGCACAG TATGTTGCAT GCGGCAAAAA 4151 AAAAAATATG AAATAGAGTA GAAGAGATAA AAGTAGTTTT CTGTCAAAAA 4201 CTAGGAGTTA AGCCTTCAGG TACGTTGTGG ACTAGGCTTT CCAGCAAGGT 4251 TTTTGTAGCT ATTTGCTTGC CTGAGAAATG AAATTGAAAA CATACCCCTG 4301 CGTGTCTGTC CTTAACACCA GGCTAAATTC TGGTGGTTAA ATAACTTACC 4351 TGTGGTTACA AGCTAGTAAG TAGCAGAGCC AAGATTTTTT TTAATATAGA 4401 GACAGGTGTC TCACTGTGTT GTCCAGGCTG GTCTTGAACT CCTGGCCTCA 4451 AGTGGTCCTC CTGCCTCAGC CTCCCAAAGT GTGGGGATTA CAGGTGCGAG 4501 CTACTGCTCC CAGCAAGAGC CAAGATTTTA ACCTAGATCT CCCTAACACC 4551 AAAGCTTTTG AAAGGGTTTC AGCTGAAGAA TTTCTTCAAT TCTTCTGTTT 4601 TGTTTTTATT TTTGTTTTTT GAGATGAAGT CTCACTGCAC TCAGGCTGGA 4651 GTGCAGTGGT ACGATCTTGG CTCACTGCAA CTTCCACTTC CCGGGTTCAA 4701 GCGATTCTCC TGCCTCAGCT CCCAAGTAGC TGGGACTACA GGTGCACGCC 4751 ACCACAACTG GCTAATTTTT GTATTTTTAG TAGAGATGGG GTTTCGCTAT 4801 GTTGGCCAGG CTGATCTCAA ACTTCTGACC TCAGGTGATC CACCCACTTT 4851 GACCTCCCAA AGTGCTGGGA TTACTAGACG TAAGCCACCA CACCTGGCAA 4901 TTCCCTGCTT TTTTGACAAC ATAATTTAAA TGAATTTGAT TTAAATGTTT 4951 CCTGAATATA TATACAGATA GATAGATGAG TATCAACACT TACAAGCTCT 5001 ACACAGTCTG TGAGTGTAGA TACTACTTAC TTGTAGGATT GTCTTGTGCT 5051 TTGGGGATTT TTGTTCGGAG TTCTATACTA TCAGCCCAGA AAGCTATCAA 5101 TGTCCAAGTG AAGTGAGATG ATGCATGCCA CATTATTGCT TTGACCTTCA 5151 CATCAAGAAA TCTGAATGAT CTGGACCAGG GTCAGCAAAG TGCAGCCTAC 5201 AAGTTATTGC AGTCAGCTGC CAGTTTTTAC TTTTTTAAAT AGTTGGGAAA 5251 AAAATTCCAA AGACTGATAT TTTGTGTCAC ATGAAAATTA TATGAAATTC 5301 AAATTGCAGT GTCCATAAGT AAAGTTTTAT TGGGGCACAC CCATGCTCAC 5351 CCTTTTAAAT ATTGTGTATG GCCGCTTTGT GCTACAGTAG CAGAGTTGAG 5401 TCATTGCAAG GGAGACCATA TGGCCCAGGT AGCCTAAATA TTTATCTGAT 5451 CCTTTACAGA AAAAGTTTGC TAACCCTTGA TAACAGATAC TCTAAAATGC 5501 AGGTTTTTCT TCTTCAGTTT AGCTCAGTGG TAGGTCAATT GTTTGCTTTT 5551 GCTATTAAGC TTATGATTTG GAAACGTAAA ATTTTCATGG TGGATCTTAT 5601 AATCAGAACT ATTCAAAAGG GAAAACATGC CAGCCATTTT ATATGATGAA 5651 GTCTCTTATT CCATAGAAAA CTGTAGAAAG GTATCTGTAT CTTTGGAGAA 5701 GGAGAAAGAA AGGAAAGAAC TCCAGAATCC CATCACTGTT CTTTAAAGCA 5751 TGCATCACAG ACCTTTAAGA GTATGCTTTA TTGAGTAAAG GTTATAGTTT 5801 GACTTTAAGA TAAGAAATTT TCTAGAATCA TGTTTGAATT CTAAACGTTG 5851 ATATTCTTAA GTACCATTTC TGATTAGGAA TTAGGTGTTA AAAGAATCAT 5901 GGTAAGGAAA CCTTAATCCG TTGAAACTCT TGTAATTTGG ACAACAAAAA 5951 ACTAGAAATA ACACTCTTTT CTTTGAAGAT CTGATTCAAA TACACCTTAG 6001 AAGAGTTTAA AATCTCCCAG GCCTTTAGGA TGCAATAGGA TATTATGCCT 6051 TGGTTTGACT CCTTTTCATT CAAATTAATG TAAGTATGTT GTTCAAGACT 6101 TTGAAGCATT GAACAATGTT CTCGACGAAT TTTATTTAAT TACTGACATT 6151 TTGTGATATT TGAATGTTAT TAAGTTAAAC ACTTAGAGTT GAGTATATTC 6201 AGGGGAGAAA AGTGAATGAA ATGACTCATA ATAAGAAAGC TTGCTCCCTT 6251 GGGACAGTTA CCTTCCTGGT GTCATTGCCA TCCTGGGTTG ACAAGCTGAG 6301 AGTAGAGGAT ACAAGTGATG TGAATAAAAT GGAAATTGCA TTGAGTTAAT 6351 GAAACATACA CTTTTAGATC AAGATAACAT TCAACAGCCT TTTTAGAAAT 6401 TGCACACTCC TATAGTCCCA CTTACTCAAG GAGGTGGGGG CAGGGAGGAT 6451 CACTGGAGTA CAGGAGTTTG GGGCTGTAGT GTGTAATGAT CAACACCTGT 6501 GAAAGAATAA CCACTGCACT CCAACCTGGG CAACATGGCA AGACCCTGTC 6551 TCTTTAAAAA AAAAAAAAAA TCCTGGGAAA AGAAGTCAAA GATGTAATTT 6601 TACACAATTT CTGCCCCCAC CGTCTTTTTT GTTTGTTTGT TTGTTTGTTT 6651 GTTTTTTGGC TAGATACTAG GTATTCAGTG TTTGCTGAGA TGACAGGTTT 6701 CTACTAATCA GCCAGTTATT TGGGAAGGAG AAAAACTTTT TCTCAATTCT 6751 GAACATGATG CAGCTTGTTA ACTCTCCAGA TAGTGCCTTT GATTGAAAAT 6801 AAGGTCTACA CCTTTTTAAG TATATTATTT CTGACATCGT AAGAAATAAT 6851 TTAGTTCCTT ATTCTCATTT ATCTTTTATA TAATGTACTG TAATATTCAG 6901 GTTTAAAGTA AAATAAGTAT ATGGAATTAT ATTTTGGTAA AACTGTTTCA 6951 TGACCAGCTT GATTAATGTA TATTAACATA CCAGACATTG ACAGCGAACA 7001 TTTCATGAAA TCAGTTATTT ACTTACTAAA ACAATGAGTT TTAGCTTGAA 7051 AGTTACTGCT GCTCTATTAT TAACAAAACT AAGTTTTAAA AATTAATTTT 7101 AATTCTACTG AAGTTTAATT TCTGGTTAAA GGTATAGTGG ATGCTATTTA 7151 TGATCCAAGT TTTTACATCA CATCCATTGA TCTGCCTATC CCTTCTTTTT 7201 ACTCCAAAGA GGAGATGATA GAATAATACA TTCAGTATAC TACCAAGAAG 7251 CAAAGTTATA ATGTGTTTCA TTTTGCTATT GGAATTTTTT ATTGTAACAT 7301 TGTATGATAA CCTGAAATGT TTACTTGTGC CTTTTCTTTA AACAATTAAA 7351 GTTTTCCATT TTATAAGAAA CAAAAAAAAA AAA // LOCUS D50917 5544 bp mRNA PRI 10-FEB-1999 DEFINITION Human mRNA for KIAA0127 gene, complete cds. ACCESSION D50917 NID g1469176 VERSION D50917.1 GI:1469176 KEYWORDS KIAA0127. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5544) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5544) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1. .5544 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1. .297 gene 298. .1242 /gene="KIAA0127" CDS 298. .1242 /gene="KIAA0127" /note="The KIAA0127 gene product is novel." /citation=[3] /codon_start=1 /protein_id="BAA09476.1" /db_xref="PID:d1010118" /db_xref="PID:g1469177" /db_xref="GI:1469177" /translation="MLGKGGKRKFDEHEDGLEGKIVSPCDGPSKVSYTLQRQTIFNIS LMKLYNHRPLTEPSLQKTVLINNMLRRIQEELKQEGSLRPMFTPSSQPTTEPSDSYRE APPAFSHLASPSSHPCDLGSTTPLEACLTPASLLEDDDDTFCTSQAMQPTAPTKLSPP ALLPEKDSFSSALDEIEELCPTSTSTEAATAATDSVKGTSSEAGTQKLDGPQESRADD SKLMDSLPGNFEITTSTGFLTDLTLDDILFADIDTSMYDFDPCTSSSGTASKMAPVSA DDLLKTLAPYSSQPVTPSQPFKMDLTELDHIMEVLVGS" 3'UTR 1243. .5544 BASE COUNT 1504 a 1281 c 1157 g 1602 t ORIGIN 1 CTCCTGCACG GCGAGTGCTG GAGCACGAGC TACCGCTCGC TCGGTCAGGG 51 CGCCCCCTCC GCCCGCCTCC TGCTTCCTCC TCCGCTGCCT GCCGCCGCCG 101 CCTCCACCAT TGTATAATGC TCGGGGCGCG CAGGCAGAGA ACGGCGGAGT 151 CTTAGCTTCA GCCTCGCCTG CTGCCCGCTC CCCGGCGCCA CCCTCGGGCC 201 CCTGGAGCGG GGCACTCCGC ATGGAGCGGG AGTAGCTGAG GAGTGGGCGG 251 AAACCCCTCC TGATGCGTTA GTTCCCAGGT GGAGCTGCAT GTGATATATG 301 TTGGGTAAAG GAGGAAAACG GAAGTTTGAT GAGCATGAAG ATGGGCTGGA 351 AGGCAAAATC GTGTCTCCCT GTGACGGTCC ATCCAAGGTG TCTTACACCT 401 TACAGCGCCA GACTATCTTC AACATTTCCC TTATGAAACT CTATAACCAC 451 AGGCCCCTGA CAGAGCCCAG CTTGCAAAAG ACCGTTTTAA TTAACAACAT 501 GTTGAGGCGG ATCCAGGAGG AACTCAAACA GGAAGGCAGC CTGAGGCCCA 551 TGTTCACCCC CTCCTCCCAG CCCACCACCG AGCCCAGCGA CAGCTACCGA 601 GAGGCCCCGC CGGCCTTCAG CCACCTGGCG TCCCCGTCCT CCCACCCCTG 651 CGACCTCGGA AGCACTACGC CCCTGGAGGC CTGCCTCACC CCGGCCTCAC 701 TGCTCGAGGA CGACGATGAC ACGTTTTGCA CCTCCCAGGC CATGCAGCCC 751 ACGGCTCCCA CCAAACTGTC ACCTCCAGCC CTCTTGCCAG AAAAGGACAG 801 TTTCTCCTCT GCCTTGGACG AGATCGAGGA GCTCTGTCCC ACATCTACCT 851 CCACAGAGGC GGCCACGGCT GCGACTGACA GTGTGAAAGG GACCTCCAGC 901 GAGGCTGGCA CCCAGAAACT CGACGGTCCT CAAGAGAGCC GCGCAGATGA 951 CTCAAAACTG ATGGACTCTC TGCCTGGGAA TTTTGAAATA ACGACGTCCA 1001 CGGGTTTCCT GACAGACTTG ACCCTGGATG ACATCCTGTT TGCTGACATT 1051 GATACGTCCA TGTATGATTT TGACCCCTGC ACTTCCTCAT CAGGGACAGC 1101 CTCAAAAATG GCCCCTGTGT CTGCCGACGA CCTCCTCAAA ACTCTGGCTC 1151 CTTACAGCAG TCAGCCTGTC ACCCCAAGTC AGCCTTTCAA AATGGACCTC 1201 ACAGAGCTGG ACCACATCAT GGAGGTGCTT GTTGGGTCCT AAGACCCAGG 1251 GACCCAGCGA CTATGCCCAC CCAGACCCCA GAGCGTTCCC ATAACCCTGA 1301 CAGTTCTCCA CACTGTGCAT GCACCCTTGC TTGCCTTTTT CAGAGAAAAA 1351 GAAAATTTTA CAACAGGATC ACACTAGTTT TTGCTTTGAG CAGAGTTGGA 1401 GTGCCTTCAT CCAAGTATGA CCACTTTTAA TACACTTTTT TGAGTGGTTC 1451 CTCAGAGACC TACTACCCTG GTATAGGAAA GAATCCATTT GAAGACAATG 1501 TTGCAATGTT GAATGACAAA AATAAACAGT TCAAGTGAAG CACAAGGATT 1551 AAGTTGGAAA AGCTGTAAAT TGCATGTGCA TATTTGTCTA TTTTTTCTAT 1601 AAGTTTTATT GCAAGAGGTA AAGAAGAAAA CTATATATAT ATATCTTATT 1651 TAGATAATCT CAGTACCTTT TCTGGCATTT TTGCCCTGTA TAGGTTGACT 1701 TGGCAATTCG GCCTTTTTAG AGGCATTAAC TACTCCTCGT AAGTGTTGCA 1751 TTTACATGGC TGTTTAGAAA ACTGCTGCCC AAATTTATTT TATATTTTTG 1801 TACAGATTCT GCAGTTTATG ATATTGTTTT TCTAAAAACA AATGCTGTTT 1851 ATACATATGA GATAGCTATT TTGATAGGAT TTGCTCACAT AGTTCCTGCA 1901 AACTTCAGAT GTACAAGTTG CACTTGTACT TTTATAGAGT TGTAATGTTT 1951 TATATGTGTA TGGTGCAAGA GAAAATTGGA TCAAATCAAT CTGCAGTTGA 2001 TGTCCCCAAA TGCAAACACA GGCACACACA TGCACACACC CATAAACACA 2051 CACACAGTGC TTTAAGAAAG GGCCAGGTGA TATCACACCC AAATTTCACA 2101 AGCACTGACC CCCTGGCACC AACACCCGCC AGTACTGTGA CTTCCAAAGC 2151 CAGAGCCACA TGTGCTCATC AAACTTGCAT TAAGCAGTTG GCGGGAGATG 2201 GCTGTGGAGC TGGGGGTTTA AGTGATGGTT CTCTTTTGCT CCCTCTTTTG 2251 AGGGTAAAGC TACTGTCTTT CTTAAGAGTG TATTTATGCC AAGTTTGCGC 2301 TTTTAATTGT TTTTATTTTG TTTTTTAATG AAAACCCAGA TCTTTCCTTT 2351 TTGGCATAAT TTTTATGATG ACCTGAAATT TTACATCCGA ACAAAATTTT 2401 ACATCCGAAA AGCAACCAAC TTCTTCATGG AACTCAGCCC TGTTGCAATG 2451 CTTAGGGCCC TTAAAGAAGA AAATCTCCCC AGAAGGCATC CATCATGTTG 2501 CTTAATTGTC TTCTGCAGCT TCCTTTCCCT AGAGCTTTCC CTGTGTTGCT 2551 AAGAGCTGAA AATGGCATCT TCGTGATCAC CACAGTGAGC TTGGCTCGCC 2601 TCGGCCGGCC CGGGATGCAC TCTTACAACA TGTGTGACTC TTGAACCTGG 2651 AGTTCATCAC ATTACGTCAC AGCTTCCCAT CTGGTTGCTT TCCTGAGTCA 2701 GCTACTTCAC ACTTGTCAAG GCTGTTTTAC CCCAAAACTC AGACAGGACT 2751 TTCTATGCAT GTTTTCCCTC CTCCCCCCAA TTCCCCCCCC ATCACCTTAT 2801 CTCCCAGGAC ACACTTGAGA AGTAGCTTTT TATTCCTAGT GGTGTACATT 2851 TAATTTTAAA AAGGTTGCAA TGTATCATGC TTGTTGCCGA AACTGTTTAT 2901 GGCCTTCTTG TTTCAGTTTT TTCTTTTCTT CCAATGGTAC TTTAGCTGTT 2951 GAGTGCAGGT TACAACCTAT ATTGTTATGC AGATGGCTTC TTTAGGAATA 3001 ACTTTTATAT TTATTTAAAA ATTTTTAAAT TATGGGATGT TTTGTTGTTG 3051 TTGTTGTCTT TGTTGTTGGT CATTTGTCAA TATTCAGTCA CCAATTCTGC 3101 TCACTTCTTG CCATGGATAA AATTGGGTCT TTCTGGCTAA TTAAAAAAGA 3151 CAACTTTATA AAATGGCACT TTAAGCAAGC CATAGTTAGT TTTATTTTTG 3201 TAATGCACAT GGCAAAGCAA AGACGTTTGT GATGAAGGAA CTGCTCATCT 3251 AAGCAAAAGA TTTGAGTATG ATATGATAAA GGCTTTCTAC ATTCTAATTT 3301 ACTTTTTCCC CCCACTTGAA TGTGTTTTAA AGGCTAATTA TCAGCTCAGT 3351 AGAGCAGTGA GAAACTGATC AAATTGCACT TGTTCTCCTA CAAGCAACCT 3401 CCACGCAGAC ACCTCGTACT GCTACAGGTG TGTCATTTCC TTTAATAGGA 3451 CCAGGGACCA TGTAACTGAG GTGAGGGTTG TAGTAGATGC TTCCAGTGTC 3501 AGTATGCCTG TTAATTTTAA GAGCTTCCCT TTCTTGCAGA GAACAAGTCT 3551 GCCCAGATTC CATGCTTTCT ATAACTGGAG GACCTGGCAA ACCTGCCGCA 3601 TGCTGCACAC ATCTACCTAC GTACACATAT ACAATAGTAT TGATGATTCT 3651 GAACAATAAC AGGGTAAAAC AGTTGGTTTG CCATTGTTAA AAACTGATTT 3701 ACAGTAACTT ACAACAACTG TACTTTTGTT GGATTAGCAA ATCATGTGTT 3751 TAAACAAATC CCATATGTTG GGCAACAGTT CAAATAAGCA CGGAGAAGTG 3801 TTGCCCAAAC TTGGTTCTCT GACTCTTATG TATTTGTAAG GCTGGGCTTC 3851 AAAATCAAAA CAAAAACCCC AAAAACAGCA GGCAAATGCT TTTTAACTCT 3901 GACACCGTTG CCATAAATCC CTGATACTCA AAGTCTAACA AGAAAGACAT 3951 GGAAAATTAG CAGCCCATTT TCAGAAAGAT CAAAATGATC TAGGGTTCTA 4001 ATTGCTTTTG CATCCTATTC TTACAAAGTG ATGTCCCAAC AGGGAACAGT 4051 AGGAGCTGGA GTGGGATCTC CAAGTCCCAG TTTGAGTGTG GGATGTGCTT 4101 CCAGCAGTGC CTTCCCTTTA TGAAAGACAT CACATGGCAT CCAGGGCCAG 4151 GCAGGCAGCT TGAGGTGCCT TTACGAGAAA ACCGAGCTGG GGCTGGGAGA 4201 GGACAGTTAT TGACACTGAT GTGCAATGAA GTGACAAGAT GAGAGCAGAA 4251 TCGTAAGAGC TTTGAATTTG AAGTGAGTTT TTTTCCCCCC ATAAGTTATT 4301 TATTCCTTTT TTCTGTGTAA ATATATTTAT TTTACTGTGG AGCGCTAACA 4351 TCTGGATCGT AACATGTGCA GAATGTATGG TAGGAATGTA TTCTCTTGTA 4401 GGAATGTAAA TCTGTATTAA AAGGGGGTCC AAGCCAGGCC CCCAGGTCTT 4451 CTCATTGTAT GCACAGTCCG CATTCATTTT TACTCTTCTC TAATATGGGT 4501 CTATTTGAAA TATGCAAAAG GTATGAGGAA TGTTTTAATA CCTCCAAATT 4551 TTTAAGAAAA GCATCAAAGG GTTGATATTT TTTAAAGTTT TTTTAGTAGC 4601 ACTTTCTCTG GATGACAGAA GGGGCAACCA CATGGGCACC CTTGTTCATA 4651 CCAAAGGGTG AGCAGTGGCC AGAGCCTCCT CTGCACCTCT CGAGTGTCTT 4701 TACCAATTGA GCTTTTTATC GCCATAGCCC CTTGGAGTGC CCCAGCTGCC 4751 CTGAGGTCAA TCAAGGAAAA TTTCTTAATG AAATAAGCTC CAAAGAGCCA 4801 AAGTATCAAC TTACAGATCG TTTTTAAAGC TTAAATTTAT GAACCACCTT 4851 TGTGGTAAAC AATGAATTAT GAATACCGCA GGGCAGCCTT CTTAAATGAC 4901 AAATGTAAAA AAAAAAAAAA AAGACTCTAC TTCGTGCAGC AATTGCTACT 4951 CTATACGAAT TGTCTTAATT TGAAAACCTT GCTGTTACAA ATTGGACCTT 5001 TATACATTTT CTGAAAACAA TGAAAAGAGT ATATTTAACC TTTTCTGGCT 5051 GTAAATGGTT ACCTTCCTGT AACTGCCCCG CACCTGGAGG CATGGAGTTG 5101 TGTGCATCCT GCTTATGTAC AATTGTTTTC AGTGTTTCTA AGAATGAGTC 5151 TGAATGGTTC TTGAAAATTA GCCAGGATCA AATGCTATTG CAGACAAAGC 5201 CAATAAAAAG TTGGACTTCT TTTGGGGATA ACAAGTTTTG GAAGAGAAAT 5251 GCAGGCCATA TGTGCGCATG ACCGAGATTT TGAAAAAAGA TGTACATAGT 5301 GACATGTTTG GTGCATGGTT TTTGAGGAGG GCTTTTGTCA AAAAGGAGGT 5351 ATAACCTTTC CCCCACAGAC CTGAGAGCTG TGCCTTTTCT ATGCAATATT 5401 ACAGACGTTA CATCGGAACC CAGATGGCTG TATTCACATG TAGGTTTGGG 5451 CTGTAATCTA AACAATTGGA CAGATTAAAT GTACATGGAA ATGAGCAGTC 5501 TTACTTTTGT AGTTTTATAT TATACAATAA ACAGTTAAAA GATG // LOCUS HUMKALL 6314 bp mRNA PRI 06-JAN-1995 DEFINITION Homo sapiens Kallmann syndrome (KAL) mRNA, complete cds. ACCESSION M97252 NID g307079 VERSION M97252.1 GI:307079 KEYWORDS Kallmann syndrome interval gene; adhesion molecule; antiprotease. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Franco,B., Guioli,S., Pragliola,A., Incerti,B., Bardoni,B., Tonlorenzi,R., Carrozzo,R., Maestrini,E., Pieretti,M., Taillon-Miller,P., Brown,C.J., Willard,F.H., Lawrence,C.B., Persico,G.M., Camerino,G. and Ballabio,A. TITLE A gene deleted in Kallmann's syndrome shares homology with neural cell adhesion and axonal path-finding molecules JOURNAL Nature 353 (6344), 529-536 (1991) MEDLINE 92018217 REFERENCE 2 (bases 1 to 6314) AUTHORS Hardelin,J.P., Levilliers,J., Blanchard,S., Carel,J.C., Leutenegger,M., Pinard-Bertelletto,J.P., Bouloux,P. and Petit,C. TITLE Heterogeneity in the mutations responsible for X chromosome-linked Kallmann syndrome JOURNAL Hum. Mol. Genet. 2 (4), 373-377 (1993) MEDLINE 93278384 FEATURES Location/Qualifiers source 1. .6314 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xp22.32" gene 1. .6314 /gene="KAL" exon 1. .357 /gene="KAL" /note="G00-120-116" /number=1 CDS 151. .2193 /gene="KAL" /codon_start=1 /function="putative antiprotease; adhesion molecule" /db_xref="GDB:G00-120-116" /protein_id="AAA59202.1" /db_xref="PID:g307080" /db_xref="GI:307080" /translation="MVPGVPGAVLTLCLWLAASSGCLAAGPGAAAARRLDESLSAGSV QRAPCASRCLSLQITRISAFFQHFQNNGSLVWCQNHKQCSKCLEPCKESGDLRKHQCQ SFCEPLFPKKSYECLTSCEFLKYILLVKQGDCPAPEKASGFAAACVESCEVDNECSGV KKCCSNGCGHTCQVPKTLYKGVPLKPRKELRFTELQSGQLEVKWSSKFNISIEPVIYV VQRRWNYGIHPSEDDATHWQTVAQTTDERVQLTDIRPSRWYQFRVAAVNVHGTRGFTA PSKHFRSSKDPSAPPAPANLRLANSTVNSDGSVTVTIVWDLPEEPDIPVHHYKVFWSW MVSSKSLVPTKKKRRKTTDGFQNSVILEKLQPDCDYVVELQAITYWGQTRLKSAKVSL HFTSTHATNNKEQLVKTRKGGIQTQLPFQRRRPTRPLEVGAPFYQDGQLQVKVYWKKT EDPTVNRYHVRWFPEACAHNRTTGSEASSGMTHENYIILQDLSFSCKYKVTVQPIRPK SHSKAEAVFFTTPPCSALKGKSHKPIGCLGEAGHVLSKVLAKPENLSASFIVQDVNIT GHFSWKMAKANLYQPMTGFQVTWAEVTTESRQNSLPNSIISQSQILPSDHYVLTVPNL RPSTLYRLEVQVLTPGGEGPATIKTFRTPELPPSSAHRSHLKHRHPHHYKPSPERY" exon 358. .405 /gene="KAL" /note="G00-120-116" /number=2 exon 406. .468 /gene="KAL" /note="G00-120-116" /number=3 exon 469. .691 /gene="KAL" /note="G00-120-116" /number=4 exon 692. .876 /gene="KAL" /note="G00-120-116" /number=5 exon 877. .1006 /gene="KAL" /note="G00-120-116" /number=6 exon 1007. .1212 /gene="KAL" /note="G00-120-116" /number=7 exon 1213. .1357 /gene="KAL" /note="G00-120-116" /number=8 exon 1358. .1504 /gene="KAL" /note="G00-120-116" /number=9 exon 1505. .1599 /gene="KAL" /note="G00-120-116" /number=10 exon 1600. .1771 /gene="KAL" /note="G00-120-116" /number=11 exon 1772. .1992 /gene="KAL" /note="G00-120-116" /number=12 exon 1993. .2134 /gene="KAL" /note="G00-120-116" /number=13 exon 2135. .6314 /gene="KAL" /note="G00-120-116" /number=14 BASE COUNT 1884 a 1274 c 1246 g 1910 t ORIGIN 1 GTCGGCGAGG AGGGTCCGGC CGGAGTTGAA GGATTGAACT TTCCGGCTCA 51 GTCGCGGCGG CTGCCTGGTC CTCAGCAGTG CAGCCCCGGC GCGGAGCAGG 101 GAGCCTCGGC CCGCGCCCGG CGCCCTCGCC CTCGCCCTCG ACCCGCAGCC 151 ATGGTGCCCG GGGTGCCCGG CGCGGTCCTG ACCCTCTGCC TCTGGCTGGC 201 GGCCTCCAGC GGCTGCCTGG CGGCCGGCCC CGGCGCGGCT GCTGCGCGGC 251 GGCTGGACGA GTCGCTGTCT GCCGGGAGCG TCCAGCGCGC TCCGTGCGCC 301 TCCAGGTGCC TGAGCCTGCA GATCACTCGC ATCTCCGCCT TCTTCCAGCA 351 CTTCCAGAAC AATGGTTCCC TGGTTTGGTG CCAGAATCAC AAGCAATGTT 401 CTAAGTGCCT GGAGCCCTGC AAGGAATCAG GGGACCTGAG GAAACACCAG 451 TGCCAAAGCT TTTGTGAGCC TCTCTTCCCC AAGAAGAGCT ACGAATGCTT 501 GACCAGCTGT GAGTTCCTCA AATACATCCT GTTGGTGAAG CAGGGGGACT 551 GTCCGGCTCC TGAGAAAGCC AGTGGATTTG CGGCCGCCTG TGTTGAAAGC 601 TGCGAAGTTG ACAATGAGTG CTCTGGGGTG AAGAAATGTT GTTCGAATGG 651 GTGTGGACAC ACCTGTCAAG TACCCAAGAC TCTGTACAAA GGTGTCCCCC 701 TGAAGCCCAG AAAAGAGTTA CGATTTACAG AACTGCAGTC TGGACAGCTG 751 GAGGTTAAGT GGTCCTCGAA ATTCAATATT TCTATTGAGC CTGTGATCTA 801 TGTGGTACAA AGAAGATGGA ATTATGGAAT CCATCCTAGC GAAGATGACG 851 CCACTCACTG GCAGACAGTG GCCCAGACCA CAGACGAGCG AGTTCAACTG 901 ACTGACATAA GACCCAGCCG ATGGTACCAG TTTCGAGTGG CTGCTGTGAA 951 TGTGCATGGA ACTCGAGGCT TCACTGCCCC CAGCAAACAC TTCCGTTCTT 1001 CCAAAGATCC ATCTGCCCCA CCAGCACCGG CTAACCTCCG GCTGGCCAAC 1051 TCCACCGTCA ACAGTGATGG GAGTGTGACC GTCACTATAG TTTGGGATCT 1101 CCCCGAGGAG CCGGACATCC CTGTGCATCA TTACAAGGTC TTTTGGAGCT 1151 GGATGGTCAG CAGTAAGTCT CTTGTCCCAA CAAAGAAGAA GCGGAGAAAG 1201 ACTACGGATG GGTTTCAAAA TTCTGTGATC CTGGAGAAAC TCCAGCCAGA 1251 CTGTGACTAT GTTGTGGAAT TGCAAGCCAT AACGTACTGG GGACAGACAC 1301 GGCTGAAGAG TGCAAAGGTG TCCCTTCACT TCACATCGAC ACATGCAACC 1351 AACAACAAAG AACAGCTTGT GAAAACTAGA AAAGGTGGAA TTCAAACACA 1401 ACTCCCTTTT CAAAGACGAC GACCCACTCG CCCGCTGGAA GTCGGAGCTC 1451 CCTTCTATCA GGATGGCCAA CTGCAAGTTA AAGTCTACTG GAAGAAGACA 1501 GAAGATCCCA CTGTCAACCG ATATCATGTG CGGTGGTTTC CTGAAGCGTG 1551 TGCCCACAAC AGAACAACCG GATCAGAGGC ATCATCTGGC ATGACCCACG 1601 AAAATTACAT AATTCTTCAA GATCTGTCAT TTTCCTGCAA GTATAAGGTG 1651 ACTGTCCAAC CAATACGGCC AAAAAGTCAC TCCAAGGCAG AAGCTGTTTT 1701 CTTCACTACT CCACCATGCT CTGCTCTTAA GGGGAAGAGC CACAAGCCTA 1751 TTGGCTGCCT GGGCGAAGCA GGTCATGTTC TTTCTAAGGT GCTAGCTAAG 1801 CCTGAGAACC TTTCTGCTTC ATTCATCGTC CAGGATGTGA ACATCACCGG 1851 TCACTTTTCT TGGAAGATGG CCAAGGCCAA TCTCTATCAG CCCATGACTG 1901 GGTTTCAAGT GACTTGGGCT GAGGTCACTA CGGAAAGCAG ACAGAACAGC 1951 CTACCCAACA GCATTATTTC ACAGTCCCAG ATTCTGCCTT CCGATCATTA 2001 TGTCCTAACA GTGCCCAATC TGAGACCATC TACTCTTTAC CGACTGGAAG 2051 TGCAAGTGCT GACCCCAGGA GGGGAGGGGC CGGCCACCAT CAAGACGTTC 2101 CGGACGCCGG AGCTCCCACC CTCTTCAGCA CACAGATCTC ATCTTAAGCA 2151 TCGTCATCCA CATCATTACA AGCCTTCTCC AGAAAGATAC TAAACTGTTC 2201 AAAAAGATTT TGTGAAATTG CACAGATGTG TAAGCTTGTT GAACTTCGGC 2251 CACGAGACAT GCACACTTCC AGAGGCAGTG GGAACTGCTC AGAGGCCCGG 2301 ACTCTCCTAT GTGACTTTAG TGCAGGAAGA ACTTCTGTCA ATCATGGACG 2351 CATCTGGAGA CAAGTGAGAA ACAGTAGATT GGTGAAGACA GACACCAGTT 2401 CCCTACAAGC ATGGAGAAAA TGAAGAATAG GCCTGTTTAA TGCTAAATTT 2451 TGTTTTCATG TATGGTGTCG CTCATTTCTA TTGAATTACA ACAGAACTCA 2501 GTTTTCCCTG AATTTGGAGC ACCAAACTCC GCCCCAAAAA GGAGAGTAAC 2551 AAATACACAA TTCACACATA ACACTAAGCG TAAATCTAAT CAATAAAATA 2601 TATTTTTGAC TAAATTATTG ATTCGATATG AAAAATCAAC TAAGATTACA 2651 CAGCTTTGTT TTTTTGAATC TTTCCTAAGA TCATTTTTAT CCTAGGTGAT 2701 TTTTAAATGA AAATGTGTAA TCTAAAATAT ACCAGCGAAT TTAAATCTAA 2751 AAATGCTCCT ACTTTAAGTA CCTTGTGCTG CTCTTTATGC AAAGGTAAAT 2801 CAAAGTTCCC TCTATAAATT ATGATTTACA AAAGACACCC AAGCCAGAGG 2851 AACTCAATGA AATAAGCTGC TAATCAGATT TTACCTTGGA GAAATGAAAA 2901 TTATTTCTTG GGGATGCCTT TTAATATTTG ATCCTATTAT GTGAGAGATT 2951 TTCCTGATAT GTTATCTTAT TTATATTTTC CCTTATTTTC CTCAATGCAG 3001 ATAATAGCTT TTGGTGCACT TTTGTTTCAC CATCTGAAAA TTCACAAAAC 3051 TTCTTGCTTC AAATGAAAAA ATCCCAACTA TTGAGCATGT TTAAATCTTT 3101 GCAGAGATTT GCCTTTTCTT AATCAAAGAA AGGTCTTTGT GTGCTAGAAT 3151 ATTATTGGTA ATGTTTTAAA AATTCCTTTG ATTGATAGAG AAGGACAGTT 3201 ATTTGCATTT AATTCACCCA TATGCTTTCA AATCTAGTAT ATCTTACTTT 3251 TTGGAAATGT TTTATGCTAC AAATTAGTGC CTTGTAGCAT GAACTTAAGT 3301 CAAAACGTGT TATCAATATA GAGTGTTGCA GTGTATATTG TAACAACCTA 3351 AAACGCAGAG AAGTTTAATT TAATACTGTT TTTTTTCTTG AAGGAATACT 3401 CACATACATG GTTTGAAATG TGCATAGATA TGCATGTCTA TATAATTATA 3451 AATGCATGTG TATATATATG CAAATATATG TACATATACA TGTATATACA 3501 CACAGACACA TGCATATACA TGAATATACC TTGAGCATGA ATCCCTGGAG 3551 AAATCGTTTT CGTAGGCTCA CCAATGGTGA GTAAAGATAC AGCTCTTTTA 3601 AAGGTCATAA GGATAATATA TTTTCCCCAT CAATGCTGAT TCTGAGAAAA 3651 GAGCAATTTA TCAAAATTAA ACACTGTAAA AGAAAGGTGT CCATATGTCT 3701 TTACCTACCT AAGTAAAACA GGAAGAAAAT CAGTAACATT ATCCTTAGGT 3751 TTTGACAATG GTACTTGCTT CTTGTTGTTT TATTGTTTCC TGAATTCATG 3801 CAGATGCCTG GCCATTCCTG GGAAGAGTGG ATAACTCAGA AGTCACTGTA 3851 CTCCACAGAG CCTCACTGCA GTGTCTAAAG GTAGATGCAA ATTAAAATGC 3901 AGGGAAAATA ACTTTTCTGA TGTTGATGCA TGTCTTTGGG AAACACATTT 3951 ATAAACATGG ATACCTGATA ATAGATATTG AAACCCATTT CCTGTGTGTT 4001 AAAATATTTA AAAAGTGGAT ATTCCAGGAA TGTTTTGCAG CTTTGTACAA 4051 GTAACATAAA TTGGACACCT CAGAATGAAA GTTCATGTTG GTTCTGAATG 4101 GTTCACTGCA GCTCCTGTCA CAAGCTGGGA TGGATTTATC ACATTGAGTT 4151 ATGAAATTAC CTGGTTCTAA GAATTTTTGA GTGGCAAAAA TAGAAAACAA 4201 TCTTCATTTG AAAACATCCC TAAGCTTGAA TAAATGGATA CCATAGATAG 4251 CTTCTCTTTT TTATTCTGGT GTCATTACCA GCATCTGAAT TTCAAGTTCT 4301 TAAAATTTCA AAAATTAAAA TTTTTCATTA TTAGCTATCC ATTTATCTTT 4351 TACATGAACT TGTCATGAAC AAATTCAAAT GTTTATGCCA GCAAATTTTT 4401 GTACTGTTGC ATAGTTAAAA ATGCTGGGAG TCTCTGCATA GATACAAAAT 4451 ATTATTAAAT TATTACATAA ATTTAATTTT ATAAAATTTA ATCATGCTTC 4501 TTTTGTCTGG TAATAGACAT TGGACAGATA TTTTTAGTTC AGATGGTGAT 4551 TCTGAAGCTT ACATCTCCCT TAAAAAAATC TAAAGCAGCT CTTATGGGCT 4601 TCTAATTTTA ATATAAATAA ATAATTTAAA TTTTATTGGT GTTATTGGAA 4651 GAAAAATGCT ATTAATGGGC TAATAAAAAA CATGTGTTTC TCTTATGGAT 4701 TTTAATAAGC TCCAGTATTA TTCAAATGAT CAAAAATATA GTTATAATTT 4751 TTTGAATTTT AAAAATGTGA TTGCTCTAAT AAAGAATAAA ATCTATGCTT 4801 TTTAACAAAC ATAGTTTTGG TGCCTAATTC TGTAATATGT TTTATTGAAA 4851 TTAGATTCAT TTCTCTAATG TGAGAAAAAT ATATCCAGTA ATAGTATTGA 4901 CTGTTTAAAA AATTGAGCTC ATCAAAAATA TTGTCATCAA ATACAGGTGG 4951 TTAATCTGAC ATACATTGCA GTTACATGCA TTATTTTTAT TTACAACATT 5001 TGCTCCTTAA TGATGAATTT ATCTGTGTTA CCCTGTTTTT CTACCTGGAA 5051 CTCCATAGAA TGATGTTTGC AAACCAACAT GTGCTCTTTT CAGTCATTCA 5101 CTGTTTTAAT ATGACATGGT AGAGAAGATA AGGTTTATGG CAGGTAATTT 5151 TTTGTAATGT GTATTAAACG AAGTTCAAAG ATTAGAAATA CATCTGTGTC 5201 CTGAAAACCT TAGATACATA GCCGACTGTA TACAGAGGTT CATCTCAACC 5251 TCAACACTAT TGACTTTTGG GGCTGGATAG TTCTCTGTTG TGGGGGTTTG 5301 TCTTGTGCAC TGTAGGTTTT TAGTAGCATC CACACTTTCT CCTCACCAGA 5351 TGCCAGTTGC ACCCTCCCCC AAGTTGAGAC AACCAAAAAT GTCTCCAGAT 5401 ATTGCCAGCT ACCCCTTGAG GGATGGTACC TCTGGTTGAG AACCATTGCT 5451 AGAGAATGAT CTTTACTGAA TTTGCCCTTT ATAAGAAACC CAGTGAATTT 5501 CTAGAGCAAG TCCCAAAAAC TAAGGGACAG CTAAGAAGTT ATTATGGTTG 5551 ACTTCAAAGG CCTAAACTGT GTTTTTTATG TCCACTAAAC AACTTGATTA 5601 AAAGACGGAA TTTTGACTCG TGTCTGTATC ATACAAGTAC AAATACTAAT 5651 TTTGCCCTAT GTATCCGTAA ATGTCATTTG TGATTTTGAC TTATTTATTT 5701 AATGCCCTTT CTTATGCCGT GGGTTTTCAA GTTTACTCAT TTCTATGGTT 5751 GCAAATAACT CTAAAACTTA TTATATAAAC TTTCATATTA TAGGCAGAAC 5801 ACAATGGCTA AATATCTGTT GCATGTACTT TAAAGTTTAT TATAAAATAT 5851 AAACAGATAT ATAAAGATGT TGACTCTTAC CTGTGATTTT GCATGGTCAG 5901 ACTCGGTGTC AGGTACGGAG AGGATTCTCA TGACTGTCTT ACCTCTACTG 5951 AATATTCTAG TGAGTTATAT GATTTACGGA GTGATTAACA GAGGTCTATA 6001 TAAAGTTACT TTTCCCCTTT ACTTAATTAT ATTGTAGTGT GCAGATAACA 6051 AAACTGCTAC CTTCTCATCC AAGTGGTCTG TAGAATTCAT GTCCCTTACA 6101 GTGGTCATTT AAAGTCAATA TTTATTTATG TATGTAATAA AAAAAGTTGG 6151 ATTTTTGTGT ATGTCTGTCA CATTATTTAG AGAGAAGTAA TCTTGTAAAA 6201 ATGTTTTGTA AAAAACAAAA AAGTATTGTA AATAGTCTTG ATATTCTGTG 6251 ACTCATTATT TTCATGTTAG AGTTTGTACA TACTGGTTCA ATAATAAAGT 6301 ATCCTTAAAC CAGA // LOCUS AB014528 5871 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0628 protein, complete cds. ACCESSION AB014528 NID g3327069 VERSION AB014528.1 GI:3327069 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH01433. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5871) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .5871 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH01433" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 214. .1824 /gene="KIAA0628" CDS 214. .1824 /gene="KIAA0628" /codon_start=1 /product="KIAA0628 protein" /protein_id="BAA31603.1" /db_xref="PID:d1032564" /db_xref="PID:g3327070" /db_xref="GI:3327070" /translation="MILLSFVSDSNVGTGEKKVTEAWISEDENSHRTTSDRLTVMELP SPESEEVHEPRLGELLGNPEGQSLGSSPSQDRGCKQVTVTHWKIQTGETAQVCTKSGR NHILNSDLLLLQRELIEGEANPCDICGKTFTFNSDLVRHRISHAGEKPYTCDQCGKGF GQSSHLMEHQRIHTGERLYVCNVCGKDFIHYSGLIEHQRVHSGEKPFKCAQCGKAFCH SSDLIRHQRVHTRERPFECKECGKGFSQSSLLIRHQRIHTGERPYECNECGKSFIRSS SLIRHYQIHTEVKQYECKECGKAFRHRSDLIEHQRIHTGERPFECNECGKAFIRSSKL IQHQRIHTGERPYVCNECGKRFSQTSNFTQHQRIHTGEKLYECNECGKAFFLSSYLIR HQKIHTGERVYECKECGKAFLQKAHLTEHQKIHSGDRPFECKDCGKAFIQSSKLLLHQ IIHTGEKPYVCSYCGKGFIQRSNFLQHQKIHTEEKLYECSQYGRDFNSTTNVKNNQRV HQEGLSLSKAPIHLGERSVDKGEHTGNL" BASE COUNT 1533 a 1243 c 1338 g 1757 t ORIGIN 1 GCACCACAGA CCAGTCTGTC CTGGTGTTGG ATGTTAAAAG TGGAGCCACA 51 AAGAAGCCAT CTTATGTGGG ACAGGCTTGG GTGTGAGACC AGAGCTGTGG 101 GTAGCAGCAT GTTGGGTCAG GATGAGGGGC AGAGGAAACA CTTTCCACCT 151 CTTGGATATG GATTTTATCC TGCCCTGATT CAGGAGATCC CCATCTATTA 201 AGTAAGGGGA AAGATGATTT TGTTGTCTTT TGTTTCAGAT TCTAATGTAG 251 GAACTGGTGA GAAGAAGGTG ACTGAAGCCT GGATTTCTGA GGATGAAAAC 301 TCACATAGGA CGACGTCAGA CAGACTCACG GTGATGGAGC TCCCCTCTCC 351 CGAGTCTGAG GAAGTCCACG AGCCCAGATT AGGGGAGCTC TTGGGAAATC 401 CAGAAGGTCA GAGCCTGGGG AGTTCCCCCT CTCAGGACAG GGGCTGCAAG 451 CAGGTGACAG TGACCCATTG GAAGATCCAG ACAGGAGAGA CAGCTCAAGT 501 GTGCACCAAG TCAGGAAGAA ACCATATTCT GAACTCAGAC CTTCTTCTGC 551 TTCAGAGAGA GCTCATAGAG GGGGAAGCCA ATCCTTGCGA TATCTGTGGC 601 AAAACCTTCA CGTTTAATTC GGACCTAGTT AGGCATCGGA TTTCGCATGC 651 TGGGGAGAAA CCTTACACGT GCGATCAGTG TGGGAAAGGC TTTGGCCAGA 701 GCTCACACCT TATGGAGCAT CAGAGAATTC ACACTGGAGA GAGACTCTAC 751 GTCTGTAATG TGTGTGGGAA AGACTTCATT CACTATTCAG GTCTCATTGA 801 GCATCAGCGC GTTCATTCAG GAGAAAAGCC CTTCAAATGT GCGCAGTGTG 851 GGAAGGCGTT TTGTCACAGT TCAGACCTGA TTAGGCACCA GAGAGTTCAC 901 ACCAGAGAGA GACCTTTTGA ATGCAAAGAG TGTGGGAAAG GCTTCAGTCA 951 GAGCTCCTTA CTTATTCGCC ATCAGAGGAT TCACACGGGA GAAAGGCCCT 1001 ATGAGTGCAA TGAATGTGGG AAATCCTTCA TAAGGAGCTC GAGCCTCATT 1051 CGCCATTATC AGATCCACAC AGAAGTGAAA CAGTATGAAT GCAAAGAATG 1101 TGGGAAGGCA TTCCGTCATC GCTCAGACCT TATTGAACAC CAGAGAATTC 1151 ACACCGGAGA GAGACCCTTT GAATGCAATG AGTGTGGGAA AGCCTTTATT 1201 CGGAGTTCAA AGCTCATTCA GCATCAGAGG ATCCATACTG GGGAGAGGCC 1251 TTACGTATGC AATGAGTGTG GGAAGCGCTT CAGCCAGACG TCAAACTTCA 1301 CCCAGCATCA GAGAATTCAC ACTGGAGAGA AACTCTATGA ATGTAACGAG 1351 TGTGGGAAAG CTTTCTTTCT GAGTTCATAC CTTATTCGAC ACCAGAAAAT 1401 CCACACTGGA GAGAGAGTGT ATGAATGTAA GGAATGTGGG AAAGCGTTTC 1451 TCCAGAAAGC CCATCTCACT GAGCACCAGA AGATCCACTC TGGGGACAGG 1501 CCCTTCGAAT GTAAAGACTG TGGGAAAGCC TTCATCCAGA GCTCCAAGCT 1551 GCTTCTGCAC CAGATTATTC ACACTGGAGA AAAGCCCTAT GTGTGCAGTT 1601 ATTGTGGGAA AGGCTTTATT CAGAGGTCAA ACTTCCTTCA ACACCAGAAA 1651 ATTCATACTG AAGAGAAGCT CTATGAATGT AGTCAGTATG GGAGAGATTT 1701 TAACTCAACT ACAAACGTTA AAAATAATCA AAGGGTTCAC CAAGAGGGAC 1751 TCTCCTTGAG TAAGGCCCCC ATACATTTGG GTGAGAGGTC TGTAGATAAG 1801 GGGGAACACA CAGGTAACTT ATAAAATAAT TACTTTCCCG CCCAGTGAGT 1851 GATGTTTGGA AATGCGTGGA ATTAGGATTC ATGTGGTTTC TAAGATTTGG 1901 ACATGTCAGA ATTTTGTGAG TCATGGATGG GGCTGCTTTT GCAGTGGGTG 1951 CCACCTGCCA CTGTGCAGCC CTACTCGGCT CAGCCCTTCT CCTCAGCTGT 2001 GAGCACTGTC CTCAGGAGAG TCACAGGGCT TGACACCTGA CTCTGAGCTG 2051 GAACAGTAGG GGCAGGGAGA AGACAGGTCT CAAGAAAAGG TTTTTAAGAA 2101 GTTTCATCCC CAGTTAAGCA GAGTCCATCC TTGACTTAAA TCCCTTATTA 2151 CAGCACAACT GTGTATCTAA TCTTACGATT TAGGAGAATG TTACCTAGGA 2201 CATTTTGATG TGTTAAGTTG AAGAAAGGTA ACTCGTGTAT GAACCCCGAG 2251 CCATTTCCCT GTTGTCCTGA GGAGGAACTC CAGGCCTCCC ATCGTGTGCC 2301 CTAAGGCCTC CTGCGTCCTG GAGCCCTGCC TCCCACTGCC TGACTTCCTG 2351 CCACACGGTT AATGCTGCAG CAACACCGAC TGCTTCATCT TCCCTGTGCT 2401 CCACGTGGCT TCCTACCTCT CTCGCCTTTG TTCTTGTTGA AGGGTCTCTT 2451 CTCAGCTAAT TAACTCTGAA TCATGGTTCA AGACAAGCCT CAGGCATCAT 2501 GTCAATGGGT GTTTCCCTCA AGCTTAGTTG GCAGCACTCT CCACACTTCT 2551 GTGGCTCAGT GATTACTGCT ATTACTATAT TTACTTGCAT ATGTCAGAAT 2601 GATGTGATAG ACTATCTCTG TCACTATGCT GTTGGGTTCC TGAGGACAGT 2651 GATCATATCT GATTGATTTC CATGTGTCCA CTGTCTAGCA CAGGGCAATA 2701 AAAAATACAC CCCTAAATCT ATGTGTAATT GGCATCTCTC TTGCTTTGTC 2751 CTTTCTATAC TGCCATTCTA AAAATTTTCA GCTGTTGGCT GTTTTTTTTT 2801 GTTGTTTGTT CATTTTGTAT CAGTATAATC TACGATTCTG TTAGAAGTAT 2851 CTTCTCAGCC CTGCTACTGT CTGCTGCTCC TACTTAGAAG TTGCAGGCAA 2901 TATCCTGTGC ACATCCACGC CCTCCTGTGT GCCACCTTAC AGCTCATGCA 2951 GAACTGTCTA CTTGATCTGG AGGGCATTGA CACCTGGCTC AGCTGGAACA 3001 GTGGGGACAG GAGGAGGCAG GTCTCAAGAG AAGGTCTTTA CTTACTTGTC 3051 TGTATCTTTC TGTCTCACTC GAAAGGCACA GTCTCAATGC GCTACGTAAC 3101 TCTTCTGTTG TAGCAGTTCT GAGGTTGAAC TGAGCTCTTG TCATTTTCCA 3151 TGCTCTCTGG CTCATTGTGT CATTTCAAAA TGTATTGATA TCTATAAGGA 3201 ATAGTCCTCA ATGTTGGGAA AACAGATGAG CAACCCACTA CCCTGAAGTC 3251 ATCCACCGTG TAGTGGGGAA GACGGAGATC ACAGTAGGGT GAGTTAAGTA 3301 TGTGCCCTCA GGTGTGAAGG AGCACAGGTG AGGGGGTGCC TGCTGTGGCA 3351 CAGAGCTGGG AAGGCCTTAC AGAAGAGGTG ACATTTCAGC TGGGCCAGAG 3401 ATCAGTAGAG AGCAGCTGAA GAAGAAAATA GGGTCCAGGA GAGGAATCTG 3451 CAGAAACATG GAGCCATGAC AGATGGCTTT TTGGCTAGGA CTGTTCGAAA 3501 GCAGGCGAGG TAAATGGACA GTGGACAGCC GCCTCGCCAA ACTGTTAGTC 3551 TTTACCTCTG AATTTCATCT GGACATGAAC CCCAAAGTCC AGTTTTGTAA 3601 CGATATAATT TTTTTCTCAT GAGGCCATAT TTTGAGTTCT TAAATACTAC 3651 CAACCCTGAA CGTCTCAGGA CAAATAATTC AAAAAAGAGA TCTACATTTT 3701 CTGGAAAATC AGCATTACTA GAGATGTTTC AAATGTAGAT TTTACCAGAC 3751 CATTTTAATC AACTTTATTG AGATTCAATT TTCATATAGT AAAACGCCTA 3801 TTTAAGTGTA CAAGGTTTTG AAAAATCTAT ACACCTGTGT AACCACCACC 3851 ACCTGTGTAA CCACCACCGC CAGAACCAAA AACGTTTCTG TCACCCAAAG 3901 AAGTTCCCTT TTGCCTTTTC CTAGTTAAGC CCACTCCAGC CCCAGATAAC 3951 CACTTTCTAT CCTCATAGAT TAGTGTTGCT TGGTTTAGAG CCTCATACAA 4001 ATGGATTCAT TTACTATGTG TTCTTTTGCG TCTGATGTCT CACTTTGGCA 4051 TGTTTTTAGA CTCATCCAAG CTGTTTTGTG TTTCAGTAGT TCTCCTTGCT 4101 GAATAGTATT CCATTGTGTG GATAAGCCAT CATTTGTTTA TTCATCCACG 4151 TGTTAATGGG TTTCCAGTCT CGGCTCTTAA AACTATGAGC ATTTGTGTAA 4201 AAGTATGTAT TTGCACATGT GTTTTTATTT CTCTTGGGTA AGTACCTAGG 4251 ACTTTAATTG CTGTGTTGTA AGTGTCTTAG TTCATTCAGG CTGCCATAAC 4301 AAAATACCAT CAACTGGGAG GCTTATAAAC AACAGAAATT TATGTTTTAC 4351 AGTTCTGGAG GCTGGGAAGT CCAAGATGAG GGTGTCAACA TTTGGTGTCT 4401 AGTGAGGGCC TGCTTCCTTT TAAGTAGATG TCTTTTTACT GTGACTGCAC 4451 GTGGTGGAAA GGGACTAGCA AGCTTACCCC ATTCATGAGG GCTCTCCCCT 4501 TGTGATTCTG TCCCCTCCCC AAGACCCCAC CTCATATCAT CTTATGGGTT 4551 AGGAATTTGA CATGAATTGT GCAAGGATGT AAACATTCAG TCCATTGCAA 4601 TGCGTGTTTC TAACTTCCTA AGAAATTGTC ACACAGCCCT CTGTAGTGGT 4651 TGAATCATTT TACACTATTA CTACATTATG GGAAGATTCC AGTTGCTCCA 4701 TATCTTTGCT AACACTTGGC ATTGTCAGTC TCTTAAAATT TAATCCACTC 4751 TAGCCTCCTT ATGATTTTAA TTTGTATTTG CCCACTGAAT AATGTATGTC 4801 TATGATTTCA TCTGCTTATT GCTTTTCCGT GTCGTCTTTT GTGATGTGTC 4851 TGTTCGTGCC TTTTGCCTGT TTTTAAGTTG GGATGCTTAT TGTTGACTCA 4901 AGTTCTTTCC ATATTCTGCA TGAAAAACAT ACATGTAATA TATAGAATTC 4951 TTTGTCTCCA TTTGTGACAA ATATTGTGAA TATTTTCATG TTTGGTTTGC 5001 CTGTTCATTT TCTTAATGGT GTCTTTTGAA TATTTTTTAT TAAGCCACCA 5051 CTCCTGGCCA ACTGTAGTTG AACTCAGGTA TTGGAAGGGC TTCCGTTTTG 5101 TTCTTTTCCC ATTTTTTTTT TTTGACTTCT AGATCCTTTG CATTTTCGTA 5151 TAAAATTTGA AGTCAACTTC TATCAACTTC AGGCCAGGCC CGTGGCTCAT 5201 GCCTGTATAA TCCCAGCACT TTGGGAGGCC GAGGCGGGCG GATCGCTTGA 5251 GGTCAGGAGT TCAAGACCAG CCTGGCCGGG GTGGTGAAAC CCCATCTCTA 5301 CTAAAAATGC GGAAGAGTTA GCCAGGCGTG GTGGCAGGCG CCTGTAGAAT 5351 CCCAGCTGCT CGGGAGGCTG AGGCAGGAGA ATTGCTTGAG CCTGGGAGGT 5401 GGAGGTTTCA GTGAGCTGGG ATCGTGGCAT TGCACTCTAG CCTGGGCAAC 5451 CAAGAGTGAA ACTGTCTCAA AAAACAACTT TTATCAATGT CTGCAAAAAG 5501 AAAGTCTTCT GGGATTTATA GATCAATTTA GGGAGAAATG ACATTTTAAC 5551 AATTCTGAGT TTTCCAATTG TTGAACATGG TGTACTGCCC CATTTATTTA 5601 GATCTGTTAA TTTCTCTCAG TTTGCAGCTC TCACATTTTG TTAAATTCAT 5651 GTATTTAATA TTTCTGCATG CTATTGCAAG TGGTAAGGTT TTCAAAAAGC 5701 TGTTTTCTAG TTATTGCTAG TATATAGAAA TGCATTAGAC TTGTACATTG 5751 ATCTTGTATC AAGCAACTTA GATCAGTTAA CTTATTCTAG TAGCTTTTTT 5801 CTAGATTCTT TAGCATTTTC TATGTAGATA ATCATGTCAT CTGTGAATAA 5851 AGTATTTTAC TTTTCCAATT T // LOCUS AB023153 6014 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0936 protein, complete cds. ACCESSION AB023153 NID g4589515 VERSION AB023153.1 GI:4589515 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hh04647. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 6014) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .6014 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hh04647" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 267. .2165 /gene="KIAA0936" CDS 267. .2165 /gene="KIAA0936" /codon_start=1 /product="KIAA0936 protein" /protein_id="BAA76780.1" /db_xref="PID:d1040533" /db_xref="PID:g4589516" /db_xref="GI:4589516" /translation="MNRYTTIRQLGDGTYGSVLLGRSIESGELIAIKKMKRKFYSWEE CMNLREVKSLKKLNHANVVKLKEVIRENDHLYFIFEYMKENLYQLIKERNKLFPESAI RNIMYQILQGLAFIHKHGFFHRDLKPENLLCMGPELVKIADFGLAREIRSKPPYTDYV STRWYRAPEVLLRSTNYSSPIDVWAVGCIMAEVYTLRPLFPGASEIDTIFKICQVLGT PKKTDWPEGYQLSSAMNFRWPQCVPNNLKTLIPNASSEAVQLLRDMLQWDPKKRPTAS QALRYPYFQVGHPLGSTTQNLQDSEKPQKGILEKAGPPPYIKPVPPAQPPAKPHTRIS SRQHQASQPPLHLTYPYKAEVSRTDHPSHLQEDKPSPLLFPSLHNKHPQSKITAGLEH KNGEIKPKSRRRWGLISRSTKDSDDWADLDDLDFSPSLSRIDLKNKKRQSDDTLCRFE SVLDLKPSEPVGTGNSAPTQTSYQRRDTPTLRSAAKQHYLKHSRYLPGISIRNGILSN PGKEFIPPNPWSSSGLSGKSSGTMSVISKVNSVGSSSTSSSGLTGNYVPSFLKKEIGS AMQRVHLAPIPDPSPGYSSLKAMRPHPGRPFFHTQPRSTPGLIPRPPAAQPVHGRTDW ASKYASRR" BASE COUNT 1831 a 1252 c 1183 g 1748 t ORIGIN 1 CCCATTTCCA GCTCCGGAGC GGGCGGCTGC GCCCCGCTCG TCGAGGAGCT 51 GCGCTCACCT CAGGGGCGGG CCCCCGCCTG CGTTCGCGGC GCCAGCAGAA 101 GACTGATTTT TGGAAATATG TATTTGGGAG ACAGTCACGT CCTATTGAAT 151 ACCTTGTGCT GGTGCTGCCA TCGAAAAATC TGGTTACACT CTGGGGAGGA 201 CTGCTACCAC TGCAGAACTG AACCACTTCG GCCGTGAGAT GAGTGTCCGG 251 CCTGAGCAGG CACACCATGA ATAGATACAC AACAATCAGG CAGCTCGGGG 301 ATGGAACCTA CGGTTCCGTC CTGCTGGGAA GAAGCATTGA GTCTGGGGAG 351 CTGATCGCTA TTAAAAAAAT GAAAAGAAAA TTTTATTCCT GGGAGGAATG 401 CATGAACCTT CGGGAGGTTA AGTCTTTAAA GAAGCTCAAC CATGCCAATG 451 TAGTCAAATT AAAAGAAGTT ATCAGGGAAA ATGATCATCT TTATTTTATC 501 TTCGAGTACA TGAAGGAAAA TCTTTACCAG CTCATTAAAG AGAGAAATAA 551 GTTGTTTCCT GAGTCTGCTA TAAGGAATAT CATGTATCAG ATATTACAAG 601 GACTCGCATT TATTCACAAA CACGGCTTCT TTCATCGAGA CTTAAAGCCT 651 GAGAACCTCC TCTGCATGGG ACCAGAACTT GTGAAAATTG CAGACTTTGG 701 TTTGGCCCGA GAAATACGAT CAAAACCTCC ATATACAGAT TATGTATCTA 751 CCAGATGGTA CAGGGCTCCA GAAGTACTCC TGAGGTCTAC CAACTACAGC 801 TCCCCCATTG ACGTCTGGGC GGTGGGCTGC ATCATGGCAG AAGTTTACAC 851 CCTCAGGCCA CTCTTCCCTG GAGCCAGTGA AATTGACACA ATATTCAAAA 901 TTTGCCAAGT GCTGGGGACA CCAAAAAAGA CTGACTGGCC TGAAGGCTAT 951 CAACTTTCAA GTGCAATGAA CTTCCGTTGG CCACAGTGTG TACCCAATAA 1001 CTTAAAGACC TTGATTCCCA ATGCTAGCAG TGAAGCAGTC CAGCTCCTGA 1051 GAGACATGCT TCAGTGGGAT CCCAAGAAAC GACCAACAGC TAGTCAGGCA 1101 CTTCGATATC CTTACTTCCA AGTTGGACAC CCACTAGGCA GCACCACACA 1151 AAACCTTCAG GATTCAGAAA AACCACAGAA AGGCATCCTG GAAAAGGCAG 1201 GCCCACCTCC TTATATTAAG CCAGTCCCAC CTGCCCAGCC ACCAGCCAAG 1251 CCACACACAC GAATTTCTTC ACGACAGCAT CAAGCCAGCC AGCCCCCTCT 1301 GCATCTCACG TACCCCTACA AAGCAGAGGT CTCCAGGACA GATCACCCAA 1351 GCCATCTCCA GGAGGACAAG CCAAGCCCGT TGCTTTTCCC ATCCCTCCAC 1401 AACAAGCATC CACAGTCGAA AATCACAGCT GGCCTGGAGC ACAAAAATGG 1451 TGAGATAAAG CCAAAGAGTA GGAGAAGGTG GGGTCTTATT TCCAGGTCAA 1501 CAAAGGATTC AGATGATTGG GCTGACTTGG ATGACTTGGA TTTCAGTCCA 1551 TCCCTCAGCA GGATTGACCT GAAAAACAAG AAAAGACAGA GTGATGACAC 1601 TCTCTGCAGG TTTGAGAGTG TTTTGGACCT GAAGCCCTCT GAGCCTGTGG 1651 GCACAGGAAA CAGTGCCCCC ACCCAGACGT CATATCAGCG GCGAGACACG 1701 CCCACCCTGA GATCTGCAGC CAAGCAGCAC TATTTGAAGC ACTCTCGATA 1751 CTTGCCTGGG ATCAGTATAA GAAATGGCAT ACTCTCGAAT CCAGGCAAGG 1801 AATTTATTCC ACCTAATCCA TGGTCTAGTT CTGGCTTGTC TGGAAAATCT 1851 TCAGGGACAA TGTCAGTAAT CAGCAAAGTA AATTCAGTTG GTTCCAGCTC 1901 TACAAGTTCT AGTGGACTGA CTGGAAACTA TGTCCCTTCC TTTCTGAAAA 1951 AAGAAATCGG TTCTGCTATG CAGAGGGTAC ACCTAGCACC TATTCCAGAC 2001 CCTTCCCCTG GTTATTCCTC CCTGAAGGCC ATGAGACCTC ATCCTGGGCG 2051 ACCATTCTTC CACACCCAGC CTAGAAGCAC TCCTGGGTTG ATACCACGGC 2101 CTCCAGCCGC CCAGCCAGTG CATGGCCGGA CAGACTGGGC TTCCAAGTAC 2151 GCATCTCGGC GATGACTGTC TGCCTTGGTG ATGAATCTCT TCCTAGGGAG 2201 AAGCAGGATA CTTTCCCTCA GCTGACTGGT GTTCTACCTG CAAGATGTGC 2251 AGAGGGCATA AAAGCAAATC AACACTTTAT AGTTATTCTT CTGAACTAAG 2301 ACATGTCAAT ATTCTTTTTT AAAGTTTTTT TTTAAAATAT TGATTTGAAT 2351 GCAGTAGGCT TTTTTGTATA AAATTATTTT ATTCTAAAAC TGGGTCCCAT 2401 TATTTTCTTA AACAACAGCA TTTTGTATAT ATGGATTATG TTTTAGCATT 2451 TTATACAGTC AACTTTGTAA TGAACTTTTT AAAAATTAAT TGATTTTCCT 2501 TTGGGGTTCC AGATAATATT TTCTACAGAT TTTGAAAAAT GTAATAATAT 2551 TAATGCAGTA TTGCAACAGG GGTGCAATTT AAGGCTATGT GATAGAGGGT 2601 TATTTACTCA GTGTGTGCAG ATATTTATGA AGTGGTGAAA TTTCAAGTGT 2651 GGCTCACTAG GTACTTCAGG CCTTCTTGGA CTGTTGTTAG AAAAGTGATC 2701 CTCTGCTTTT CTTAGTAGGT CATTGGTTTG ATTTTTGGAT ACCACTCTGC 2751 TGTTCTAAAA GGACTATTAT ATTATATAAT TCACTTTGTT TTACTTTTGT 2801 TCCCCAGATG AAAGAACTCT AAGTAAATAC ATTTTAAAAA ATTTTTCTGA 2851 CACCCTTTAA TGTGGTTGCA GATCTCAGAT GAAACCAAGC TTAATTATAC 2901 TATGCCATTA TATTCTAATT TATTCCATTT TTGAAATCAA GTTGTATGTG 2951 TACCAATAAA AGAGATTTCT GCTTCAAAAG GCTCTCAACA TGAAGGTTAA 3001 CACAGTCAAT CAAACTTACA TTCCTGCCAA GATGCATGGC CAAAAAACTA 3051 AGTATCAAAG CAGCAGAAGG TTTTTGATTA TAGTAACTGA GATGGAATTT 3101 TGTGCCTAGC TCAGTTCTCC AGATCTGGCT AGGAGCAGTC AATGACTAAT 3151 GTTCTGTCCT AGCCAAATTC TCAGGACAAT TTGGGGAGCA GAAAGAGTTA 3201 TGGCAGAGGT TCCACTCATC TACAAAGTCA CAGTCACATG CCACATTTGA 3251 TCTCCTAACC CTGGTGTAGT TTCTTTCAAG AGTGAGAACT TTATTTGTTG 3301 GGCAGAGGCT GTTCCATTGA GAGGAATGTT TACAGCAGTT TCAAAAATGA 3351 CAAAGTCAGT TTGGAGACAG AAAAAGACAA AAGGTCCAGT CTCATCCATC 3401 TCTATATGGT ACATTTGCCT CACTTATGGT TGCCTTAAAG GCAAGAGGGA 3451 AGGTCACCAT CAGTGAACGC AATGCAATCT CAACAGTGTA TTGATTCATA 3501 TTCTCCTAGG GCTCAAACTA CTCTCTATTG GTTCCAGGAT AATGACAAAT 3551 TGAACCATAT GTAAGTAATC TTTTATTTTT TATTTTTTTT TTGAGACAGA 3601 GTCTCACTCT GTCACCCAGG CTGGAGTGCA GTGGCGCGAT CTTAGCTCTC 3651 TGCAACCTCT GCCTCCCAGG TTCAAGCCTC CTGAGTAACT GGGACTACAG 3701 GCGCCCGCCA CCACGCCCAG CTAATTTTTT GTATTTTTAG TAGAGACGGG 3751 GTTTCACTGT GTTAGCCAGG ACGGCCTCGA TCTCCTGACC TCGTGATCCA 3801 CCCTCCTCCA CCTCCCAAAG TACTGGGATT ACAGGCATGA GCCACTGCAC 3851 CCAGCCAAGT GATCATTTTT ATAGGTTAAA ATGATAGGTG AAATGAATAT 3901 AGACACTTTC ATATGGTTCA ACCTAATGAC TTGGTAAATT ATTGCCTTGG 3951 TGTATTAATA ATATGTTGCA TTCTGAACAA ATAACCATGG CTTCCAAAGG 4001 GCCCTAACCT AAAATCGGAG AGTAATTTAT GCTTTGGAGA ATTTGACTCA 4051 AATATATACT TGACCAAGCA CCATGATCCC TAGGGGCATG AGAAAAGCAC 4101 ATAATGGATG TGGATGTGAT AGGTGGTCTT TTCCTGTTAA CAAGCTGGCA 4151 GCAAAGCTTC AGAAAATATA TATGCAAGCA CAACTTGAAG CTGAATTCAT 4201 TTCTGTATTA TATTCTCAAC TCGTTATCTA AAGCATCAGA ACATGTGTTT 4251 TCAGAGATGA GTCCTTTACT ATAAGGTTAA TATTTATTTT CATTTTCTGT 4301 ATTATATATG AAAAGTAAAT TAATGTGAAA CCTGGCCCAG CTTGCTGGAA 4351 AGCAGGTTTT AAATTGTAAA TATTCCTTAG AGGAGCAAAT GGATTGTTTA 4401 ATACCATAGT CTCAGTAATC TAGCTTATAT AAGGTCATTA CATTTTTTAA 4451 CTGAAAAACC TAGTTACCTG ATTATTGCAC ATTATAAAAT TGTTTTTCTA 4501 ATACTTTATA GGGCCCAACT TCAGAAAATA CTTCGCTTTT TTCTTTTTAT 4551 GCTTTCGTTT GTTTACCAGC AAGCAACTTC CCTGGGGAAG CCAAACACAT 4601 ATTCATAAAA AAAATCAAGT AGCTGATGTG CAGTTGAGAA AACTAGAGGA 4651 CTGAAAAAAC AAATTTTAAC TAGCAAATGC TGTGAATTAC TCTTCCTCCC 4701 CTTCTCTGAA ATGGGTAAAG GACAAATTGT GTAAAAAAAC CTATGCACTA 4751 TAGAAGGGAA TAGTAACCAT TTCTTTTGTC TCTCTGTTTC TGTTCTGACT 4801 GAGAACCTGC AGCCATTTCT TGTTACATGA AAACAAAATG CTACTTGTTA 4851 CCTCTATTTT TTGTTACTAT ACAATTATGA AATGTAATGT AAGACACCAA 4901 CAGAAATGAT ATACCTGTAA CTGTACCTAT CAGGACTATA CCTCATTTAC 4951 AGTCAGAAAG CTTACTGGGA TGTCAGGAAA TGATACAGGG TTGGTTCTCA 5001 TTTCGTGCCG AAATGAGACA GAAATTCAGT GACGAAGGTG CGTTGTAGGG 5051 GTATTGATGT GCCCCAGGTA GTGCCAGCAG AGTAGGGAAA ACTGCATTTG 5101 CATAAAAACT ACTCTTGACA TGATTGTTCA TTTTACAAAA AAATTCCATT 5151 AATTACCAAG CCCTCACCCA GCCCATGTGT GATAGGATTT ATGTAGGAAG 5201 AAACTTGATT TTCAAATAAT TTTTTAAATG TATCTCTTGC CTAAAGGACT 5251 ATATACATCT AATAAAGTAA CACTGTGTCA TCTTCTGGAG TTATCAAAAA 5301 TTGTATACAA TCAAGACAAC ACAAGAATTA TTTTATTTTT GAGTGCAAAT 5351 ACAGGTACTG TTGGAGTTGA TGGGCACCAT GCTTTCTCAT GAAGTAGCAT 5401 TTCCCTACCA TCAAGCCATT GTTTTGTGCC ATTCAGGAGA GGAAAAAAAG 5451 GAATTTATGC TGTACATTTC AGTTCAGTGT ATGACCAAAA GCAATATGTT 5501 TATAAGAAGA TGTTTGACAT ACTAATTATT TTATATCATT TAAACCATAC 5551 TGTAGCAACA TAATATATGG AGCTAATTTG TAGAATTATT TTTACGATTT 5601 CCAAACAAAT GTACTGTACT GTTATATAAT TTATTGTGAG GACCTTCTCA 5651 TGGAAGCCAT TAGGAAAACA AACTAGAGGT AAATATCACA TTAATCTGTA 5701 TTATCAATTT CTCATAGACA CTGTGCTAAT GTGAATTTTA AATGACCTGC 5751 ATCAAGTCTT CTGATCTCAG ATAACTCAGT ACAGATAGCA ATTAGTCAGC 5801 TGATTTGATT ACAATGGAGT AACCGACAAT ATATTTATTT ATAAAGCACA 5851 TATTCATAAT AACGAGAAGA ATTCAGAAAA CCACTTAAGC AAGACCCTTC 5901 TGAAATAAAA AATGTTGCTT TTTAAATAGT TTGTCCTAAG GTGTTTAAAA 5951 CATGTCAACC TTATGTAAGG AAAAATTTCC TGGTCCAAAT AAAGTTGAAG 6001 TTTAAGAAAA ATTG // LOCUS AB002329 6474 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0331 gene, complete cds. ACCESSION AB002329 NID g2224602 VERSION AB002329.1 GI:2224602 KEYWORDS KIAA0331. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0928. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6474) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .6474 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0928" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 467. .2794 /gene="KIAA0331" CDS 467. .2794 /gene="KIAA0331" /codon_start=1 /protein_id="BAA20789.1" /db_xref="PID:d1021627" /db_xref="PID:g2224603" /db_xref="GI:2224603" /translation="MASAGHIITLLLWGYLLELWTGGHTADTTHPRLRLSHKELLNLN RTSIFHSPFGFLDLHTMLLDEYQERLFVGGRDLVYSLSLERISDGYKEIHWPSTALKM EECIMKGKDAGECANYVRVLHHYNRTHLLTCGTGAFDPVCAFIRVGYHLEDPLFHLES PRSERGRGRCPFDPSSSFISTLIGSELFAGLYSDYWSRDAAIFRSMGRLAHIRTEHDD ERLLKEPKFVGSYMIPDNEDRDDNKVYFFFTEKALEAENNAHAIYTRVGRLCVNDVGG QRILVNKWSTFLKARLVCSVPGMNGIDTYFDELEDVFLLPTRDHKNPVIFGLFNTTSN IFRGHAICVYHMSSIRAAFNGPYAHKEGPEYHWSVYEGKVPYPRPGSCASKVNGGRYG TTKDYPDDAIRFARSHPLMYQAIKPAHKKPILVKTDGKYNLKQIAVDRVEAEDGQYDV LFIGTDNGIVLKVITIYNQEMESMEEVILEELQIFKDPVPIISMEISSKRQQLYIGSA SAVAQVRFHHCDMYGSACADCCLARDPYCAWDGISCSRYYPTGTHAKRRFRRQDVRHG NAAQQCFGQQFVGDALDKTEEHLAYGIENNSTLLECTPRSLQAKVIWFVQKGRETRKE EVKTDDRVVKMDLGLLFLRLHKSDAGTYFCQTVEHSFVHTVRKITLEVVEEEKVEDMF NKDDEEDRHHRMPCPAQSSISQGAKPWYKEFLQLIGYSNFQRVEEYCEKVWCTDRKRK KLKMSPSKWKYANPQEKKLRSKPEHYRLPRHTLDS" BASE COUNT 2107 a 1193 c 1294 g 1880 t ORIGIN 1 GTTTGGCAAG TCAGTGCAAG AGGCTGACTT CTGAGAGGCT TCCAGGAGCC 51 CGAAGAGAGG ACCTCCACGG GAGAAGGGAG TGCGTGTGCT CGGTTTTTTT 101 TTTTTCTCTC TTTTTTTTTT TTTTTTCTGA ATGAACAGCT TTGCCCAAGT 151 GACTGAAAAA TACAGCTTCT TCCTGAATCT ACCGGCGTAG TTGCTGAAGA 201 GCGCTCTAGA CAGGACATGG CTCTGAAGAC TCACTCTTTG GAATGTCCTC 251 TTGCTCCCGG CTTATAAACA ACTGTCCCGA GGAAAGAAAG GTTTTACATA 301 GCCAAATACA GCCTGACAAA TGGCACTTCG GAACTGTGCT TTCTGATGAC 351 AACGCGTTCG ATTTCTGACA AAGCCTCTCG CACGCTGCCC CTGGAGGGAA 401 GTCCTAAGTA AAACTCAGAC CCTCCTTAAA GTGAGGAGCG AGGGCTTGGA 451 CGGTGAACAC GGCAGCATGG CATCCGCGGG GCACATTATC ACCTTGCTCC 501 TGTGGGGTTA CTTACTGGAG CTTTGGACAG GAGGTCATAC AGCTGATACT 551 ACCCACCCCC GGTTACGCCT GTCACATAAA GAGCTCTTGA ATCTGAACAG 601 AACATCAATA TTTCATAGCC CTTTTGGATT TCTTGATCTC CATACAATGC 651 TGCTGGATGA ATATCAAGAG AGGCTCTTCG TGGGAGGCAG GGACCTTGTA 701 TATTCCCTCA GCTTGGAGAG AATCAGTGAC GGCTATAAAG AGATACACTG 751 GCCGAGTACA GCTCTAAAAA TGGAAGAATG CATAATGAAG GGAAAAGATG 801 CGGGTGAATG TGCAAATTAT GTTCGGGTTT TGCATCACTA TAACAGGACA 851 CACCTTCTGA CCTGTGGTAC TGGAGCTTTT GATCCAGTTT GTGCCTTCAT 901 CAGAGTTGGA TATCATTTGG AGGATCCTCT GTTTCACCTG GAATCACCCA 951 GATCTGAGAG AGGAAGGGGC AGATGTCCTT TTGACCCCAG CTCCTCCTTC 1001 ATCTCCACTT TAATTGGTAG TGAATTGTTT GCTGGACTCT ACAGTGACTA 1051 CTGGAGCAGA GACGCTGCGA TCTTCCGCAG CATGGGGCGA CTGGCCCATA 1101 TCCGCACTGA GCATGACGAT GAGCGTCTGT TGAAAGAACC AAAATTTGTA 1151 GGTTCATACA TGATTCCTGA CAATGAAGAC AGAGATGACA ACAAAGTATA 1201 TTTCTTTTTT ACTGAGAAGG CACTGGAGGC AGAAAACAAT GCTCACGCAA 1251 TTTACACCAG GGTCGGGCGA CTCTGTGTGA ATGATGTAGG AGGGCAGAGA 1301 ATACTGGTGA ATAAGTGGAG CACTTTCCTA AAAGCGAGAC TCGTTTGCTC 1351 AGTACCAGGA ATGAATGGAA TTGACACATA TTTTGATGAA TTAGAGGACG 1401 TTTTTTTGCT ACCTACCAGA GATCATAAGA ATCCAGTGAT ATTTGGACTC 1451 TTTAACACTA CCAGTAATAT TTTTCGAGGG CATGCTATAT GTGTCTATCA 1501 CATGTCTAGC ATTCGGGCAG CCTTCAACGG ACCATATGCA CATAAGGAAG 1551 GACCTGAATA CCACTGGTCA GTCTATGAAG GAAAAGTCCC TTATCCAAGG 1601 CCTGGTTCTT GTGCCAGCAA AGTAAATGGA GGGAGATACG GAACCACCAA 1651 GGACTATCCT GATGATGCCA TCCGATTTGC AAGAAGTCAT CCACTAATGT 1701 ACCAGGCCAT AAAACCTGCC CATAAAAAAC CAATATTGGT AAAAACAGAT 1751 GGAAAATATA ACCTGAAACA AATAGCAGTA GATCGAGTGG AAGCTGAGGA 1801 TGGCCAATAT GACGTCTTGT TTATTGGGAC AGATAATGGA ATTGTGCTGA 1851 AAGTAATCAC AATTTACAAC CAAGAAATGG AATCAATGGA AGAAGTAATT 1901 CTAGAAGAAC TTCAGATATT CAAGGATCCA GTTCCTATTA TTTCTATGGA 1951 GATTTCTTCA AAACGGCAAC AGCTGTATAT TGGATCTGCT TCTGCTGTGG 2001 CTCAAGTCAG ATTCCATCAC TGTGACATGT ATGGAAGTGC TTGTGCTGAC 2051 TGCTGCCTGG CTCGAGACCC TTACTGTGCC TGGGATGGCA TATCCTGCTC 2101 CCGGTATTAC CCAACAGGCA CACATGCAAA AAGGCGTTTC CGGAGACAAG 2151 ATGTTCGACA TGGAAATGCA GCTCAGCAGT GCTTTGGACA ACAGTTTGTT 2201 GGGGATGCTT TGGATAAGAC TGAAGAACAT CTGGCTTATG GCATAGAGAA 2251 CAACAGTACT TTGCTGGAAT GTACCCCACG ATCTTTACAA GCGAAAGTTA 2301 TCTGGTTTGT ACAGAAAGGA CGTGAGACAA GAAAAGAGGA GGTGAAGACA 2351 GATGACAGAG TGGTTAAGAT GGACCTTGGT TTACTCTTCC TAAGGTTACA 2401 CAAATCAGAT GCTGGGACCT ATTTTTGCCA GACAGTAGAG CATAGCTTTG 2451 TCCATACGGT CCGTAAAATC ACCTTGGAGG TAGTGGAAGA GGAGAAAGTC 2501 GAGGATATGT TTAACAAGGA CGATGAGGAG GACAGGCATC ACAGGATGCC 2551 TTGTCCTGCT CAGAGTAGCA TCTCGCAGGG AGCAAAACCA TGGTACAAGG 2601 AATTCTTGCA GCTGATCGGT TATAGCAACT TCCAGAGAGT GGAAGAATAC 2651 TGCGAGAAAG TATGGTGCAC AGATAGAAAG AGGAAAAAGC TTAAAATGTC 2701 ACCCTCCAAG TGGAAGTATG CCAACCCTCA GGAAAAGAAG CTCCGTTCCA 2751 AACCTGAGCA TTACCGCCTG CCCAGGCACA CGCTGGACTC CTGATGGGGT 2801 GAGACTATCT ACTGTCTTTT GAAGAATTTA TATTTGGAAA GTAAAAAAGT 2851 AAAAAAATAA ATCATCCAAC TTCTTTGCAT TACTTAAAAG AGATTTCTGT 2901 AATACAGGAA TGACTATGAA GGTGTTATAA TAAATTATTC TACATACTCA 2951 TTTGACTGGA TAAACTTTAC ATAAAATTAA CTAATTTTTT AAATAAATGC 3001 ATTGCTTAAT GGTTTCTCAT TATGTTTATC AAAAAACAAC TGTAGCTGTT 3051 ATTTTCAGTA CTTGGCTGCT TTTCTGTGAA AATTATTATT TTACTTTTGG 3101 AAGACAAGAT TATTAGAATA TTGAAGAAAA ATTGGAGACT TATAATCATG 3151 GTAAATATAA AACTAAATAT GTTTTAATAT TTCTGAATTT TTCTTTTCCA 3201 TCACAATGTA AGATATGCAG AATACAAGAT ACTTTGGCAT TCTCATGTGA 3251 ACTTTCTGTA CTCTTTAAGG ATTATTTTAT TAGTGTTGTT TAAGCCATGA 3301 GTGTTAAGTA GCAGGTGTGT TGTGAGTGCT GTAACCCATG AAAGGAAAAA 3351 TGTCATTCTG AGGCTTGTGC CCTTCGTAAA ATATTCATTA AAGTACATTC 3401 ACACTATTTT TGCTTTATAA CACAGTCTTT AATTTTCACT CACTGTGGAA 3451 ATAAAAACTA AGGTAACTTC TCAGAAAGAT ATCAAATCTC AGAAAGAATG 3501 TCAAATCAGA TGAAGTTATA GTTAGGATTC TAACTACTGT AAAAGATTTT 3551 TGCTTCCCTC TTGTGGTAAA AAAAATTATA TTCTCACACA TTTCTTTTTT 3601 CTCTACAGAC GGATATCTGT TTAGGAAAGA TTTGAAAGCA GATTATCAGT 3651 AGGTACATGG ATACATCAAG TTCATTTGCA GAAACAAATA ACTGAAATAA 3701 AAAACATGTT AATCCTTGTA TCATACTTTA ATATGAAAGT ATTGTTTATA 3751 GATAATTTAT CTCACAAGTC AAAAATGAAG ATTTTGCAGC ACTGAAAATC 3801 TATTAAAGCT CCAAATTTTA AGTTTCTAAA TAATCTTCGC TGAAATCTAA 3851 AATATACTAT AACAACCGTG TTTTATTTGT GAAAAAAATA TTAAAGTGAT 3901 TTGCTCTCAA ATATCAAATT TTCTTCTCTC TTTTATATTA AGAGACAGAA 3951 AATTGTTTCA TGAGTTCACT TAACTACTGA GATATTCAGA GCATTTTTAC 4001 CTCTCTCTTA AATGTTATAA AAAACAATTG TATTTTTAAG AATGTTTATT 4051 TATCAAAGTC TTTCCTTCTT CTATTAAATA TTTAGCAATT ACCTTTCTAA 4101 AATATGAAAT TTTGTAAGAT GTTTTCACCT AAATAAAAAT TGAAAGCAAG 4151 TGGATTACAC AGGAGAACCA TTATGAACAT TTATTTAGAT ATTAATCTTA 4201 AACAGTGTTT ATTTCAGTTT TCAAAGTTAG CTTATAGGTT ATACATTTAA 4251 GTTAAAGTGC TCATAATCAC TTGCAATTTC ATTGTAAAAT GAACAAATAC 4301 ATAAATATTT TAAGAAAAAT TTAAGTTTAT TCAGATAAGT CACCATGCTT 4351 CAAAAGATCT AAGAAATGCA AATATACTGA AAATTGACAT CCTCTGAAAA 4401 TTCCACTTGC TATTTACCCA AGAATCCACT GGAGGTCATT ACTGCCATTA 4451 AATAATAACT GAAAAGACTA TGTAGTGAAA TGTATTTTTA AAAACTATAT 4501 TCAGTAAAAG CCTGCTCAAT TTGGAGAAAT AGAACCACAA ACACAGATCA 4551 CAGGGGCCTT ACAAAGTTTA TGTCTGAACA AATAAGTCAA TTAAGTACAC 4601 TTTATTGAAA ATTGCCTTCC ATTAACACAC AAGAAAGAAA GCAGGATTTT 4651 CTCCTGTATC TGAATTTTAA AATTAAAAAG GCAGATAAGA CATAAATAGT 4701 TATCATTTTA ATTGCAATAA CACAGACAAG TAGTTAATGA TGATAACAAT 4751 GGTGTAACTT GTAAACTAAA TATTTGGTAA CTGAAGCAAT AGGCAGAGGA 4801 AAATAGCTTT TCTATGACAC AAGTCATAAG AAGTCCATAT ACTGAAGAGC 4851 GTTTGATTAA AATAAAGTGA CTATTAACCA GAAAAGAAAC ATTTTACATA 4901 AAATGCTAAA ATTTATTATA GGAAAATAAA TCAAACCCAA AGAAAGTTTA 4951 TTCAATGCTA ATTTGAAAGA AAATTGATAA GAAAACTTTG AGGGCCCAAG 5001 TCCACAATTT GGTGAGACCA CTAAATTTTA CATATAATTA TACACACACA 5051 TATGTACATA TATATGTATA TAATCTTGCT TCCCGCCTGT TTATGGCAGT 5101 ACTGAAGAGA AATGGGAAAG AAGAGGGAGG GAGAGAGAAA GACGAAGGGA 5151 GAGAGAAAGC AGTTTCCAAG GATATGTTTC ATGTCCCACC ATTTTCTCAG 5201 TTTCTCCCTC TCTCTCCCAA CACACACACA CACACACCCC TCACATACTA 5251 TAAAATAAAT CTTCACTGCC CTATCAAAAT ACAAATAAAT CAATCTATGC 5301 TGTTCTGTCC TTCTTGAGAA TCTAAAACAT ACCACAAAAA TACATCCCCA 5351 GTCTTTTGTT CTGTCTGAGG TTAGAATTAA TTCAAATTCA GAATCTGTTG 5401 TGAGAAATGC CCAGGCTTTA AAAATTAAAA ATGGATGGAT CTTCTCTGAA 5451 CTCAGGGAGG GCACATACTT AGATACCTAC AAGACTTGGA GGAATTAAGA 5501 GTTCACCCTT CATCTCACCA AATTTTCCCC ATTTTTCTCT TTCTTGTAGA 5551 AGGAGAGAAA CCATGCTCTC TAGCAACATT GAGCAAAAAT CATAACCACT 5601 CATCTAATTT CTAAGAGGCA CCTCCATCGA GGGCCGGTCT CCTGCTTCTT 5651 TAGACCTCTT CTATCTTTGT TACAGGAGAG GACCTGTGGA TAGACTTAGT 5701 TTTGACATAA AACAATGCCC ATTCACCTCC TCCTTCAGCA CAACGTCACC 5751 CATTGGGCAA GAGATCCAGA TTTGTTAACA AAAAAGATTT TACTTCGTGA 5801 TTCCACGTCT ATAATTCTAT ATTGCTAATT TTTTCTTTTG TGTGAATTAC 5851 TGAATATTTC AGAGCAAAGC TATCAACTTG GAGAAACAGG GATTAAAAAT 5901 AAGGATAAAC ACTAATAAGA GCTCTAGAAA AAAGGGAACA GAAAGTCTGC 5951 CTGTTTAGTA AGTGGCAATT CCATACATAT TTTAGAGTTT TTTCTATCTA 6001 AAATTAGTTA AATACTTAGA ATGTTTGTAA TGAGTGTTCG ATATTTGCTA 6051 TAGGTTTTAG GGTTTTGTAA ATCTTCATAG TAATTATAAA CATTTGTAAA 6101 ATTTGTAAAA TACTATAAGT CATTTTGAGT GTTGGTGTTA AGCATGAAAC 6151 AAACAGCAGC TGTTGTCCTT AAAAATGAAT TGACCTGGCC GGGCGCGGTG 6201 GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGTGGATCAT 6251 GAGGTCAGGA GATGGAGACC ATCCTGGCTA ACAAGGTGAA ACCCCGTCTC 6301 TACTAAAAAT ACAAAAAATT AGCCGGGCGC GGTGGCGGGC GCCTGTAGTC 6351 CCAGCTACTT GGGAGGCTGA GGCAGGAGAA TGGCGTGAAC CCGGGAAGCG 6401 GAGCTTGCAG TGAGCCGAGA TTGCGCCACT GCAGTCCGCA GTCCGGCCTG 6451 GGCGACAGAG CGAGACTCCG TCTC // LOCUS D87445 6935 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0256 gene, complete cds. ACCESSION D87445 NID g1665778 VERSION D87445.1 GI:1665778 KEYWORDS KIAA0256. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA4798. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6935) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1. .6935 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA4798" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 1425. .3332 /gene="KIAA0256" CDS 1425. .3332 /gene="KIAA0256" /codon_start=1 /protein_id="BAA13386.1" /db_xref="PID:d1014076" /db_xref="PID:g1665779" /db_xref="GI:1665779" /translation="MEQKKLQEALSKAAGKKNKTPVQLDLGDMLAALEKQQQAMKARQ ITNTRPLSYTVVTAASFHTKDSTNRKPLTKSQPCLTSFNSVDIASSKAKKGKEKEIAK LKRPTALKKVILKEREEKKGRLTVDHNLLGSEEPTEMHLDFIDDLPQEIVSQEDTGLS MPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSTITKIHSKRFREYCNQVLCK EIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPN CEKIQSKGGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGA ESLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISF CSVISEPISEVNEKEYETNWRNMVETSDGLEASENEKEVSCKHSTSEKPSKLPFDTPP IGKQPSLVATGSTTSATSAGKSTASDKEEVKPDDLEWASQQSTETGSLDGSCRDLLNS SITSTTSTLVPGMLEEEEDEDEEEEEDYTHEPISVEVQLNSRIESWVSETQRTMETLQ LGKTLNGSEEDNVEQSGEEEAEAPEVLEPGMDSEAWTADQQASPGQQKSSNCSSLNKE HSDSNYTTQTT" BASE COUNT 2199 a 1305 c 1429 g 2002 t ORIGIN 1 GCCGCCTCCT CGGCCAGTGG CGTAGCCGAA TCGGTGTCGC GGCCAGCCAG 51 ATAGGGGCGG AGGTCCGGAA CCCAGTCTGG ACCCGAGCGG GGGGCCATGG 101 AGAAAGCGGC CCGAGGCGCT GTTTACACCG ACTAGCGCGG GCCCGTTGCG 151 GCTGCAGGCA CCATGGACCG AGCCCCCACG GAGCAGAATG TCAAGCTGTC 201 AGCTGAGGTG GAGCCATTTA TTCCCCAGAA GAAGAGTCCT GATACATTTA 251 TGATCCCTAT GGCTCTCCCA AATGATAATG GAAGTGTTTC TGGTGTGGAA 301 CCAACTCCAA TTCCCAGCTA CCTGATTACT TGTTACCCAT TTGTGCAGGA 351 AAACCAGTCC AATAGACAGT TTCCTTTATA TAACAATGAT ATACGATGGC 401 AACAACCCAA TCCAAACCCT ACTGGACCAT ACTTTGCCTA TCCCATTATA 451 TCTGCTCAGC CGCCTGTTTC TACAGAGTAT ACATATTATC AGCTGATGCC 501 AGCACCATGT GCCCAGGTTA TGGGTTTCTA TCATCCTTTT CCTACACCTT 551 ACTCCAACAC CTTTCAGGCT GCAAATACTG TAAATGCTAT CACCACAGAA 601 TGCACTGAGC GTCCAAGTCA GCTTGGACAG GTCTTCCCAT TGTCCAGCCA 651 TCGAAGCAGA AACAGTAACA GAGGATCAGT GGTCCCAAAA CAACAGCTTT 701 TACAACAGCA CATAAAAAGC AAAAGGCCGC TGGTGAAAAA TGTAGCTACT 751 CAGAAAGAAA CAAATGCAGC AGGTCCTGAT AGTCGATCAA AAATTGTGCT 801 TCTGGTAGAT GCTTCACAGC AAACTGATTT CCCATCAGAT ATCGCTAACA 851 AGTCTCTCTC AGAGACCACT GCAACAATGC TCTGGAAGTC CAAGGGCAGG 901 AGAAGAAGAG CATCCCACCC TACTGCTGAA TCTTCTAGTG AGCAGGGGGC 951 TAGTGAAGCC GACATTGACA GTGATAGTGG TTACTGCAGT CCCAAACACA 1001 GCAACAACCA GCCTGCAGCA GGGGCTTTGA GAAATCCTGA TTCTGGGACC 1051 ATGAATCATG TGGAATCATC TATGTGTGCA GGTGGTGTAA ATTGGTCCAA 1101 TGTAACTTGC CAGGCAACTC AGAAAAAACC TTGGATGGAA AAAATCAGAC 1151 ATTTTCTAGA GGTGGAAGGC AAACTGAACA AAGAAATAAT TCACAGGATG 1201 AAGATGGGTT TCAAGAACTA AATGAGAATG GAAATGCTAA GGATGAGAAT 1251 ATTCAACAAA AACTTTCTTC TAAAGTATTG GATGATTTAC CTGAAAACTC 1301 ACCAATCAAT ATAGTTCAGA CTCCAATTCC TATTACCACC TCAGTTCCCA 1351 AACGTGCAAA AAGTCAGAAG AAGAAAGCTT TAGCAGCAGC CCTTGCCACA 1401 GCTCAAGAGT ATTCAGAAAT AAGTATGGAG CAAAAAAAAT TACAGGAAGC 1451 TTTATCAAAA GCAGCTGGAA AAAAGAATAA AACACCTGTG CAGCTAGATT 1501 TAGGGGACAT GTTAGCTGCT CTGGAAAAAC AACAGCAAGC AATGAAAGCA 1551 CGGCAAATTA CTAACACCAG ACCTCTGTCA TATACAGTGG TTACTGCAGC 1601 TTCTTTTCAC ACTAAAGACT CTACTAATAG AAAACCTTTA ACCAAAAGTC 1651 AGCCCTGTTT GACATCCTTT AATTCTGTGG ACATTGCTTC TTCTAAAGCA 1701 AAAAAAGGAA AAGAGAAGGA AATTGCAAAA CTAAAACGAC CCACAGCACT 1751 TAAAAAGGTT ATTTTAAAAG AAAGAGAGGA AAAGAAGGGG CGCTTAACTG 1801 TGGACCACAA TCTTTTGGGA TCCGAGGAAC CAACAGAAAT GCACTTAGAT 1851 TTTATTGATG ACTTGCCACA GGAGATTGTT TCCCAGGAAG ATACTGGACT 1901 AAGCATGCCC AGTGATACTT CACTCTCTCC AGCAAGTCAG AACTCTCCAT 1951 ACTGTATGAC ACCTGTGTCA CAAGGCTCTC CTGCTAGTTC TGGAATAGGC 2001 AGTCCAATGG CATCTTCAAC AATAACCAAA ATCCACAGCA AAAGATTTAG 2051 AGAGTATTGT AATCAGGTTC TTTGTAAAGA GATTGATGAA TGTGTGACTC 2101 TTCTTCTCCA AGAGCTTGTC AGTTTCCAGG AACGCATCTA CCAAAAAGAT 2151 CCTGTAAGAG CAAAAGCAAG GAGACGACTC GTTATGGGTC TAAGAGAAGT 2201 TACCAAACAT ATGAAGTTAA ACAAGATCAA GTGTGTTATA ATTTCTCCAA 2251 ACTGTGAAAA AATCCAGTCA AAAGGTGGTC TGGATGAGGC TCTCTATAAT 2301 GTTATAGCCA TGGCACGGGA ACAAGAAATT CCTTTTGTGT TTGCCCTTGG 2351 AAGGAAAGCT CTAGGACGCT GTGTGAACAA GCTGGTTCCT GTTAGCGTAG 2401 TGGGAATCTT CAACTACTTT GGTGCTGAGA GCCTGTTTAA TAAATTAGTA 2451 GAACTCACTG AGGAGGCCAG GAAAGCATAT AAAGATATGG TTGCAGCAAT 2501 GGAACAGGAG CAGGCTGAGG AAGCCTTAAA GAATGTGAAG AAGGTACCAC 2551 ACCACATGGG ACATTCTCGG AATCCCTCTG CAGCAAGTGC CATTTCTTTC 2601 TGCAGTGTTA TTTCTGAACC GATCTCTGAA GTAAATGAAA AGGAATATGA 2651 AACAAATTGG AGAAACATGG TGGAAACTTC AGATGGACTG GAAGCATCAG 2701 AAAATGAGAA AGAGGTATCC TGTAAGCACA GCACTTCTGA AAAACCCAGT 2751 AAACTTCCAT TTGACACACC CCCAATTGGT AAGCAGCCAT CATTAGTGGC 2801 TACAGGCAGT ACTACCTCAG CTACAAGTGC TGGGAAATCC ACAGCAAGTG 2851 ATAAAGAGGA AGTGAAGCCA GATGACCTGG AATGGGCCTC ACAGCAGAGT 2901 ACAGAGACTG GCTCTTTGGA TGGCAGTTGC CGAGATCTTT TGAATTCCTC 2951 CATCACCAGC ACCACCAGCA CTCTTGTACC TGGCATGCTT GAAGAAGAAG 3001 AAGATGAAGA TGAGGAGGAG GAGGAAGATT ATACTCATGA ACCCATATCT 3051 GTAGAAGTGC AGCTCAATAG TAGAATTGAG TCTTGGGTCT CAGAGACCCA 3101 GAGAACTATG GAAACCCTTC AGCTTGGAAA AACCCTTAAT GGTTCTGAGG 3151 AAGACAATGT AGAGCAAAGT GGAGAAGAGG AAGCAGAGGC GCCTGAGGTG 3201 CTGGAGCCAG GGATGGACAG TGAGGCATGG ACTGCTGACC AGCAGGCCAG 3251 TCCTGGGCAG CAGAAGTCCA GCAACTGCAG CTCGCTCAAC AAAGAGCACT 3301 CTGATTCTAA TTACACAACG CAAACTACGT AACTCAGGAA ATGTCGGCTC 3351 TCTATCTCCA GCTGTGGAAG GGTTGCAGCC ATTACCTTTT ATGCTTCATC 3401 TCAACATTTT GCACTGTCCA GTATTTAATA TACGTATTTA ATTCCCAACA 3451 AATATTTTTG TAGCTTTTAC TTGTTATGAT CTGTAGCTTA GCTTTTAATT 3501 AGTATCTAAG TGTCTTTCTA AGAACTGTGT GGAAAATTCA GATCTGTTTC 3551 AGCTTATTTT GTAATCAAAA ACAGTGATAA AAAGAAGACC AGATCTTAAA 3601 GAAAATAAAT TTCAAATGCT TACTTAAAAG ACATTTTGAA AGTTAAAGAA 3651 CAAGGTTCTA AGGATAGAAG CAGTTATCAG TGTTTGCTTC AGGACTCCAC 3701 CTCCTCTACT CTAATTTGAC CAAAAAATTG TTTGGGCTTC TTTAAAAAAG 3751 AACTGGGGGT GGAGTCAGAA AATTAAATGA AAGGCTGAGG GTAACTAAGT 3801 CCACCAGTGT TGTATGTTAA AAAATCAATG CAACTTTTAT GTGGTCCACA 3851 AATGTTTAGT CAGAAGTCAC TGATTATTGT AATTAATTAG TGTTGGGATG 3901 GGCTAAAACA GAGCCTTCAA AACTTCGGCT AGCAGTGGAG CCACCATCTT 3951 AGATTATAGC TAGCTAGCCT CATTTGTGGA AAGTGATAGA TGCTGTCTAT 4001 AATAGTGAAC AGTCACCCAT GATAGGACCT CCAGGTTCTG TCTCATATTT 4051 GCTTCTTACT TACCTCAGGA ATGCTCTTGT ACATAGACTT ATTTACAAAA 4101 AGCTAGGCAC ATGTTGACAG GTGAATAACT GTAACCGATT GTATGACTGC 4151 TGCACTTACA TGTAAACTCT TCAGAAACAG AGTCTTATAC TGGTGTGTTC 4201 TCTTGCATGC TTCTGGTTCA GGACTCTTGA TTTGAGATAT GGATTTGATT 4251 GAGTATCCAA ACTTGTCCTG AGTGCAAAAC TGTTTCACCT TTTAAAAAAT 4301 ACCTATTTTG CACCTAGCCT TGAGCACCTT CCACATAGCA ATGACCATAG 4351 TTACTGTCAG GAGGTCAAGG AAAGGAACTT TGCACAACTT GTGACATGTA 4401 TCCTGATAAT CAAGGCTTAG AGGAGGAAGT TTTAGAAGAT AAGAGAAAGT 4451 TGTTCTAATT GTGCTGAAAC TATTAGATGA TTTAGAGTAT ACAGATATGT 4501 AGGTATTAAT TCTCTATTCA CTATTATTTA TCTCTGCCCT TCTCTAGGAG 4551 TTTGTATACC TGCTTAGGAG ACAATAAATG AGCTAAATGT TTTATTTGCT 4601 AGTCAGTCAC CACCTGGACT TCAGTGACTT TACAAGTTTA TGTAATGGTG 4651 GAAGAATGAC AAACTATGTA ATTTTTTTGT CTTCCATCCA ACTCCCCACC 4701 ACCCCCAACT GTCCCCCCCA CCCCCCTCAC ACACATGCAC ACATCCGTAC 4751 GTGTGTGTGT TTTCCACTTA CAAGCTTCCA TAAGCAGGCA CAAAACTGAG 4801 AAGGAAGGGG TATTATCCCT GCCCTGATTA TCTGGGGCAG GGCTTTGCCT 4851 CACAGAGGCA GGAGAGAAGA ATTGGGCAGA TTCTTTACTG AACTCATTGG 4901 GACTACTGTG CTAGTTTTGA TGTTTTATAA TGCTGGCATT TAATTACTGG 4951 AGAGATTGGA TTCTTGGTTG ATGATTTAGT ATTTGTGAAT TGTGAAAGTT 5001 CAGGAGCTGT GTAGAAAATG TTAGTCAATC AACTTTATTA TTGTGCTAAA 5051 AGGGGACATT CTTATACTGT CCTGTCTAAA CTGTTCTCCA GTATAGACTT 5101 CCTAGGCACT AAATATCCAA TATTTAAAGG AACACAGCAG GTAAGGAATG 5151 AAGCCTCTGA AATAGTACTC ATGGATTTAT ACATGGCAGA TCTTACTGTC 5201 TCTACACATT TGGAAGTGTT CGTTGGTTTA AAGAAATGAT AGAGGTTTTG 5251 AACTACTGAC AGTCTTAAAA GTGAATTTAA AAACTGTTCA TACTTTTTAT 5301 GGTGTAAATT TCCTTTGCTC GATGTCAGTG ATTCAGATAA CTCTTGACCT 5351 TGAGATGATG GCTTTTCACA GGTTTCTTAT ATTTTATATC TCTTCTGAAC 5401 ATGAATTGTC ATTTTAGATT TTTGACATTT GTATCAAAAG AGAAGTTGAG 5451 GAAATCTTCA GAACACTGGT AACTTTTAGT TTTGCTATAG ACTTCAGAAG 5501 TGTTTATTTA TATGTTCGGT AAATGCTCTC GCATATGCAG TACCTCTTCT 5551 GCCAGCAAAT CCAAGGGACC ATAGCCTTTT TATGAGACAG GTCACCTCTA 5601 GAGGACACCC CAAGAATTAT TAAAGGAAAT GTTACCATTT TGAGAGCATG 5651 CTTAAATAAA TATTAATAAT GTCTTTATAA CTTGTTTCCT TTAAATTTTG 5701 GAATATTGAA TTACAGGCTT TGGAGGAGTT GTGAAAATTA GGAAAGTTTT 5751 TATATATTTT TTGAAGTGGG CATGGTTGGC TCTTTGAAGA CCTATAAAGA 5801 GATCCAGTGG GAAGAGTAAG GGTTGGTTCA TCATCACAAG AAATAAAAAA 5851 CATAGTGATT TTTTCTCTTA ATGTGTAGAG GTGGTTTTAC TGGCAATAAT 5901 TAATAATAGA TTTCTATTTC AGTATGTAAG CATATTAACT AAAATATGAA 5951 TTACACTTCC AAAGTTAGAT TTCTGCTTCA GTAGGTTTGT TTGCTGTGAA 6001 GATTACTTCT CAAAAGACAG ATGTTCATAT TAGCTTAATT TTCGGTTTAA 6051 ATATGTTTGT AAATGATGTA ATATATTTCT TTTGACTAAA TGTGGAAAAG 6101 TAATGTGTGT TATACATTGA GAAGTTTTTA CTGGCTTTGA CTGGAGGTTG 6151 TTTTTGCAGA GATGGTATTT TATATGATTC CAGTATTTGG AAAAGAATTA 6201 GTCAAAAGGA ATTCACATAG TTTAAATACT GAGAAATTAA TATCCAAATA 6251 TGTACTTGTC TGATTTCTAA ATAAGCTGGG GGAGGAGGGA GGGGTGGGAA 6301 TTGAAATGTG CAAATGAGTA GTGAATGCTA CACTCATTTT CAACTCTTTA 6351 ACATGAAACT GTTCAATCTT AACACATTGT TACTTTAATA TATGTATAAA 6401 GAAGTATTAC TGTTTGTAAA GCTGCTGTTT GCTTAAAAAA AAAAAACACC 6451 CTTGTCATGT ATTTTCTGTA TGTTGGGCCA ACAGGTTAGA ACATCAACTC 6501 ATTTAAAAAT TTTTATCTTT TTTTGATTTA AAAAAATTCT GTGAAATAAT 6551 TTATTTACAG ACATCTTCCT CCTCCCTCAT CCCTTCCAAC CTTTACATAC 6601 ATCACAGAAT CAACCAAACT GTTTGCCTAA TCTGAAATCT GAATCCTAAT 6651 GAGAAAAATT TAAATTTTGT TGGCACATCA CACCTTGAAA GTATTTGTAT 6701 TATTTTATAA TTTAATTTCT AAATATACCA CATAAGTTTA TAATTTAATG 6751 TCTTAATTGT AATGCTCTAA TAAAAAACTA GCAAAATTAG TGTGAGTTAT 6801 AACATGAAGG GATTTTCATC TTTTGCTGTA TGAAGGATAA TTGTTATATC 6851 ACATTTGGGG GGTAATAACA GCTTTTTTGC ACTATGTAAA TACTAGTGGG 6901 GATTCTTCTG TACTAATAAA ATGATTATTG AAATG // LOCUS AF056032 5000 bp mRNA PRI 04-OCT-1998 DEFINITION Homo sapiens kynurenine 3-hydroxylase mRNA, complete cds. ACCESSION AF056032 NID g3695026 VERSION AF056032.1 GI:3695026 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5000) AUTHORS Magagnin,S., Covini,N., Cini,M., Bormetti,R., Molinari,A., Speciale,C., Post,C. and Benatti,L. TITLE Direct Submission JOURNAL Submitted (30-MAR-1998) Central Nervous System, Pharmacia & Upjohn, Via Pasteur, 10, Nerviano, MI 20014, Italy FEATURES Location/Qualifiers source 1. .5000 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q42-q44" CDS 47. .1507 /function="monooxygenase" /note="hK3OH-2" /codon_start=1 /product="kynurenine 3-hydroxylase" /protein_id="AAC62615.1" /db_xref="PID:g3695027" /db_xref="GI:3695027" /translation="MDSSVIQRKKVAVIGGGLVGSLQACFLAKRNFQIDVYEAREDTR VATFTRGRSINLALSHRGRQALKAVGLEDQIVSQGIPMRARMIHSLSGKKSAIPYGTK SQYILSVSRENLNKDLLTAAEKYPNVKMHFNHRLLKCNPEEGMITVLGSDKVPKDVTC DLIVGCDGAYSTVRSHLMKKPRFDYSQQYIPHGYMELTIPPKNGDYAMEPNYLHIWPR NTFMMIALPNMNKSFTCTLFMPFEEFEKLLTSNDVVDFFQKYFPDAIPLIGEKLLVQD FFLLPAQPMISVKCSSFHFKSHCVLLGDAAHAIVPFFGQGMNAGFEDCLVFDELMDKF SNDLSLCLPVFSRLRIPDDHAISDLSMYNYIEMRAHVNSSWFIFQKNMERFLHAIMPS TFIPLYTMVTFSRIRYHEAVQRWHWQKKVINKGLFFLGSLIAISSTYLLIHYMSPRSF LCLRRPWNWIAHFRNTTCFPAKAVDSLEQISNLISR" BASE COUNT 1548 a 996 c 954 g 1502 t ORIGIN 1 GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGG 51 ACTCATCTGT CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG 101 GTTGGCTCAT TACAAGCATG CTTTCTTGCA AAGAGGAATT TCCAGATTGA 151 TGTATATGAA GCTAGGGAAG ATACTCGAGT GGCTACCTTC ACACGTGGAA 201 GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC CTTGAAAGCT 251 GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 301 AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA 351 AGTCTCAGTA TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA 401 TTGACTGCTG CTGAGAAATA CCCCAATGTG AAAATGCACT TTAACCACAG 451 GCTGTTGAAA TGTAATCCAG AGGAAGGAAT GATCACAGTG CTTGGATCTG 501 ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG ATGTGATGGA 551 GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 601 CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA 651 AGAACGGAGA TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA 701 AATACCTTTA TGATGATTGC ACTTCCTAAC ATGAACAAAT CATTCACATG 751 TACTTTGTTC ATGCCCTTTG AAGAGTTTGA AAAACTTCTA ACCAGTAATG 801 ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT CCCTCTAATT 851 GGAGAGAAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 901 GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC 951 TGGGAGATGC AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT 1001 GCGGGCTTTG AAGACTGCTT GGTATTTGAT GAGTTAATGG ATAAATTCAG 1051 TAACGACCTT AGTTTGTGTC TTCCTGTGTT CTCAAGATTG AGAATCCCAG 1101 ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT AGAGATGCGA 1151 GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1201 TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA 1251 CTTTTTCCAG AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA 1301 AAAAAGGTGA TAAACAAAGG ACTCTTTTTC TTGGGATCAC TGATAGCCAT 1351 CAGCAGTACC TACCTACTTA TACACTACAT GTCACCACGA TCTTTCCTCT 1401 GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA TACAACATGT 1451 TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1501 CAGGTGATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC 1551 AAAATTAAGC ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT 1601 GGAAGATAGT GTTCTGCTTA TAATTAAACT GAATGTAGAG TATCTCTGTA 1651 TGTTAATTGC AATTACTGGT TGGGGGGTGC ATTTTAAAAG ATGAAACATG 1701 CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT AACTATAAAA 1751 GTGCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1801 TCAGGGAAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC 1851 GCAACAGCAT TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC 1901 ATGTTAACTG CACTTTTCAT TACGTGAATG GAACTTACCT AACCACAGGG 1951 CTCAGACTTA CTAGATAAAA CCAGAAATGG AAATAAGGAA TTCAGGGGAG 2001 TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC CTTCAAATAT 2051 AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2101 CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA 2151 TCTATCTATC TCTATTTATT TATGTATTTA GAGATCAGGT CTCACTCTGT 2201 TGACCAGGCT GGAGTGCAGT GGTGAGATCT GGGTTCACTG CAACCTCTGC 2251 CTCCTGGGCT CAAGCAATCC TCCCACTTCA GCCTCCCAAA TAGCTGGGGC 2301 TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC TGCCCAGGCC 2351 AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2401 TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT 2451 AAAGATAAGA AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG 2501 GGTTTTATTC ATAAATTAAA TGTCTACATG TGCCAGAATG GAAAGGAAAC 2551 AAGGGGAGAC AACTTTTATA GAAATACAAA GCCATTACTT TATTCAATTT 2601 CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCGACTA CATACTGCAG 2651 CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2701 TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT 2751 CAAACATGAT TAATTATGAA GATGAAACAC TAGAGTCATA TAAGAAATAA 2801 AAATTGGGCA ATAAAATAAA ATGATTCAGT GTTTCTTTTC TATATTGTCA 2851 ATGAAAACCT TGAGTTCTAA TAATCCATGT TCAGTTTGTA GGGAAAGAAA 2901 AAATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG GGGCCCCTAT 2951 AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3001 TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG 3051 AGCTGGCGTC TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT 3101 AAATTTTTAG AGAAACAACA AAACAAAGAA AAAGAGCTTT GAGTCTGTAG 3151 GGGCAGCAAT TTGGGGGAAG CAAATATATG GGAGTTTGCC TTGTAGATTC 3201 CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG TCTCTGAAAC 3251 TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3301 GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA 3351 GGACATGACA CTGAATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT 3401 CTAGCTACTT CACACATGTG TACGCGACAG TTATTTTTAC AGTAAGGTAT 3451 TTTCGAGAAA AATGCATTAC GTGTTTTGGA AAATAGAGTA ATTTAAAAAA 3501 TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT GATGTTAGAT 3551 GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3601 ATTAAAAGAA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAGAGCATA 3651 TGGGCAATTC GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT 3701 AGAAACGGGT ATTCCAGACA CTTCTTATGA TGAAAGTCCA AAAGTGGCAT 3751 CCAATTTAAG GCCCCATCTT TCGTTGCCAT TCTTCATTCC TACAAAGGAC 3801 GAACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC GCTGTCGTCA 3851 ACTGACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAGAAGAGTT 3901 GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA 3951 TGGGTCTGAT CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC 4001 CTCTGGCACC TCAGCAGTCG GAGGCACAGA AGCTGCAAAA GGGATCTTCG 4051 AAACTGGGCA GAGAAAAAAT AAAGTGGAAT ATTAAGTAAA AGTTGGGCAC 4101 TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT TAAGTTGTTT 4151 TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4201 TACCTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT 4251 TAACCGGAAT ATGTAGGACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC 4301 AATCTGCACA CACTTTCTAT GAGGGCAGGT ACAACTATTA AGAGATTTTG 4351 AACATTAAGT TAGTCCACAA ATATTCAGTG GGCATCTACT AGGTGACAGC 4401 CACTGTGCTA TAATTAGAGA CTTTTTACTA TAAGCATCAA AAACAGATAA 4451 GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4501 AATTAGGTGA AGAATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG 4551 AAAAAAATAC TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT 4601 TCTCCGTGTC CTGGGTACAA CATCGAATAA TATTTCTTGG CCTCCTTTCC 4651 GCTTCTCCTC TCTGCTGTTC CTCTCTACAA GAACCTGGGA GGCCAACGCC 4701 TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT AAATGACTGC 4751 ATAGGACAGA TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4801 GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT 4851 TGCTTGTTAT AGTAGTCGGT CCATCATGAC CAGTAAAACA TAAATCAAAA 4901 GTTAATGTAA TTGTTATCCC ATTATTTAGA GCGAAATAAA TGTTGAATAT 4951 ATGGACTTTC TCAGATTAGG AAATACCAAT TAAAAATATA ATAAATAGCT // LOCUS HSEP3C 4575 bp mRNA PRI 31-MAY-1995 DEFINITION H.sapiens mRNA for prostaglandin E receptor (EP3c). ACCESSION X83860 NID g633213 VERSION X83860.1 GI:633213 KEYWORDS prostaglandin E2 receptor EP3 subtype. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4575) AUTHORS Schmid,A., Thierauch,K.H., Schleuning,W.D. and Dinter,H. TITLE Splice variants of the human EP3 receptor for prostaglandin E2 JOURNAL Eur. J. Biochem. 228 (1), 23-30 (1995) MEDLINE 95188908 REFERENCE 2 (bases 1 to 4575) AUTHORS Dinter,H. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) H. Dinter, Schering AG, IZMB S109/618, 13342 Berlin, FRG FEATURES Location/Qualifiers source 1. .4575 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterus" CDS 208. .1374 /codon_start=1 /product="prostaglandin E receptor, subtype EP3C" /protein_id="CAA58740.1" /db_xref="PID:g633214" /db_xref="GI:633214" /db_xref="SWISS-PROT:P43115" /translation="MKETRGYGGDAPFCTRLNHSYTGMWAPERSAEARGNLTRPPGSG EDCGSVSVAFPITMLLTGFVGNALAMLLVSRSYRRRESKRKKSFLLCIGWLALTDLVG QLLTTPVVIVVYLSKQRWEHIDPSGRLCTFFGLTMTVFGLSSLFIASAMAVERALAIR APHWYASHMKTRATRAVLLGVWLAVLAFALLPVLGVGQYTVQWPGTWCFISTGRGGNG TSSSHNWGNLFFASAFAFLGLLALTVTFSCNLATIKALVSRCRAKATASQSSAQWGRI TTETAIQLMGIMCVLSVCWSPLLIMMLKMIFNQTSVEHCKTHTEKQKECNFFLIAVRL ASLNQILDPWVYLLLRKILLRKFCQVANAVSSCSNDGQKGQPISLSNEIIQTEA" BASE COUNT 1285 a 959 c 955 g 1376 t ORIGIN 1 GAAGGCGTGG CTCCCTCCCG GGCCAGTGAG CCTGGCGCCG CCGCGGCCGC 51 GTCCCAGCAG CGGAGTAGGG CGGCGGCTGC GCCCCGCACC ATGGGGGCAG 101 CCCAGCCCCA GCCGCGGTAA ACGCCGACCT CCGCCGCCGC CCGCGCCCGT 151 CTGCCCCCTC CCGCTGCGGC TCTCTGGACG CCATCCCCTC CTCACCTCGA 201 AGCCAACATG AAGGAGACCC GGGGCTACGG AGGGGATGCC CCCTTCTGCA 251 CCCGCCTCAA CCACTCCTAC ACAGGCATGT GGGCGCCCGA GCGTTCCGCC 301 GAGGCGCGGG GCAACCTCAC GCGCCCTCCA GGGTCTGGCG AGGATTGCGG 351 ATCGGTGTCC GTGGCCTTCC CGATCACCAT GCTGCTCACT GGTTTCGTGG 401 GCAACGCACT GGCCATGCTG CTCGTGTCGC GCAGCTACCG GCGCCGGGAG 451 AGCAAGCGCA AGAAGTCCTT CCTGCTGTGC ATCGGCTGGC TGGCGCTCAC 501 CGACCTGGTC GGGCAGCTTC TCACCACCCC GGTCGTCATC GTCGTGTACC 551 TGTCCAAGCA GCGTTGGGAG CACATCGACC CGTCGGGGCG GCTCTGCACC 601 TTTTTCGGGC TGACCATGAC TGTTTTCGGG CTCTCCTCGT TGTTCATCGC 651 CAGCGCCATG GCCGTCGAGC GGGCGCTGGC CATCAGGGCG CCGCACTGGT 701 ATGCGAGCCA CATGAAGACG CGTGCCACCC GCGCTGTGCT GCTCGGCGTG 751 TGGCTGGCCG TGCTCGCCTT CGCCCTGCTG CCGGTGCTGG GCGTGGGCCA 801 GTACACCGTC CAGTGGCCCG GGACGTGGTG CTTCATCAGC ACCGGGCGAG 851 GGGGCAACGG GACTAGCTCT TCGCATAACT GGGGCAACCT TTTCTTCGCC 901 TCTGCCTTTG CCTTCCTGGG GCTCTTGGCG CTGACAGTCA CCTTTTCCTG 951 CAACCTGGCC ACCATTAAGG CCCTGGTGTC CCGCTGCCGG GCCAAGGCCA 1001 CGGCATCTCA GTCCAGTGCC CAGTGGGGCC GCATCACGAC CGAGACGGCC 1051 ATTCAGCTTA TGGGGATCAT GTGCGTGCTG TCGGTCTGCT GGTCTCCGCT 1101 CCTGATAATG ATGTTGAAAA TGATCTTCAA TCAGACATCA GTTGAGCACT 1151 GCAAGACACA CACGGAGAAG CAGAAAGAAT GCAACTTCTT CTTAATAGCT 1201 GTTCGCCTGG CTTCACTGAA CCAGATCTTG GATCCTTGGG TTTACCTGCT 1251 GTTAAGAAAG ATCCTTCTTC GAAAGTTTTG CCAGGTAGCA AATGCTGTCT 1301 CCAGCTGCTC TAATGATGGA CAGAAAGGGC AGCCTATCTC ATTATCTAAT 1351 GAAATAATAC AGACAGAAGC ATGAAAGAAA ACACTTAACT TGCATGTGCA 1401 CAGCTTTTGG TAACAAATAT CGCTAAACCT TACTGTGAAT TTAGGCATCT 1451 CTGGCATGCC ACTGTTTATG CATTGAAGTG GAATTTTTGG TATAAAGCTA 1501 AATGGTCTTA GAAGCATAGA AAATCCCTAT GTGCCAAAAG TAGTGAAACA 1551 CAAACAAAGG AAAATATATT AATAACAGTC TAGTGTTTTT GTTGAGTCTG 1601 CCATTCGTAG CTGAATATGT GATTAATTAT GTGATGAAAA CATTTTTTAT 1651 AAATGATCTT GGTCTATTGG GGAGCGGGGA TAGTTAATAT TCCAGTACAC 1701 TGAATACATG AGGAATTTAA CCACATACAT CATTGAAGAC AAGGGATAGC 1751 AGTTTGTTTT TATTCAAAGA CATTGCTGTG TTCTCTTTCA TTGCCTCTCT 1801 CGCTTTCTGT CACTTTTTTC CTCCTTACAT TAAAGAAAAG TTTAATTACA 1851 GTTAAAAATG TATAATGTAT TTATAATATT CATCGATACC ATTATTCAAA 1901 TATTGCTCAA TACAGCAAAT TAGCTCCTAA CCTAACAAAG TTTAAGTTTA 1951 CTTGGATTGA TAATTAGGTT TACTCTTTAT CTGAATAAGA ACCAATTCCA 2001 TTTGTTTGAA ATATGGAGTT TGTGACTACC CAAATTGCTA ATTATTCTTT 2051 CTTTTGAATA TATTTTACAT TTCTATGAGC CTAAGGAAGA TTCTAGAAAC 2101 TGACCTATGA GAGTCGTGAA GTCGTTTTTC AGAATGCTAT GTAAGGACCG 2151 ATTTGAGCAC TAACTATAGG TACTCTGAAT ATATATTTCC CTTGATTATT 2201 CACCAAAAGT GTTCCCCAGT CTTTGACTCT TTAAATTCCA ATACTGATTC 2251 CAAAACAAAT AAATATTTTG AAGACTCAAT GAATACTTTC CATATTTTGG 2301 CCTATTTATA TAAGAAAGTT AATAACATTG ACCCTTCACA GCTCTTCTGC 2351 CTGCTCCTCA AGAGGCTCTA TCTAATATTT ATTACTAAAA TGTTTTTTCT 2401 ACAGTCTACA TGAATACAAA CCTCAATAGC TAAGCTTGAC GTATTTGTGC 2451 ACAAGTAGAT CACTACATTA AGTTTTGGGA ATTGCACTTC TTAAAAATGT 2501 CTCCCCACCA AACATAGTAA TCCTGTAGTT ATGCCTACAC AAAGCTTGCC 2551 ATATTCTTTG GTCGATTCAT TTTGTAAACC CATTAACTTT TTATTGTGAA 2601 GATTTTCATT TGCAGTTTCT TGCACTGCTT TTCTAGTTTT TTAAAAGCTT 2651 GAGATTTATT TATACTTCTT GTAGTAACTG CATATTTCTG TGTGTGTTTA 2701 GTGGTAAAGA ATTAATTTTG ATAGGTACAA TATGTCTATC AGATTGATAT 2751 ATACACCAGC CTATGTCAAT TGGGGCTAAT TATTTTAAAT GACCATGTCA 2801 AATTGAATTT GGAGACAAAA TCTGTTGAGA GTGCTTATGT AATTAATGAT 2851 GGTTCTACTA ACTAAATTTT GGAAAAGGTG ATAAATAGAC TATACTAAAA 2901 TCTCTCTATG CCATAGAATT GGATTATCCT GTAGGTCATC TCATTGGGTC 2951 TAAGACAAAA CTACCTACTT TTTTTCAAAA GTGCACTGAA ATCACATAAT 3001 AAAGAGGCTT TACCTCTTGG TTGGTCCTGT GACCCTAAGT TCTAGTCAGA 3051 TAGACACAGA GGCAATGTGA ATTTGAGTGG CATGAGCATG ATTAGGTTAT 3101 TCCTTCCAGC ATCTAGTATA GCACCTGGAA TATAGAAACT GTCTAATACA 3151 TATTTATTCA GTGAATCAAT GAGCAGAAGT TTGCCAGGAC AGTACACATT 3201 GGCAAGGCAC ATACCATATG ATTGAAGTGC TTCATGCCAT TACAGTCCAT 3251 CAGGCTGATA AAGTGAATTA TTTCTGATTA TTTAATTACA GAAATATGAA 3301 TTTATCTTCA AGGGGTTAGT GTCATACTGC TGTACAACAC AGTGCTTTAT 3351 TTATACTAAT AATTTAGGAG ACTGATACTT CCAAATGATA GTGGACATTA 3401 CTATCACAAG AATATCACTT TTCATCAAAC TGCAAAAATA CAGAAAGGCA 3451 AAAAACCTGA CACTTATTCT TAACTGCAAA TTAAATTCCT GCCCAGGGGA 3501 TATATTTTAG GTGGGGATGA ATGGCAGCTT TTGTGTTTTT TTTAACAAGC 3551 TTGAAAGGGG GTGGAAAACA AAGAAATTAT GTAAATGGCA TATGAGTTTT 3601 TATTATCTAG GCATTCGTTA GTATGGGGAA ACCTGATAAG AAATGAAAAT 3651 CCCAAATGAT TTCAGCCTTT TCATGATGGT TGAGGTTAGG TTTCAGAGAT 3701 GTACAGAGAC TAGAGCGGTG GTTAGAAAGA GGATATATGT AGTCACAGCA 3751 GAAAGACGTG TCTAAGTTTA ATTTTATTGG CTTTCAAGTT CACTCATGTA 3801 TACTTAGTTT GTCCATACAT ATGTCTAATC AGGAAAAATG CATGTATAGA 3851 TTATGACAAT TCCTGAATTT TGAAGTATTG GTTAAAAGAC AATTAAAGGC 3901 CAAGAAAACC ATGGTGGAAG AAGTAAGCGA ATGAAATGGA GAAATATATG 3951 TAAAATTAGC AAGTGTCAAT TTTACCAAGT AGTGTTGATT TTCCAAACCC 4001 TGAATTTATA TACTATGCTG AGTCACAGAG AAGAATGATC ACATGTTACT 4051 TAATGAGAGC AGTTTACTTT TCAAATAAAA TAGGTATGAT GAATGTCTTA 4101 AAAATATCTT GAAGTTGAAG AAACACAAAT GAGTTATCTC AATATTTACC 4151 AAGTTAACCT AGTGCTGTAT ATATCCCAAG ATATTTTAGG TAAATGTAAG 4201 TGTTTAATCA TGCCAGATTT AAACTAGTCT GAAATATAGG GTATACATAT 4251 ATTTCTACTT ACATTTCTTT ATTTTATGAA ATATCCGACC ATGTTGCAGA 4301 AAATAATGCA AAACCTCATG TAAGTTAACT ATGAAAGATC CTGTGAGCAC 4351 ATTGGCATTG AGTGACAGAC AAACTAAAAA CTGGCAAACA GTATTTTAAT 4401 AAGGGGGTCA CTCTGTGGCA GTATTCTAAT ATTGGATTTT CAAGTAGATT 4451 AGGCTTTTTA TTTATTCAAC GCTTTTTATA ATTTTGTTCT TTTTGACTCC 4501 AAATTATTGG TCAGCTTTCA ACCTTCTCCA CATCAGCAAT CACTAATAGT 4551 TCTTTTGGTT GAGATCAACT CAGAA // LOCUS HUMORFR 4460 bp mRNA PRI 06-FEB-1999 DEFINITION Human mRNA for KIAA0040 gene, complete cds. ACCESSION D25539 NID g436219 VERSION D25539.1 GI:436219 KEYWORDS KIAA0040. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4460) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4460) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REMARK Erratum:[[published erratum appears in DNA Res 1995 Aug 31;2(4):211]] REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1. .4460 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1. .815 gene 816. .1277 /gene="KIAA0040" CDS 816. .1277 /gene="KIAA0040" /citation=[3] /codon_start=1 /protein_id="BAA05022.1" /db_xref="PID:d1005563" /db_xref="PID:g436220" /db_xref="GI:436220" /translation="MHYVHVHRVTTQPRNKPQTKCPSGGQSQGPRGQFLDTVLAAMCP IAMLLTADPGMPPTCLWHTPHAKHKEHLSIHLNMVPKCVHMHVTHTHTNSGSRYVGKY ILLIKWSLAMYFVQGSTLSTVTKMSHGKALPDSDTYIQFPNQQGPHTPSIP" 3'UTR 1278. .4460 BASE COUNT 1245 a 1079 c 991 g 1145 t ORIGIN 1 CGGGGCAGCA ACCAGGAGAT TCCCTGGGCC TGCAGGAAGC CCTTCCGCGG 51 ACCGAAAGAT TGTTCCCCAT TTTGGAGATG AAGAAACTGA GACTCAAAGC 101 AGCTGAGTGA CCTTCCCAAG GACACACACT GAACTGGGCG GTGATCAGGA 151 TCTGAATGCA CAGGGCGGGT GTTCAGCGAT TGTTTACTAC GTTGAACGTG 201 ACCTCCAGGA AAGCAGTTCT GGCCGAGATC CCCTGACAAC GCAAAGCAAG 251 AAGTAACGTG GAAGGAGGCT CCCCAAGCTG GCTGGCCATT TTGCTGCTGT 301 GTGTGGAGGT GCTGTCAGTG GCATGCCCAA ACCCAAAGCT GGAAGAGGAA 351 TAAATTACAA GTGGTCAAGG TTGCATCCTT TTGAGCTCAG GACCTGCTTG 401 TAAGCCGAGA GGGTTCTCTG GCCCTAATCT AGCCAAGCAC CATGGAGAGA 451 ATCAGTGCCT TCTTCAGCTC TATCTGGGAC ACCATCTTGA CCAAACACCA 501 AGAAGGCATC TACAACACCA TCTGCCTGGG AGTCCTCCTG GGCCTGCCAC 551 TCTTGGTGAT CATCACACTC CTCTTCATCT GTTGCCATTG CTGCTGGAGC 601 CCACCAGGCA AGAGGGGCCA GCAGCCAGAG AAGAAAAAGA AGAAGAAGAA 651 GAAGAAGGAT GAAGAAGACC TCTGGATCTC TGCTCAACCC AAGCTTCTCC 701 AGATGGAGAA GAGACCATCA CTGCCTGTTT AGTTAGGCAG GAAGCAGAGG 751 TGTTTCCTTT CTGGGGCTAA GCCTCCTTCT GACCACACAC AGACATTTCA 801 GGAACCCCTG AAATAATGCA CTATGTCCAT GTCCACAGAG TAACTACTCA 851 ACCAAGGAAC AAACCTCAGA CTAAGTGTCC CAGTGGAGGG CAGTCCCAGG 901 GACCACGTGG ACAATTCTTG GATACTGTCT TGGCAGCTAT GTGTCCAATA 951 GCAATGCTCC TTACTGCAGA CCCAGGCATG CCTCCCACCT GTCTCTGGCA 1001 TACCCCACAT GCAAAGCACA AAGAACATTT ATCCATACAT CTCAATATGG 1051 TTCCCAAGTG TGTGCACATG CACGTAACAC ACACACACAC AAATTCAGGT 1101 AGCAGGTACG TGGGCAAGTA TATTCTGCTC ATCAAATGGT CATTGGCTAT 1151 GTACTTTGTG CAGGGAAGTA CATTATCTAC AGTCACAAAA ATGTCTCATG 1201 GGAAAGCCTT GCCAGATTCA GACACATATA TACAATTTCC TAACCAGCAA 1251 GGCCCCCATA CACCATCTAT TCCATAAACC ACTCAGGTTA CAGATGCATG 1301 CTTTCCTATT TCTAACTCTA CACATAAACT TTTACTGGAA GTACTCATAA 1351 TTGGACATTC CAGCAACCTG CTACAGTCCC CACCCTTGTG TGTCTTGATA 1401 CAGACACACC AAGTTTCTGT GCCTCTGACC CCTCACCTGT GCCAAGATGT 1451 TTAAAGTGTG ATGGTTCAAA ATTCATTGAA AGCTCTTTTC TTGTAACTCA 1501 TGACAAAGTC CGTCCTCATT GCCACTGAGA GGTGTTTAAT GTGATCCAAG 1551 ACCTCTCTGT GAAACATTAC CCCCGCAAAC CACTCAGCAA AGTGCCTTTC 1601 TCCAAGCAAG AACAAAGAGC TCTTGGTGGT GACTGCTAGA AAATTATGGA 1651 AGCCCACTCA TTTATGTCAG TGGACTGCAA CTGTGTACCT GTGCAATGTT 1701 TACAGATGGA AAGGGTGAGG AGATGCTACA CCTGAGCTAG GTATCTCCTA 1751 TATAACCAAA GTTTCCAGCA GGGAAGGAAC TAGACAATCA TCAGTGCAGT 1801 CTCACAGAAG GCAACACTGG AAGTGATGTC ATAAGGTTGT GATGTGTGCA 1851 CGGTACGGCA CAGGTGGGAT GCAGAGGTAA CAGAGTTTAA ATGAAAGTAG 1901 GATGAAGCTA TAAAGAGGTT TATTTATATT TATATTGAAG CTCAGGCAAG 1951 TGCCTTGCAC ACAGTAGGTA CTTATAACTA ACTGTGGTTA CTGTTGGATA 2001 TGTGATGTTG TTAAGGGTAA GCTTGTAATA CCTCACCAAT TCTCTGCGAG 2051 TGATCTTCTC TTCTAAGTGA GCCCACTAAT TGCTGCAATG GATGAAATTG 2101 GGTGTTTAAT GCTGGAGAGC ACATGTAGGT GACACATGTG CCTTGAGGTA 2151 TGTGAGGACA TGTAAATTAG ATCCACAGTG AGCTGAGGAG GGCTTTCCCC 2201 GCCAGAGTGA GGTTGGGAAG CAGAGTTAAT CCACTTATAG GATGAACTGC 2251 TTGGTATTTT TATTGTATTG TGACTGTATT ACAAAGATGG ACAATTCACT 2301 CCTTGGGAGC AAGTTATGCT CTAGAAGTTT ATTTACAAAT ATGCTGGGCA 2351 GCTCTCTTGA AATATTTTCC CAAGGAAGCT ATTCTACACA GTGGCAAAAT 2401 TGCTATCTAA TTAATAATGT AGCTAAACTA TGATATTTAT AGTAGCAAAA 2451 AACTAAATTC TATAAGATTG CATTAAAGGA AAGATATATT CTATTTGCTC 2501 ACTTGGGCTG CTTGGTACTC ACCTGCCCTC CAGGTGTACT TTAGGCCTGT 2551 GGAGGGTGGG CATTTAGTGG TGACCCTTGC ACCAGGGTTT TCTAACAGAT 2601 GACCCTGTGA ATCATAATTT AAACCTGCAT ATATTTTATA GCCAGTCACA 2651 TTTGCCCTCT CACCCTATAT GGCCATAAAC TGCCTAAGCA CTCAGGCCTC 2701 CCACTCATCA ACCCCTTTGA CCAGAGAAAG AAGCACTCTG GTTCTCTATC 2751 CCCTTGTCAC ATAGAGAGTT TGTCATGGGG CCTCTGGCTG TGCCCTTCAC 2801 ATAACAGAAT AACTTGCCAT CTGCCTGCAC CAAACCCAGG GATGTGGAAG 2851 ACATCTCCCC ACAACTGCCA CTGCTCACCA GGACAAGCTG CCCTTCCTGT 2901 CTCCACCTCT CAGTCCCCCT AGAATGGATG GCTGGGGAGA GGTGGAGGCT 2951 GACAGCTGAG ACGTAGTGTC AGATATGATC TAGGAGGGCG GATCACCGGG 3001 ATCCGGGACC ATACAAGTAA CATGGTTTCC ATGGCAACTG CTTGCTCGTT 3051 TGAATTAAGA CAGCAGTCAG TTGTCATTGC CATGACAAGG CCTCTATCTC 3101 CAGGCACAAT GTCCCTGCTG TCTCCTAATC CAATGGACTT GCTCTCACCC 3151 CAGGGATGAA ACACCCAGAA ACTCACTTCT CAGTCACTTC CACAGCCGAT 3201 GACTCAGAAG AGCCAAACCC AGAATGGGGC CTCTCTTTTC CCCATCACAG 3251 ACTCCCCTGA CAACCTTTCC TGGCGTAACT AGAGGAGTCC CAGTGCAGGA 3301 TAGGCCCTAA ACGTTTTGTT AAATAAACAG GTGCATGAAA GGAGCCTAAG 3351 GCCATTGTTG ATATCCACTC TCTTCTTTCC ACTTCCTTCT CATCTTTTTC 3401 TCCATGTTTT ATGCTTCTCT GATTCCCTCT TCTGCCTGCA CCAGACCAGC 3451 CCCAGCCCTT TATTCCTCTC CATTTTCACT CCTTCCAGCC TCTGTCCCTG 3501 AACTGCCACT GGCAACCCAT GGGACCTCAG GACCAGAGAC TGCTTGACTC 3551 ATCTGGGGAG GGTAAGTTCA CGGGGGACAA AAAAATGATT CCTAAAGAAG 3601 AGGCTTCCTA GACCAGCACA GGCTCCAGAA AGACATCCCC TAGGCCTGGA 3651 CTTCTGAGCA GCTTTAGCCA GGCTCCGGAC GGCAGCCAGA GGAGGCCTTT 3701 CCCCATTGCT CCTTTCCCCA TTGCTCAATG GATTCCATGT TTCTTTTTCT 3751 TGGGGGGAGC AGGGAGGGAG AAAGGTAGAA AAATGGCAGC CACCTTTCCA 3801 AGAAAAATAT AAAGGGTCCA AGCTGTATAG TATTTGTCAG TATTTTTTTC 3851 TGTAAAATTC GAACACACAC AAAAGAAAAA TTTATTTAAA TAAAATACTT 3901 TGAAAATGAA AAGTCTTGAT GTAGTCAGAT GGTTACTTTC TTAACATTAG 3951 GTATTACCCC CACTCAGACA TCACTCAGAA ATGATCAATG CAGGGACTCT 4001 TTCTGTGACA CAAATGTCCC AGCCCTCCCT GGTCACCGCC TTCGCCATGG 4051 TAGAGTCGTA GGTCTGAGGA TGAGGAATGT GGCTGTCTCA CCCTTGCTTG 4101 CAAAACAGAT GGCCTTGGAG ACCAGACTCC CTCAAAGGTG CCAGCTACAG 4151 GAAAAATACA CTGATGTTCC TTGGCAACAC TTACAGAACT TTCCATCAAT 4201 GAGGGTCCAT CAATGGCTTC TTAAAGGAAA AGGGGGGAAA TAGCAAAAAC 4251 CTAAGGAAGA ATGGACCTTT GAGTTAAATC CAGTGTTTGT TGGGAAAGGA 4301 GGGATCAAAA ACCTCTATAG TAGCCACTAG GGCAAAAACT GTGTGTATGT 4351 GTGTGTGTAT GTGTGTGTAC ACTGTTCAAT ATGGTTCAAT ATGGTACCAA 4401 TAGCCACATG TGACTATTTA AATTCATTGC AATGAAATAA AATTAAAGGT 4451 ATACTAGCTC // LOCUS HSZFX2 5527 bp mRNA PRI 14-JUN-1991 DEFINITION Human ZFX mRNA for put. transcription activator, isoform 2. ACCESSION X59739 X17312 NID g38021 VERSION X59739.1 GI:38021 KEYWORDS sex determination; transcription activator; ZFX gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5527) AUTHORS Schneider-Gadicke,A., Beer-Romero,P., Brown,L.G., Mardon,G., Luoh,S.W. and Page,D.C. TITLE Putative transcription activator with alternative isoforms encoded by human ZFX gene JOURNAL Nature 342 (6250), 708-711 (1989) MEDLINE 90081847 FEATURES Location/Qualifiers source 1. .5527 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1. .5527 /gene="ZFX" /note="alternatively spliced isoform 2" /evidence=experimental gene 1. .5527 /gene="ZFX" CDS 79. .2493 /gene="ZFX" /note="alternatively spliced isoform 2" /codon_start=1 /product="ZFX product, isoform 2" /protein_id="CAA42417.1" /db_xref="PID:g38022" /db_xref="GI:38022" /db_xref="SWISS-PROT:P17010" /translation="MDEDGLELQQEPNSFFDATGADGTHMDGDQIVVEVQETVFVSDV VDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDV TEEVSLAHCTVPDDVLASDITSASMSMPEHVLTGDSIHVSDVGHVGHVGHVEHVVHDS VVEAEIVTDPLTTDVVSEEVLVADCASEAVIDANGIPVDQQDDDKGNCEDYLMISLDD AGKIEHDGSSGMTMDTESEIDPCKVDGTCPEVIKVYIFKADPGEDDLGGTVDIVESEP ENDHGVELLDQNSSIRVPREKMVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAA AAGHAPVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHIDESAGLGRL AKQKPKKRRRPDSRQYQTAIIIGPDGHPLTVYPCMICGKKFKSRGFLKRHMKNHPEHL AKKKYRCTDCDYTTNKKISLHNHLESHKLTSKAEKAIECDECGKHFSHAGALFTHKMV HKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFPHICVECGKGFRHPSELKKHM RIHTGEKPYQCQYCEYRSADSSNLKTHVKTKHSKEMPFKCDICLLTFSDTKEVQQHAL IHQESKTHQCLHCDHKSSNSSDLKRHIISVHTKDYPHKCDMCDKGFHRPSELKKHVAA HKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKGFRQQSELKKHMKTH SGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRRPSEKNQHIMRHH KEVGLP" BASE COUNT 1671 a 1022 c 1141 g 1693 t ORIGIN 1 GCCGGCCTGC AGCACCCGCC ACCGTCGCGG CCGCCCGCAA CGTCCGTCCG 51 GAGCTGTGAC TGATGAGAAT TAAAGGCCAT GGATGAAGAT GGGCTTGAAT 101 TACAACAAGA GCCAAACTCA TTTTTTGATG CAACAGGAGC TGATGGTACA 151 CACATGGATG GTGATCAAAT TGTTGTGGAA GTACAAGAAA CTGTTTTTGT 201 TTCAGATGTT GTGGATTCAG ACATAACTGT GCATAACTTT GTTCCTGATG 251 ACCCAGATTC AGTTGTAATC CAAGATGTTA TTGAGGACGT TGTTATAGAA 301 GATGTTCAGT GCCCAGATAT CATGGAAGAA GCAGATGTGT CTGAAACGGT 351 CATCATTCCT GAGCAAGTGC TGGACTCAGA TGTAACTGAA GAAGTTTCTT 401 TAGCACATTG CACAGTCCCA GATGATGTTT TAGCTTCTGA CATTACTTCA 451 GCCTCAATGT CTATGCCAGA ACACGTCTTG ACGGGTGATT CTATACATGT 501 GTCTGACGTT GGACATGTTG GACATGTTGG ACATGTTGAA CATGTGGTTC 551 ATGATAGTGT AGTGGAAGCA GAAATTGTCA CTGATCCTCT GACTACCGAC 601 GTAGTTTCAG AAGAAGTATT GGTAGCAGAC TGTGCCTCTG AAGCAGTCAT 651 AGATGCCAAT GGGATCCCTG TGGACCAGCA GGATGATGAC AAAGGCAACT 701 GTGAGGACTA CCTTATGATT TCCTTGGATG ATGCTGGAAA AATAGAACAC 751 GATGGTTCTT CTGGAATGAC CATGGACACA GAGTCGGAAA TTGATCCTTG 801 TAAAGTGGAT GGCACTTGCC CTGAGGTCAT CAAGGTGTAC ATTTTTAAAG 851 CTGACCCTGG AGAAGATGAC TTAGGTGGAA CTGTAGACAT TGTGGAGAGT 901 GAGCCTGAGA ATGATCATGG AGTTGAACTG CTTGATCAGA ACAGCAGTAT 951 TCGTGTTCCC AGGGAAAAGA TGGTTTATAT GACTGTCAAT GACTCTCAGC 1001 CAGAAGATGA AGATTTAAAT GTTGCTGAAA TCGCTGACGA AGTTTATATG 1051 GAAGTGATCG TAGGAGAGGA GGATGCTGCA GCAGCAGGAC ACGCGCCGGT 1101 GCACGAGCAG CAAATGGATG ACAATGAAAT CAAAACCTTC ATGCCGATTG 1151 CATGGGCAGC AGCTTATGGT AATAATTCTG ATGGAATTGA AAACCGGAAT 1201 GGCACTGCAA GTGCCCTCTT GCACATAGAT GAGTCTGCTG GCCTCGGCAG 1251 ACTGGCTAAA CAAAAACCAA AGAAAAGGAG AAGACCTGAT TCCAGGCAGT 1301 ACCAAACAGC AATAATTATT GGCCCTGATG GACATCCTTT GACTGTCTAT 1351 CCTTGCATGA TTTGTGGGAA GAAGTTTAAG TCGAGAGGTT TTTTGAAAAG 1401 GCACATGAAA AACCATCCCG AACACCTTGC CAAGAAGAAA TACCGCTGTA 1451 CTGACTGTGA TTACACTACC AACAAGAAGA TAAGTTTACA CAACCACCTG 1501 GAGAGCCACA AGCTGACCAG CAAGGCAGAG AAGGCCATTG AATGCGATGA 1551 GTGTGGGAAG CATTTCTCTC ATGCAGGGGC TTTGTTTACT CACAAAATGG 1601 TGCATAAGGA AAAAGGAGCC AACAAAATGC ACAAGTGTAA ATTCTGTGAA 1651 TACGAGACAG CTGAACAAGG GTTATTGAAT CGCCACCTCT TGGCAGTCCA 1701 CAGCAAGAAC TTTCCTCATA TTTGTGTGGA GTGTGGTAAG GGTTTTCGTC 1751 ACCCGTCAGA GCTCAAAAAG CACATGAGAA TCCATACTGG GGAGAAGCCG 1801 TACCAATGCC AGTACTGCGA ATATAGGTCT GCAGACTCTT CTAACTTGAA 1851 AACGCATGTC AAAACTAAGC ATAGTAAAGA GATGCCATTC AAGTGTGACA 1901 TTTGTCTTCT GACTTTCTCG GATACCAAAG AGGTGCAGCA ACATGCTCTT 1951 ATCCACCAAG AAAGCAAAAC ACACCAGTGT TTGCATTGCG ACCACAAGAG 2001 TTCGAACTCA AGTGATTTGA AACGACACAT AATTTCAGTT CACACGAAAG 2051 ACTACCCCCA TAAGTGTGAC ATGTGTGATA AAGGCTTTCA CAGGCCTTCA 2101 GAACTCAAGA AACACGTGGC TGCCCACAAG GGCAAAAAAA TGCACCAGTG 2151 TAGACATTGT GACTTTAAGA TTGCAGATCC ATTTGTTCTA AGTCGCCATA 2201 TTCTCTCAGT TCACACAAAG GATCTTCCAT TTAGGTGCAA GAGATGTAGA 2251 AAGGGATTTA GGCAACAGAG TGAGCTTAAA AAGCATATGA AGACACACAG 2301 TGGCAGGAAA GTGTATCAGT GTGAGTACTG TGAGTATAGC ACTACAGATG 2351 CCTCAGGCTT TAAACGGCAC GTTATTTCCA TTCACACGAA AGACTATCCT 2401 CACCGGTGTG AGTACTGCAA GAAAGGCTTC CGAAGACCTT CAGAAAAGAA 2451 CCAGCACATA ATGCGACATC ATAAAGAAGT TGGCCTGCCC TAACAATACT 2501 TCTACAGAAC GTTTGTAGAG ATATTGGCCT TGAAGCAGAA AATTCATTTT 2551 AAAGCCAATC AGTCTCATTC ACATACAATA CTGTATATTG ATTTATGCTG 2601 TGTACAAATA GAATTATTAC TTCTAGTTGA CTTTTTTTTA AATATACATT 2651 TTGCTCAGTA GTGTGTTCTG AATTCTATTC AGTTTGTTTA ATAAATAGGG 2701 AAAACTGGCA ACATGCTAGT TACTTTTAAT AAAGTAATCC CTGATTCTAT 2751 ACCGAAGTTT TATATCTTAG AATTTTATAT TTATTTAAAT ATTTACCTTG 2801 CTTACCTTGA TGGTACTCTT CTAAGACCAT TAACTTAAGG TAACTTTATA 2851 TTGGTAACTC TGAAAGTATT CATGTTGACT CATTTTTTTC CCCATACATT 2901 TCTCACAATA AAATTGTCAG AGACATCTAC TAATATAAAT GGGAGATTTT 2951 ACAGTCAGGT CTAATTATCA TAACATGGAA GTCATTTACT TGTCTTGCTT 3001 AATATTTTCA GACCACTTGA CAGTGAAAGT TTCCATTTGA GCTGTTGCGT 3051 CCCTGGCTTT GCTGAGTAAA GAGCAGTGGC TGGGTTCGTG TTTACTTTTC 3101 AAATATACTT CCTTTTCGGC TTTCTTTGGA TTATTTACAT CTTTTGTCCA 3151 GCGTAGCAAA CTTTTAGAAA ACCTTATTGA AAAACTGTGC TTGCTCATGT 3201 TGTATTTTGA TTATTCTGTC TGTGCGGCTT CATCTTGGAA TGGTTGTGTG 3251 CTACAAATGA CACTTACTGA GGACTGCATT TTGGAATCTC CTAGAGGTAA 3301 CTCATGGCTT ATAGGATCTT TTGCAACTTT ATGTATGTAA ATGTACCCTG 3351 AATTATATAT ATACACATAT ATATCATGTA CCTGTGTGTA TTGCTTATTT 3401 TACATATTTA TACACACAAC CCCAAGTAGT AGTTGTTTAA AATCTATAAT 3451 GAAAAGTATT AAATTTACAA TAACATGAAA GATCCAGGGA TGCATGAGAG 3501 AGCATTTTGT AAGTCATGCT CTTCAGAGAG ACTACTCAGG TGAAGAATTA 3551 GAAGGAAAAT AAGGACACTA GTATTTTTAA AGAGTAAAGA TATTTTCTTT 3601 TAAATATCTT TGGTAATTGA AACATAGAGG TTAAGATGTT TCTAGGTAGA 3651 ATGTTTTCAT ACAATTTCAC CTCCATGTCT TTATGTTTTT CTGAAAAGCA 3701 AATGAGTATC CAGACATGAC TCCCACAGTT CTCTTTGAGA AGCCTGAGAG 3751 GGAACTCTGT CTTACCTAGT GAGGGGGATG GAAAAGAAGT GATGGCTCTG 3801 TGGACCAGAG AACGGGTGCT AATTATGACT TCACACTCGG CAAGTTCAGG 3851 CTGATCTGTT ATTTCTCAGT TACAGTTAGC AAACTTTAAA AACTTAACAC 3901 TCAAGTTGGC TTTGATTAAA AGGTAAAGAT GTGTTTTAAG TGGATAAGGA 3951 AAGTCTGAGG CCTTATTTGG AACATCACTA AGTCTTCCAC AAGGTTTTTT 4001 GTTTGTTTGT TTTTTTTTTG TTGTTTTTTT TTTCTTAAGA CGGAGTCTTG 4051 CTCTGTTGCC CAGGCTGGAC TGCAGTGGTG TGATCACTGC AAACCTCTGC 4101 AGCTCACTGC AACCTCTGCT TCCTGGGTTC AAGCAATTAT CTGCCTCAGC 4151 CTCTCGAGTA GCTGGGATTA CAGGCGCCCA CCACCACGCC CGGCTAATTT 4201 TTTGTATTTT TAATAGAGAT GGGGTTTCAC CATCTTGGCC AGGCTGGTCT 4251 TGAACCCTGA CCTCGTGATC CACCTGCCTC GGCCTTCCAT AGTGCTGGGT 4301 TTACAGGCGT GAGCCACCGT GCCCAGACAC CACATAGGTC TGAATCAGTG 4351 TCATACATTC ATAAAACAAA CTCGGTTAAT TAGAACTTGG TTATGTTAAG 4401 ACGAATCTGG GAGAACAGAA AACAGTTTTT GGGGTCCCTT CAGTTGGCTA 4451 TTGGTCCGTA TGCATCTAGC ACATTGTAGG AGATTTAGAA ATTGTCTTCC 4501 CACCCGATAG CTGCCTTGTC ACCTCATTAT GGTGCTCCAT CCCCTGTGTG 4551 CTTAGGTTTT TACCTTTCAT CTTTCTCTTT GCCATTGATG TTTGTATTCA 4601 AGAGTTATAT TTTTAGGGTT AGAAATCAAA ATATTTGGTG TTTGGCAAAC 4651 CTCTGAAGTG CTAGACTGAT TTAGTCTAGT TTTAAACCAA GTGCTTTAGG 4701 CAGGTGTGAA CTCCAGCCCA AATGCCAGTC AAAGTCAAGG CATGGGTTTT 4751 CCTAGCCTAT CTTATAGGAA ATTCCTGTAC CTTCTTGGCC CCCATAATGT 4801 GTTTTTTTTT TTTTTTTTTT TTAAAACTAA CTTACAATTT TGTGATCCGT 4851 GATTCATTGC CCTGCGATTC TTGAAAGCTC TGTCTGTTTT TTTGTGAGAA 4901 CCTTTAAAAT CTCCCTTAAT TTTTATTTTC CCAGAAATAA TGTAAAAACA 4951 CTTAAATGAA AGTGGAAATG TATTAATTTT AAATCCTATA AAATTAATAC 5001 AGAAAATATA AATGATTGGG TCATTTAACT ATATTTTTTT AAATAAACTG 5051 AAAGATAAAG AACACAACAC TTCACACATT TTATATTTCT CTTACATACT 5101 CCGGAATCAT ACACAGTTCT TTTTAAAGCA CAACATTAAA ACCTTTAAAA 5151 GGTATTTAAG GGTTTGGTCA AGTGAATATG ATAAAACATA CTTGTCTGTA 5201 TAAAGAGAAA ATGAAATTGT AGTCACTGTT ATGTACTGAC ATTAGTTACA 5251 ACCTAGTTTT AATTCTTAAA ACATTTTGAT TAGCAAAGCT AAAAAAAAAT 5301 GGATGTTTCA GTTAAATGTT TTAAAGAGGT ACAGATTTTT ACAAGGACAT 5351 AATATAAGTT ATTGTTCTGT AGAAATATCC TATTAAATAT TGTATGTCCC 5401 TCCCTCTGTA CACTTTGTAA AGAAAGTAAA ATACATAAAA ACAAAATCAT 5451 ATAGGGATGT GTGACATTAT TGTAATTGTG TACTTGAGAA TAACGTGCAA 5501 AAATAAAAAT CAGAATATTT TCCTGTT // LOCUS AB020686 4312 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0879 protein, complete cds. ACCESSION AB020686 NID g4240246 VERSION AB020686.1 GI:4240246 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk07371. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (6), 355-364 (1998) MEDLINE 99156230 REFERENCE 2 (bases 1 to 4312) AUTHORS Ohara,O., Suyama,M., Kikuno,R., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (02-DEC-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4312 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk07371" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 49. .1410 /gene="KIAA0879" CDS 49. .1410 /gene="KIAA0879" /codon_start=1 /product="KIAA0879 protein" /protein_id="BAA74902.1" /db_xref="PID:d1038636" /db_xref="PID:g4240247" /db_xref="GI:4240247" /translation="MKLLVILLFSGLITGFRSDSSSSLPPKLLLVSFDGFRADYLKNY EFPHLQNFIKEGVLVEHVKNVFITKTFPNHYSIVTGLYEESHGIVANSMYDAVTKKHF SDSNDKDPFWWNEAVPIWVTNQLQENRSSAAAMWPGTDVPIHDTISSYFMNYNSSVSF EERLNNITMWLNNSNPPVTFATLYWEEPDASGHKYGPEDKENMSRVLKKIDDLIGDLV QRLKMLGLWENLNVIITSDHGMTQCSQDRLINLDSCIDHSYYTLIDLSPVAAILPKIN RTEVYNKLKNCSPHMNVYLKEDIPNRFYYQHNDRIQPIILVADEGWTIVLNESSQKLG DHGYDNSLPSMHPFLAAHGPAFHKGYKHSTINIVDIYPMMCHILGLKPHPNNGTFGHT KCLLVDQWCINLPEAIAIVIGSLLVLTMLTCLIIIMQNRLSVPRPFSRLQLQEDDDDP LIG" BASE COUNT 1411 a 675 c 761 g 1465 t ORIGIN 1 CGACCGCGGC GGCTGGAACC CTGATTGCTG TCCTTCAACG TGTTCATTAT 51 GAAGTTATTA GTAATACTTT TGTTTTCTGG ACTTATAACT GGTTTTAGAA 101 GTGACTCTTC CTCTAGTTTG CCACCTAAGT TACTACTAGT ATCCTTTGAT 151 GGCTTCAGAG CTGATTATCT GAAGAACTAT GAATTTCCTC ATCTCCAGAA 201 TTTTATCAAA GAAGGTGTTT TGGTAGAGCA TGTTAAAAAT GTTTTTATCA 251 CAAAAACATT TCCAAACCAC TACAGTATTG TGACAGGCTT GTATGAAGAA 301 AGCCATGGCA TTGTGGCTAA TTCCATGTAT GATGCAGTCA CAAAGAAACA 351 CTTTTCTGAC TCTAATGACA AGGATCCTTT TTGGTGGAAT GAGGCAGTAC 401 CTATTTGGGT GACCAATCAG CTTCAGGAAA ACAGATCAAG TGCTGCTGCT 451 ATGTGGCCTG GTACTGATGT ACCCATTCAC GATACCATCT CTTCCTATTT 501 TATGAATTAC AACTCCTCAG TGTCATTTGA GGAAAGACTA AATAATATTA 551 CTATGTGGCT AAACAATTCG AACCCACCAG TCACCTTTGC AACACTATAT 601 TGGGAAGAAC CAGATGCAAG TGGCCACAAA TACGGACCTG AAGATAAAGA 651 AAACATGAGC AGAGTGTTGA AAAAAATAGA TGATCTTATC GGTGACTTAG 701 TCCAAAGACT CAAGATGTTA GGGCTATGGG AAAATCTTAA TGTGATCATT 751 ACAAGTGATC ATGGGATGAC CCAGTGTTCT CAGGACAGAC TGATAAACCT 801 GGATTCCTGC ATCGATCATT CATACTACAC TCTTATAGAT TTGAGCCCAG 851 TTGCTGCAAT ACTTCCCAAA ATAAATAGAA CAGAGGTTTA TAACAAACTG 901 AAAAACTGTA GCCCTCATAT GAATGTTTAT CTCAAAGAAG ACATTCCTAA 951 CAGATTTTAT TACCAACATA ATGATCGAAT TCAGCCCATT ATTTTGGTTG 1001 CCGATGAAGG CTGGACAATT GTGCTAAATG AATCATCACA AAAATTAGGT 1051 GACCATGGTT ATGATAATTC TTTGCCTAGT ATGCATCCAT TTCTAGCTGC 1101 CCACGGACCT GCATTTCACA AAGGCTACAA GCATAGCACA ATTAACATTG 1151 TGGATATTTA TCCAATGATG TGCCACATCC TGGGATTAAA ACCACATCCC 1201 AATAATGGGA CCTTTGGTCA TACTAAGTGC TTGTTAGTTG ACCAGTGGTG 1251 CATTAATCTC CCAGAAGCCA TCGCGATTGT TATCGGTTCA CTCTTGGTGT 1301 TAACCATGCT AACATGCCTC ATAATAATCA TGCAGAATAG ACTTTCTGTA 1351 CCTCGTCCAT TTTCTCGACT TCAGCTACAA GAAGATGATG ATGATCCTTT 1401 AATTGGGTGA CATGTGCTAG GGCTTATACA AAGTGTCTTT GATTAATCAC 1451 AAAACTAAGA ATACATCCAA AGAATAGTGT TGTAACTATG AAAAAGAATA 1501 CTTTGAAAGA CAAAGAACTT AGACTAAGCA TGTTAAAATT ATTACTTTGT 1551 TTTCCTTGTG TTTTGTTTCG GTGCATTTGC TAATAAGATA ACGCTGACCA 1601 TAGTAAAATT GTTAGTAAAT CATTAGGTAA CATCTTGTGG TAGGAAATCA 1651 TTAGGTAACA TCAATCCTAA CTAGAAATAC TAAAAATGGC TTTTGAGAAA 1701 AATACTTCCT CTGCTTGTAT TTTGCGATGA AGATGTGATA CATCTTTAAA 1751 TGAAAATATA CCAAAATTTA GTAGGCATGT TTTTCTAATA AATTTATATA 1801 TTTGTAAAGA AAACAACAGA AATCTTTATG CAATTTGTGA ATTTTGTATA 1851 TTAGGGAGGA AAAGCTTCCT ATATTTTTAT ATTTACCTTT AATTAGTTTG 1901 TATCTCAAGT ACCCTCTTGA GGTAGGAAAT GCTCTGTGAT GGTAAATAAA 1951 ATTGGAGCAG ACAGAAAAGA TATAGCAAAT GAAGAAATAT TTTAAGGAAA 2001 CCTATTTGAA AAAAAAAGCA AAGACCATTT GATAAAAGCC TGAGTTGTCA 2051 CCATTATGTC TTAAGCTGTT AGTCTTAAAG ATTATTGTTA AAAAATTCAG 2101 AAGAAAAGAG AGACAAGTGC TCTTCTCTCT ATCTATGCTT AATGCCTTTA 2151 TGTAAGTTAC TTAGTTGTTT GCGTGTGCCT GTGCAAGTGT GTTTGTGTGT 2201 GGTTGTGTGG ACATTATGTG ATTTACTATA TAAGGAGGTC AGAGATGGAC 2251 TGTGGCCAGG CTTCCACATT CCTGAAGCAC ACAGATCTCA GGAAAGGTTA 2301 TTTTTGCACT TCATATTTGT TTACTTTCTC CTAACTCACA AGTTAAAATC 2351 ATAACTTAAT TTCATTAACT TTTATCATTT AACTCTCTCA TGTTTGTTGT 2401 AACCTGAGGT ATCCAAATGC TACAGAAAAA TTTATGACCC AAATACAAAT 2451 CTCAATTTGA CTGGGACAGA ATGAGGAATG GAGATTTTTG TATTTATCTT 2501 TGGGACTTTA TGCCTTACTT TTTAGGCTAT AGAATAGTTA AGAAATTTTA 2551 AACAAAATTT AGTATCTTTT GGTCTTTCAC ACCATTCATA TGTTAAGTGG 2601 CAGAATAGCC TTAGTGCTAC CTCCACTTTT TTTCTCCAGT ATTTGCATCA 2651 CAGAAATAAT CCCTCTGTTT AACATGTTTG TTCAGAGCCA AGGGTTTATT 2701 GTGAAGAACT GTCATCCTGC CTTTGCTAGC TGGTACCTTC TAGTAATCAA 2751 AATTAATATG AAGAAACTAG GTTGTGACAG ACTAGATTAT ATTTAGTAGG 2801 GGAAAAATTG GGCTCAAGAA CCATTCATCA GTACGTGAGA CAAGCAGTTA 2851 ATAGTATGAT CTTTAAAGTT TTGACAATAT AAAATAAACT TGGTAACTGT 2901 TTTACAAATA TAAAAGTATA ATAAATATGC AGCCCAGTTA AATATTGATT 2951 ATCTGTGATG GTAAAGAACA ACAGTGGTGC CAGTCATCAA ACATACAGTG 3001 CGTCCTATTG AGTCACTGCT AATTTCTTGA GCCTGGTATT TGCTGCCTAT 3051 TGTATTTGTG GTTGTTGAGA GGCATTTTCA AACCCTGTAT AAATAATCCA 3101 TGCTGTTGGT CATAAGTTAA CTGTATTAAG AACAGTAAAA TAAATAAAAA 3151 CCAATAGTAC TAATTTTGCT TTAAAAAAAT TTCTAATTTT TTTCACATAA 3201 AACAATTATC CTAAAGGTTA ATAGTTGATC GAAACAGAAT AATAGAAAAA 3251 TTCTACTTTA ATTTCCATTA AAAAGCAAAT AGCATTGACA CATTTAAAGC 3301 TTTTCATTTA AAGTAGTGGA TGTTTTTGAA GTATCTAAAA TAGTAGCAGA 3351 ATATTTTATA CTTGGTCCTT GCAATGGTGT GAGTTTTAAT GATTGCATTA 3401 TCGTGATTGG TGGTTATGAG TTTCAGAAAT CTATACTTGG CATCCAACTC 3451 ATGAGTGGAT TTTATATAGG ATGGAACAGG AAGGTATGTC CTGTCAGTAT 3501 CTTAACCCTT TCAACAAGAC ATTTACCTAT TTGTCTTTCC TTACGTTCTC 3551 AAAATATTAA CTCGAATTGT AAATTAAGCA AAAATTTAAA AAGTATATGT 3601 TGATGGGACA AGAAGAATAG TATTTATTTA ATAAAACATA TATTATATTG 3651 AACTATGTGT TAATTCATTT GTATCTTTTA AAAAATTATC ACTGTTAAAG 3701 CCATTGACTC CTTTAGTACA CTGAGAAAAA TCTTATAGTA AAACTAGCCT 3751 TTCACATTAA GGTTTTGGTG TGTATTTTGT TAAATAACTA ACATGCTGCT 3801 CTATTTTCTG GGTGTAGAAA GTATTTGGCT CTAGGAAACA TTTACTTGTT 3851 TGTGAAAACA ATACCCCAAG GTAATAGGAA AAGTTTGAGT TAAGTGTTTT 3901 TAATTCAGTC AGTGAATTCA GAATAAGTAC ATTCATGTAT AACATAGGGA 3951 CAGTTCTGCT GCTGTTATTT ATATGCAATT CTTCTGGTAA ATAGCAATAG 4001 AATAAAACAT ATTTCAATGT TTGTGTATAG GTTTTATATT ATTATTCCAC 4051 TAGGAATGGC ATAAGAATTT ATAGATAAAT TCTTGTAACA TTAAAGGATT 4101 AAAATGTTTT TACATTGTTT TTGGGTGTCT CCTTCTTGTG CCCATATCTG 4151 ATAAGCTTTA TGGATTATTG CATTTAATTC CTTTTATTTG GAGGGTTTTA 4201 CTTCCTTGTT AACATATAAA GTTATAAATG AAGGACAAGG AGGAGATGGA 4251 AAATGTGTAT TTATTGTTAA TTCTTAAAAT AGTGTGTAAA TAAAATAACA 4301 TCAGTGTGCT TT // LOCUS AF043976 3415 bp mRNA PRI 07-APR-1999 DEFINITION Homo sapiens CLCA homolog (hCLCA3) mRNA, complete cds. ACCESSION AF043976 NID g4572288 VERSION AF043976.1 GI:4572288 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3415) AUTHORS Gruber,A.D. and Pauli,B.U. TITLE Molecular cloning and biochemical characterization of a truncated, secreted member of the human family of Ca2+-activated Cl- channels JOURNAL Biochim. Biophys. Acta 1444 (3), 418-423 (1999) MEDLINE 99196715 REFERENCE 2 (bases 1 to 3415) AUTHORS Gruber,A.D., Elble,R.C. and Pauli,B.U. TITLE Direct Submission JOURNAL Submitted (21-JAN-1998) Department of Pathology, College of Veterinary Medicine, Cornell University, Ithaca, NY 14853, USA FEATURES Location/Qualifiers source 1. .3415 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1. .3415 /gene="hCLCA3" CDS 19. .807 /gene="hCLCA3" /codon_start=1 /product="CLCA homolog" /protein_id="AAD23734.1" /db_xref="PID:g4572289" /db_xref="GI:4572289" /translation="MVFSLKVILFLSLLLSPVLKSSLVTLNNNGYDGIVIAINPSVPE DEKLIQNIKEMVTEASTHLFHATKQRAYFRNVSILIPMTYKSKSEYLIPKQETYDQAD VIVADLYLKYGDDPYTLQYGQCGDKGQYIHFTPNFLLTNNLATYGPRGKVFVHGWAHL RWGVFDEYNVDQPFYISRRNTTEATRCSTRITVYMVLNECKGASCIARPFRRDSQTGL YEAKCTFIPKRSQTAKESIVFMQNLDSVTEFCTEKTHNKEAPNL" BASE COUNT 1149 a 625 c 630 g 1011 t ORIGIN 1 TTTGTTTAAC ATGCAAGAAT GGTGTTCAGT CTGAAGGTGA TTCTCTTCCT 51 ATCCTTGCTT CTCTCGCCTG TATTGAAAAG CTCACTGGTA ACTTTGAATA 101 ACAATGGATA TGATGGCATT GTGATTGCAA TTAATCCCAG TGTACCAGAA 151 GATGAAAAAC TCATTCAAAA CATAAAGGAA ATGGTAACTG AAGCATCTAC 201 TCACCTGTTT CATGCCACCA AACAAAGAGC TTATTTCAGG AATGTAAGCA 251 TTTTAATTCC AATGACCTAC AAATCAAAAT CTGAGTACTT AATCCCAAAA 301 CAAGAAACAT ATGACCAGGC AGATGTCATA GTTGCTGATC TTTACCTGAA 351 ATACGGAGAT GATCCCTATA CACTTCAATA TGGACAATGT GGAGATAAAG 401 GACAATATAT ACATTTTACT CCAAACTTCT TGTTGACTAA TAACTTGGCT 451 ACCTATGGGC CTCGAGGTAA AGTATTTGTC CATGGGTGGG CCCATCTCCG 501 GTGGGGAGTA TTTGATGAGT ATAATGTGGA CCAGCCATTC TATATTTCCA 551 GAAGAAACAC TACTGAAGCA ACAAGATGTT CCACTCGTAT TACTGTTTAC 601 ATGGTTTTGA ACGAATGCAA GGGGGCCAGC TGTATAGCAC GACCATTCAG 651 ACGTGACTCA CAGACAGGGC TGTATGAAGC AAAATGTACA TTTATCCCAA 701 AGAGATCCCA GACTGCCAAG GAATCCATTG TGTTTATGCA AAATCTTGAT 751 TCTGTGACTG AATTTTGTAC TGAAAAAACA CACAATAAAG AAGCTCCAAA 801 CCTATAGAAC AAAATGTGCA ATCACAGAAG CACATGGGAT GTAATCATGA 851 GCTCTGAAGA TTTTCAGCAT TTATCTCCCA TGACAGAAAT AAATTTACCT 901 CGTCCTACAT TTTCATTGCT CAAGTCCAAA CAGCGTGTAG TCTGTTTGGT 951 ACTTGATAAA TCTGGAAGCA TGAATGCAGA AGACCGTCTC TTTCGAATGA 1001 ATCAAGCAGC AGAATTGTAC TTGATTCAAA TTATTGAAAA GGGATCCTTG 1051 GTTGGGTTGG TCACATTTGA CAGTTTTGCT AAAATCCAAA GTAAGCTCAT 1101 AAAAATAATT GATGATAACA CTTACCAAAA GATCACTGCA AACCTGCCTC 1151 AAGAAGCTGA TGGTGGCACT TCAATTTGCA GGGGACTCAA AGCAGGATTT 1201 CAGGCAATTC CCCAGAGTAA TCAGAGTACT TTCGGTTCTG AAATCATATT 1251 ACTAACAGAT GGGGAAGATT ATCAAATAAG CTTATGCTTT GGAGAGGTAA 1301 AACAAAGTGG CACAGTCATC CACACCATTG CTCTGGGGCC GTCTGCTGAC 1351 GAAGAACTGG AGACCCTGTC AAATATGACA GGATTACATA AGGGACACTG 1401 ATATACTGAA AGTTCATAAA GTGCTGGGAA GTTCATCTTT TGAGGACATC 1451 GTTTTTATGC CCATAAAAAC ATAAATGGCC TTATTGATGC TTTCAGCAGA 1501 ATTTCATCTA GAAGTGGCAG CATCTCTCAG CAGGCTCTTC AGTTGGAAAG 1551 TAAAACTTTG AATATCCCAG CGAAGAAATG GATAAATGGT ACAGTGCCTG 1601 TGGATAGTAC AGTTAGAAAT GATACTTCCT TTGTTGTCAC ATGGACGATA 1651 CAAAAGCCAG CAATAATTCT TCAAGATCCA AAAGGAAAAA AATATACTAC 1701 CTCAGATTTT CAAGAAGGTG AACTAAATAT TCGGTCTGCC CGTCTTCGAA 1751 TACCAGGTAT TGCAGAGACA GGCACTTGGA CTTACAGCGT TCGAAACAAT 1801 CATACCAAAT CTCAATTGCT AACTGTGACA ATGACCACTC GAGCAAGAAG 1851 CCCTACCACA CTCCCAGTAA TTGCAACTGC TCACATGAGT CAAAATACAG 1901 CTCATTACCC TAGCCCAGTG ATTGTTTATG CATGAGTCAG TCAAGGGTTT 1951 CTTCCTGTTC TGGGAATCAA TGTAACAGCC ATTATAGAAA ATGAAGAGGG 2001 ACATCAAGTA ACATTGGAGC TCTGCGACAA TGGCGCAGGT GCTGATTCTG 2051 TCAAGAATGA TGGCATCTAC TCAAGGTATT TTACAGATTA CCATGGAAAT 2101 GGTAGATACA GTTTAAAAGT GCTTACCCAG GCAAGAAAAA ACACAGCTAG 2151 GCTAAGTCAA CAACAGAATA AAGCTCTGTA TGTACCGCGC TATGCTGAAA 2201 ATGGAAAAAT TATACTGAAC CCATCCAAAC CTGAAGTCAC AGATGATGTG 2251 GAAGGAGCTC AAACAGACGA CTTCAGCAGA CTCACCTCTG GAGGGTCGTT 2301 TACTGTATCA GGAGTGCCTC CTAATGGTAA TCATTCTCAG GTGTTCTCAC 2351 CTGGTAAAAT TGTAGACCTC GAGGCTAAGT TTCAAGGAGA TCATATTCAA 2401 CTTTCATGGA CTGCCCCTGG CAAGGTCCTC GATAAAGGAA GAGCTGAGAG 2451 CTACATTATA AGAATAAGTA AACATTTCCT GGACCTCCAA GAAGATTTTG 2501 ATAAAGCTGC TTTAATAAAT ACTTCTGGTC TGATACCTAA GGAGCCTGGT 2551 TCAGTAGAAA GTTTTGAATT TAAACCAGAA CCTTCTAAAA TAGAGAATGG 2601 TACGACATTC TATATTGCAA TTCAAGCCAT CCATGAAGCC AATGTCACCT 2651 CAGAGGTTTC AAACATTGCA CAAGCAACTA ACTTTATTCC TCCACAGGAA 2701 CCCAGCATTC CTGATCTGGG TACCAATATT TCTGCAATCA GTTTGGCAAT 2751 TTTTGGATTA GCTGTAATTT TATCTATATT TTAAACTAGA AATTATATTA 2801 GAACTCAAAT TCAATGTTAT ACATACTTGG TAAACATTTA TTTAAAATTT 2851 AATTTACTAT ACTTATTGTC TATTATAAAG CTCATTATAA TATAAAAAGT 2901 GAAGTACAAA AGTTGTAAGT TTCCTAATTA CTTGATTAAT TAATACTATT 2951 TGAGTTATTA AATGTTAATC AAAATGAGTA TATCATTTCC TGTCTGGAAT 3001 AATCCACTCA TTAATTTTTA ATATGAAAAG ATATATATTT GTACTTGTAA 3051 GCATTTTAAG AAACATTTTT AAAGTGTGCT ACAAATTCAT TTGGTGTACT 3101 AACATCAAAA TGTATCCAAG CCATTTAAAA AATATTTATA TATACATAGT 3151 AGCAAATAGT TTTATAGATT TATTTGTATC GCATTTTTTA TTACAAATGA 3201 ATATTTCATG TTTATATAAG CTGTAATCAA AAAGGACTAG TAGTAGTAGT 3251 AAGGAAGTCA AATTTGTTTT TTTATCATTG ATTATAAGTG GTATATTTGT 3301 TTTTTGTCAT TGATTAAAAG TGATTTTAGC CCTAGGCCCG AAATGACTAG 3351 CAAATATCAT TTTCTGTATG AATTGTGGAA CATCACAATA AAATTATTTC 3401 TGTGCTGATG CTAAA // LOCUS HSAPXL 7445 bp mRNA PRI 10-DEC-1995 DEFINITION H.sapiens APXL mRNA. ACCESSION X83543 NID g790999 VERSION X83543.1 GI:790999 KEYWORDS APXL gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7445) AUTHORS Schiaffino,M.V., Bassi,M.T., Rugarli,E.I., Renieri,A., Galli,L. and Ballabio,A. TITLE Cloning of a human homologue of the Xenopus laevis APX gene from the ocular albinism type 1 critical region JOURNAL Hum. Mol. Genet. 4 (3), 373-382 (1995) MEDLINE 95315933 REFERENCE 2 (bases 1 to 7445) AUTHORS Schiaffino,V.M. TITLE Direct Submission JOURNAL Submitted (19-DEC-1994) V.M. Schiaffino, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY FEATURES Location/Qualifiers source 1. .7445 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /dev_stage="adult" /tissue_type="retina" /map="Xp22.2-22.3" gene 91. .4941 /gene="APXL" CDS 91. .4941 /gene="APXL" /codon_start=1 /protein_id="CAA58534.1" /db_xref="PID:g1181628" /db_xref="GI:1181628" /db_xref="SWISS-PROT:Q13796" /translation="MEGAEPRARPERLAEAETRAADGGRLVEVQLSGGAPWGFTLKGG REHGEPLVITKIEEGSKAAAVDKLLAGDEIVGINDIGLSGFRQEAICLVKGSHKTLKL VVKRRSELGWRPHSWHATKFSDSHPELAASPFTSTSGCPSWSGRHHASSSSHDLSSSW EQTNLQRTLDHFSSLGSVDSLDHPSSRLSVAKSNSSIDHLGSHSKRDSAYGSFSTSSS TPDHTLSKADTSSAENILYTVGLWEAPRQGGRQAQAAGDPQGSEEKLSCFPPRVPGDS GKGPRPEYNAEPKLAAPGRSNFGPVWYVPDKKKAPSSPPPPPPPLRSDSFAATKSHEK AQGPVFSEAAAAQHFTALAQAQPRGDRRPELTDRPWRSAHPGSLGKGSGGPGCPQEAH ADGSWPPSKDGASSRLQASLSSSDVRFPQSPHSGRHPPLYSDHSPLCADSLGQEPGAA SFQNDSPPQVRGLSSCDQKLGSGWQGPRPCVQGDLQAAQLWAGCWPSDTALGALESLP PPTVGQSPRHHLPQPEGPPDARETGRCYPLDKGAEGCSAGAQEPPRASRAEKASQRLA ASITWADGESSRICPQETPLLHSLTQEGKRRPESSPEDSATRPPPFDAHVGKPTRRSD RFATTLRNEIQMHRAKLQKSRSTVALTAAGEAEDGTGRWRAGLGGGTQEGPLAGTYKD HLKEAQARVLRATSFKRRDLDPNPGDLYPESLEHRMGDPDTVPHFWEAGLAQPPSSTS GGPHPPRIGGRRRFTAEQKLKSYSEPEKMNEVGLTRGYSPHQHPRTSEDTVGTFADRW KFFEETSKPVPQRPAQKQALHGIPRDKPERPRTAGRTCEGTEPWSRTTSLGDSLNAHS AAEKAGTSDLPRRLGTFAEYQASWKEQRKPLEARSSGRCHSADDILDVSLDPQERPQH VHGRSRSSPSTDHYKQEASVELRRQAGDPGEPREELPSAVRAEEGQSTPRQADAQCRE GSPGSQQHPPSQKAPNPPTFSELSHCRGAPELPREGRGRAGTLPRDYRYSEESTPADL GPRAQSPGSPLHARGQDSWPVSSALLSKRPAPQRPPPPKREPRRYRATDGAPADAPVG VLGRPFPTPSPASLDVYVARLSLSHSPSVFSSAQPQDTPKATVCERGSQHVSGDASRP LPEALLPPKQQHLRLQTATMETSRSPSPQFAPQKLTDKPPLLIQDEDSTRIERVMDNN TTVKMVPIKIVHSESQPEKESRQSLACPAEPPALPHGLEKDQIKTLSTSEQFYSRFCL YTRQGAEPEAPHRAQPAEPQPLGTQVPPEKDRCTSPPGLSYMKAKEKTVEDLKSEELA REIVGKDKSLADILDPSVKIKTTMDLMEGIFPKDEHLLEEAQQRRKLLPKIPSPRSTE ERKEEPSVPAAVSLATNSTYYSTSAPKAELLIKMKDLQEQQEHEEDSGSDLDHDLSVK KQELIESISRKLQVLREARESLLEDVQANTVLGAEVEAIVKGVCKPSEFDKFRMFIGD LDKVVNLLLSLSGRLARVENALNNLDDGASPGDRQSLLEKQRVLIQQHEDAKELKENL DRRERIVFDILANYLSEESLADYEHFVKMKSALIIEQRELEDKIHLGEEQLKCLLDSL QPERGK" BASE COUNT 1705 a 2198 c 2087 g 1455 t ORIGIN 1 TCTGCGGCGC TCGGAGCCTC CCTTGCGATC CCACGGCCGG GACTGCCCGG 51 AGTGCATGGG CGCGGGCCAG GGACGCTGAG CGGTCGCGCC ATGGAGGGCG 101 CCGAGCCCCG CGCGCGGCCC GAGCGCCTGG CCGAGGCCGA GACGCGGGCG 151 GCGGACGGCG GGCGCCTGGT GGAGGTGCAG CTGAGCGGCG GCGCCCCGTG 201 GGGCTTCACC CTGAAGGGCG GCCGCGAGCA CGGCGAGCCG CTGGTCATCA 251 CCAAGATTGA AGAGGGCAGT AAAGCCGCGG CGGTCGACAA GTTACTGGCT 301 GGAGATGAGA TCGTCGGCAT CAATGACATT GGTCTCTCAG GGTTTAGACA 351 GGAAGCGATT TGCCTGGTGA AGGGGTCCCA TAAGACCCTG AAGCTGGTCG 401 TCAAAAGGAG GAGCGAGCTG GGCTGGAGGC CTCACTCCTG GCATGCCACC 451 AAGTTCTCTG ACAGCCACCC CGAGCTAGCG GCCTCCCCGT TCACCTCCAC 501 CAGCGGCTGT CCTTCCTGGT CCGGCCGACA CCACGCGAGT TCTTCCTCCC 551 ACGACCTGTC CAGTTCCTGG GAGCAGACGA ACCTACAGCG CACCTTAGAT 601 CACTTCAGCT CCTTGGGGAG CGTTGACAGC CTGGACCACC CCTCCAGTCG 651 CCTCTCGGTG GCCAAGTCCA ACAGCAGCAT CGACCACCTG GGCAGCCACA 701 GCAAGCGCGA CTCGGCCTAC GGCTCCTTCT CCACCAGCTC TAGCACTCCT 751 GACCACACCT TGTCCAAAGC CGACACGTCC TCCGCAGAGA ACATCCTCTA 801 CACTGTGGGC CTCTGGGAGG CTCCCAGGCA GGGTGGCCGG CAGGCCCAGG 851 CCGCAGGCGA CCCTCAGGGC TCGGAGGAGA AGCTCAGTTG TTTCCCGCCC 901 AGGGTCCCCG GTGACAGCGG CAAAGGCCCC AGGCCAGAGT ACAATGCCGA 951 GCCCAAGCTG GCTGCCCCTG GGAGGTCCAA TTTTGGGCCA GTCTGGTATG 1001 TTCCCGATAA GAAGAAAGCA CCATCATCCC CACCTCCTCC CCCTCCCCCT 1051 CTCCGCAGTG ACAGCTTTGC TGCCACCAAG AGCCACGAGA AGGCCCAGGG 1101 CCCTGTGTTC TCAGAGGCGG CTGCGGCACA GCACTTTACG GCCCTGGCCC 1151 AGGCTCAGCC TCGTGGTGAC CGGAGACCAG AGCTCACCGA TCGGCCTTGG 1201 AGGTCAGCAC ACCCGGGGAG CCTCGGGAAG GGATCGGGAG GCCCGGGCTG 1251 CCCACAGGAG GCCCACGCAG ACGGCAGCTG GCCGCCCTCC AAGGATGGAG 1301 CTTCCAGTAG GCTGCAGGCC TCTCTGTCCA GCTCAGATGT GCGCTTCCCT 1351 CAGTCTCCTC ATAGCGGCCG ACACCCTCCC CTATACAGCG ACCACAGCCC 1401 CCTCTGTGCT GACAGCCTTG GGCAGGAGCC AGGGGCTGCC AGCTTCCAGA 1451 ACGACAGCCC TCCTCAGGTG AGGGGGCTCA GCAGCTGTGA CCAGAAGCTG 1501 GGGAGCGGCT GGCAGGGTCC CCGGCCCTGT GTGCAGGGAG ACCTGCAAGC 1551 AGCACAGCTC TGGGCGGGAT GCTGGCCTTC TGACACAGCC CTTGGAGCCC 1601 TCGAGAGTCT TCCCCCACCC ACGGTGGGCC AGAGCCCACG CCATCACCTA 1651 CCTCAGCCTG AGGGTCCTCC GGATGCCCGC GAGACAGGAC GGTGTTACCC 1701 GCTGGACAAA GGGGCCGAGG GCTGCTCCGC GGGAGCCCAG GAGCCTCCCA 1751 GGGCCAGCCG TGCAGAAAAA GCCAGCCAGA GGCTGGCAGC CAGCATCACG 1801 TGGGCAGATG GGGAGAGCAG CAGGATCTGC CCGCAGGAGA CGCCCCTGTT 1851 GCACTCCCTG ACCCAGGAGG GGAAGCGCCG GCCTGAGAGC AGTCCAGAGG 1901 ACAGCGCCAC CAGACCGCCA CCGTTCGACG CCCACGTGGG CAAGCCCACC 1951 CGAAGAAGCG ACCGCTTTGC CACCACCCTG CGGAATGAGA TCCAGATGCA 2001 TAGAGCCAAG CTGCAGAAGA GCCGGAGCAC AGTGGCTCTG ACTGCAGCAG 2051 GGGAGGCGGA GGATGGCACC GGCCGCTGGA GGGCCGGGTT GGGAGGTGGC 2101 ACCCAGGAAG GACCCCTCGC TGGCACCTAT AAAGACCACC TGAAAGAGGC 2151 CCAAGCCCGG GTCCTGAGGG CCACGTCCTT CAAGCGCCGC GACTTGGACC 2201 CCAACCCAGG AGACCTATAC CCGGAGTCAC TGGAACACCG GATGGGGGAT 2251 CCAGACACTG TCCCCCACTT CTGGGAGGCA GGCCTGGCCC AGCCACCCTC 2301 ATCTACAAGT GGCGGGCCCC ACCCGCCCCG CATCGGAGGC CGGAGACGGT 2351 TCACAGCTGA GCAGAAATTG AAGTCCTACT CGGAACCTGA GAAGATGAAC 2401 GAGGTGGGCC TCACGAGGGG CTACAGTCCT CACCAGCACC CCAGGACATC 2451 TGAGGATACT GTGGGCACGT TTGCTGACAG GTGGAAGTTT TTTGAGGAAA 2501 CGAGCAAACC TGTTCCCCAG AGGCCTGCCC AGAAGCAAGC TCTTCACGGA 2551 ATCCCGAGAG ACAAGCCAGA GAGGCCGCGG ACAGCGGGCC GCACATGTGA 2601 GGGCACGGAG CCCTGGTCGC GCACCACCTC CCTTGGGGAC AGCCTCAACG 2651 CTCACAGCGC AGCGGAGAAG GCAGGGACTT CAGACCTGCC GCGGAGGCTC 2701 GGCACCTTTG CAGAGTATCA GGCCTCTTGG AAGGAACAGA GGAAACCTCT 2751 GGAGGCCAGG AGCTCTGGGC GCTGCCACTC AGCGGATGAC ATCCTGGATG 2801 TGAGCCTGGA CCCACAGGAG AGGCCGCAGC ACGTTCATGG GAGGTCCCGG 2851 TCTTCACCGT CCACAGACCA CTACAAGCAG GAAGCTTCTG TCGAACTGCG 2901 AAGGCAGGCA GGGGACCCCG GCGAGCCCAG AGAAGAGCTT CCCTCCGCAG 2951 TCCGGGCCGA GGAGGGACAG TCCACGCCGA GACAAGCAGA TGCCCAGTGT 3001 CGGGAAGGCA GCCCAGGATC ACAGCAGCAC CCACCGAGTC AGAAGGCACC 3051 GAACCCACCC ACATTCTCTG AACTATCTCA CTGCCGGGGA GCCCCAGAGC 3101 TGCCCCGGGA GGGCCGGGGC CGAGCGGGAA CCCTACCTCG AGATTATAGA 3151 TACTCGGAGG AGAGCACCCC AGCAGACTTG GGACCCCGAG CCCAGAGCCC 3201 TGGCTCACCC CTGCATGCTC GAGGACAAGA CTCGTGGCCA GTGAGCTCAG 3251 CCCTGCTCTC CAAGAGGCCA GCCCCACAGA GGCCACCGCC ACCCAAGCGC 3301 GAGCCCAGGA GATACAGGGC CACAGACGGC GCACCTGCTG ACGCCCCCGT 3351 GGGCGTCCTC GGCAGGCCCT TCCCAACGCC ATCCCCTGCG TCCCTGGATG 3401 TGTATGTGGC CCGCCTGTCC CTCTCCCACA GCCCCTCTGT GTTCAGCAGT 3451 GCCCAGCCCC AGGACACCCC GAAGGCCACT GTCTGTGAGC GTGGAAGCCA 3501 GCATGTGAGC GGGGACGCAT CACGTCCTCT GCCAGAAGCA CTGCTCCCTC 3551 CCAAGCAGCA GCACCTGCGC CTGCAGACGG CCACCATGGA GACCTCGCGC 3601 TCCCCCTCGC CCCAGTTCGC CCCCCAGAAA CTGACGGACA AACCTCCCCT 3651 GCTCATCCAG GATGAGGATT CAACCAGAAT TGAGCGGGTG ATGGACAACA 3701 ACACCACGGT GAAGATGGTG CCCATCAAGA TCGTGCACTC GGAGAGCCAG 3751 CCAGAGAAGG AGAGCCGCCA GAGCCTGGCA TGCCCCGCCG AGCCACCTGC 3801 CCTGCCCCAC GGGCTGGAGA AAGACCAGAT CAAGACGCTG AGCACATCTG 3851 AGCAGTTCTA CTCGCGCTTC TGTCTGTACA CGCGGCAGGG TGCTGAGCCC 3901 GAGGCCCCAC ATAGGGCCCA GCCGGCTGAG CCCCAGCCCC TGGGCACCCA 3951 GGTGCCCCCC GAGAAAGACC GCTGCACCTC CCCTCCAGGG CTCAGCTACA 4001 TGAAGGCCAA AGAGAAGACT GTGGAAGACC TGAAGTCGGA GGAGCTGGCC 4051 AGGGAGATCG TGGGGAAGGA TAAGTCCCTG GCCGACATCC TGGATCCCAG 4101 TGTGAAGATC AAAACCACTA TGGACTTGAT GGAAGGCATC TTCCCCAAAG 4151 ACGAGCACCT CCTGGAAGAA GCCCAGCAAC GGAGGAAGCT GCTCCCCAAA 4201 ATCCCCTCTC CTAGAAGCAC AGAGGAGAGG AAAGAGGAGC CCAGCGTGCC 4251 TGCGGCCGTG TCCCTGGCCA CCAATTCTAC CTACTACAGC ACGTCGGCCC 4301 CCAAGGCGGA GCTGCTGATC AAGATGAAGG ACCTGCAGGA GCAGCAGGAG 4351 CACGAAGAGG ATTCGGGAAG CGACTTGGAC CACGACCTGT CGGTGAAGAA 4401 GCAGGAGCTC ATCGAGAGCA TCAGCCGCAA GCTGCAGGTG CTCCGGGAGG 4451 CCCGCGAGAG CCTGCTGGAG GACGTGCAGG CCAACACCGT GCTGGGGGCC 4501 GAGGTGGAGG CCATCGTGAA AGGCGTCTGC AAGCCCAGCG AGTTTGACAA 4551 GTTCCGGATG TTCATTGGAG ACCTGGACAA AGTGGTGAAC CTCCTGCTGT 4601 CGCTGTCAGG CCGCCTGGCC CGGGTGGAGA ATGCCCTCAA TAATTTGGAC 4651 GACGGCGCTT CTCCCGGTGA TCGGCAATCA CTGCTTGAGA AGCAGAGAGT 4701 CCTGATCCAG CAGCACGAGG ACGCCAAGGA GCTCAAGGAG AACCTGGACC 4751 GCCGCGAGCG CATCGTCTTT GACATTTTGG CCAACTATCT GAGCGAGGAG 4801 AGCCTCGCGG ACTATGAGCA CTTCGTGAAG ATGAAGTCGG CCCTCATCAT 4851 CGAGCAGCGG GAGCTGGAAG ATAAAATCCA CCTTGGTGAA GAGCAGCTGA 4901 AGTGCTTATT GGACAGCCTT CAGCCCGAAA GGGGCAAATA AGAGACCAGT 4951 CCCCGGTGGA GGAGGGGCAC GGGGCCTCCG AGCTCCAGCT CCGTTCCCAA 5001 GGATACTCGT GAAGACCCCA TCTGTGTTCA TGGCCTGGAA AGAGACTTCT 5051 CCCATAGCAA AGAGGCTGTT ATAAAAGCAA TAACTTTTGT GTTTGTGTGG 5101 GATGATTTAT TTAATTTTTT AGTTTCCCCT TTGATTGCTG AGAGCCATTT 5151 TCCTTTACAC ATAACTACAC CTGACACCAG GCTCTGCTGG ATGTGAGTTT 5201 CCACTGCATG GGCTGTGGGC TGGGCCTGTG GTGCCTGCCG AGTGGTCACT 5251 GTCAGTGGGA AACCCGTTGT TCCTCCCGTC TTCAGATGCT GAGCCAACTG 5301 CTTGGACAGC AGCCAGCGCG TCATGACGTG CATGAGAGGG GGACCCTGGT 5351 GCTCATCTTC TCTTGTCATT CATCCAGGCA TGGGCTGCCA GGTTTTGTCC 5401 CTGCTCGTTC AACAGTGTGA GCATTTGTCT CTGTTATCTA ATGATGTTCT 5451 CTGACCCAGC AGAAATCATC ATCATGATGA TGATAATTTA TTAACTTTTT 5501 GGAAGGGTGA ATAGTTTCCT AATGGTTAAA AACCAACTGT GAAAGGAACC 5551 ACCTGTGTGG TTGGGTTCAC TCATTCTCAG ATTAAATTGC CACTTAAAGA 5601 AATAACGTGC ATGCTTTAAA AAACACAGTC ACGCACCAAG CAGGCAAATA 5651 GCTTTAGTCC TTCTCACCTC ACATCACAGT TGTTCTGCAA AGTAAAATTT 5701 TTTGGTTAAG AGCGTGTCCA GTAGTAATGT GCTTGTTAGC TGTTTCTCAA 5751 GACCAACAGA AGATTTTTTC AGTTACTTTC CCCCCATGTA TTTTGTATGC 5801 ATATGATTGT CCGTGATAAT TGGCTACTTT TCCATTGTTT CCTCCTTAAA 5851 TCGTTTAGCA TGGCATGAGG GCCACATTCC ATGGACGGGA AGACCCCTTC 5901 CTCTTCAGAG GTCCCGTGGA CTACACAGCT CCTGAGCTTG ATCTTTTTCT 5951 GCCATGAAGT TTAAAGATTC TATGCCCATT TCCTTGATTG AAATGGCAGG 6001 ATTCTAAAGA GAGCCTGGTT TGTTAAAAGA AAACACTGTC ATGCTGTCAG 6051 TTCCCAATTG ACAAGTCACA GACTGGGAGA AAATATTTGC AAATCGTGTA 6101 TCTGACAAAA GGTTTGTGTC CAGGATGTAC AAAGAACTCT CAAACCGGAT 6151 AGTAAGAAAA CAAACAGCCC AAGTGAAAAG CAGGCAAAAG ACTTGAATAG 6201 ACACTTCACC AAAGAGCATA CACGCGTGGC AAACAAGCAC ACGAAAAGAC 6251 GTTCAGCCGC CGATGGCTTG GTTATAATTT ATAACTTACT TATTTTTATC 6301 TAATAATTGT AGATTCAGTG TATTTCTTCA AAAAATGTTT AATTAAATGC 6351 ATGTTAATGG TGAGTGAATC CCTTGGGTGA CTTCGTGTTT AGGTCGTATT 6401 AGGGCATTTG TTGGATCAAC GGATCATTTT AACCCTGACT TCCCCTTATT 6451 CCCATAAAAG AAGTTTTCCA GTGGAATGGA GATTTCATTT TGTCAGCAGC 6501 AGTGACCACA GCCTTACCAA AGCAGACGCG TGCGCGTGCA CAGATGCACA 6551 CACACAGATG TCTTAAAAGA CTAGAATCCA CACTTCCTGA GCCAGAGGGG 6601 CCGTGTTGAC GGTAATGCAT TCTCTATAGA GCCAAGTCCA AACTGGCAAG 6651 CTCAATGATG CAGGCAATAA ACCGCCTTTT TGGCAGCCTA CCAATGCCAA 6701 AAGGATAAAT GTCTTTCCAA AAGTGTGTAT TCCTGTTAAA TTAAGCTCTT 6751 GCTAACTTGA AAAATCCCTG TTCTGCCAGC GAAGCTTCCT CCTCCTCTCC 6801 AGCTGGTAGT CACTTGCGTG AATGCTGGTC AGTCTGAAAA GGTGAAGCTG 6851 GCTGTGCACT TACCCCCATC TTTCTCCCTC GGGGAGACGA CCCAAGGAAT 6901 TTCAGAGTAT TTTGTTTGGC AGAGCTTTTA CCTGTTATTC TTTGCCCTCA 6951 AATACAGTAT TGTGGTCATT TTGATGATAT GTGTGTAAAA TGTGAATAAT 7001 CCAATTGGTG TCTGTACTCA GCCTTTTGAT GTCTTTTTAG GACTTTCTCT 7051 TCTACACAGC AATACGTCGT GCTCGAGTAT CCTTGTAGCA AAGCACATAG 7101 AGCCAGCTGT CCTGTCAGTT CCCCTGTTTG CCTCTGAAAC GTCTGGTTAG 7151 TGGGGACCCA AAGATTCTAG TGAGTCAACA TCCATAACTC TGTATCTAGT 7201 TGTATTATTC ATAGAAAATC AATCTGGTGC TAATGGTTGG CCCTGGTGTT 7251 GTTGGGTGGC AGCTGCTCCT TCGCCCTCTT GTAGTGTGGC TGTGGAGGGC 7301 TCTGCCTATG GGGGGTGGCC TGTGGCTTGT ATCCTTCAGT CCACCACAGC 7351 AAATGTGTGT AGATTTCATG CTCGACACTT ACCACTCACC TATCAACAGA 7401 TCATCCTGCT TGACTGTAAC AAAATAAATA GTGTCTCTTC AAGTG // LOCUS AF134803 2995 bp mRNA PRI 19-MAY-1999 DEFINITION Homo sapiens cofilin isoform 2 mRNA, complete cds. ACCESSION AF134803 NID g4868362 VERSION AF134803.1 GI:4868362 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2995) AUTHORS Jin,J., Li,G., Hu,S., Li,W., Yuan,J. and Qiang,B. TITLE Isolation of two isoforms of human cofilin cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 2995) AUTHORS Jin,J., Li,G., Hu,S., Li,W., Yuan,J. and Qiang,B. TITLE Direct Submission JOURNAL Submitted (08-MAR-1999) Biochemistry, Institute of Basic Medical Sciences, 5 Dong Dan San Tiao, Beijing 100005, P.R. China FEATURES Location/Qualifiers source 1. .2995 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 47. .547 /note="COF2" /codon_start=1 /product="cofilin isoform 2" /protein_id="AAD31281.1" /db_xref="PID:g4868364" /db_xref="GI:4868364" /translation="MASGVTVNDEVIKVFNDMKVRKSSTQEEIKKRKKAVLFCLSDDK RQIIVEEAKQILVGDIGDTVEDPYTSFVKLLPLNDCRYALYDATYETKESKKEDLVFI FWAPESAPLKSKMIYASSKDAIKKKFTGIKHEWQVNGLDDIKDRSTLGEKLGGNVVVS LEGKPL" BASE COUNT 959 a 465 c 549 g 1022 t ORIGIN 1 CAGAGCCGAA GCCCGAGCTG CCGCCGCAGC CACAGCCGAG GGCACTATGG 51 CTTCTGGAGT TACAGTGAAT GATGAAGTCA TCAAAGTTTT TAATGATATG 101 AAAGTAAGGA AATCTTCTAC ACAAGAGGAG ATCAAAAAGA GAAAGAAAGC 151 AGTTCTCTTC TGTTTAAGCG ATGACAAAAG ACAAATAATT GTAGAGGAAG 201 CAAAGCAGAT CTTGGTGGGT GACATTGGTG ATACTGTAGA GGACCCCTAC 251 ACATCTTTTG TGAAGTTGCT ACCTCTGAAT GATTGCCGAT ATGCTTTGTA 301 CGATGCCACA TACGAAACAA AAGAGTCTAA GAAAGAAGAC CTAGTATTTA 351 TATTCTGGGC TCCTGAAAGT GCACCTTTAA AAAGCAAGAT GATTTATGCT 401 AGCTCTAAAG ATGCCATTAA AAAGAAATTT ACAGGTATTA AACATGAGTG 451 GCAAGTAAAT GGCTTGGATG ATATTAAGGA CCGTTCGACA CTTGGAGAGA 501 AATTGGGAGG CAATGTAGTA GTTTCACTTG AAGGAAAACC ATTATAAAAT 551 GACAGTCAAG TGCCATCTGG ATCTTAAGGA GCTTCCATTT CTCCAGCTCA 601 GTCCATTGGA ATAGTATTAG GTTTTGGTTT TTTGTTGTAT TTCCCCCTTT 651 CCACTGGGCC CTTCCAACAC AATGAATGAA GGAAATATCA TTTATTTAAG 701 CAGCCTATCA GTGATTGCCA TTAGACTGTT GAATACTGTT ACTTTTATAT 751 AGAACCCAAG GAATGCCTTC CTGTCATATT TTAGCCAAAA CAACTGGTTA 801 TATGCCTCCC TTGCAGCAAG CACTACAATG TATGTGATCG TCAATGTGAA 851 TAGCTTAGAA TACTGCAAAG GATAAGCTAA TTGAATGCCT TGAAAGTATT 901 ATCCACTGGT CAGATGGTCA ACTTTTTTCA GTATTATTTA TAGTTGGCAC 951 TTGATGCAGT TCTGTGAGGC TTGAGCATTC ATACACCTCA CCTGCCTTGG 1001 CAAGCCTATT TTGCCTATTT TAGTGATATG GCAGCACGGA TATAACACTA 1051 TGCATTAAAA GCACTTTTTG TAATAAGTTT AATATCCTAA AAGGAATGCC 1101 AATTAAGTTT TGTTAACTGT GTCATCAACT TATCCTAGTA CCTCAGTGTT 1151 CATTCCTGTT ACCTGCATAT CTTCTTAAAA GAAATAGCTG TTATTAATGC 1201 CTTTTTGTTT TCCATTGAGT GTACACTACT GAATAAGTGT AGGAGTTTTA 1251 TGTTTACCAT GTGAGTCCTG CAACACTAAA GATATTTTGA ATATCAGTCA 1301 TGATGGCAAT TTCTGTATAA AAGAGCCTTA AATGGAACAT TGTTTTGAGA 1351 TCAAACTCCC CACCCTCACA AAAATGGCCA CGTTGCAATA AAAATTGTGG 1401 CATATTACAG AACGTTGCCT TGTTTTCCTT GGAAATTTTG CAAAATGTTA 1451 TGTGAAACAA CTTCTAGGGT AAAAACAGCT ATTACTAATC TCTGCACTGG 1501 TCATTTGAGA ATTTTTTTTG TACAGCATTC ATGTGTGATA TTTTCCAGAT 1551 TTGTTGGATC TATTTGGTTT AAAAAGTATT CTATCTTAAG GCCAACTAAT 1601 ATAAAATACC ATTGTTAAAG AATGGTACTT TTATAAACAT TAGTGTATTT 1651 ATTTCCTATG TGTTAATATG AAGATCAGAA ATTATTTTTT GCACTTTGGC 1701 ATAAATACTT TTCAATATCT GATTTGTTCT CTGGATAAAT TAGCATAGTT 1751 ATTTTTTTAT TCACATTTAC ATTTCTAAGT AGTTGTATAG TAGAAGCAGG 1801 AAGCTCTTAT TGCTTATTTG GTCGTAATGA AAATAATTTG TAAAATGTCC 1851 TTTAAAAGTT TAATGATACT TCTGATGTTT CGGAACAGTC ATTTCACCTA 1901 CTATTTCTGA ATATATTTTG CAAATTGAAT TGGAATAGGA ATTGATATAG 1951 CAGTCTTAAA CATTAGTAGT GGGATTTGGC TATGGTCCAG ACTGTGCTCC 2001 TTATAGAGAA TTTGATCTGC TCAGTGTGAG CGGTTTGCTG TTAGCCAGGG 2051 CTATTTATGG CAAACACATG CTTTTGTATC TTGTCATAGT TATCCACAAA 2101 GGCAAAACTG GACTTGATTC TACTGGTATG CAAAACAGGC ATGCTAGTAA 2151 GCAGTCAGTC GTGGCTCAGA ACTTAACCCC ATAGCTCAGA GGAATGCTTT 2201 TAGCAGAAAA CAGGAAAGAA AATATCCCTT AAAATTTTTT TTTGAATGTG 2251 TGGAAGTAAT TTTAGTATAA TTAGATTTTT TCCATATTTT TGAAAGATTT 2301 TTCAGATGTG AACATTAAAA ATAGGGATTA AATGTCTAGG CTTCCATTTA 2351 AAATTATATG AATGGTTTGG GATCTTTTTG CACTGAGCAA TTTTATTTCA 2401 GGCTTCCAGC TGTCCCTGTG AGTTATCCTG GACATTTCGA TGGTTTTTGG 2451 TAAGGCCAAA CTCTGATAAG CAAAACAGAG AATACTGACG TATACTTAAC 2501 CATATGTGTA ACTGATACTT GGCACCATGG AATTTTTCAT TGAGTTATTT 2551 CCTCATTCTT TTAAAAAATA AGGGACTATA AATCAGTTAT GTAGTATCTT 2601 TTGTTTTTGT AGCTGATTCC TTAACTTTCT TGTATGCCTC TAGTAATTTC 2651 AGAGATTAAA TATTGCTTTA AACTGTGATA CTTTGATTTG CTAGATTGAC 2701 AAAACTGATA CTAATATAAT TAAGTTCATC TTTGAAATAC ATCTTTGTGC 2751 GTAGAGCCAA AAAAAGAGAT AAAATTAATA ATAGTTCACT TGTTATTTGA 2801 GATTAATTTG GCATTTGAAA TGATCATTTT ATTTTACAAT CATTTATAAT 2851 GAATCAATGT TCCAGTTAGC TTTAAAAGGT ATACGGTGCT AATTAGTAAA 2901 ATATTGAAGG CAATATTTTA CTGCTAGCTT GCAAAGTTAT GAGAGTTTAA 2951 AAAATAAAAT ATATGAAAAT ATGTAAAAAA AAAAAAAAAA AAAAA // LOCUS AF134802 3087 bp mRNA PRI 19-MAY-1999 DEFINITION Homo sapiens cofilin isoform 1 mRNA, complete cds. ACCESSION AF134802 NID g4868361 VERSION AF134802.1 GI:4868361 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3087) AUTHORS Jin,J., Li,G., Hu,S., Li,W., Yuan,J. and Qiang,B. TITLE Isolation of two isoforms of human cofilin cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 3087) AUTHORS Jin,J., Li,G., Hu,S., Li,W., Yuan,J. and Qiang,B. TITLE Direct Submission JOURNAL Submitted (08-MAR-1999) Biochemistry, Institute of Basic Medical Sciences, 5 Dong Dan San Tiao, Beijing 100005, P.R. China FEATURES Location/Qualifiers source 1. .3087 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 153. .653 /note="COF1" /codon_start=1 /product="cofilin isoform 1" /protein_id="AAD31280.1" /db_xref="PID:g4868363" /db_xref="GI:4868363" /translation="MASGVTVNDEVIKVFNDMKVRKSSTQEEIKKRKKAVLFCLSDDK RQIIVEEAKQILVGDIGDTVEDPYTSFVKLLPLNDCRYALYDATYETKESKKEDLVFI FWAPESAPLKSKMIYASSKDAIKKKFTGIKHEWQVNGLDDIKDRSTLGEKLGGNVVVS LEGKPL" BASE COUNT 984 a 477 c 564 g 1062 t ORIGIN 1 AAACATTTTT TGGATGGGAC AACTTGTATT TGTCCCTTTC GCTTCCACGT 51 CCAAACCCCT TTAAGAAGGA TGAATGGGCA GGATGAGTTA GACTCCTTCG 101 CTGTATCGTC TACTGATTCT TAAAATGTGA CAAATCTGAT TGGACGACTT 151 ACATGGCTTC TGGAGTTACA GTGAATGATG AAGTCATCAA AGTTTTTAAT 201 GATATGAAAG TAAGGAAATC TTCTACACAA GAGGAGATCA AAAAGAGAAA 251 GAAAGCAGTT CTCTTCTGTT TAAGCGATGA CAAAAGACAA ATAATTGTAG 301 AGGAAGCAAA GCAGATCTTG GTGGGTGACA TTGGTGATAC TGTAGAGGAC 351 CCCTACACAT CTTTTGTGAA GTTGCTACCT CTGAATGATT GCCGATATGC 401 TTTGTACGAT GCCACATACG AAACAAAAGA GTCTAAGAAA GAAGACCTAG 451 TATTTATATT CTGGGCTCCT GAAAGTGCAC CTTTAAAAAG CAAGATGATT 501 TATGCTAGCT CTAAAGATGC CATTAAAAAG AAATTTACAG GTATTAAACA 551 TGAGTGGCAA GTAAATGGCT TGGATGATAT TAAGGACCGT TCGACACTTG 601 GAGAGAAATT GGGAGGCAAT GTAGTAGTTT CACTTGAAGG AAAACCATTA 651 TAAAATGACA GTCAAGTGCC ATCTGGATCT TAAGGAGCTT CCATTTCTCC 701 AGCTCAGTCC ATTGGAATAG TATTAGGTTT TGGTTTTTTG TTGTATTTCC 751 CCCTTTCCAC TGGGCCCTTC CAACACAATG AATGAAGGAA ATATCATTTA 801 TTTAAGCAGC CTATCAGTGA TTGCCATTAG ACTGTTGAAT ACTGTTACTT 851 TTATATAGAA CCCAAGGAAT GCCTTCCTGT CATATTTTAG CCAAAACAAC 901 TGGTTATATG CCTCCCTTGC AGCAAGCACT ACAATGTGTG TGATCGTCAA 951 TGTGAATAGC TTAGAATACT GCAAAGGATA AGCTAATTGA ATGCCTTGAA 1001 AGTATTATCC ACTGGTCAGA TGGTCAACTT TTTTCAGTAT TATTTATAGT 1051 TGGCACTTGA TTGCAGTTCT GTGAGGCTTG AGCATTCATA CACCTCACCT 1101 GCCTTGGCAA GCCTATTTTA GTGATATGGC AGCACGGATA TAACACTATG 1151 CATTAAAAGC ACTTTTTGTA ATAAGTTTAA TATCCTAAAA GGAATGCCAA 1201 TTAAGTTTTG TTAACTGTGT CATCAACTTA TCCTAGTACC TCAGTGTTCA 1251 TTCCTGTTAC CTGCATATCT TCTTAAAAGA AATAGCTGTT ATTAATGCCT 1301 TTTTGTTTTC CATTGAGTGT ACACTACTGA ATAAGTGTAG GAGTTTTATG 1351 TTTACCATGT GAGTCCTGCA ACACTAAAGA TATTTTGAAT ATCAGTCATG 1401 ATGGCAATTT CTGTATAAAA GAGCCTTAAA TGGAACATTG TTTTGAGATC 1451 AAACTCCCCA CCCTCACAAA AATGGCCACG TTGCAATAAA AATTGTGGCA 1501 TATTACAGAA CGTTGCCTTG TTTTCCTTGG AAATTTTGCA AAATGTTATG 1551 TGAAACAACT TCTAGGGTAA AAACAGCCAT TACTAATCTC TGCACTGGTC 1601 ATTTGAGAAT TTTTTTTGTA CAGCATTCAT GTGTGATATT TTCCAGATTT 1651 GTTGGATCTA TTTGGTTTAA AAAGTATTCT ATCTTAAGGC CAACTAATAT 1701 AAAATACCAT TGTTAAAGAA TGGTACTTTT ATAAACATTA GTGTATTTAT 1751 TTCCTATGTG TTAATATGAA GATCAGAAAT TATTTTTTGC ACTTTGGCAT 1801 AAATACTTTT CAATATCTGA TTTGTTCTCT GGATAAATTA GCATAGTTAT 1851 TTTTTTATTC ACATTTACAT TTCTAAGTAG TTGTATAGTA GAAGCAGGAA 1901 GCTCTTATTG CTTATTTGGT CGTAATGAAA ATAATTTGTA AAATGTCCTT 1951 TAAAAGTTTA ATGATACTTC TGATGTTTCG GAACAGTCAT TTCACCTACT 2001 ATTTCTGAAT ATATTTTGCA AATTGAATTG GAATAGGAAT TGATATAGCA 2051 GTCTTAAACA TTAGTAGTGG GATTTGGCTA TGGTCCAGAC TGTGCTCCTT 2101 ATAGAGAATT TGATCTGCTC AGTGTGAGCG GTTTGCTGTT AGCCAGGGCT 2151 ATTTATGGCA AACACATGCT TTTGTATCTT GTCATAGTTA TCCACAAAGG 2201 CAAAACTGGA CTTGATTCTA CTGGTATGCA AAACAGGCAT GCTAGTAAGC 2251 AGTCAGTCGT GGCTCAGAAC TTAACCCCAT AGCTCAGAGG AATGCTTTTA 2301 GCAGAAAACA GGAAAGAAAA TATCCCTTAA AAATTTTTTT TGAATGTGTG 2351 GAAGTAATTT TAGTATAATT AGATTTTTTC CATATTTTTG AAAGATTTTT 2401 CAGATGTGAA CATTAAAAAT AGGGATTAAA TGTCTAGGCT TCCATTTAAA 2451 ATTATATGAA TGGTTTGGGA TCTTTTTGCA CTGAGCAATT TTATTTCAGG 2501 CTTCCAGCTG TCCCTGTGAG TTATCCTGGA CATTTCGATG GTTTTTGGTA 2551 AGGCCAAACT CTGATAAGCA AAACAGAGAA TACTGACGGT ATACTTAACC 2601 ATATGTGTAA CTGATACTTG GCACCATGGA ATTTTTCATT GAGTTATTTC 2651 CTCATTCTTT TAAAAAATAA GGGACTATAA ATCAGTTATT TAGTATCTTT 2701 TGTTTTTGTA GCTGATTCCT TAACTTTCTT GTATGCCTCT AGTAATTTCA 2751 GAGATTAAAT ATTGCTTTAA ACTGTGATAC TTTGATTTGC TAGATTGACA 2801 AAACTGATAC TAATATAATT AAGTTCATCT TTGAAATACA TCTTTGTGCG 2851 TAGAGCCAAA AAAAGAGATA AAATTAATAA TAGTTCACTT GTTATTTGAG 2901 ATTAATTTGG CATTTGAAAT GATCATTTTA TTTTACAATC ATTTATAATG 2951 AATCAATGTT CCAGTTAGCT TTAAAAGGTA TACGGTGCTA ATTAGTAAAA 3001 TATTGAAGGC AATATTTTAC TGCTAGCTTG CAAAGTTATG AGAGTTTAAA 3051 AAATAAAATA TATGAAAATA AAAAAAAAAA AAAAAAA // LOCUS AF030409 4452 bp mRNA PRI 09-MAR-1998 DEFINITION Homo sapiens sodium-hydrogen exchanger 6 (NHE-6) mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF030409 NID g2944232 VERSION AF030409.1 GI:2944232 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4452) AUTHORS Numata,M., Petrecca,K., Lake,N. and Orlowski,J. TITLE Identification of a mitochondrial Na+/H+ exchanger JOURNAL J. Biol. Chem. 273 (12), 6951-6959 (1998) MEDLINE 98175963 REFERENCE 2 (bases 1 to 4452) AUTHORS Numata,M. and Orlowski,J. TITLE Direct Submission JOURNAL Submitted (18-OCT-1997) Physiology, McGill University, 3655 Drummond St., Montreal, QC H3G 1Y6, Canada COMMENT This mRNA sequence extends the sequence in the file with GenBank Accession Number D87743. FEATURES Location/Qualifiers source 1. .4452 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1. .4452 /gene="NHE-6" CDS 36. .2045 /gene="NHE-6" /function="ion transporter" /codon_start=1 /product="sodium-hydrogen exchanger 6" /protein_id="AAC39643.1" /db_xref="PID:g2944233" /db_xref="GI:2944233" /translation="MARRGWRRAPLRRGVGSSPRARRLMRPLWLLLAVGVFDWAGASD GGGGEARAMDEEIVSEKQAEESHRQDSANLLIFILLLTLTILTIWLFKHRRARFLHET GLAMIYGLLVGLVLRYGIHVPSDVNNVTLSCEVQSSPTTLLVTFDPEVFFNILLPPII FYAGYSLKRRHFFRNLGSILAYAFLGTAISCFVIGSIMYGCVTLMKVTGQLAGDFYFT DCLLFGAIVSATDPVTVLAIFHELQVDVELYALLFGESVLNDAVAIVLSSSIVAYQPA GDNSHTFDVTAMFKSIGIFLGIFSGSFAMGAATGVVTALVTKFTKLREFQLLETGLFF LMSWSTFLLAEAWGFTGVVAVLFCGITQAHYTYNNLSTESQHRTKQLFELLNFLAENF IFSYMGLTLFTFQNHVFNPTFVVGAFVAIFLGRAANIYPLSLLLNLGRRSKIGSNFQH MMMFAGLRGAMAFALAIRDTATYARQMMFSTTLLIVFFTVWVFGGGTTAMLSCLHIRV GVDSDQEHLGVPENERRTTKAESAWLFRMWYNFDHNYLKPLLTHSGPPLTTTLPACCG PIARCLTSPQAYENQEQLKDDDSDLILNDGDISLTYGDSTVNTEPATSSAPRRFMGNS SEDALDRELAFGDHELVIRGTRLVLPMDDSEPPLNLLDNTRHGPA" BASE COUNT 1114 a 903 c 1008 g 1427 t ORIGIN 1 CGCCGGTGAG GTAGGGGCGG GAGGCGGGGG GAGACATGGC TCGGCGCGGC 51 TGGCGGCGGG CACCCCTCCG CCGTGGCGTC GGCAGCAGTC CCCGAGCCCG 101 CAGGCTCATG CGGCCCCTTT GGTTGCTCCT CGCAGTGGGC GTCTTTGACT 151 GGGCAGGGGC TTCGGACGGC GGCGGCGGAG AGGCTAGAGC CATGGACGAG 201 GAGATCGTGT CCGAGAAGCA AGCCGAGGAG AGCCACCGGC AGGACAGCGC 251 CAACCTGCTC ATCTTCATCC TGCTGCTCAC CCTCACCATT CTCACAATCT 301 GGCTCTTCAA GCACCGCCGG GCCCGCTTCC TGCACGAAAC CGGCCTGGCT 351 ATGATTTATG GTCTTTTGGT GGGCCTTGTG CTTCGGTATG GCATTCATGT 401 TCCGAGTGAT GTAAATAATG TGACCCTGAG CTGTGAAGTG CAGTCAAGTC 451 CAACTACCTT ACTGGTTACT TTTGATCCAG AAGTATTTTT CAACATATTA 501 CTTCCTCCTA TCATATTTTA TGCAGGTTAT AGCCTGAAAA GGAGACATTT 551 TTTTCGAAAT CTTGGGTCTA TCCTAGCATA CGCTTTTCTT GGAACAGCAA 601 TTTCTTGTTT CGTTATTGGG TCAATAATGT ATGGCTGTGT AACGCTGATG 651 AAGGTAACGG GACAACTTGC AGGAGATTTT TACTTTACAG ATTGCCTACT 701 GTTTGGTGCC ATTGTATCAG CAACTGATCC AGTGACTGTT CTTGCTATAT 751 TCCACGAGCT TCAAGTTGAT GTTGAACTCT ATGCACTTCT TTTTGGTGAA 801 AGTGTCCTCA ATGATGCTGT TGCCATAGTG CTGTCCTCCT CAATAGTGGC 851 ATACCAGCCA GCTGGAGACA ACAGTCACAC CTTTGATGTC ACAGCGATGT 901 TCAAGTCTAT TGGGATCTTC CTTGGAATCT TCAGTGGATC TTTTGCAATG 951 GGTGCTGCTA CTGGAGTGGT GACAGCTTTA GTGACAAAGT TCACCAAATT 1001 ACGGGAGTTC CAGTTGTTGG AGACAGGCCT GTTCTTCTTG ATGTCCTGGA 1051 GTACCTTCCT CTTGGCTGAA GCATGGGGCT TCACAGGTGT AGTTGCAGTA 1101 TTGTTTTGTG GCATCACACA AGCACATTAT ACGTATAATA ATTTGTCCAC 1151 GGAGTCTCAG CATAGAACTA AACAGTTGTT TGAGCTTCTC AATTTCTTGG 1201 CAGAGAATTT CATCTTCTCC TACATGGGGC TGACACTGTT CACCTTCCAG 1251 AACCATGTCT TTAACCCAAC ATTTGTAGTA GGAGCATTTG TTGCTATTTT 1301 CTTGGGAAGA GCTGCCAATA TTTACCCCTT GTCCCTCTTA CTTAATTTGG 1351 GTAGAAGAAG TAAGATTGGA TCAAATTTTC AACACATGAT GATGTTTGCT 1401 GGCCTTCGTG GTGCAATGGC ATTTGCCTTG GCCATTCGAG ATACTGCCAC 1451 TTATGCACGG CAAATGATGT TCAGCACCAC GCTTCTGATT GTGTTTTTTA 1501 CCGTGTGGGT ATTTGGTGGT GGCACCACTG CAATGCTGTC ATGCTTGCAT 1551 ATCAGGGTTG GTGTTGATTC AGACCAAGAA CACTTGGGTG TTCCTGAAAA 1601 TGAAAGGAGA ACTACCAAAG CAGAGAGTGC TTGGCTTTTC CGGATGTGGT 1651 ACAACTTTGA TCATAACTAT CTGAAGCCTC TGCTGACCCA CAGCGGGCCT 1701 CCGCTGACAA CAACACTCCC TGCCTGCTGT GGACCCATCG CCAGGTGCCT 1751 CACCAGCCCC CAGGCTTACG AAAACCAGGA ACAGTTGAAA GATGATGATT 1801 CTGATCTTAT TCTCAATGAT GGTGACATCA GTTTGACATA TGGAGATTCT 1851 ACTGTGAACA CTGAACCGGC CACATCCAGC GCCCCAAGGA GATTTATGGG 1901 AAACAGTTCT GAAGATGCCT TGGATCGGGA GCTTGCATTT GGGGACCATG 1951 AACTGGTCAT TCGAGGAACA CGCCTGGTTC TTCCAATGGA TGATTCTGAA 2001 CCCCCGCTAA ATTTGTTAGA TAATACGAGA CATGGTCCAG CCTAAGCTTA 2051 CTAATACTCA CTTAGTGATT TGTAAAATTT GCACATGTGA TTGTGAAGAA 2101 ATTTGTACTA CCTAAAAGTC CCAGTGCATG TCTCTGAATG TGTAAGCTAT 2151 ATAAATGCTA TTTATATGGC ATAGAAAGAA TATAAATATC CTGTACACGG 2201 CAGATTGTGA ACAAACTATA TTCCTTTAAG TTTTCCTGGT TGCACTCTGT 2251 AGACTGGATC TGTTTTAGGA AGTTACTTTC ACAGTGATGT TGTGTGTTCT 2301 GTTAGTTTTA TGTCTCAGTT AAAGTGTAAA AAGTGACGGA TTTTCTCTTT 2351 CTTAAACTTA CCTGACACTT AACCAGAGTA CCAGTTCTCG TGATGTGAAT 2401 TAATTTTTTT GTGTGCTAGG GGAGGGAGAG TGAGGAGGGA GTGTTATTTC 2451 CTTGGGAACC TAGGGAGGAG AGGTTCCTTT GTTGGGAAAC TTTTGTTGAT 2501 AGCTGCTGCC TTTGTCCTGA TCGTTTTCTT TCCCTTTTCT CTGGTGGCCT 2551 GTTGTGGTGC AACGAGCTGA TGGCATTTGA TCTTGCCCCA TTCAGGTTGG 2601 GGAGTGAAGT GTGGGGACCC TTTTCCCCCG CTTGCTGTGA AAGCACAGAT 2651 TCATTGACTA CAGTACACTG TTGTTCAGAA AAGAAGGCTG CAAATGACTT 2701 CTGAGACTTT ATGTCTTTTC TTCCAGACCA AGACCGTAGA AGGAGTCACA 2751 TCTAGCCGGC TTAGCCAAAG TACAGGTGTA TATAGTTCAG GGCACTTGAT 2801 TTAGATTTGG AGGGGCTGGG GTGGGCAGAG AGCAAGAGGC GAGTAAAGAG 2851 AATGGTGGTT TCAGAGATCT CTCTTCCCAA ATGTGTAAAT ATTCTATACC 2901 AGATAAGTTT AAATAAGAAA TTTAATTGCT GCTTAATTTT TGATTATGTA 2951 CTTTATCTGT ATAGCAGGCT TTGTCGTCAG AAGTTTTTAT ATCGATTTAA 3001 ATTGCTGCTC TTTAGCAGCC AAACAGGAGC AAAATGTAAA ATTTTTGAAC 3051 TTACTGTGTC TAATCATCAT TTGTTAGTCT GTAGTTAATG TCAACAGTTA 3101 ATTTATGAAC CCACGATCGT TCCACACTGC ACCAAAGTCA GTCATAAGAG 3151 AAATCGAATA TTCTGGAGCA CTGATTGCAG CAGGGTGGCT CCTTTGTGTG 3201 CAGCAGGTGT AGTAGTCTTC ATTTTCATGG TACGTTTTAA TATTAATTAC 3251 CTAAGCTGCC ATGCATTTTT TTTTTACAGT TCTCAAGGAA GAGCACAGAA 3301 CAATTTCTCA TTTCATATTT GGAGTATGAA AGTAGATTCT ATTTTGTAAT 3351 GCTGATAATA CCTAAAGATG CATTGAATGC TTGGAAGAAT GCTTTTTGAT 3401 GTTGATTTTG ACCTGTTCAT GATTCAGAAG AAAAACAAAC TTTTTTGGAT 3451 TTTTTTTCCC TCAGGTCTGA GTAGCATTGC CTTAAATCTT ATCCAGTTAG 3501 AACATTGATT TATTTACATG ATGTTCAGAT TTTCCAGTGA AAAATACCCT 3551 TCTGAACAAA ACATGTACTT ACTCTCCGAA AGGCATCTAT CTGTGCTATT 3601 GCAAACACTC CTTGAGATTT TAGGGGAATT CTAATGTTGT ACCCTTTCGT 3651 GGCAGCTTTG ACTGTTGGCA TAGCCATTTG TTATGTAGTG GTAGCGACTT 3701 TCCTGCTATG CAGGAATCCC TCCCATGACG TGTATGTTTT ACATGATGTG 3751 TGCCTCTTCA CGCAGTAAAT AGTTTCTTGT TAATGTATGT TTGAGGAGTT 3801 TGAACGTCAG TGTCACTTAC CCACAAAGTT ATTCAAGTTG TAAAAGGTTA 3851 TAAATAATTT AACAACTACC TTTTTTATTC TGTCGGGTTA CTGACCTCAC 3901 TTTATGTAAA TACTTCGCAT GACAAATTCA GTAACTCGTC TATTTCAGCA 3951 TGCATAAGAC TTTTCACTAG GGAAACTGAT AAAGCTTGAG TCAACTAAAT 4001 CTGCCTTCAT ACTTTATCAA GGGGAACCAA GCCTGCTGTG CTTACATCAG 4051 CATCTGGAAG ACTTTCCTCT CCTCTAATCT GTGTACACAT CTCCAAGCAA 4101 GGAAGAAAAA ACAAACTCTG CTCAGACGCC TATGAAACAC CTGAATGAAC 4151 TTTGATGAAG TACAGTCTGA GTTACCATCA TGCACAAGTA GAACTGCTCT 4201 TGGACTTGTT TTCCTGTTGT TTGTGGAACC TACGCGTTTG AATGGCTTGA 4251 ACGTTGCATC TTTTAAAGTT ATTTTTTAAG GTTTCTTGGC ATTTATCCTA 4301 GTTGTCCGTG TTTGGCAATG TGCTGTTAAA GTAATAGACT TTTAATCTTT 4351 ATGTATTTTT TGTTTTCTCT GGAGTACTTG GACAGATGTT ATAGTGGTTT 4401 CTTTTAGGAA AATCTGTCAT TAAAAAAGTT ATAGCCTTGC AAATAACCAC 4451 TC // LOCUS AF158555 4438 bp mRNA PRI 04-AUG-1999 DEFINITION Homo sapiens glutaminase C mRNA, complete cds. ACCESSION AF158555 NID g5690371 VERSION AF158555.1 GI:5690371 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4438) AUTHORS Elgadi,K.M., Meguid,R.A., Qian,M., Souba,W.W. and Abcouwer,S.F. TITLE Cloning and analysis of unique human glutaminase isoforms generated by tissue-specific alternative splicing JOURNAL Unpublished REFERENCE 2 (bases 1 to 4438) AUTHORS Elgadi,K.M., Meguid,R.A., Qian,M., Souba,W.W. and Abcouwer,S.F. TITLE Direct Submission JOURNAL Submitted (11-JUN-1999) Surgical Oncology Research Laboratories, Massachusetts General Hospital, 55 Fruit Street, J918, Boston, MA 02114-2696, USA FEATURES Location/Qualifiers source 1. .4438 /organism="Homo sapiens" /db_xref="taxon:9606" source 1. .879 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" CDS 245. .2041 /codon_start=1 /product="glutaminase C" /protein_id="AAD47056.1" /db_xref="PID:g5690372" /db_xref="GI:5690372" /translation="MMRLRGSGMLRDLLLRSPAGVSXTLRRAQPLVTLCRRPRGGGRP AAGPAAAARLHPWWGGGGWPAEPLARGLSSSPSEILQELGKGSTHPQPGVSPPAAPAA PGPKDGPGETDAFGNSEGKELVASGENKIKQGLLPSLEDLLFYTIAEGQEKIPVHKFI TALKSTGLRTSDPRLKECMDMLRLTLQTTSDGVMLDKDLFKKCVQSNIVLLTQAFRRK FVIPDFMSFTSHIDELYESAKKQSGGKVADYIPQLAKFSPDLWGVSVCTADGQRHSTG DTKVPFCLQSCVKPLKYAIAVNDLGTEYVHRYVGKEPSGLRFNKLFLNEDDKPHNPMV NAGAIVVTSLIKQGVNNAEKFDYVMQFLNKMAGNEYVGFSNATFQSERESGDRNFAIG YYLKEKKCFPEGTDMVGILDFYFQLCSIEVTCESASVMAATLANGGFCPITGERVLSP EAVRNTLSLMHSCGMYDFSGQFAFHVGLPAKSGVAGGILLVVPNVMGMMCWSPPLDKM GNSVKGIHFCHDLVSLCNFHNYDNLRHFAKKLDPRREGGDQRHSFGPLDYESLQQELA LKETVWKKVSPESNEDISTTVVYRMESLGEKS" source 347. .3529 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hGA10-1" /cell_line="HT-29/ino" source 2591. .4438 /organism="Homo sapiens" /db_xref="dbEST:247733" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="Soares Nb2HP" BASE COUNT 1297 a 756 c 914 g 1466 t 5 others ORIGIN 1 GCGGCCGCTG AATTCTAGGG CGGAGCGAAG AGAACCGGTC GCGGCAATCC 51 TAGCGCGCAG CAGCAGCAGC AGCAGCAGCA CCCGCATCCG CTGCGGGAGT 101 CCGAGCCGGA ACCACACCCA AGTAGCTGCC CTTTCCTCTT CTGTCATCTC 151 ACCGCCCCAC CACAGACCGC GTTCCCCGAG GAAACCGGCC GCCCACGCCC 201 GGAGCATCCT CCCCTGTTGA GCGGGCGCTG ACGGACCCGG CGGCATGATG 251 CGGCTGCGAG GCTCGGGGAT GCTGCGGGAC CTGCTCCTGC GGTCGCCCGC 301 CGGCGTGAGC GNGACTCTGC GGCGGGCACA GCCCTTGGTC ACCCTGTGCC 351 GGCGTCCCCG AGGCGGGGGA CGGCCGGCCG CGGGCCCGGC TGCCGCCGCG 401 CGACTCCACC CGTGGTGGGG CGGGGGCGGC TGGCCGGCGG AGCCCCTCGC 451 GCGGGGCCTG TCCAGCTCTC CTTCGGAGAT CTTGCAGGAG CTGGGCAAGG 501 GGAGCACGCA TCCCCAGCCC GGGGTGTCGC CACCCGCTGC CCCGGCGGCG 551 CCCGGCCCCA AGGACGGCCC CGGGGAGACG GACGCGTTTG GCAACAGCGA 601 GGGCAAAGAG CTGGTGGCCT CAGGTGAAAA CAAAATAAAA CAGGGTCTGT 651 TACCTAGCTT GGAAGATTTG CTGTTCTATA CAATTGCTGA AGGACAAGAG 701 AAAATACCTG TTCATAAATT TATTACAGCA CTCAAATCTA CAGGATTGCG 751 AACGTCTGAT CCCAGGTTGA AAGAGTGTAT GGATATGTTA AGATTAACTC 801 TTCAAACAAC ATCAGATGGT GTCATGCTAG ACAAAGATCT TTTTAAAAAA 851 TGTGTTCAGA GCAACATTGT TTTGTTGACA CAAGCATTTA GAAGAAAGTT 901 TGTGATTCCT GACTTTATGT CTTTTACCTC ACACATTGAT GAGTTATATG 951 AAAGTGCTAA AAAGCAGTCT GGAGGAAAGG TTGCAGATTA TATTCCTCAA 1001 CTGGCCAAAT TCAGTCCCGA TTTGTGGGGT GTGTCTGTTT GTACAGCAGA 1051 TGGACAGAGG CATTCTACTG GAGATACCAA AGTTCCCTTC TGTCTTCAGT 1101 CCTGTGTAAA ACCTTTGAAA TATGCCATTG CTGTTAATGA TCTTGGAACT 1151 GAATATGTGC ATCGATATGT TGGAAAAGAG CCGAGTGGAC TAAGATTCAA 1201 CAAACTATTT TTGAATGAAG ATGATAAACC ACATAATCCT ATGGTAAATG 1251 CTGGAGCAAT TGTTGTGACT TCACTAATAA AGCAAGGAGT AAATAATGCT 1301 GAAAAATTTG ACTATGTCAT GCAGTTTTTG AATAAGATGG CTGGTAATGA 1351 ATATGTTGGA TTCAGTAATG CAACGTTTCA GTCTGAAAGA GAAAGTGGAG 1401 ATCGAAATTT TGCAATAGGA TATTACTTAA AAGAAAAGAA GTGTTTTCCA 1451 GAAGGCACAG ACATGGTTGG TATATTAGAC TTCTACTTCC AGCTGTGCTC 1501 CATTGAAGTG ACTTGTGAAT CAGCCAGTGT GATGGCTGCG ACACTGGCTA 1551 ATGGTGGTTT CTGCCCAATT ACTGGTGAAA GAGTACTGAG CCCTGAAGCA 1601 GTTCGAAATA CATTGAGTTT GATGCATTCC TGTGGCATGT ATGACTTCTC 1651 AGGGCAGTTT GCTTTCCATG TTGGTCTTCC TGCAAAATCT GGAGTTGCTG 1701 GGGGCATTCT TTTAGTTGTC CCCAATGTTA TGGGTATGAT GTGCTGGTCT 1751 CCTCCTCTGG ATAAGATGGG CAACAGTGTT AAGGGAATTC ACTTTTGTCA 1801 CGATCTTGTT TCTCTGTGTA ATTTCCATAA CTATGATAAT TTGAGACACT 1851 TTGCAAAAAA ACTTGATCCT CGAAGAGAAG GTGGTGATCA AAGGCATTCC 1901 TTTGGACCAT TGGACTATGA AAGTCTCCAA CAAGAACTTG CTTTAAAAGA 1951 GACAGTATGG AAAAAAGTGT CACCTGAGTC AAATGAGGAC ATCTCTACAA 2001 CTGTAGTATA TAGAATGGAA AGTCTGGGAG AGAAAAGCTA AAGAAATGGG 2051 TTCTAGTTTC AGAATGTTTC TTCATTTAAT CTTTCAAACA TCTTTAGCTT 2101 TTTTTTGCAA GTTATAAATA TTTATTTGAG GTATTTTTTG TTCTCAATCT 2151 TGGGTGCTGG AGCCATAAAG CTTTTTTTTC CTTTTAATCT TTGTATAAAG 2201 GCAGTAGATT AAGAAGTGCA TTTGTTGGTC TTTAAAAAGT ATTTACAAGT 2251 ACATAAATTT GCTTTATTTT TAAAAATACA AAAAGGAAAA ATTTAAATTT 2301 TTTTTGATGT AATTAAAATG TTAACTATGT GGTCAGATAA TCCCATTTTA 2351 CAATAGTAAC AGAAAATTGT AATTCTTAGT TCTAAAATTC ACAAATTAAA 2401 CTCATAAGCT TTGTTGCATT TTGTTTTTTC TTTTCCATTT TTAAAACTAA 2451 TGTGATGTCT TTAGTGGCAA TAGAAGGTAC TTCTATGCTA AATACAAAAC 2501 TAAAAAGGCA AAATAATGAA CCCCAAATTA TTTTATTTAA AATAGCAGTG 2551 GATTATAAAA TTAGCTTGTG TTTACATTTA TGCCATTTTT GGTGATAGAT 2601 TGGCTTTACA TTTTAAAAAA TTTATTTAAA AATTTATCAA ATGCTTTAAA 2651 ATATGACTCC TACTTTTTTT ATTTTGCAAC TCCTCTGTTC TGTCAGAGTT 2701 GTTATATACA GGAGTGTCTT ATGTTACTAA AACATTCCAG CCAAAGAATT 2751 TCAGATGTGA GATAATGATG TTTCATCAAT AAAAAGCTAT AATGGTTAGT 2801 TACTCAGAAG GAGAAACAGT GAGTGTCTTC AAGTGAATTG TTCACCTAAA 2851 CAATTTTATT TTCATATTAT CCACATAACT TTTTCTATGT TATATTTAAA 2901 TATGAATGGC AAATTTTGGT TTTTAGCTTT TACATTTTAT TATCTTAATT 2951 TTATAAATGC TAATATTTCT TTTGTGATAA GTTATAGCAT CTCATAAAGT 3001 TTGTTCTATT TGAAGTTTTT TAGAGTACTT GAGAAATGAA TTTAGTCTGC 3051 AGGTAGTAAG TATGCTACTA AAATACGTTA GATCTAAATC CTTTTATTTG 3101 GTATAAAAAT GCAATATTGA GAATCAAAAC TTGTTTTTAA GAGAACTATA 3151 GATTCTACAC AACCTGATTT CAAGTAATTA TTCATAGTAT TTATAGTTGT 3201 CTTGGCAAAG TGATTGTAAA ATTCTGTAGG ACCTATTCAC ACTTCTTCCT 3251 TCTTCCATAT ACTTCTCTGG TTTTCCCCAT AGTTCCCCTA TAATTTCAAG 3301 TTTGTTGAAA CCTGTTAATT TTAGTGGGGG ATTAGAAGAA AAACTTGGTG 3351 GTTTCTTAGC ATGATGGTGT ATGTATGTGG TAATGGAAAG TCTGTAAAAG 3401 TAAATATAGT GTAGNAAAAA AGATTTCACT GAGTATTTTA GATACTAGTG 3451 NAAATAAAGA TAGAAAATCT TGATCATAAT GTCTTAAGTT TGGGAACTGT 3501 GATATTAAGA AAAGAAATTC CCTTCTAGAG GTGCTGGCCA AAAAGCCTTT 3551 TGGGCTAACT TAAGTATTAA ATTTATATAT TTAAATAATT ATATTTTAAG 3601 TTGTAGAGGA TTTTCCCAAG GATTTTATGC TTACTTGAAT GTTCTTTGAA 3651 TGTTCAGATG CATATCCTAA NCTGGATGCT TCTCAAGGCC TTACTGCATA 3701 TTTGTGTTGC ATATTTATGT TAGTTGCACC AGGGCCATTT GTAGTTTGGG 3751 CAACCGAATG CCTTAATTGG AAAAAAGGCA TTGTGGTTTC CCCTATTATC 3801 TAAATTGTTA CATTTTACCA TTTCATTCCG AAGTTGGTTT TACTTTATTA 3851 AATGAAGATT TAGTTTTCAT ATCGTATACN TAGCTGTATA GATTTCAAAA 3901 TTAGGTTGTT AATTTGTGTC ACTTACTATT TTTGTGTTGG TAATGCTTTA 3951 AATGCATACT TAAAAATGAA GTACTGTTAT CTAAGCTACT GTGTTTAGAA 4001 AATGTTAAGA ATGAGCAGAA ATTTTTATAG AAAAGATAAA CGGAAGAAGA 4051 GATAAGATAC TGCGAATAGG CCCTCAAACT TAAAAAAGAA AAAACTTTGC 4101 CAGTTTTAAG GACATATTTT GATTCTTTCA GTATCTTAAC ACCTTTTTAA 4151 ACAAAGTTCT TGATAGTACC ACTATTATTG GGTTTGTTTT ATGCCATTAT 4201 TGATTCTTGA TATTCAAGCA TTTACAATGT AGCATATTTG ATTTTCTTTT 4251 TTCTTTCTTT TTTTGGCATG TTTTTAGCAT AAATGTTGCA TTTTATCTTA 4301 GTGTTTGGAT GAAAACATTT GTGTTGTTTA GCTTTCATTT GCTTTGTATA 4351 TTTAATAATG TACCTTTATT TTCCAGTATG CCTACATTTT GTATTGCACA 4401 ATAAATTTAT TTTAAGCTGA AAAAAAAAAA AAAAAAAA // LOCUS HUMOCTF1A 3824 bp mRNA PRI 01-MAY-1995 DEFINITION Human octamer binding transcription factor 1 (OTF1) mRNA, complete cds. ACCESSION L20433 NID g418015 VERSION L20433.1 GI:418015 KEYWORDS octamer binding transcription factor 1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3824) AUTHORS Bhargava,A.K., Li,Z. and Weissman,S.M. TITLE Differential expression of four members of the POU family of proteins in activated and phorbol 12-myristate 13-acetate-treated Jurkat T cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (21), 10260-10264 (1993) MEDLINE 94052142 FEATURES Location/Qualifiers source 1. .3824 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkai" /cell_type="T-cell" /tissue_lib="lambda pSH4K" /map="1cen-q32" gene 235. .1497 /gene="OTF1" CDS 235. .1497 /gene="OTF1" /codon_start=1 /product="octamer binding transcription factor 1" /protein_id="AAA65605.1" /db_xref="PID:g418016" /db_xref="GI:418016" /translation="MMSMNSKQPHFAMHPTLPEHKYPSLHSSSEAIRRACLPTPPLQS NLFASLDETLLARAEALAAVDIAVSQGKSHPFKPDATYHTMNSVPCTSTSTVPLAHHH HHHHHHQALEPGDLLDHISSPSLALMAGAGGAGAAAGGGGAHDGPGGGGGPGGGGGPG GGGPGGGGGGGPGGGGGGPGGGLLGGSAHPHPHMHSLGHLSHPAAAAAMNMPSGLPHP GLVAAAAHHGAAAAAAAASAGQVAAASAAAAVVGAAGLASICDSDTDPRELEAFAERF KQRRIKLGVTQADVGSALANLKIPGVGSLSQSTICRFESLTLSHNNMIALKPILQAWL EEAEGAQREKMNKPELFNGGEKKRKRTSIAAPEKRSLEAYFAVQPRPSSEKIAAIAEK LDLKKNVVRVWFCNQRQKQKRMKFSATY" misc_feature 1031. .1469 /gene="OTF1" /label=POU 3'UTR 1498. .3824 BASE COUNT 896 a 1005 c 1067 g 856 t ORIGIN 1 GCGGGGCTAG AGCTGTCGGA GAAGCGGGAC CGCGAGGCCG GCGCGCGGCG 51 CTCTGCGCGG TCAGAGGGAG CGCCTGGCAG CAGCAGGAGC AGCAGCAGCA 101 GCCCGCGGCG GGGCCGCCGC CAGCCGCCGC GACCGCCGCG GCTGCAGCCT 151 CCGAAGGGAG GCCGGGTGAG CCGGCGTACG CACTTTCCCG CGGACTTTCG 201 GAGTGTTTGT GGATATACAT GCCAAGCCGC CACGATGATG TCCATGAACA 251 GCAAGCAGCC TCACTTTGCC ATGCATCCCA CCCTCCCTGA GCACAAGTAC 301 CCGTCGCTGC ACTCCAGCTC CGAGGCCATC CGGCGGGCCT GCCTGCCCAC 351 GCCGCCGCTG CAGAGCAACC TCTTCGCCAG CCTGGACGAG ACGCTGCTGG 401 CGCGGGCCGA GGCGCTGGCG GCCGTGGACA TCGCCGTGTC CCAGGGCAAG 451 AGCCATCCTT TCAAGCCGGA CGCCACGTAC CACACGATGA ACAGCGTGCC 501 GTGCACGTCC ACTTCCACGG TGCCTCTGGC GCACCACCAC CACCACCACC 551 ACCACCACCA GGCGCTCGAA CCCGGCGATC TGCTGGACCA CATCTCCTCG 601 CCGTCGCTCG CGCTCATGGC CGGCGCGGGC GGCGCGGGCG CGGCGGCCGG 651 CGGCGGCGGC GCCCACGACG GCCCGGGGGG CGGTGGCGGC CCGGGCGGCG 701 GCGGCGGCCC GGGCGGCGGC GGCCCCGGGG GAGGCGGCGG TGGCGGCCCG 751 GGGGGCGGCG GCGGCGGCCC GGGCGGCGGG CTCCTGGGCG GCTCCGCGCA 801 CCCTCACCCG CATATGCACA GCCTGGGCCA CCTGTCGCAC CCCGCGGCGG 851 CGGCCGCCAT GAACATGCCG TCCGGGCTGC CGCACCCCGG GCTGGTGGCG 901 GCGGCGGCGC ACCACGGCGC GGCAGCGGCA GCGGCGGCGG CGTCGGCCGG 951 GCAGGTGGCA GCGGCATCGG CGGCGGCGGC CGTGGTGGGC GCAGCGGGCC 1001 TGGCGTCCAT CTGCGACTCG GACACGGACC CGCGCGAGCT CGAGGCGTTC 1051 GCGGAGCGCT TCAAGCAGCG GCGCATCAAG CTGGGCGTGA CGCAGGCCGA 1101 CGTGGGCTCG GCGCTGGCCA ACCTCAAGAT CCCGGGCGTG GGCTCACTCA 1151 GCCAGAGCAC CATCTGCAGG TTCGAGTCGC TCACGCTCTC GCACAACAAC 1201 ATGATCGCGC TCAAGCCCAT CCTGCAGGCG TGGCTCGAGG AGGCCGAGGG 1251 CGCCCAGCGC GAGAAAATGA ACAAGCCTGA GCTCTTCAAC GGCGGCGAGA 1301 AGAAGCGCAA GCGGACTTCC ATCGCCGCGC CCGAGAAGCG CTCCCTCGAG 1351 GCCTACTTCG CCGTGCAGCC CCGGCCCTCG TCCGAGAAGA TCGCCGCCAT 1401 CGCCGAGAAA CTGGACCTCA AAAAGAACGT GGTGCGGGTG TGGTTTTGCA 1451 ACCAGAGACA GAAGCAGAAG CGGATGAAAT TCTCTGCCAC TTACTGAGGG 1501 GGCTGGGAGG TGTCGGGCGG GACAGAATGG GGAGCTGAGG AGGCATTTTT 1551 GGGGGGCTTT CCTCTGCTTG CCTCCCCTCG GATTTGGAGT GTCCGTTATC 1601 CTGCCTGCAT TTGGGGAGTC CCTTCTCGCT CTCTTTCCTC CACCCATTCT 1651 CTGATTTTCC TGCCTTTGCT GTCCCCTAGC CTTGAGGACT GGGGTGCTGG 1701 GTGTGGGGAT TGGAGTATAG GGTAGGGGAG AAGGGGGGGA GCATTCGGGG 1751 GAGTGGGGAG TGGGGGGAAG GAAAGCGGAG ACCCGAGCAG GGGTTTTAAG 1801 GAGCAGGATG GTTCTGGGGT TTGGGTGGGG GGAGACGCGG GAAGGGTAGG 1851 AAAATGGACT GTTTCTGACC AGAGACACTT ACCTAAATAT CCTGGGGACC 1901 AAGGAACTAT GTACAAAAAC AAACCTACCA ACCACCAAAA ACTAGACAAA 1951 TAAAGACAAA CTAAAACAAA ACAGAACAAA AGCAAAGGAA AATGCTTTAG 2001 AAATTTTAAC TCCGGGGAGC CATAATCTGC AACTTCATTT TCCCCCATAG 2051 AAGAGAAAAA AGAGCACCAC CATTATTACC ACCTCCCCAA CCCTACACGC 2101 ACGAACTGAG TCGAAAAACG AAAACCAAAC GAGCGAGAAG TTGAAGTTCT 2151 GGGTATCAAA GCTAGTTGTT CTGTCTGCGT GTTTAATTTT TCCCTCTCTC 2201 ACCTCCACCC CATCCATATC CTCTTTATTT CCTCCGTTCC AATGAGAGGC 2251 CTATGGCTGC TCTCCAATCC CGGGAAGTGA GTGGGAGCAC AGCTGAAAAG 2301 AGAGGGTCAG GGGGAGGCTG GCTGCTTGCT TAGGTGGAAT CCAACTTTTC 2351 CCGTGGCCCT GCCTATACTC TGGTGGCCTG GTCCTGTTGG GGTGGGGGTC 2401 TTTGGAGAGA AGGGCATAGT CTTTGAGCTA CTAAAAAGCA GAATTCCGGA 2451 GCTTCGAGAT ATCTTATTCT AGGAAAATGA AACAATTTTA ACAACAGTTT 2501 TTTTTCCTCT TATGTCGAAG ATCTAGTTTT AGACAATTTC AAAATAAGCT 2551 TTTCCCACTC ATAGAACTTT AACTTGCCCT TTCAGTTTTA TCTTTTTTTT 2601 AGAGAGAGGT TTAAACTACT GATTTTTCCT GTTGATTCAA ATAGACTAAT 2651 GGGGTGAAAG TTATTAGGAG AGATACTCTC TCCTGTTTTC TCCACTGAAC 2701 GAGACTCATC TTGCTCTTCT AGGTCCCGTT TCTTCCTCTC TTGGAGGACA 2751 TGAAATTATA GAAATGTTGA GAAGTTCCTG CTTTCTTTTG CGGTAGGACT 2801 TGGCTGTGAG AAAATCACCT AAATCCCAGA AAAGAGGAAG ACAGATTTAA 2851 AGTGCCCCCA CCCCCATTTG TTTCAAAGAG GTCTGCATGT TGGGCGAAAA 2901 CAGAACAACT GTGTTTCCTT TTACTTGTTC TTATTATTCA AGAGTCATTT 2951 ATTACAGGGG ATAAATGTTG GGTAGCAAGA ACTTTAATTT GCACTACCAG 3001 TCTCCCAAAT AGAAAATCAT GTATAGTATT TCATAGTAAT AATCAGGTAC 3051 CTTACAAGCT GCTGGTGGAT TTTAAAAAAT TAAGATAGTT GAAGGTGGTT 3101 AGGTAAAATG CCTGCTTTGT GTACAAGATA CTCTTTGGAT CTCTCGTAGA 3151 GATGGTTTGT TACCATCCTT TAATCATAAC TAAAACATTG AAAACAGAAC 3201 AAATGAGAAA AGAAAAAAAA CCTGCCGATT AACAAGACTG AAATCATGCA 3251 TGATCTGAAA GGTGTGGAAA GAAACACAAT TAGGTCTCAC TCTGGTTAGG 3301 CATTATTTAT TTAATTATGT TGTATATCAT TGTTTGCAGG GCAAACATTC 3351 TATGCATTTG AAACTGAGCA CTAAACTGGG CTAGCTTTCT GGTAGACCGT 3401 TTTGTGGCTA GTGCGATTTC ACAGTCTACT GCCTGTTTCC ACTGAAAACA 3451 TTTTTGTCAT ATTCTTGTAT TCAAAGAAAA CAGGAAAAAA GTTATTGTAA 3501 ATATTTTATT TAATGCACAC ATTCACACAG TGGTAACAGA CTGCCAGTGT 3551 TCATCCTGAA ATGTCTCACG GATTGATCTA CCTGTCTATG TATGTCTGCT 3601 GAGCTTTCTC CTTGGTTATG TTTTTTCTCT TTTACCTTTC TCCTCCCTTA 3651 CTTCTATCAG AACCAATTCT ATGCGCCAAA TACAACAGGG GGATGTGTCC 3701 CAGTACACTT ACAAAATAAA ACATAACTGA AAGAAGAGCA GTTTTATGAT 3751 TTGGGTGCGT TTTTGTGTTT ATACTGGGCC AGGTCCTGGT AGAACCTTTC 3801 AACAAACAAC CAAACAAAAA AAAA // LOCUS HSRDC1MR 3492 bp mRNA PRI 23-SEP-1992 DEFINITION H.sapiens mRNA for RDC-1 POU domain containing protein. ACCESSION X64624 NID g35914 VERSION X64624.1 GI:35914 KEYWORDS POU domain; POU domain protein. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3492) AUTHORS Collum,R.G., Fisher,P.E., Datta,M., Mellis,S., Thiele,C., Huebner,K., Croce,C.M., Israel,M.A., Theil,T., Moroy,T. et,al. TITLE A novel POU homeodomain gene specifically expressed in cells of the developing mammalian nervous system JOURNAL Nucleic Acids Res. 20 (18), 4919-4925 (1992) MEDLINE 93027214 REFERENCE 2 (bases 1 to 3492) AUTHORS Alt,F.W. TITLE Direct Submission JOURNAL Submitted (25-FEB-1992) F.W. Alt, Howard Hughes Medical Institute, The Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1. .3492 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /cell_line="HeLa, CHP100" /clone_lib="HeLa genomic lib., CHP100 genomic lib." /map="p14-22" gene 278. .1273 /gene="RDC-1" CDS 278. .1273 /gene="RDC-1" /codon_start=1 /product="RDC-1" /protein_id="CAA45907.1" /db_xref="PID:g35915" /db_xref="GI:35915" /db_xref="SWISS-PROT:Q01851" /translation="MNSVPCHTSTVPLAHHHHHHHHHQALEPGDLLDHISSPSLALMA GAGARRGAGGGGAHDAAGGGGPRGGGGGPGGGGPGGGGGGAAGGGGGGPGGGLLGASA HPHPHMHSLGHLSHPAAAAAMNMPSGLPHPGLVAAAAHHGAAAAAAAAAAGQVAAASA AAVVGRAGLASICDSDTDPRELEAFGSFKQRRIKLGVTQADVGSALANLKIPGVGSLS QSTICRFESLTLSHNNMIALKPILQAWLEEAEGPSEKMNKPELFNGGEKKRKRTSIAA PEKRSLEAYFAVQPRPSSEKIAAIAEKLDLKKNVVRVWFCNQRQKQKRMKFSATY" misc_feature 796. .1252 /gene="RDC-1" /note="POU domain" BASE COUNT 807 a 884 c 953 g 848 t ORIGIN 1 TCCACTGCCC CCAAACCCGT TGTTATTTTG GTTGCTTTGT GTTTGCCCTT 51 CGCATGCTGT GCTTTCCGGC CTGTGTGTGT GTTTCTTCTG TTGTTTTGTT 101 TTGCTCTTTT GGTTTTGTGG TTTTTTTGGT TTGGTTTGTG TCGCCTGCAG 151 CTGCACCGAG CAACCTCTTC GCCAGCCTGG ACGAGACGCT GCTGGCCGGG 201 CCGAGGCCTG GCGGCCGTGG ACATCGCCGT GTCCCAGGGC AAGAGCATCC 251 TTCAAGCCGG ACGCCACGTA CCACACGATG AACAGCGTGC CGTGTCACAC 301 TTCCACGGTG CCTCTGGCGC ACCACCACCA CCACCACCAC CACCACCAGG 351 CGCTCGAACC CGGCGATCTG CTGGACCACA TCTCCTCGCC GTCGCTCGCG 401 CTCATGGCCG GCGCGGGCGC GCGGCGCGGC GCCGGCGGCG GCGGCGCCCA 451 CGACGCCGCG GGGGGCGGTG GCCCGCGGGG CGGCGGCGGC GGCCCGGGCG 501 GCGGCGGCCC CGGGGGAGGC GGCGGTGGCG CCGCGGGGGG CGGCGGCGGC 551 GGCCCGGGCG GCGGGCTCCT GGGCGCGTCC GCGCACCCTC ACCCGCATAT 601 GCACAGCCTG GGCCACCTGT CGCACCCCGC GGCGGCGGCC GCCATGAACA 651 TGCCGTCCGG GCTGCCGCAC CCCGGGCTGG TGGCGGCGGC GGCGCACCAC 701 GGCGCGGCAG CGGCAGCGGC GGCGGCGGCG GCCGGGCAGG TGGCAGCGGC 751 ATCGGCGGCG GCCGTGGTGG GCCGAGCGGG CCTGGCGTCC ATCTGCGACT 801 CGGACACGGA CCCGCGCGAG CTCGAGGCGT TCGGGAGCTT CAAGCAGCGG 851 CGCATCAAGC TGGGCGTGAC GCAGGCCGAC GTGGGCTCGG CGCTGGCCAA 901 CCTCAAGATC CCGGGCGTGG GCTCACTCAG CCAGAGCACC ATCTGCAGGT 951 TCGAGTCGCT CACGCTCTCG CACAACAACA TGATCGCGCT CAAGCCCATC 1001 CTGCAGGCGT GGCTCGAGGA GGCCGAGGGG CCCAGCGAGA AAATGAACAA 1051 GCCTGAGCTC TTCAACGGCG GCGAGAAGAA GCGCAAGCGG ACTTCCATCG 1101 CCGCGCCCGA GAAGCGCTCC CTCGAGGCCT ACTTCGCCGT GCAGCCCCGG 1151 CCCTCGTCCG AGAAGATCGC CGCCATCGCC GAGAAACTGG ACCTCAAAAA 1201 GAACGTGGTG CGGGTGTGGT TTTGCAACCA GAGACAGAAG CAGAAGCGGA 1251 TGAAATTCTC TGCCACTTAC TGAGGGGGCT GGGAGGTGTC GGGCGGGACA 1301 GAATGGGGAG CTGAGGAGGC ATTTTTGGGG GGCTTTCCTC TGCTTGCCTC 1351 CCCTCGGATT TGGAGTGTCC GTTATCCTGC CTGCATTTGG GGAGTCCCTT 1401 CTCGCTCTCT TTCCTCCACC CATTCTCTGA TTTTCCTGCC TTTGCTGTCC 1451 CCTAGCCTTA GAGGACTGGG GTGCTGGGTG TGGGGATTGG AGTATAGGGT 1501 AGGGGAGAAG GGGGGGAGCA TTCGGGGGAG TGGGGAGTGG GGGGAAGGAA 1551 AGCGGAGACC CGAGCAGGGG TTTTAAGGAG CAGGATGGTT CTGGGGTTTG 1601 GGTGGGGGGA GACGCGGGAA GGGTAGGAAA ATGGACTGTT CTGACCAGAG 1651 ACACTTACCT AATATCCTGG GACAAGAACT ATGTACAAAA CAAACCTACC 1701 AACCACCAAA AACTAGACAA ATAAAGACAA ACTAAAACAA AACAGAACAA 1751 AAGCAAAGGA AAATGCTTTA GAAATTTTAA CTCCGGGGAG CCATAATCTG 1801 CAACTTCATT TTCCCCCATA GAAGAGAAAA AAGAGCACCA CCATTATTAC 1851 CACCTCCCCA ACCCTACACG CACGAACTGA GTCGAAAAAC GAAAACCAAA 1901 CGAGCGAGAA GTTGAAGTTC TGGGTATCAA AGCTAGTTGT TCTGTCTGCG 1951 TGTTTAATTT TTCCCTCTCT CACCTCCACC CCATCCATAT CCTCTTTATT 2001 TCCTCCGTTC CAATGAGACC TATGGCTGCT CTCCAATCCC GGGAAGTGAG 2051 TGGGAGACAG CTGAAAAGAG AGGGTCAGGG GGAGGCTGGC TGCTTGCTTA 2101 GGTGGAATCC AAGTTTTCCC GTGGCCCTGC CTATACTCTG GTGGCCTGGT 2151 CCTGTTGGGG TGGGGGTCTT TGGAGAGAAG GGCATAGTCT TTGAGCTACT 2201 AAAAAGCAGA ATTCCGGAGC TTCGAGATAT CTTATTCTAG GAAAATGAAA 2251 CAATTTTAAC AACAGTTTTT TTTCCTCTTA TGTCGAAGAT CTAGTTTTAG 2301 ACAATTTCAA AATAAGCTTT TCCCACTCAT AGAACTTTAA CTTGCCCTTT 2351 CAGTTTTATC TTTTTTTTAG AGAGAGGTTT AAACTACTGA TTTTGGCTGT 2401 TGATTCAAAT AGACTAATGG GGTGAAAGTT ATTAGGAGAG ATACTCTCTC 2451 CTGTTTCTCC ACTGAACGAG ACTCATCTTG CTCTTCTAGG TCCCGTTTCT 2501 TCCTCTCTTG GACATGAAAT TATAGAAATG TTGAGAAGTC TGCCTGCTTT 2551 CTTTTGCGGT AGGACTTGGC TGTGAGAAAA TCACCTAAAT CCCAGAAAAG 2601 AGGAAGACAG ATTTAAAGTG CCCCCACCCC CATTTGTTTC AAAGAGGTCT 2651 GCATGTTGGG CGAAAACAGA ACAACTGTGT TTCCTTTTAC TTGTTCTTAT 2701 TATTCAAGAG TCATTTATTA CAGGGGATAA ATGTTGGGTA GCAAGAACTT 2751 TAATTTGCAC TACCAGTCTC CCAAATAGAA AATCATGTAT AGTATTTCAT 2801 AGTAATAATC AGGTACCTTA CAAGCTGCGG TGGATTTTAA AAAATTAAGA 2851 TAGTTGAAGG TGGTTAGGTA AAATGCCTGC TTTGTGTACA AGATACTCTT 2901 TGGATCTCTC GTAGGAATGG TTTGTTACCA TCCTTTAATC ATAACTAAAA 2951 CATTGAAAAC AGAACAAATG AGAAAAGAAA AAAAACCTGC CGATTAACAA 3001 TGACGAAAAT CATGCATGAT CTGAAAGGTG TGGAAAGAAA CACAATTAGG 3051 TCTCACTCTG GTTAGGCATT ATTTATTTAA TTATGTTGTA TATCATTGTT 3101 TGCAGGGCAA CATTCTATGC ATTGAACTGA GCACTAACTG GGCTAGCTTC 3151 TGGTAGACGT TTGTGGCTAG TGCGATTCAC AGTCTACTGC CTGTTCCACT 3201 GAAACATTTT GTCATATTCT TGTATTCAAA GAAAAAAGGA AAAAAAGATT 3251 ATTGTAAATA TTTTATTTAA TGCACACATT CACACAGTGG TAACAGACTG 3301 CCAGTGTTCA TCCTGAAATG TCTCACGGAT TGATCTACCT GTCCATGTAT 3351 GTCTGCTGAG CTTTCTCCTT GGTTATGTTT TTTCTCTTTT ACCTTTCTCC 3401 TCCCTTACTT CTATCAGAAC CAATTCTATG CGCCAAAATA CAACAGGGGG 3451 ATGTGTCCCA GTACACTTAC AAATAAAACC TCGTGCCGAA TT // LOCUS HUMTHRSPO 5784 bp mRNA PRI 30-DEC-1993 DEFINITION Human thrombospondin 2 (THBS2) mRNA, complete cds. ACCESSION L12350 NID g307505 VERSION L12350.1 GI:307505 KEYWORDS thrombospondin 2. SOURCE Homo sapiens adult connective cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5784) AUTHORS LaBell,T.L., Milewicz,D.J., Disteche,C.M. and Byers,P.H. TITLE Thrombospondin II: partial cDNA sequence, chromosome location, and expression of a second member of the thrombospondin gene family in humans JOURNAL Genomics 12 (3), 421-429 (1992) MEDLINE 92217961 REFERENCE 2 (bases 1 to 5784) AUTHORS LaBell,T.L. and Byers,P.H. TITLE Sequence and characterization of the complete human thrombospondin 2 cDNA: potential regulatory role for the 3' untranslated region JOURNAL Genomics 17 (1), 225-229 (1993) MEDLINE 94010892 FEATURES Location/Qualifiers source 1. .5784 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="adult" /tissue_type="connective" mRNA 1. .5784 /gene="THBS2" /citation=[2] /evidence=experimental gene 1. .5784 /gene="THBS2" 5'UTR 1. .239 /gene="THBS2" /note="putative" /citation=[2] sig_peptide 240. .293 /gene="THBS2" /note="putative" /citation=[2] CDS 240. .3758 /gene="THBS2" /standard_name="TSP2" /citation=[2] /codon_start=1 /evidence=experimental /product="thrombospondin 2" /protein_id="AAA03703.1" /db_xref="PID:g307506" /db_xref="GI:307506" /translation="MVWRLVLLALWVWPSTQAGHQDKDTTFDLFSISNINRKTIGAKQ FRGPDPGVPAYRFVRFDYIPPVNADDLSKITKIMRQKEGFFLTAQLKQDGKSRGTLLA LEGPGLSQRQFEIVSNGPADTLDLTYWIDGTRHVVSLEDVGLADSQWKNVTVQVAGET YSLHVGCDLIGPVALDEPFYEHLQAEKSRMYVAKGSARESHFRGLLQNVHLVFENSVE DILSKKGCQQGQGAEINAISENTETLRLGPHVTTEYVGPSSERRPEVCERSCEELGNM VQELSGLHVLVNQLSENLKRVSNDNQFLWELIGGPPKTRNMSACWQDGRFFAENETWV VDSCTTCTCKKFKTICHQITCPPATCASPSFVEGECCPSCLHSVDGEEGWSPWAEWTQ CSVTCGSGTQQRGRSCDVTSNTCLGPSIQTRACSLSKCDTRIRQDGGWSHWSPWSSCS VTCGVGNITRIRLCNSPVPQMGGKNCKGSGRETKACQGAPCPIDGRWSPWSPWSACTV TCAGGIRERTRVCNSPEPQYGGKACVGDVQERQMCNKRSCPVDGCLSNPCFPGAQCSS FPDGSWSCGFCPVGFLGNGTHCEDLDECALVPDICFSTSKVPRCVNTQPGFHCLPCPP RYRGNQPVGVGLEAAKTEKQVCEPENPCKDKTHNCHKHAECIYLGHFSDPMYKCECQT GYAGDGLICGEDSDLDGWPNLNLVCATNATYHCIKDNCPHLPNSGQEDFDKDGIGDAC DDDDDNDGVTDEKDNCQLLFNPRQADYDKDEVGDRCDNCPYVHNPAQIDTDNNGEGDA CSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPLVHNPDQTDVDNDLVGD QCDNNEDIDDDGHQNNQDNCPYISNANQADHDRDGQGDACDPDDDNDGVPDDRDNCRL VFNPDQEDLDGDGRGDICKDDFDNDNIPDIDDVCPENNAISETDFRNFQMVPLDPKGT TQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDRDDDYAGFVFG YQSSSRFYVVMWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRNALWHTGNT PGQVRTLWHDPRNIGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVMADSGPIYDQTYA GGRLGLFVFSQEMVYFSDLKYECRDI" mat_peptide 294. .3755 /gene="THBS2" /citation=[2] /evidence=experimental /product="thrombospondin 2" 3'UTR 3759. .5784 /gene="THBS2" /note="putative" /citation=[2] polyA_signal 5761. .5766 /gene="THBS2" /note="putative" /citation=[2] polyA_site 5784 /gene="THBS2" BASE COUNT 1447 a 1460 c 1518 g 1359 t ORIGIN 1 ACGGCATCCA GTACAGAGGG GCTGGACTTG GACCCCTGCA GCAGCCCTGC 51 ACAGGAGAAG CGGCATATAA AGCCGCGCTG CCCGGGAGCC GCTCGGCCAC 101 GTCCACCGGA GCATCCTGCA CTGCAGGGCC GGTCTCTCGC TCCAGCAGAG 151 CCTGCGCCTT TCTGACTCGG TCCGGAACAC TGAAACCAGT CATCACTGCA 201 TCTTTTTGGC AAACCAGGAG CTCAGCTGCA GGAGGCAGGA TGGTCTGGAG 251 GCTGGTCCTG CTGGCTCTGT GGGTGTGGCC CAGCACGCAA GCTGGTCACC 301 AGGACAAAGA CACGACCTTC GACCTTTTCA GTATCAGCAA CATCAACCGC 351 AAGACCATTG GCGCCAAGCA GTTCCGCGGG CCCGACCCCG GCGTGCCGGC 401 TTACCGCTTC GTGCGCTTTG ACTACATCCC ACCGGTGAAC GCAGATGACC 451 TCAGCAAGAT CACCAAGATC ATGCGGCAGA AGGAGGGCTT CTTCCTCACG 501 GCCCAGCTCA AGCAGGACGG CAAGTCCAGG GGCACGCTGT TGGCTCTGGA 551 GGGCCCCGGT CTCTCCCAGA GGCAGTTCGA GATCGTCTCC AACGGCCCCG 601 CGGACACGCT GGATCTCACC TACTGGATTG ACGGCACCCG GCATGTGGTC 651 TCCCTGGAGG ACGTCGGCCT GGCTGACTCG CAGTGGAAGA ACGTCACCGT 701 GCAGGTGGCT GGCGAGACCT ACAGCTTGCA CGTGGGCTGC GACCTCATAG 751 GACCAGTTGC TCTGGACGAG CCCTTCTACG AGCACCTGCA GGCGGAAAAG 801 AGCCGGATGT ACGTGGCCAA AGGCTCTGCC AGAGAGAGTC ACTTCAGGGG 851 TTTGCTTCAG AACGTCCACC TAGTGTTTGA AAACTCTGTG GAAGATATTC 901 TAAGCAAGAA GGGTTGCCAG CAAGGCCAGG GAGCTGAGAT CAACGCCATC 951 AGTGAGAACA CAGAGACGCT GCGCCTGGGT CCGCATGTCA CCACCGAGTA 1001 CGTGGGCCCC AGCTCGGAGA GGAGGCCCGA GGTGTGCGAA CGCTCGTGCG 1051 AGGAGCTGGG AAACATGGTC CAGGAGCTCT CGGGGCTCCA CGTCCTCGTG 1101 AACCAGCTCA GCGAGAACCT CAAGAGAGTG TCGAATGATA ACCAGTTTCT 1151 CTGGGAGCTC ATTGGTGGCC CTCCTAAGAC AAGGAACATG TCAGCTTGCT 1201 GGCAGGATGG CCGGTTCTTT GCGGAAAATG AAACGTGGGT GGTGGACAGC 1251 TGCACCACGT GTACCTGCAA GAAATTTAAA ACCATTTGCC ACCAAATCAC 1301 CTGCCCGCCT GCAACCTGCG CCAGTCCATC CTTTGTGGAA GGCGAATGCT 1351 GCCCTTCCTG CCTCCACTCG GTGGACGGTG AGGAGGGCTG GTCTCCGTGG 1401 GCAGAGTGGA CCCAGTGCTC CGTGACGTGT GGCTCTGGGA CCCAGCAGAG 1451 AGGCCGGTCC TGTGACGTCA CCAGCAACAC CTGCTTGGGG CCCTCGATCC 1501 AGACACGGGC TTGCAGTCTG AGCAAGTGTG ACACCCGCAT CCGGCAGGAC 1551 GGCGGCTGGA GCCACTGGTC ACCTTGGTCT TCATGCTCTG TGACCTGTGG 1601 AGTTGGCAAT ATCACACGCA TCCGTCTCTG CAACTCCCCA GTGCCCCAGA 1651 TGGGGGGCAA GAATTGCAAA GGGAGTGGCC GGGAGACCAA AGCCTGCCAG 1701 GGCGCCCCAT GCCCAATCGA TGGCCGCTGG AGCCCCTGGT CCCCGTGGTC 1751 GGCCTGCACT GTCACCTGTG CCGGTGGGAT CCGGGAGCGC ACCCGGGTCT 1801 GCAACAGCCC TGAGCCTCAG TACGGAGGGA AGGCCTGCGT GGGGGATGTG 1851 CAGGAGCGTC AGATGTGCAA CAAGAGGAGC TGCCCCGTGG ATGGCTGTTT 1901 ATCCAACCCC TGCTTCCCGG GAGCCCAGTG CAGCAGCTTC CCCGATGGGT 1951 CCTGGTCATG CGGCTTCTGC CCTGTGGGCT TCTTGGGCAA TGGCACCCAC 2001 TGTGAGGACC TGGACGAGTG TGCCCTGGTC CCCGACATCT GCTTCTCCAC 2051 CAGCAAGGTG CCTCGCTGTG TCAACACTCA GCCTGGCTTC CACTGCCTGC 2101 CCTGCCCGCC CCGATACAGA GGGAACCAGC CCGTCGGGGT CGGCCTGGAA 2151 GCAGCCAAGA CGGAAAAGCA AGTGTGTGAG CCCGAAAACC CATGCAAGGA 2201 CAAGACACAC AACTGCCACA AGCACGCGGA GTGCATCTAC CTGGGTCACT 2251 TCAGCGACCC CATGTACAAG TGCGAGTGCC AGACAGGCTA CGCGGGCGAC 2301 GGGCTCATCT GCGGGGAGGA CTCGGACCTG GACGGCTGGC CCAACCTCAA 2351 TCTGGTCTGC GCCACCAACG CCACCTACCA CTGCATCAAG GATAACTGCC 2401 CCCATCTGCC AAATTCTGGG CAGGAAGACT TTGACAAGGA CGGGATTGGC 2451 GATGCCTGTG ATGATGACGA TGACAATGAC GGTGTGACCG ATGAGAAGGA 2501 CAACTGCCAG CTCCTCTTCA ATCCCCGCCA GGCTGACTAT GACAAGGATG 2551 AGGTTGGGGA CCGCTGTGAC AACTGCCCTT ACGTGCACAA CCCTGCCCAG 2601 ATCGACACAG ACAACAATGG AGAGGGTGAC GCCTGCTCCG TGGACATTGA 2651 TGGGGACGAT GTCTTCAATG AACGAGACAA TTGTCCCTAC GTCTACAACA 2701 CTGACCAGAG GGACACGGAT GGTGACGGTG TGGGGGATCA CTGTGACAAC 2751 TGCCCCCTGG TGCACAACCC TGACCAGACC GACGTGGACA ATGACCTTGT 2801 TGGGGACCAG TGTGACAACA ACGAGGACAT AGATGACGAC GGCCACCAGA 2851 ACAACCAGGA CAACTGCCCC TACATCTCCA ACGCCAACCA GGCTGACCAT 2901 GACAGAGACG GCCAGGGCGA CGCCTGTGAC CCTGATGATG ACAACGATGG 2951 CGTCCCCGAT GACAGGGACA ACTGCCGGCT TGTGTTCAAC CCAGACCAGG 3001 AGGACTTGGA CGGTGATGGA CGGGGTGATA TTTGTAAAGA TGATTTTGAC 3051 AATGACAACA TCCCAGATAT TGATGATGTG TGTCCTGAAA ACAATGCCAT 3101 CAGTGAGACA GACTTCAGGA ACTTCCAGAT GGTCCCCTTG GATCCCAAAG 3151 GGACCACCCA AATTGATCCC AACTGGGTCA TTCGCCATCA AGGCAAGGAG 3201 CTGGTTCAGA CAGCCAACTC GGACCCCGGC ATCGCTGTAG GTTTTGACGA 3251 GTTTGGGTCT GTGGACTTCA GTGGCACATT CTACGTAAAC ACTGACCGGG 3301 ACGACGACTA TGCTGGCTTC GTCTTTGGTT ACCAGTCAAG CAGCCGCTTC 3351 TATGTGGTGA TGTGGAAGCA GGTGACGCAG ACCTACTGGG AGGACCAGCC 3401 CACGCGGGCC TATGGCTACT CCGGCGTGTC CCTCAAGGTG GTGAACTCCA 3451 CCACGGGGAC GGGCGAGCAC CTGAGGAACG CGCTGTGGCA CACGGGGAAC 3501 ACGCCGGGGC AGGTGCGAAC CTTATGGCAC GACCCCAGGA ACATTGGCTG 3551 GAAGGACTAC ACGGCCTATA GGTGGCACCT GACTCACAGG CCCAAGACCG 3601 GCTACATCAG AGTCTTAGTG CATGAAGGAA AACAGGTCAT GGCAGACTCA 3651 GGACCTATCT ATGACCAAAC CTACGCTGGC GGGCGGCTGG GTCTATTTGT 3701 CTTCTCTCAA GAAATGGTCT ATTTCTCAGA CCTCAAGTAC GAATGCAGAG 3751 ATATTTAAAC AAGATTTGCT GCATTTCCGG CAATGCCCTG TGCATGCCAT 3801 GGTCCCTAGA CACCTCAGTT CATTGTGGTC CTTGCGGCTT CTCTCTCTAG 3851 CAGCACCTCC TGTCCCTTGA CCTTAACTCT GATGGTTCTT CACCTCCTGC 3901 CAGCAACCCC AAACCCAAGT GCCTTCAGAG GATAAATATC AATGGAACTC 3951 AGAGATGAAC ATCTAACCCA CTAGAGGAAA CCAGTTTGGT GATATATGAG 4001 ACTTTATGTG GAGTGAAAAT TGGGCATGCC ATTACATTGC TTTTTCTTGT 4051 TTGTTTAAAA AGAATGACGT TTACATATAA AATGTAATTA CTTATTGTAT 4101 TTATGTGTAT ATGGAGTTGA AGGGAATACT GTGCATAAGC CATTATGATA 4151 AATTAAGCAT GAAAAATATT GCTGAACTAC TTTTGGTGCT TAAAGTTGTC 4201 ACTATTCTTG AATTAGAGTT GCTCTACAAT GACACACAAA TCCCGCTAAA 4251 TAAATTATAA ACAAGGGTCA ATTCAAATTT GAAGTAATGT TTTAGTAAGG 4301 AGAGATTAGA AGACAACAGG CATAGCAAAT GACATAAGCT ACCGATTAAC 4351 TAATCGGAAC ATGTAAAACA GTTACAAAAA TAAACGAACT CTCCTCTTGT 4401 CCTACAATGA AAGCCCTCAT GTGCAGTAGA GATGCAGTTT CATCAAAGAA 4451 CAAACATCCT TGCAAATGGG TGTGACGCGG TTCCAGATGT GGATTTGGCA 4501 AAACCTCATT TAAGTAAAAG GTTAGCAGAG CAAAGTGCGG TGCTTTAGCT 4551 GCTGCTTGTG CCGTTGTGGC GTCGGGGAGG CTCCTGCCTG AGCTTCCTTC 4601 CCCAGCTTTG CTGCCTGAGA GGAACCAGAG CAGACGCACA GGCCGGAAAA 4651 GGCGCATCTA ACGCGTATCT AGGCTTTGGT AACTGCGGAC AAGTTGCTTT 4701 TACCTGATTT GATGATACAT TTCATTAAGG TTCCAGTTAT AAATATTTTG 4751 TTAATATTTA TTAAGTGACT ATAGAATGCA ACTCCATTTA CCAGTAACTT 4801 ATTTTAAATA TGCCTAGTAA CACATATGTA GTATAATTTC TAGAAACAAA 4851 CATCTAATAA GTATATAATC CTGTGAAAAT ATGAGGCTTG ATAATATTAG 4901 GTTGTCACGA TGAAGCATGC TAGAAGCTGT AACAGAATAC ATAGAGAATA 4951 ATGAGGAGTT TATGATGGAA CCTTAATATA TAATGTTGCC AGCGATTTTA 5001 GTTCAATATT TGTTACTGTT ATCTATCTGC TGTATATGGA ATTCTTTTAA 5051 TTCAAACGCT GAAAACGAAT CAGCATTTAG TCTTGCCAGG CACACCCAAT 5101 AATCAGTCAT GTGTAATATG CACAAGTTTG TTTTTGTTTT TGTTTTTTTT 5151 GTTGGTTGGT TTTTTTGCTT TAAGTTGCAT GATCTTTCTG CAGGAAATAG 5201 TCACTCATCC CACTCCACAT AAGGGGTTTA GTAAGAGAAG TCTGTCTGTC 5251 TGATGATGGA TAGGGGGCAA ATCTTTTTCC CCTTTCTGTT AATAGTCATC 5301 ACATTTCTAT GCCAAACAGG AACGATCCAT AACTTTAGTC TTAATGTACA 5351 CATTGCATTT TGATAAAATT AATTTTGTTG TTTCCTTTGA GGTTGATCGT 5401 TGTGTTGTTT TGCTGCACTT TTTACTTTTT TGCGTGTGGA GCTGTATTCC 5451 CGAGACAACG AAGCGTTGGG ATACTTCATT AAATGTAGCG ACTGTCAACA 5501 GCGTGCAGGT TTTCTGTTTC TGTGTTGTGG GGTCAACCGT ACAATGGTGT 5551 GGGAATGACG ATGATGTGAA TATTTAGAAT GTACCATATT TTTTGTAAAT 5601 TATTTATGTT TTTCTAAACA AATTTATCGT ATAGGTTGAT GAAACGTCAT 5651 GTGTTTTGCC AAAGACTGTA AATATTTATT TATGTGTTCA CATGGTCAAA 5701 ATTTCACCAC TGAAACCCTG CACTTAGCTA GAACCTCATT TTTAAAGATT 5751 AACAACAGGA AATAAATTGT AAAAAAGGTT TTCT // LOCUS AF064244 7247 bp mRNA PRI 21-NOV-1998 DEFINITION Homo sapiens intersectin long form mRNA, complete cds. ACCESSION AF064244 NID g3859854 VERSION AF064244.1 GI:3859854 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7247) AUTHORS Guipponi,M., Scott,H.S., Chen,H., Schebesta,A., Rossier,C. and Antonarakis,S.E. TITLE Two isoforms of a human intersectin (ITSN) protein are produced by brain-specific alternative splicing in a stop codon JOURNAL Genomics 53 (3), 369-376 (1998) MEDLINE 99017974 REFERENCE 2 (bases 1 to 7247) AUTHORS Guipponi,M., Scott,H.S., Chen,H., Schebesta,A., Rossier,C. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (05-MAY-1998) Genetics and Microbiology, CMU, 1 rue Michel-Servet, Geneva 4 CH-1211, Switzerland FEATURES Location/Qualifiers source 1. .7247 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.1-q22.2" /dev_stage="fetus" /tissue_type="brain" CDS 107. .5272 /codon_start=1 /product="intersectin long form" /protein_id="AAC78611.1" /db_xref="PID:g3859855" /db_xref="GI:3859855" /translation="MAQFPTPFGGSLDIWAITVEERAKHDQQFHSLKPISGFITGDQA RNFFFQSGLPQPVLAQIWALADMNNDGRMDQVEFSIAMKLIKLKLQGYQLPSALPPVM KQQPVAISSAPPFGMGGIASMPPLTAVAPVPMGSIPVVGMSPTLVSSVPTAAVPPLAN GAPPVIQPLPAFAHPAATLPKSSSFSRSGPGSQLNTKLQKAQSFDVASVPPVAEWAVP QSSRLKYRQLFNSHDKTMSGHLTGPQARTILMQSSLPQAQLASIWNLSDIDQDGKLTA EEFILAMHLIDVAMSGQPLPPVLPPEYIPPSFRRVRSGSGISVISSTSVDQRLPEEPV LEDEQQQLEKKLPVTFEDKKRENFERGNLELEKRRQALLEQQRKEQERLAQLERAEQE RKERERQEQERKRQLELEKQLEKQRELERQREEERRKEIERREAAKRELERQRQLEWE RNRRQELLNQRNKEQEDIVVLKAKKKTLEFELEALNDKKHQLEGKLQDIRCRLTTQRQ EIESTNKSRELRIAEITHLQQQLQESQQMLGRLIPEKQILNDQLKQVQQNSLHRDSLV TLKRALEAKELARQHLRDQLDEVEKETRSKLQEIDIFNNQLKELREIHNKQQLQKQKS MEAERLKQKEQERKIIELEKQKEEAQRRAQERDKQWLEHVQQEDEHQRPRKLHEEEKL KREESVKKKDGEEKGKQEAQDKLGRLFHQHQEPAKPAVQAPWSTAEKGPLTISAQENV KVVYYRALYPFESRSHDEITIQPGDIVMVKGEWVDESQTGEPGWLGGELKGKTGWFPA NYAEKIPENEVPAPVKPVTDSTSAPAPKLALRETPAPLAVTSSEPSTTPNNWADFSST WPTSTNEKPETDNWDAWAAQPSLTVPSAGQLRQRSAFTPATATGSSPSPVLGQGEKVE GLQAQALYPWRAKKDNHLNFNKNDVITVLEQQDMWWFGEVQGQKGWFPKSYVKLISGP IRKSTSMDSGSSESPASLKRVASPAAKPVVSGEEFIAMYTYESSEQGDLTFQQGDVIL VTKKDGDWWTGTVGDKAGVFPSNYVRLKDSEGSGTAGKTGSLGKKPEIAQVIASYTAT GPEQLTLAPGQLILIRKKNPGGWWEGELQARGKKRQIGWFPANYVKLLNPGTSKITPT EPPKSTALAAVCQVIGMYDYTAQNDDELAFNKGQIINVLNKEDPDWWKGEVNGQVGLF PSNYVKLTTDMDPSQQWCSDLHLLDMLTPTERKRQGYIHELIVTEENYVNDLQLVTEI FQKPLMESELLTEKEVAMIFVNWKELIMCNIKLLKALRVRKKMSGEKMPVKMIGDILS AQLPHMQPYIRFCSRQLNGAALIQQKTDEAPDFKEFVKRLEMDPRCKGMPLSSFILKP MQRVTRYPLIIKNILENTPENHPDHSHLKHALEKAEELCSQVNEGVREKENSDRLEWI QAHVQCEGLSEQLVFNSVTNCLGPRKFLHSGKLYKAKNNKELYGFLFNDFLLLTQITK PLGSSGTDKVFSPKSNLQYKMYKTPIFLNEVLVKLPTDPSGDEPIFHISHIDRVYTLR AESINERTAWVQKIKAASELYIETEKKKREKAYLVRSQRATGIGRLMVNVVEGIELKP CRSHGKSNPYCEVTMGSQCHITKTIQDTLNPKWNSNCQFFIRDLEQEVLCITVFERDQ FSPDDFLGRTEIRVADIKKDQGSKGPVTKCLLLHEVPTGEIVVRLDLQLFDEP" misc_feature 167. .406 /note="encodes EH domain" misc_feature 767. .1936 /note="encodes EH domain" misc_feature 2324. .2524 /note="encodes SH3 domain" misc_feature 2843. .3019 /note="encodes SH3 domain" misc_feature 3110. .3286 /note="encodes SH3 domain" misc_feature 3326. .3520 /note="encodes SH3 domain" misc_feature 3569. .3748 /note="encodes SH3 domain" misc_feature 3836. .4390 /note="encodes GDS domain" misc_feature 4649. .4819 /note="encodes PH domain" misc_feature 4895. .5143 /note="encodes C2 domain" BASE COUNT 2156 a 1652 c 1747 g 1692 t ORIGIN 1 GCGTCCCTCC CAGCGGCGCG TGAGCGGCAC TGATTTGTCC CTGGGGCGGC 51 AGCGCGGACC CGCCCGGAGA TGAGGCGTCG ATTAGCAAGG TAAAAGTAAC 101 AGAACCATGG CTCAGTTTCC AACACCTTTT GGTGGCAGCC TGGATATCTG 151 GGCCATAACT GTAGAGGAAA GAGCGAAGCA TGATCAGCAG TTCCATAGTT 201 TAAAGCCAAT ATCTGGATTC ATTACTGGTG ATCAAGCTAG AAACTTTTTT 251 TTTCAATCTG GGTTACCTCA ACCTGTTTTA GCACAGATAT GGGCACTAGC 301 TGACATGAAT AATGATGGAA GAATGGATCA AGTGGAGTTT TCCATAGCTA 351 TGAAACTTAT CAAACTGAAG CTACAAGGAT ATCAGCTACC CTCTGCACTT 401 CCCCCTGTCA TGAAACAGCA ACCAGTTGCT ATTTCTAGCG CACCACCATT 451 TGGTATGGGA GGTATCGCCA GCATGCCACC GCTTACAGCT GTTGCTCCAG 501 TGCCAATGGG ATCCATTCCA GTTGTTGGAA TGTCTCCAAC CCTAGTATCT 551 TCTGTTCCCA CAGCAGCTGT GCCCCCCCTG GCTAACGGGG CTCCCCCTGT 601 TATACAACCT CTGCCTGCAT TTGCTCATCC TGCAGCCACA TTGCCAAAGA 651 GTTCTTCCTT TAGTAGATCT GGTCCAGGGT CACAACTAAA CACTAAATTA 701 CAAAAGGCAC AGTCATTTGA TGTGGCCAGT GTCCCACCAG TGGCAGAGTG 751 GGCTGTTCCT CAGTCATCAA GGCTGAAATA CAGGCAATTA TTCAATAGTC 801 ATGACAAAAC TATGAGTGGA CACTTAACAG GTCCCCAAGC AAGAACTATT 851 CTTATGCAGT CAAGTTTACC ACAGGCTCAG CTGGCTTCAA TATGGAATCT 901 TTCTGACATT GATCAAGATG GAAAACTTAC AGCAGAGGAA TTTATCCTGG 951 CAATGCACCT CATTGATGTA GCTATGTCTG GCCAACCACT GCCACCTGTC 1001 CTGCCTCCAG AATACATTCC ACCTTCTTTT AGAAGAGTTC GATCTGGCAG 1051 TGGTATATCT GTCATAAGCT CAACATCTGT AGATCAGAGG CTACCAGAGG 1101 AACCAGTTTT AGAAGATGAA CAACAACAAT TAGAAAAGAA ATTACCTGTA 1151 ACGTTTGAAG ATAAAAAGCG GGAGAACTTT GAACGTGGCA ACCTGGAACT 1201 GGAGAAACGA AGGCAAGCTC TCCTGGAACA GCAGCGCAAG GAGCAGGAGC 1251 GCCTGGCCCA GCTGGAGCGG GCGGAGCAGG AGAGGAAGGA GCGTGAGCGC 1301 CAGGAGCAAG AGCGCAAAAG ACAACTGGAA CTGGAGAAGC AACTGGAAAA 1351 GCAGCGGGAG CTAGAACGGC AGAGAGAGGA GGAGAGGAGG AAAGAAATTG 1401 AGAGGCGAGA GGCTGCAAAA CGGGAACTTG AAAGGCAACG ACAACTTGAG 1451 TGGGAACGGA ATCGAAGGCA AGAACTACTA AATCAAAGAA ACAAAGAACA 1501 AGAGGACATA GTTGTACTGA AAGCAAAGAA AAAGACTTTG GAATTTGAAT 1551 TAGAAGCTCT AAATGATAAA AAGCATCAAC TAGAAGGGAA ACTTCAAGAT 1601 ATCAGATGTC GATTGACCAC CCAAAGGCAA GAAATTGAGA GCACAAACAA 1651 ATCTAGAGAG TTGAGAATTG CCGAAATCAC CCATCTACAG CAACAATTAC 1701 AGGAATCTCA GCAAATGCTT GGAAGACTTA TTCCAGAAAA ACAGATACTC 1751 AATGACCAAT TAAAACAAGT TCAGCAGAAC AGTTTGCACA GAGATTCACT 1801 TGTTACACTT AAAAGAGCCT TAGAAGCAAA AGAACTAGCT CGGCAGCACC 1851 TACGAGACCA ACTGGATGAA GTGGAGAAAG AAACTAGATC AAAACTACAG 1901 GAGATTGATA TTTTCAATAA TCAGCTGAAG GAACTAAGAG AAATACACAA 1951 TAAGCAACAA CTCCAGAAGC AAAAGTCCAT GGAGGCTGAA CGACTGAAAC 2001 AGAAAGAACA AGAACGAAAG ATCATAGAAT TAGAAAAACA AAAAGAAGAA 2051 GCCCAAAGAC GAGCTCAGGA AAGGGACAAG CAGTGGCTGG AGCATGTGCA 2101 GCAGGAGGAC GAGCATCAGA GACCAAGAAA ACTCCACGAA GAGGAAAAAC 2151 TGAAAAGGGA GGAGAGTGTC AAAAAGAAGG ATGGCGAGGA AAAAGGCAAA 2201 CAGGAAGCAC AAGACAAGCT GGGTCGGCTT TTCCATCAAC ACCAAGAACC 2251 AGCTAAGCCA GCTGTCCAGG CACCCTGGTC CACTGCAGAA AAAGGTCCAC 2301 TTACCATTTC TGCACAGGAA AATGTAAAAG TGGTGTATTA CCGGGCACTG 2351 TACCCCTTTG AATCCAGAAG CCATGATGAA ATCACTATCC AGCCAGGAGA 2401 CATAGTCATG GTTAAAGGGG AATGGGTGGA TGAAAGCCAA ACTGGAGAAC 2451 CCGGCTGGCT TGGAGGAGAA TTAAAAGGAA AGACAGGGTG GTTCCCTGCA 2501 AACTATGCAG AGAAAATCCC AGAAAATGAG GTTCCCGCTC CAGTGAAACC 2551 AGTGACTGAT TCAACATCTG CCCCTGCCCC CAAACTGGCC TTGCGTGAGA 2601 CCCCCGCCCC TTTGGCAGTA ACCTCTTCAG AGCCCTCCAC GACCCCTAAT 2651 AACTGGGCCG ACTTCAGCTC CACGTGGCCC ACCAGCACGA ATGAGAAACC 2701 AGAAACGGAT AACTGGGATG CATGGGCAGC CCAGCCCTCT CTCACCGTTC 2751 CAAGTGCCGG CCAGTTAAGG CAGAGGTCCG CCTTTACTCC AGCCACGGCC 2801 ACTGGCTCCT CCCCGTCTCC TGTGCTAGGC CAGGGTGAAA AGGTGGAGGG 2851 GCTACAAGCT CAAGCCCTAT ATCCTTGGAG AGCCAAAAAA GACAACCACT 2901 TAAATTTTAA CAAAAATGAT GTCATCACCG TCCTGGAACA GCAAGACATG 2951 TGGTGGTTTG GAGAAGTTCA AGGTCAGAAG GGTTGGTTCC CCAAGTCTTA 3001 CGTGAAACTC ATTTCAGGGC CCATAAGGAA GTCTACAAGC ATGGATTCTG 3051 GTTCTTCAGA GAGTCCTGCT AGTCTAAAGC GAGTAGCCTC TCCAGCAGCC 3101 AAGCCGGTCG TTTCGGGAGA AGAATTTATT GCCATGTACA CTTACGAGAG 3151 TTCTGAGCAA GGAGATTTAA CCTTTCAGCA AGGGGATGTG ATTTTGGTTA 3201 CCAAGAAAGA TGGTGACTGG TGGACAGGAA CAGTGGGCGA CAAGGCCGGA 3251 GTCTTCCCTT CTAACTATGT GAGGCTTAAA GATTCAGAGG GCTCTGGAAC 3301 TGCTGGGAAA ACAGGGAGTT TAGGAAAAAA ACCTGAAATT GCCCAGGTTA 3351 TTGCCTCATA CACCGCCACC GGCCCCGAGC AGCTCACTCT CGCCCCTGGT 3401 CAGCTGATTT TGATCCGAAA AAAGAACCCA GGTGGATGGT GGGAAGGAGA 3451 GCTGCAAGCA CGTGGGAAAA AGCGCCAGAT AGGCTGGTTC CCAGCTAATT 3501 ATGTAAAGCT TCTAAACCCT GGGACGAGCA AAATCACTCC AACAGAGCCA 3551 CCTAAGTCAA CAGCATTAGC GGCAGTGTGC CAGGTGATTG GGATGTACGA 3601 CTACACCGCG CAGAATGACG ATGAGCTGGC CTTCAACAAG GGCCAGATCA 3651 TCAACGTCCT CAACAAGGAG GACCCTGACT GGTGGAAAGG AGAAGTCAAT 3701 GGACAAGTGG GGCTCTTCCC ATCCAATTAT GTGAAGCTGA CCACAGACAT 3751 GGACCCAAGC CAGCAATGGT GTTCAGACTT ACATCTCTTG GATATGTTGA 3801 CCCCAACTGA AAGAAAGCGA CAAGGATACA TCCACGAGCT CATTGTCACC 3851 GAGGAGAACT ATGTGAATGA CCTGCAGCTG GTCACAGAGA TTTTTCAAAA 3901 ACCCCTGATG GAGTCTGAGC TGCTGACAGA AAAAGAGGTT GCTATGATTT 3951 TTGTGAACTG GAAGGAGCTG ATTATGTGTA ATATCAAACT ACTAAAAGCG 4001 CTGAGAGTCC GCAAGAAGAT GTCCGGGGAG AAGATGCCTG TGAAGATGAT 4051 TGGAGACATC CTGAGCGCAC AGCTGCCGCA CATGCAGCCC TACATCCGCT 4101 TCTGCAGCCG CCAGCTCAAC GGGGCTGCCC TGATCCAGCA GAAGACGGAC 4151 GAGGCCCCAG ACTTCAAGGA GTTCGTCAAA AGATTGGAAA TGGATCCTCG 4201 GTGTAAAGGG ATGCCACTCT CTAGTTTTAT ACTGAAGCCT ATGCAACGGG 4251 TAACAAGATA CCCACTGATC ATTAAAAATA TCCTGGAAAA CACCCCTGAA 4301 AACCACCCGG ACCACAGCCA CTTGAAGCAC GCCCTGGAGA AGGCGGAAGA 4351 GCTCTGTTCC CAGGTGAACG AAGGGGTGCG GGAGAAGGAG AACTCTGACC 4401 GGCTGGAGTG GATCCAGGCC CACGTGCAGT GTGAAGGCCT GTCTGAGCAA 4451 CTTGTGTTCA ATTCAGTGAC CAATTGCTTG GGGCCGCGCA AATTTCTGCA 4501 CAGTGGGAAG CTCTACAAGG CCAAGAACAA CAAGGAGCTG TATGGCTTCC 4551 TTTTCAACGA CTTCCTCCTG CTGACTCAGA TCACGAAGCC TTTGGGGTCT 4601 TCTGGCACCG ACAAAGTCTT CAGCCCCAAA TCAAACCTGC AGTATAAAAT 4651 GTATAAAACA CCTATTTTCC TAAATGAGGT TCTAGTAAAA TTACCCACCG 4701 ACCCTTCTGG AGACGAGCCC ATCTTCCACA TCTCCCACAT TGACCGCGTC 4751 TATACTCTCC GAGCAGAAAG CATAAATGAA AGGACTGCCT GGGTGCAGAA 4801 AATCAAAGCT GCTTCTGAAC TTTACATAGA GACTGAGAAA AAGAAGCGCG 4851 AGAAAGCGTA CCTGGTCCGT TCCCAAAGGG CAACAGGCAT TGGAAGGTTG 4901 ATGGTGAACG TGGTTGAAGG CATCGAGTTG AAACCCTGTC GGTCACATGG 4951 AAAGAGCAAC CCGTACTGTG AGGTGACCAT GGGTTCCCAG TGCCACATCA 5001 CCAAGACGAT CCAGGACACT CTGAACCCCA AGTGGAATTC CAACTGCCAG 5051 TTCTTCATCC GAGACCTGGA GCAGGAAGTC CTCTGCATCA CTGTGTTCGA 5101 GAGGGACCAG TTCTCACCAG ATGATTTTTT GGGTCGGACG GAGATCCGTG 5151 TGGCGGACAT CAAGAAAGAC CAGGGCTCCA AAGGTCCAGT TACGAAGTGT 5201 CTTCTGCTGC ACGAAGTCCC CACGGGAGAG ATTGTGGTCC GCTTGGACCT 5251 GCAGTTGTTT GATGAGCCGT AGGCAGCGGG CTCAGGGTGT GCTCAGCAGG 5301 GTCCCAGCCC ACGGCCACAC ATGCTGTCTG GAAATTGTAT TCCTTTTCTA 5351 AGAAACCACC ATTTGGTATT CAGTCACAGG GATATGGGAT GGCAAAGACA 5401 GGCCCCTCAA AGCTCCTAGG AATCATTCTC GACAATCCTC CCTGCCCCGA 5451 AACAATTTCC TGTTTCATGA AACAAAGCTG TGTTTTCCTT TGTCCTCACT 5501 ACAGGTCTCA TTATGGCTTC TAGGGTCGCT GAAATCCCAT AGCCCTCAAC 5551 AGGGTGCAGC TGGGAGTCTA GCCCCTTCCC GGGCTTGAGG GATGGGTCTG 5601 GTTACTATAA AATAGATTTA TAAATGCAAT GTCTATATTT TTGGAGAACT 5651 CATGTAACCC TCCTGTTTCT TACATCCACC AGTCCCCAAG TAGACTTCTT 5701 GGCCTACAAT GCCCAGTCCT TGGTGTGAGT TTAGAAACAA TTATGACGGT 5751 CCTGTCATTG CTTCAGAATC CCATCTCTCC TGCAGGGAAA TGCTGCCTAG 5801 AGCTGATCAC TCGGTGAGAC GGTCTGATCA GGCCCTGGCT TAGCTCTTTG 5851 AAGAGCTGGT CTATGGAAGT TTCCAGCATG TGCACCGTTA TAGCCGTTCC 5901 TTCCCCCTCT AGGCCTTGTA TTAATATATG TCAATGAAAA CACACTGGTG 5951 TATTGTTGCG TGGATTCAGT TCTGATTCCC AGCATGCTTA GAATATGGTC 6001 ACAGAAAGTC ATTATCTAGA AAGTCACCCC TCTGCTGGAT CAGATCACTA 6051 CAGGTCACTG GAAAGGCAAC TTTACAATGT TGGGTCACTG GGTCTCGGTT 6101 GGCAGCCATG TTGGAAAAAT CTCTTTTGGC TCGGAGGCCT GTGATATTTC 6151 ATAGCAGCAG TCGTTGCTGG TGACCTGTTC TGTGCTTGAA TGTGCTGAAT 6201 CCTGATTGTT GTAGGACATT TCAACAGCTC TTTTTGGTAC GTTCCCCAAA 6251 AAGCCATGTC CTAGATCCCC AAGGCGTGAA AAGGAAAAAT ATCAAGCTGG 6301 AGGTTGGGAA AGAAAATGAA GGCAGTCCAT TATGTGGTGG GTGAAAGACC 6351 CTAGGAGGAT GCAAGCCCCG CACATCCCGG GGCAAAGACC TAAGACACTT 6401 TTCCACCCTC CACCACCCCA ACCTCACATA ATATGCTTGT TGCAAGAGTC 6451 AGGACTTTAT GACTATGTGC CAAGCTGTTT GGTTTGAGTT CTTTAATTTT 6501 TTTTTCCCTT AAATGCCAGG AGATCATCTG GTTAGTTAGA TAGTAACTTG 6551 ATTTGCTAAT GAAAAGTGGG GGCCGTGTTT TGTTTGCATG TTAATATTCT 6601 CATAATCCTA GTTTGTTGTG GTCATGAAAT GCCCTTTGCA TGTTCTGTTG 6651 GTACTGGAGT CTAGCTTTCC TGTACTAGAT GGTGTTCTCT TTGATTGTAG 6701 GTCCTTAGAC TTTAATTAGG GTTATCAAAG TGCTTTCTAA ATGATAGCAT 6751 CAGCGTTGTG GCAGAGTACC TCCTTTGCTG GGAACTGAAT GTGTAGGGTT 6801 ATCATTTCCC ATGAGAGCCC GGTCATACTT CAAGCAATTT TTTTAAAAGT 6851 GTGTGTTGGA AAGGACAACA AAGTTTACAT TTCATACTTT TAAGAAATAC 6901 TTTATTATTT ATTTATTGAA GATAGTGTAG AATTTTGTAT CAAGAACAAC 6951 AGACATAAGT ATTTTTTGAA ACAAGCAAAT ATACCCTGTA GTTAGAAACT 7001 TTCAACTGAA CATGTTAGAG ACCAAGTTTA ACTTCAGGCA TGCATTTGTT 7051 TACCATTTCC CAGCAGAAAA CATGGTTAAA ATACTTTAAG TTTATATTTT 7101 TTGATGTTGT TAAGAAACTT TTAAATTAAA TCTATAAATA GACATGCAAC 7151 TCATGCTTTC CTATTTCTAT AACCAACACC GTTTGTTTAG TGTATTTATG 7201 AAAGATATGC TACCATGGTA GAAAGAAAAG TATTCAATGT GTAAATT // LOCUS AF077599 3129 bp mRNA PRI 28-JUL-1998 DEFINITION Homo sapiens hypothetical SBBI03 protein mRNA, complete cds. ACCESSION AF077599 NID g3342561 VERSION AF077599.1 GI:3342561 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3129) AUTHORS Zhang,W., Cao,X., Wan,T., Yuan,Z., He,L., Li,N., Zhu,X. and Tao,Q. TITLE Hypothetical protein SBBI03 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3129) AUTHORS Zhang,W., Cao,X., Wan,T., Yuan,Z., He,L., Li,N., Zhu,X. and Tao,Q. TITLE Direct Submission JOURNAL Submitted (12-JUL-1998) Department of Immunology, Shanghai Brilliance Biotechnology Institute & Second Military Medical University, 800 Xiangyin Road, Shanghai 200433, China FEATURES Location/Qualifiers source 1. .3129 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 214. .1167 /note="SBBI03" /codon_start=1 /product="hypothetical SBBI03 protein" /protein_id="AAC27647.1" /db_xref="PID:g3342562" /db_xref="GI:3342562" /translation="MGYDVTRFQGDVDEDLICPICSGVLEEPVQAPHCEHAFCNACIT QWFSQQQTCPVDRSVVTVAHLRPVPRIMRNMLSKLQIACDNAVFGCSAVVRLDNLMSH LSDCEHNPKRPVTCEQGCGLEMPKDELPNHNCIKHLRSVVQQQQTRIAELEKTSAEHK HQLAEQKRDIQLLKAYMRAIRSVNPNLQNLEETIEYNEILEWVNSLQPARVTRWGGMI STPDAVLQAVIKRSLVESGCPASIVNELIENAHERSWPQGLATLETRQMNRRYYENYV AKRIPGKQAVVVMACENQHMGDDMVQEPGLVMIFAHGVEEI" BASE COUNT 773 a 758 c 775 g 822 t 1 others ORIGIN 1 GGGCGAGTAC TCCTGATTGT GACATCACAT TCATCCCCTG GGCGATGGAG 51 CTTGTCACTG GGAAGGAATA CTCAGTCGGA GAATAGCCAA CAAGATGGGT 101 TACTGGGAGA ATCTCTTCAG TGGCACTGAG TGGAGGCATC AGGGGGTTGG 151 AGCCTTGTGA ACAGGGAACC TGCCCCCCAA CACTTGGAAG GACCTGGGTT 201 TCAGTGATGA GACATGGGGT ATGATGTAAC CCGTTTCCAG GGGGATGTTG 251 ACGAAGATCT TATCTGCCCT ATTTGCAGTG GAGTCTTGGA GGAGCCAGTA 301 CAGGCACCTC ATTGTGAACA TGCTTTCTGC AACGCCTGCA TCACCCAGTG 351 GTTCTCTCAG CAACAGACAT GTCCAGTGGA CCGTAGTGTT GTGACGGTCG 401 CCCATCTGCG CCCAGTACCT CGGATCATGC GGAACATGTT GTCAAAGCTG 451 CAGATTGCCT GTGACAACGC TGTGTTCGGC TGTAGTGCCG TTGTCCGGCT 501 TGACAACCTC ATGTCTCACC TCAGCGACTG TGAGCACAAC CCGAAGCGGC 551 CTGTGACCTG TGAACAGGGC TGTGGCCTGG AGATGCCCAA AGATGAGCTG 601 CCCAACCATA ACTGCATTAA GCACCTGCGC TCAGTGGTAC AGCAGCAGCA 651 GACACGCATC GCAGAGCTGG AGAAGACGTC AGCTGAACAC AAACACCAGC 701 TGGCGGAGCA GAAGCGAGAC ATCCAGCTGC TAAAGGCATA CATGCGTGCA 751 ATCCGCAGTG TCAACCCCAA CCTTCAGAAC CTGGAGGAGA CAATTGAATA 801 CAACGAGATC CTAGAGTGGG TGAACTCCCT TCAGCCAGCA AGAGTGACCC 851 GCTGGGGAGG GATGATCTCG ACTCCTGATG CTGTGCTCCA GGCTGTAATC 901 AAGCGCTCCC TGGTGGAGAG TGGCTGTCCT GCTTCTATTG TCAACGAGCT 951 GATTGAAAAT GCCCACGAGC GTAGCTGGCC CCAGGGTCTG GCCACACTAG 1001 AGACTAGACA GATGAACCGA CGCTACTATG AGAACTACGT GGCCAAGCGC 1051 ATCCCTGGCA AGCAGGCTGT TGTCGTGATG GCCTGTGAGA ACCAGCACAT 1101 GGGGGATGAC ATGGTGCAAG AGCCAGGCCT TGTCATGATA TTTGCGCATG 1151 GCGTGGAAGA GATATAAGAG AACTCGACTG GCTATCAGGA AGAGATGGAA 1201 ATCAGAAAAT CCCATCACTC CAGCAGCTGG GACCTGAGTC CTACCCACCA 1251 TTCTTAATAC TGTGGCTTAT ACCTGAGCCA CACATCTCCC TGCCCTTCTG 1301 GCACTGAAGG GCCTTGGGGT AGTTTGCTCA GCCTTTCAGG TGGGAAACCC 1351 AGATTTCCTC CCTTTGCCAT ATTCCCCTAA AATGTCTATA AATTATCAGT 1401 CTGGGTGGGA AAGCCCCCAC CTCCATCCAT TTTCCTGCTT AGGGTCCCTG 1451 GTTCCAGTTA TTTTCAGAAA GCACAAAGAG ATTCAATTTC CCTGGAGGAT 1501 CAGGACAGAG GAAGGAATCT CTAATCGTCC CTCTCCTCCA AAACCAGGGA 1551 ATCAGAGCAG TCAGGCCTGT TGACTCTAAG CAGCAGACAT CCTGAAGAAA 1601 TGGTAAGGGT GGAGCCAAAT CTCTAGAAAT AAGTAGTGAG GCCGTTAATT 1651 GGCCATCACT GATGGCCCTT AGGGAAAGAC TGGACCTCTG TGCCAAGCAG 1701 TATCCCTGTT CAGCCCACCT TAAAGGTGTA GGCACCCACT GGGTCTACCA 1751 GTATGCAGGT TGGGATACTG AAAATTTCCA GATGAGCTCT TCTTTCCTAC 1801 AAGTTTTCAT AATTAGGGAA TGCCAGGGTT TAGGGTAGGG GTTAATCTGT 1851 TGGGGGTTGA TGTGTTTAGC AAGAAGCTAC TCCTAGCTTT TGCTAAAATA 1901 TGGTTGGCAC TGCCTCTTGT GGCACAGGCC ATAATTGTTC CATAGACCCC 1951 TCTCTAGCCC TGTGACTGTA GTTAGTTACT TTGATGATTT TCTTTGGCCA 2001 TTGTTTGTTT ATATTTCACA AACTCCACCT ACTGCCCCCC CCCCTCTTTT 2051 TTTTAAGAAT GGCCTGATCA TGGCTATCTC AGCCACATTG TTGGCAATTT 2101 AATTTATTTA CTTCCTTTTT TTTTTTTAAG AAAGGAAAAA AGAAAAAAAA 2151 ATCAAACTTG AAACTTTTCT TTTGATGTTC CTATTGTGGG GGTTCTGGAT 2201 AGGGTGGGAC AGGGATGGGG GTGTGTTTTA TATTTTTTCC TTTTCAGCAC 2251 AACCTTTGGC TTTAATATAG GAAGAGCCAA GGGAGTCCTC GGCTGAACTT 2301 ACGATATCTG CCCCAAACCT CTGTAACCCC AACTGAAATG AGGAGCTTCC 2351 TCTCTTCCTG TGAAGGATAT GACAGTCCAG CATCGATGCC TGTGCCCTCT 2401 GGAAAAATTT CCTCCTAGCC CTTCCAGGGC CTTATCATAA AACTCTGGAT 2451 TTAGAGTATT CATTTTGAAG GCAACTCCCC CTTCCCCAAG TTTCCTTGGA 2501 GCTGTATAGC TGGGTTCTAA GCTTCACCAT GCAAATCAGA AATTTTATCT 2551 CTAAGTACAG GCTGTGCCGT GTCTCACCCA CACCCCCCTG GGGACTTCAG 2601 TTCCATTTCA GGTTACCTGG GGTATACCTT GATCCCTAGA GTGACTGGCA 2651 GAGTAAGAGA AGGGGAGAGA TAATAGGTGT GATTATTTTA ATATGAAGGT 2701 GGNAGTGTGG TTGGAGATAG AAAGGCTCCT CCCCACCATG TAATGGCTTC 2751 CTCTCAGAAT TTTATTCCAG GCTAGCTTGC TGCAGGTCTG GGTAGTTGGA 2801 TCATGGCTCC ACTGGGATTG GGGTGGAAAG CTTGAGGGGA GTAGGGTTCC 2851 AGCTCTGGGA CATTGTGCTC AGGAATTTGA AAACGCTGCT ATACTTACTC 2901 TGGTTACTAC ATTTCTTCCA CTCCCCTTTC CCCTACCTGC CTTAACCAAG 2951 GCTCATACTG TCCTGTCCTT ACCCTCAGAT GGAGCCAGGA AGCTCAGTGA 3001 AAGGCTTCCC TACCCTTTGC ACTAGTGTCT CTGCAGGTTG CTGGTTGTGT 3051 TGTATGTGCT GTTCCATGGT GTTGACTGCA CTAATAATAA ACCTTTTACT 3101 CAACTCTCTA AAAAAAAAAA AAAAAAAAA // LOCUS HSU17714 3145 bp mRNA PRI 22-DEC-1998 DEFINITION Homo sapiens putative tumor suppressor ST13 (ST13) mRNA, complete cds. ACCESSION U17714 NID g4049267 VERSION U17714.1 GI:4049267 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3145) AUTHORS Mo,Y., Zheng,S. and Shen,D. TITLE Differential expression of HSU17714 gene in colorectal cancer and normal colonic mucosa JOURNAL Chung-Hua Chung Liu Tsa Chih 18 (4), 241-243 (1996) MEDLINE 98048555 REFERENCE 2 (bases 1 to 3145) AUTHORS Cao,J., Cai,X., Zheng,L., Geng,L., Shi,Z., Pao,C.C. and Zheng,S. TITLE Characterization of colorectal-cancer-related cDNA clones obtained by subtractive hybridization screening JOURNAL J. Cancer Res. Clin. Oncol. 123 (8), 447-451 (1997) MEDLINE 97436735 REFERENCE 3 (bases 1 to 3145) AUTHORS Xinhan,C., Yanming,Z., Liyi,G., Jiang,C. and Shu,Z. TITLE Assignment of a novel colorectal cancer-associated gene HSU17714 gene to human chromosome band 22q13 by in situ hybridization JOURNAL Chung-Hua Chung Liu Tsa Chih 19 (3), 177-179 (1997) REFERENCE 4 (bases 1 to 3145) AUTHORS Zheng,S., Cai,X., Cao,J., Zheng,L., Geng,L., Zhang,Y., Gu,J. and Shi,Z. TITLE Screening and identification of down-regulated genes in colorectal carcinoma by subtractive hybridization: a method to identify putative tumor suppressor genes JOURNAL Chin. Med. J. 110 (7), 543-547 (1997) MEDLINE 98256570 REFERENCE 5 (bases 1 to 3145) AUTHORS Zheng,S. TITLE Direct Submission JOURNAL Submitted (29-NOV-1994) Shu Zheng, Cancer Institute, Zhejiang Medical University, Hangzhou, 310009, People's Republic of China COMMENT On Dec 22, 1998 this sequence version replaced gi:1549233. FEATURES Location/Qualifiers source 1. .3145 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="subtraction library between normal colon mucosa and colon cancer mucosa" /tissue_type="colon mucosa" /chromosome="22" /map="22q13" gene 1. .3145 /gene="ST13" CDS 75. .1184 /gene="ST13" /note="expressed less in colon cancer tissue than in normal colon mucosa" /codon_start=1 /product="putative tumor suppressor ST13" /protein_id="AAC97526.1" /db_xref="PID:g4049268" /db_xref="GI:4049268" /translation="MDPRKVNELRAFVKMCKQDPSVLHTEEMRFLREWVESMGGKVPP ATQKAKSEENTKEEKPDSKKVEEDLKADEPSSEESDLEIDKEGVIEPDTDAPQEMGDE NAEITEEMMDQANDKKVAAIEALNDGELQKAIDLFTDAIKLNPRLAILYAKRASVFVK LQKPNAAIRDCDRAIEINPDSAQPYKWRGKAHRLLGHWEEAAHDLALACKLDYDEDAS AMLKEVQPRAQKIAEHRRKYERKREEREIKERIERVKKAREEHERAQREEEARRQSGA QYGSFPGGFPGGMPGNFPGGMPGMGGGMPGMAGMPGLNEILSDPEVLAAMQDPEVMVA FQDVAQNPANMSKYQSNPKVMNLISKLSAKFGGQA" BASE COUNT 958 a 573 c 665 g 949 t ORIGIN 1 GCGGTCACGC CGAGCCAGCG CCTGGGCCTG GAACCGGGCC GTAGCCCCCC 51 AGTTTCGCCC ACCACCTCCC TACCATGGAC CCCCGCAAAG TGAACGAGCT 101 TCGGGCCTTT GTGAAAATGT GTAAGCAGGA TCCGAGCGTT CTGCACACCG 151 AGGAAATGCG CTTCCTGAGG GAGTGGGTGG AGAGCATGGG TGGTAAAGTA 201 CCACCTGCTA CTCAGAAAGC TAAATCAGAA GAAAATACCA AGGAAGAAAA 251 ACCTGATAGT AAGAAGGTGG AGGAAGACTT AAAGGCAGAC GAACCATCAA 301 GTGAGGAAAG TGATCTAGAA ATTGATAAAG AAGGTGTGAT TGAACCAGAC 351 ACTGATGCTC CTCAAGAAAT GGGAGATGAA AATGCGGAGA TAACGGAGGA 401 GATGATGGAT CAGGCAAATG ATAAAAAAGT GGCTGCTATT GAAGCCCTAA 451 ATGATGGTGA ACTCCAGAAA GCCATTGACT TATTCACAGA TGCCATCAAG 501 CTGAATCCTC GCTTGGCCAT TTTGTATGCC AAGAGGGCCA GTGTCTTCGT 551 CAAATTACAG AAGCCAAATG CTGCCATCCG AGACTGTGAC AGAGCCATTG 601 AAATAAATCC TGATTCAGCT CAGCCTTACA AGTGGCGGGG GAAAGCACAC 651 AGACTTCTAG GCCACTGGGA AGAAGCAGCC CATGATCTTG CCCTTGCCTG 701 TAAATTGGAT TATGATGAAG ATGCTAGTGC AATGCTGAAA GAAGTTCAAC 751 CTAGGGCACA GAAAATTGCA GAACATCGGA GAAAGTATGA GCGAAAACGT 801 GAAGAGCGAG AGATCAAAGA AAGAATAGAA CGAGTTAAGA AGGCTCGAGA 851 AGAGCATGAG AGAGCCCAGA GGGAGGAAGA AGCCAGACGA CAGTCAGGAG 901 CTCAGTATGG CTCTTTTCCA GGTGGCTTTC CTGGGGGAAT GCCTGGTAAT 951 TTTCCCGGAG GAATGCCTGG AATGGGAGGG GGCATGCCTG GAATGGCTGG 1001 AATGCCTGGA CTCAATGAAA TTCTTAGTGA TCCAGAGGTT CTTGCAGCCA 1051 TGCAGGATCC AGAAGTTATG GTGGCTTTCC AGGATGTGGC TCAGAACCCA 1101 GCAAATATGT CAAAATACCA GAGCAACCCA AAGGTTATGA ATCTCATCAG 1151 TAAATTGTCA GCCAAATTTG GAGGTCAAGC GTAATGTCCT TCTGATAAAT 1201 AAAGCCCTTG CTGAAGGAAA AGCAACCTAG ATCACCTTAT GGATGTCGCA 1251 ATAATACAAA CCAGTGTACC TCTGACCTTC TCATCAAGAG AGCTGGGGTG 1301 CTTTGAAGAT AATCCCTACC CCTCTCCCCC AAATGCAGCT GAAGCATTTT 1351 ACAGTGGTTT GCCATTAGGG TATTCATTCA GATAATGTTT TCCTACTAGG 1401 AATTACAAAC TTTAAACACT TTTTAAATCT TCAAAATATT TAAAACAAAT 1451 TTAAAGGGCC TGTTAATTCT TATATTTTTC TTTACTAATC ATTTTGGATT 1501 TTTTTCTTTG AATTATTGGC AGGGAATATA CTTATGTATG GAAGATTACT 1551 GCTCTGAGTG AAATAAAAGT TATTAGTGCG AGGCAAACAT AACTCATTTG 1601 AGGATAAAGT TTGTGTTGGA TATGTGGTTC CTGATGCATT TTGACTTGTC 1651 TTTTTAAATG CTTTATCTTT TTCTTTAAAG ATTTATTTCA ATAAAACTAA 1701 TTGGGACCAC CCGTATTTCA GTAGGACCTG GGTAGGGATT GGAAGTACTT 1751 GGCAGGGCAG CAGCAATCTT GCTGTGTTTG ATATAACATG CATCCTTGGG 1801 CAGGTTGCCC TTAAATCTTA CACTGTGGTG AAGGGATGTT TTTTTTGTAA 1851 TGCTGCAGTA GAGTTGGAGT ACTTAGTTCT CTTGTTGTCC AGTATATCTA 1901 ATAAGTGTTT TTCATATTAT TTCCACGTAA GGGAAATAAG GTAGTACTTT 1951 TCTTTTTATA TTTCTATGCT TAAAATTCTC TTTCCTAGTC AAAAATTGCC 2001 CAAATCTGTG TTTGCTTTCT GCTTGCTACA TTTGTCTCCC TTACTTTTCT 2051 TGAGCTAAAG ACAGGCTTTT TCCACCGGCA TCATCACTGC TATCATCATT 2101 AACAGCGTAA TTATACAAGC ATATTTAATG CTGAGTTTAA TTTAATATGT 2151 AATACATATG GTAATTGTAG GGTAATACCC ACAACAACTG TAGTTTCTTA 2201 CTTGGCCAAG AGAATGCTTA TTTAAGTGTT AGACTTCCAT TCTGGCAAAA 2251 TCTTGCCTTA TCAGAAGACA TTGGAAAGAG GGATTCCCTT TGGTGTTTGG 2301 TCTTCTACTT AGAAAAACCT ATTGCAGTTA GTTTATCTTG TAGTATTCAT 2351 CTTTGTATTC TGAAGATAAG GTTTGAATTA AATTGATACA CACAGAGGGG 2401 AACCGATTTT TTTTATCCAA TGTGAATTAT AAATGAGATA ATCCACAGTT 2451 ATTCATTGTG GAGTTGTTGA GACTATGAAA GACTCATTGT CTTTGTATTC 2501 AGCTCTTAAA TAGTGTAACT ATATCCCCAC CTCTGCTTGC TTTCTTTCCC 2551 TCCCCTCCAA TGATAAAGAA AATGATAAAT TTTCTGTTGT GCATTCAATT 2601 CTTATTTTAA ATAAGACTAA GTATAGGCAT TGTACCTGAC ATTGCTACGT 2651 TTCTACCAGT GTTTCAATTT AAAGTGCTAG TGTTTAAAAA CATTTTCAAG 2701 GGATAAGGCC TTCTGTACTT TGCTTATTTG AAGAATCAGT GGTAGGAGCA 2751 GTGAAGTAAA TTCTATGGAG TACATTTCTA AAATACCACA TTTCTGAAAT 2801 CATAAATAAG TTTATTCAGG TTCTAACCCT TTGCTGTACA CAAGCAGACA 2851 GAAATGCATC TGTTACATAA ATGAGAAAAA GCTATTATGC TGATGGAGCA 2901 TGCTTTTTAA ATCCTTTAAA AACACTCACC ATATAAACTT GCATTTGAGC 2951 TTGTGTGTTC TTTTGTTAAT GTGTAGAGTT CTCCTTTCTC GAAATTGCCA 3001 GTGTGTACTT GGCTTAACTC AAGAACAGTT TCTTCTGGAT TCCTTATTTG 3051 ATTTATTTAA CCTAATTATA TTCTAATATT GCAAATATTA CCATAAGTGG 3101 GTAAAAGTAA AATTCCTCTT CTGAAAAAAA AAAAAAAAAA AAAAA // LOCUS HSNOV 2588 bp mRNA PRI 04-SEP-1996 DEFINITION H.sapiens mRNA for novel gene in Xq28 region. ACCESSION X92396 NID g1150415 VERSION X92396.1 GI:1150415 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2588) AUTHORS D'Esposito,M., Ciccodicola,A., Gianfrancesco,F., Esposito,T., Flagiello,L., Mazzarella,R., Schlessinger,D. and D'Urso,M. TITLE A synaptobrevin-like gene in the Xq28 pseudoautosomal region undergoes X inactivation JOURNAL Nat. Genet. 13 (2), 227-229 (1996) MEDLINE 96225453 REFERENCE 2 (bases 1 to 2588) AUTHORS D'Urso,M. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) M. D'Urso, International Institute of Genetics and, Biophysics,, CNR, Via Marconi, 10, I- 80125 Naples, ITALY FEATURES Location/Qualifiers source 1. .2588 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /cell_line="NTERA2" /clone="D1" /clone_lib="phage10" /map="q28" gene 115. .777 /gene="ORF" CDS 115. .777 /gene="ORF" /codon_start=1 /protein_id="CAA63133.1" /db_xref="PID:e206118" /db_xref="PID:g1150416" /db_xref="GI:1150416" /db_xref="SWISS-PROT:P51809" /translation="MAILFAVVARGTTILAKHAWCGGNFLEVTEQILAKIPSENNKLT YSHGNYLFHYICQDRIVYLCITDDDFERSRAFNFLNEIKKRFQTTYGSRAQTALPYAM NSEFSSVLAAQLKHHSENKGLDKVMETQAQVDELKGIMVRNIDLVAQRGERLELLIDK TENLVDSSVTFKTTSRNLARAMCMKNLKLTIIIIIVSIVFIYIIVSPLCGGFTWPSCV KK" BASE COUNT 763 a 486 c 500 g 839 t ORIGIN 1 GAATTCGCCG GTCCAGCCTC CTCTGGGAGC GGGCAGTTGG CGACCCTGCA 51 CTGACCCGCG TCCCTCCGTC CCGAGCCCGC GCGCCCTCAG AGGGTGCCCG 101 GACAGACTGA AGCCATGGCG ATTCTTTTTG CTGTTGTTGC CAGGGGGACC 151 ACTATCCTTG CCAAACATGC TTGGTGTGGA GGAAACTTCC TGGAGGTGAC 201 AGAGCAGATT CTGGCTAAGA TACCTTCTGA AAATAACAAA CTAACGTACT 251 CACATGGCAA TTATTTGTTT CATTACATCT GCCAAGACAG GATTGTATAT 301 CTTTGTATCA CTGATGATGA TTTTGAACGT TCCCGAGCCT TTAATTTTCT 351 GAATGAGATA AAGAAGAGGT TCCAGACTAC TTACGGTTCA AGAGCACAGA 401 CAGCACTTCC ATATGCCATG AATAGCGAGT TCTCAAGTGT CTTAGCTGCA 451 CAGCTGAAGC ATCACTCTGA GAATAAGGGC CTAGACAAAG TGATGGAGAC 501 TCAAGCCCAA GTGGATGAAC TGAAAGGAAT CATGGTCAGA AACATAGATC 551 TGGTAGCTCA GCGAGGAGAA AGATTGGAAT TATTGATTGA CAAAACAGAA 601 AATCTTGTGG ATTCTTCTGT CACCTTCAAA ACTACCAGCA GAAATCTTGC 651 TCGAGCCATG TGTATGAAGA ACCTCAAGCT CACTATTATC ATCATCATCG 701 TATCAATTGT GTTCATCTAT ATCATTGTTT CACCTCTCTG TGGTGGATTT 751 ACATGGCCAA GCTGTGTGAA GAAATAGGAA AGAAGAAGTT ACCATTAACC 801 AAGGATATGA GAGAACAAGG AGTTAAAAGC AATCCATGTG ACTCAAGCCT 851 TTCACATACT GACAGATGGT ATCTGCCAGT CTCTTCAACC CTCTTCTCAC 901 TTTTTAAAAT CTTGTTCCAT GCCTCCAGGT TTATCTTTGT CTTATCTACC 951 AGTTTATTCC TGTGAACTTC AGATTGAACC ATTCATTGCA GCAGTAGCCT 1001 TAAAAAGGCT TTTGTTTATT TCTTTGGTTT GTTAACTAGT GTCATCTATT 1051 TAGAGAAACA TTTTTGTTTT TAATTGCTCA AAGCTGTCGC CGCTAGTCTT 1101 ATGAGCTATC TACTAAAACT ATGGAGAAAC TTTGTATGTG CACACAAAAG 1151 TATTCAAGAG ACAGTATTGC TAACATCTCA TCTTAATGTC TTTTGTTATT 1201 GAGAAGTTTT AGGTGCTTCA AAACAATATA AATGGATAAT AGTTGTTATT 1251 TGGGGAATTG TAATGATGTT GGTGCTGCTT CCTTCTAAGA GCTCAGACAA 1301 GTAAAGTATG AAACATTCTT ATTTCAGTTA GATGGGGAAC ATTTTGCTAG 1351 CCCATTAGAA GCACACAGAA TTATCCTTGT CCTCCTAATA TTGACTTTCA 1401 GGAATAAAGT TCAGTGTGCT GATCATTCAC AATACAGTGG ATAGCTTGAT 1451 ATCTTCTGTT TTCCCATTGC AGTTGATTTG AGAAGATGAA GGTTTAAATA 1501 TTGTTGAAAG TTGCAGTTTT TTAAATGTGT TCCTTTTTCT TCTGTGAATA 1551 TTTAGGGCAA TCGTGTCGCT AATAGAATAT GTAGTAGAGG GGGTGGGGAG 1601 GTAAATTCCT CTGACTTGCC AAAGAAAAAG AAGGGAACCA CAGTGGATAT 1651 GCTAGCATTT TAGCTGTGCA AAGGGAGGTA GTGTGGGAAA AGTGTTTCCA 1701 TTCTGGGAAA AGCCCAAACC GAATACGGTC AGCAGTCAAC TCCAGGGTTT 1751 GGGCTTGATT CCTGTTGAAT AATAGTTTTG AGCATTCTTT GTGGTTAAAT 1801 AAATTCTTAA ATCTGCCTAG TTTTGATGAA TTCTTTTGTG AAACTTGAAA 1851 GAGAATAGAC AGTATGACAT ATAGAATTAA TACAAAACAG TTTAACAACC 1901 ATTTAACTGC AGTGTAAGAA AATTGGACTG TAATCATATC GCTACTGGCA 1951 TCTGTTATCT AGTATGCATT TCTGGTGTGT ATCTGAAAGG AAGACATTTT 2001 CTACCCTAGA TCCAATTGCA TTTATTTATC AATAAGTGCC ATTAAATTGA 2051 AATTATATTA CATTTTACAC TTTCTCAATG AATGAACAAA TTAGTCTGTA 2101 GAATCTAGCC ACCTGTTTAG CCTAGTCATG TGCCTTGAAC ATATATGTGT 2151 CCCATAATCT GGCTCATGGT ACCTGTTCTT CTATCCAAAC CTTTCAATTC 2201 ATGCTACCTG ATTCATTTAT TTGACATAGA TCTTAGGCCC ACTTGAACTC 2251 TTTTCTTGTT TATCTAGCAT AGCACAAACG TTTTTCCAGT CTTCTTTATC 2301 AACACTAATG CCTCTTAATT GCATCAGTAT TTCCTATTGG AAAATACATC 2351 TGTTCCAGAA AAACATTTGG CATTCCTGAA TAATTTCCAA ATGTTTTTAA 2401 TCCAAAGAAA AAGGTTTAAA GCTTATTTCC CTTTCTTATA CACACCTGAA 2451 TAAAATTGAT GTGCATGTTT TAGGGATCAA TTACCTAACT GTTCCTTGGT 2501 CTATTTATGT ATAAGAATGC TTTTTAAAGC ACATGTCTCA TTTTAAATGA 2551 CGCACAAACT GAAGATGTTA ATAAAATTTA AGGAATTC // LOCUS AB020642 5533 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0835 protein, complete cds. ACCESSION AB020642 NID g4240158 VERSION AB020642.1 GI:4240158 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hj05861. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (6), 355-364 (1998) MEDLINE 99156230 REFERENCE 2 (bases 1 to 5533) AUTHORS Ohara,O., Suyama,M., Kikuno,R., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (02-DEC-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .5533 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hj05861" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 365. .3730 /gene="KIAA0835" CDS 365. .3730 /gene="KIAA0835" /codon_start=1 /product="KIAA0835 protein" /protein_id="BAA74858.1" /db_xref="PID:d1038592" /db_xref="PID:g4240159" /db_xref="GI:4240159" /translation="MSLENEDKRARTRSKALRGPPETTAADLSCPTPGCTGSGHVRGK YSRHRSLQSCPLAKKRKLEGAEAEHLVSKRKSHPLKLALDEGYGVDSDGSEDTEVKDA SVSDESEGTLEGAEAETSGQDEIHRPETAEGRSPVKSHFGSNPIGSATASSKGSYSSY QGIIATSLLNLGQIAEETLVEEDLGQAAKPGPGIVHLLQEAAEGAASEEGEKGLFIQP EDAEEVVEVTTERSQDLCPQSLEDAASEESSKQKGILSHEEEDEEEEEEEEEEEEDEE EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAAPDVIFQEDTSHTSAQKAPELRGPESP SPKPEYSVIVEVRSDDDKDEDTHSRKSTVTDESEMQDMMTRGNLGLLEQAIALKAEQV RTVCEPGCPPAEQSQLGLGEPGKAAKPLDTVRKSYYSKDPSRAEKREIKCPTPGCDGT GHVTGLYPHHRSLSGCPHKDRIPPEILAMHENVLKCPTPGCTGQGHVNSNRNTHRSLS GCPIAAAEKLAKSHEKQQPQTGDPSKSSSNSDRILRPMCFVKQLEVPPYGSYRPNVAP ATPRANLAKELEKFSKVTFDYASFDAQVFGKRMLAPKIQTSETSPKAFQCFDYSQDAE AAHMAATAILNLSTRCWEMPENLSTKPQDLPSKSVDIEVDENGTLDLSMHKHRKRENA FPSSSSCSSSPGVKSPDASQRHSSTSAPSSSMTSPQSSQASRQDEWDRPLDYTKPSRL REEEPEESEPAAHSFASSEADDQEVSEENFEERKYPGEVTLTNFKLKFLSKDIKKELL TCPTPGCDGSGHITGNYASHRSLSGCPLADKSLRNLMAAHSADLKCPTPGCDGSGHIT GNYASHRSLSGCPRAKKSGVKVAPTKDDKEDPELMKCPVPGCVGLGHISGKYASHRSA SGCPLAARRQKEGSLNGSSFSWKSLKNEGPTCPTPGCDGSGHANGSFLTHRSLSGCPR ATFAGKKGKLSGDEVLSPKFKTSDVLENDEEIKQLNQEIRDLNESNSEMEAAMVQLQS QISSMEKNLKNIEEENKLIEEQNEALFLELSGLSQALIQSLANIRLPHMEPICEQNFD AYVSTLTDMYSNQDPENKDLLESIKQAVRGIQV" BASE COUNT 1383 a 1474 c 1537 g 1139 t ORIGIN 1 CTCCGCCAGC CCGTGCCACC GCTGCTAATG AGAGCAGTCA TTAAGTAAAT 51 GAGACGTCGC CTTTAGCTGG CTTAGGAGTT CGCACACTAA GGGGAGAAGA 101 TATTTAATTG AAACCCGCAC GCAGGCTTCC CCACATGTGA CCGCTGTACG 151 GGAGGCAGCT GCCTTCCCTC TCCTCCCCCA GTCCACCCTG CACCCCCCAT 201 GTAAATTTCA TGATTGCTTT CCGTGATGTC ATTTTGAAAG AGGACAGACA 251 ATAGCTGTGG GGAAAGAAAA TGAGTTCCGA GGTGAGCTGT TAAATCAGAG 301 GTGGACACAC GGAGGCAAGG CCAGCAGCTG CAGGACCTCA AGACACCTGG 351 GCAGACACTG CAAGATGAGC TTAGAAAATG AAGACAAGCG AGCTCGCACC 401 CGATCCAAGG CCCTGCGAGG ACCCCCAGAG ACCACAGCTG CAGACCTCAG 451 CTGCCCCACC CCAGGATGCA CAGGCTCAGG GCACGTCCGG GGCAAGTACT 501 CCAGGCACCG AAGTTTACAG AGCTGCCCCC TGGCCAAGAA GAGGAAGCTG 551 GAGGGCGCTG AGGCTGAGCA CCTGGTGTCC AAGAGGAAGT CACACCCCCT 601 GAAGCTGGCT CTGGACGAGG GCTATGGTGT GGACAGCGAC GGCAGTGAGG 651 ACACTGAGGT GAAGGACGCC TCTGTTTCGG ATGAATCGGA AGGAACTCTG 701 GAGGGGGCCG AGGCTGAGAC GTCAGGACAG GACGAGATTC ATCGCCCCGA 751 GACAGCTGAA GGAAGGAGCC CCGTCAAGTC CCATTTTGGA TCCAACCCCA 801 TCGGCAGCGC CACTGCCTCC TCCAAGGGCA GCTACAGCAG CTACCAGGGA 851 ATCATCGCAA CTTCTCTCCT GAACTTGGGT CAAATTGCTG AAGAGACCCT 901 GGTGGAAGAG GACTTGGGCC AGGCGGCCAA GCCAGGTCCT GGCATTGTGC 951 ACCTGCTTCA GGAGGCTGCA GAGGGAGCTG CCAGCGAGGA GGGTGAAAAG 1001 GGCCTCTTCA TCCAGCCAGA GGATGCCGAG GAGGTCGTCG AAGTCACCAC 1051 CGAGCGCTCC CAGGACCTGT GTCCCCAGTC CCTGGAGGAT GCAGCCAGTG 1101 AGGAGTCCAG CAAGCAGAAA GGCATCCTGA GTCACGAAGA GGAGGACGAG 1151 GAGGAGGAGG AGGAGGAAGA GGAGGAGGAG GAGGATGAAG AAGAGGAAGA 1201 GGAAGAGGAG GAGGAAGAGG AAGAGGAGGA GGAGGAAGAG GAAGAGGAGG 1251 AGGAAGAGGA AGAGGAAGAG GAGGAGGAGG AGGCAGCTCC TGATGTGATC 1301 TTTCAGGAAG ACACCTCTCA CACCTCTGCC CAGAAGGCCC CTGAGCTCCG 1351 GGGCCCAGAA TCACCCAGTC CCAAGCCTGA GTACTCTGTT ATTGTGGAGG 1401 TCCGCTCGGA TGATGACAAG GACGAGGACA CCCACTCCCG GAAGTCAACA 1451 GTCACTGACG AGTCGGAGAT GCAGGACATG ATGACCCGGG GAAACCTGGG 1501 CCTCCTGGAG CAGGCCATCG CCCTGAAGGC TGAACAGGTG CGCACAGTCT 1551 GCGAGCCGGG CTGCCCGCCT GCCGAGCAGA GCCAGCTGGG CCTGGGAGAG 1601 CCAGGGAAGG CAGCAAAGCC CCTGGACACT GTGCGGAAGA GTTACTACAG 1651 TAAAGATCCT TCAAGAGCTG AGAAGCGTGA GATCAAGTGT CCAACACCAG 1701 GCTGTGATGG CACTGGCCAC GTTACCGGGT TGTACCCTCA CCACCGCAGC 1751 CTTTCTGGCT GTCCCCACAA GGATAGGATC CCCCCAGAGA TCTTAGCCAT 1801 GCATGAGAAC GTGCTGAAGT GCCCCACTCC TGGCTGCACA GGCCAGGGTC 1851 ACGTGAACAG CAACCGCAAC ACGCACAGAA GTTTGTCTGG GTGTCCCATT 1901 GCTGCCGCCG AAAAATTAGC CAAATCCCAT GAGAAGCAGC AGCCGCAGAC 1951 AGGAGATCCT TCCAAGAGTA GCTCCAATTC CGATCGGATC CTCAGGCCCA 2001 TGTGCTTCGT GAAGCAGCTC GAGGTCCCTC CATATGGGAG CTACCGGCCC 2051 AACGTGGCCC CCGCCACACC CAGGGCCAAC TTGGCCAAGG AGCTGGAGAA 2101 GTTCTCCAAG GTCACCTTTG ACTACGCAAG TTTCGATGCT CAGGTTTTTG 2151 GCAAACGCAT GCTTGCCCCA AAGATTCAGA CCAGCGAAAC CTCACCTAAA 2201 GCCTTTCAAT GCTTTGACTA CTCGCAGGAC GCCGAGGCTG CACACATGGC 2251 TGCCACTGCC ATCCTGAACC TCTCCACGCG CTGCTGGGAG ATGCCTGAGA 2301 ACCTCAGCAC GAAGCCACAG GACCTCCCCA GCAAGTCTGT GGATATCGAG 2351 GTAGACGAAA ATGGAACCCT GGACTTGAGC ATGCACAAAC ACCGCAAACG 2401 AGAAAATGCT TTCCCCAGCA GCAGCAGCTG CAGCAGCAGC CCCGGTGTGA 2451 AGTCTCCCGA CGCCTCCCAG CGCCACAGCA GCACCAGCGC CCCCAGCAGC 2501 TCCATGACCT CTCCCCAGTC CAGCCAGGCC TCCCGCCAGG ACGAGTGGGA 2551 CCGGCCCCTG GACTACACCA AGCCTAGCCG CCTGAGAGAG GAGGAACCTG 2601 AGGAGTCAGA GCCAGCAGCC CATTCTTTTG CTTCTTCTGA AGCAGATGAC 2651 CAGGAAGTGT CGGAAGAGAA TTTTGAGGAG CGGAAGTATC CGGGGGAAGT 2701 CACCCTGACC AACTTTAAGC TGAAGTTTCT CTCCAAGGAC ATAAAGAAGG 2751 AGCTGCTCAC CTGTCCCACC CCTGGCTGTG ACGGCAGCGG CCACATCACC 2801 GGGAACTACG CCTCCCACCG CAGCCTCTCT GGTTGCCCTC TTGCTGACAA 2851 GAGCCTCAGA AACCTCATGG CTGCCCACTC TGCTGACCTC AAGTGCCCCA 2901 CGCCCGGCTG TGACGGCTCT GGCCACATCA CAGGGAACTA CGCTTCACAC 2951 CGGAGCTTGT CCGGCTGCCC TCGTGCAAAG AAAAGTGGAG TCAAGGTGGC 3001 ACCCACCAAG GACGACAAGG AGGACCCCGA GCTGATGAAG TGCCCAGTTC 3051 CAGGCTGTGT GGGGCTCGGT CACATCAGCG GGAAATACGC CTCTCACAGG 3101 AGCGCATCCG GCTGCCCACT GGCCGCCCGC AGGCAGAAGG AAGGGTCCCT 3151 CAATGGCTCG TCATTCTCCT GGAAGTCCCT GAAGAATGAA GGACCGACCT 3201 GCCCCACCCC GGGCTGTGAC GGCTCTGGCC ACGCCAATGG GAGTTTCCTC 3251 ACCCACCGGA GTTTGTCAGG CTGTCCCAGA GCAACCTTTG CTGGAAAGAA 3301 GGGAAAACTG TCAGGGGATG AGGTCCTCAG TCCAAAGTTC AAGACTAGCG 3351 ACGTGTTGGA GAATGATGAG GAGATCAAGC AGCTGAACCA GGAGATCCGA 3401 GACCTGAACG AGTCCAACTC GGAGATGGAG GCTGCCATGG TGCAGCTGCA 3451 GTCCCAGATC TCCTCCATGG AGAAGAACCT GAAGAACATC GAGGAGGAGA 3501 ACAAGCTCAT TGAGGAGCAG AATGAAGCCC TGTTTCTGGA GCTGTCCGGC 3551 CTGAGCCAGG CCCTCATCCA AAGTCTCGCC AATATCCGCC TTCCGCACAT 3601 GGAGCCAATA TGCGAACAGA ATTTCGATGC CTATGTGAGC ACCCTCACCG 3651 ACATGTACTC CAACCAGGAC CCGGAGAACA AGGACCTCCT GGAGAGCATC 3701 AAGCAGGCTG TGAGGGGCAT CCAGGTCTAG GCCGTGTGGT ACCCAGAAGT 3751 GTCCCAGCCC ACCACACCGT TTACCTCCCT CGCCCTGCCC CGCACCGTGG 3801 GGATGCCCAA CTCACAGTGA CTTCCCGTTT GGGGCCCGGT GTGGCCGCGG 3851 GCGGGTTTAT CCAAAGGGAT GGCTGGAAAT CGGCCGCTCC CACGAGGCTC 3901 CCTCCAGGCT TGGGGCCGTG GTGGCCCTAT CTGTGTGCAT AGGGGCACTG 3951 AAGAATTACA AAGTGATTTA TTTTTGTTTT CTGAAAGAAA TCTGAAGAGC 4001 AGCTCAAAGT CTCCAGTGGA AGCTCATGGA CAAGGTTCTC AGGGAAGTTT 4051 TGGAGTTTGC AACCACAGTA TTCCTTTGTC TGTCGAGGCT GGGAGGGTAG 4101 CCGTGAGCGT GGTGGGTGGG TGGTGTGAGT GGCATCTTGG CCTGGAGGAC 4151 ACGCCTGGGG CAGCGTGTCT GTGCTCAATG AGGGCCGCTG AAGAAAGTGC 4201 GTTTTCTGTT TTCATTTAAT TCAGTTTTGT GTTAAGGGTG GAACAAAGCT 4251 CTGTTCTAGA GGGAAAGTTA GGAATAGGCC TCCCAGAGGA TACTCCAGAC 4301 AGTTGACAAG CATCGCAGCC CCTGTCAGGT CAAGGAATTT TAACCTGGGA 4351 GACTCCTGAA AATCAGCCTA ATGTCTGGGG GCAGCTGGCG CTGAGGCTGT 4401 GCAGATGTGT ATGGCCTTGG GTCACAGCTC CCTACTGGGG CTGCAGCCTT 4451 TAGACCCTGG GTCTGGGATC CACGTCTAGG GACCACTCTG GCTTCTGTGC 4501 ATCCACCCAG GCCCCCGACC CATCACTGCC ACATTCTCCG GGTCATGGCC 4551 GAGCTTGACC TCTGCATCCA TGGTGACGGA GGTGGCAATG ACATCATCAA 4601 GGTCAGAGGG TAAATTCCTA ACAGGAGACC ACAAAGGTTT CACTTCTTTT 4651 AGATCCATCC AATATGCGAA AAAAGGGTAC ATTTTAATTT AATGTGTTAA 4701 AAACACAATG TCCCTACCGT GTAAATAGAA TCCAAGTCAA TTGTGACTGC 4751 AATTCCAAGT TAGTATTTAA AAATAGAGAA ATTGCACTTC CTGGAAAAAA 4801 TTAAAAAGTA GTTAGTTTAA TTATTCTGTA AATTATTTAA TTTTGTCAAG 4851 ATGTAAGTAT TTATATTCAC AGCCTCTTAT GTTAACGTTT GCTATTTATT 4901 TAACATTTTT ATTTATTAAT TTTATACTAT TTCAACTAGA CCCCACACCA 4951 GGGCACCCTA AGAGGGGAAA AAAGTGAATT CAAAGCAAAA ATAACTATTT 5001 GACTATAATA TAGGCCACAA AAAATCTTGA TATAACTCCT AGTTATAATT 5051 TTGATGCAAC CAAGTTATTT TTTATATATT TTGTTATTTA TTCTTTTAAT 5101 AATGGAATTT TTTTAAAAAG TATTTTGTAA CTGAGGAATT TTGATAATTT 5151 GGGGATTTTG TTTTCCCTTG GTCTAATGGA GAGAAAGACA TTTTGGTTTT 5201 CTTTTTCCAT TTTGGGTTCC TTTTGTTTCA GGCACGGATA ACAGCAAATG 5251 GAATTCTGTA TTTATTATTA TTTAAGTGTA GCGCAGATGT GTGTGCTTGG 5301 GCGTGTTTCT CGTGTCGCGT TTGCGTGTCG GCTCTTGCGG TGGAGTCCTG 5351 TTGCTGTGAG GGAGTGCTGG CCGCAGGCAG CCTCGAGTCA CCGCCAGGCT 5401 CACAGGCTGT GCCACCCGTC CCTCCGACAG CTCCAATACT GTGACCCTCC 5451 TTCCTCAGGA AGCGCGTTGA ACCCACAGCA CTCCGTGGTG TGTTTGCAGG 5501 GTGATTTCCT AATAAAAGTC CACTCTGACT GTG // LOCUS HUMTHM 3653 bp mRNA PRI 07-AUG-1995 DEFINITION Human endothelial cell thrombomodulin mRNA, complete cds. ACCESSION M16552 NID g339656 VERSION M16552.1 GI:339656 KEYWORDS glycoprotein; thrombomodulin. SOURCE Homo sapiens (clone: lambda-HTm[15,12].) umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3653) AUTHORS Wen,D.Z., Dittman,W.A., Ye,R.D., Deaven,L.L., Majerus,P.W. and Sadler,J.E. TITLE Human thrombomodulin: complete cDNA sequence and chromosome localization of the gene JOURNAL Biochemistry 26 (14), 4350-4357 (1987) MEDLINE 88024950 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by D.Wen, 04-AUG-1987. FEATURES Location/Qualifiers source 1. .3653 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-HTm[15,12]." /cell_type="endothelial" /tissue_type="umbilical vein" /map="20p12-cen" mRNA 1. .3653 /note="thrombomodulin mRNA" gene 147. .1874 /gene="THBD" CDS 147. .1874 /gene="THBD" /note="thrombomodulin" /codon_start=1 /db_xref="GDB:G00-119-613" /protein_id="AAB59508.1" /db_xref="PID:g339657" /db_xref="GI:339657" /translation="MLGVLVLGALALAGLGFPAPAEPQPGGSQCVEHDCFALYPGPAT FLNASQICDGLRGHLMTVRSSVAADVISLLLNGDGGVGRRRLWIGLQLPPGCGDPKRL GPLRGFQWVTGDNNTSYSRWARLDLNGAPLCGPLCVAVSAAEATVPSEPIWEEQQCEV KADGFLCEFHFPATCRPLAVEPGAAAAAVSITYGTPFAARGADFQALPVGSSAAVAPL GLQLMCTAPPGAVQGHWAREAPGAWDCSVENGGCEHACNAIPGAPRCQCPAGAALQAD GRSCTASATQSCNDLCEHFCVPNPDQPGSYSCMCETGYRLAADQHRCEDVDDCILEPS PCPQRCVNTQGGFECHCYPNYDLVDGECVEPVDPCFRANCEYQCQPLNQTSYLCVCAE GFAPIPHEPHRCQMFCNQTACPADCDPNTQASCECPEGYILDDGFICTDIDECENGGF CSGVCHNLPGTFECICGPDSALARHIGTDCDSGKVDGGDSGSGEPPPSPTPGSTLTPP AVGLVHSGLLIGISIASLCLVVALLALLCHLRKKQGAARAKMEYKCAAPSKEVVLQHV RTERTPQRL" sig_peptide 147. .209 /gene="THBD" /note="thrombomodulin signal peptide" mat_peptide 210. .1871 /gene="THBD" /note="thrombomodulin mature peptide" repeat_region 879. .986 /note="EGF-like repeat 1" repeat_region 1008. .1115 /note="EGF-like repeat 2" repeat_region 1131. .1358 /note="EGF-like repeat 3" repeat_region 1251. .1463 /note="EGF-like repeat 4" repeat_region 1368. .1463 /note="EGF-like repeat 5" repeat_region 1479. .1586 /note="EGF-like repeat 6" variation 1564 /gene="THBD" /note="c in lambda-HTm15; t in lambda-HTm12" /replace="t" BASE COUNT 758 a 1069 c 976 g 850 t ORIGIN 1 GGCAGCGCGC AGCGGCAAGA AGTGTCTGGG CTGGGACGGA CAGGAGAGGC 51 TGTCGCCATC GGCGTCCTGT GCCCCTCTGC TCCGGCACGG CCCTGTCGCA 101 GTGCCCGCGC TTTCCCCGGC GCCTGCACGC GGCGCGCCTG GGTAACATGC 151 TTGGGGTCCT GGTCCTTGGC GCGCTGGCCC TGGCCGGCCT GGGGTTCCCC 201 GCACCCGCAG AGCCGCAGCC GGGTGGCAGC CAGTGCGTCG AGCACGACTG 251 CTTCGCGCTC TACCCGGGCC CCGCGACCTT CCTCAATGCC AGTCAGATCT 301 GCGACGGACT GCGGGGCCAC CTAATGACAG TGCGCTCCTC GGTGGCTGCC 351 GATGTCATTT CCTTGCTACT GAACGGCGAC GGCGGCGTTG GCCGCCGGCG 401 CCTCTGGATC GGCCTGCAGC TGCCACCCGG CTGCGGCGAC CCCAAGCGCC 451 TCGGGCCCCT GCGCGGCTTC CAGTGGGTTA CGGGAGACAA CAACACCAGC 501 TATAGCAGGT GGGCACGGCT CGACCTCAAT GGGGCTCCCC TCTGCGGCCC 551 GTTGTGCGTC GCTGTCTCCG CTGCTGAGGC CACTGTGCCC AGCGAGCCGA 601 TCTGGGAGGA GCAGCAGTGC GAAGTGAAGG CCGATGGCTT CCTCTGCGAG 651 TTCCACTTCC CAGCCACCTG CAGGCCACTG GCTGTGGAGC CCGGCGCCGC 701 GGCTGCCGCC GTCTCGATCA CCTACGGCAC CCCGTTCGCG GCCCGCGGAG 751 CGGACTTCCA GGCGCTGCCG GTGGGCAGCT CCGCCGCGGT GGCTCCCCTC 801 GGCTTACAGC TAATGTGCAC CGCGCCGCCC GGAGCGGTCC AGGGGCACTG 851 GGCCAGGGAG GCGCCGGGCG CTTGGGACTG CAGCGTGGAG AACGGCGGCT 901 GCGAGCACGC GTGCAATGCG ATCCCTGGGG CTCCCCGCTG CCAGTGCCCA 951 GCCGGCGCCG CCCTGCAGGC AGACGGGCGC TCCTGCACCG CATCCGCGAC 1001 GCAGTCCTGC AACGACCTCT GCGAGCACTT CTGCGTTCCC AACCCCGACC 1051 AGCCGGGCTC CTACTCGTGC ATGTGCGAGA CCGGCTACCG GCTGGCGGCC 1101 GACCAACACC GGTGCGAGGA CGTGGATGAC TGCATACTGG AGCCCAGTCC 1151 GTGTCCGCAG CGCTGTGTCA ACACACAGGG TGGCTTCGAG TGCCACTGCT 1201 ACCCTAACTA CGACCTGGTG GACGGCGAGT GTGTGGAGCC CGTGGACCCG 1251 TGCTTCAGAG CCAACTGCGA GTACCAGTGC CAGCCCCTGA ACCAAACTAG 1301 CTACCTCTGC GTCTGCGCCG AGGGCTTCGC GCCCATTCCC CACGAGCCGC 1351 ACAGGTGCCA GATGTTTTGC AACCAGACTG CCTGTCCAGC CGACTGCGAC 1401 CCCAACACCC AGGCTAGCTG TGAGTGCCCT GAAGGCTACA TCCTGGACGA 1451 CGGTTTCATC TGCACGGACA TCGACGAGTG CGAAAACGGC GGCTTCTGCT 1501 CCGGGGTGTG CCACAACCTC CCCGGTACCT TCGAGTGCAT CTGCGGGCCC 1551 GACTCGGCCC TTGCCCGCCA CATTGGCACC GACTGTGACT CCGGCAAGGT 1601 GGACGGTGGC GACAGCGGCT CTGGCGAGCC CCCGCCCAGC CCGACGCCCG 1651 GCTCCACCTT GACTCCTCCG GCCGTGGGGC TCGTGCATTC GGGCTTGCTC 1701 ATAGGCATCT CCATCGCGAG CCTGTGCCTG GTGGTGGCGC TTTTGGCGCT 1751 CCTCTGCCAC CTGCGCAAGA AGCAGGGCGC CGCCAGGGCC AAGATGGAGT 1801 ACAAGTGCGC GGCCCCTTCC AAGGAGGTAG TGCTGCAGCA CGTGCGGACC 1851 GAGCGGACGC CGCAGAGACT CTGAGCGGCC TCCGTCCAGG AGCCTGGCTC 1901 CGTCCAGGAG CCTGTGCCTC CTCACCCCCA GCTTTGCTAC CAAAGCACCT 1951 TAGCTGGCAT TACAGCTGGA GAAGACCCTC CCCGCACCCC CCAAGCTGTT 2001 TTCTTCTATT CCATGGCTAA CTGGCGAGGG GGTGATTAGA GGGAGGAGAA 2051 TGAGCCTCGG CCTCTTCCGT GACGTCACTG GACCACTGGG CAATGATGGC 2101 AATTTTGTAA CGAAGACACA GACTGCGATT TGTCCCAGGT CCTCACTACC 2151 GGGCGCAGGA GGGTGAGCGT TATTGGTCGG CAGCCTTCTG GGCAGACCTT 2201 GACCTCGTGG GCTAGGGATG ACTAAAATAT TTATTTTTTT TAAGTATTTA 2251 GGTTTTTGTT TGTTTCCTTT GTTCTTACCT GTATGTCTCC AGTATCCACT 2301 TTGCACAGCT CTCCGGTCTC TCTCTCTCTA CAAACTCCCA CTTGTCATGT 2351 GACAGGTAAA CTATCTTGGT GAATTTTTTT TTCCTAGCCC TCTCACATTT 2401 ATGAAGCAAG CCCCACTTAT TCCCCATTCT TCCTAGTTTT CTCCTCCCAG 2451 GAACTGGGCC AACTCACCTG AGTCACCCTA CCTGTGCCTG ACCCTACTTC 2501 TTTTGCTCTT AGCTGTCTGC TCAGACAGAA CCCCTACATG AAACAGAAAC 2551 AAAAACACTA AAAATAAAAA TGGCCATTTG CTTTTTCACC AGATTTGCTA 2601 ATTTATCCTG AAATTTCAGA TTCCCAGAGC AAAATAATTT TAAACAAAGG 2651 TTGAGATGTA AAAGGTATTA AATTGATGTT GCTGGACTGT CATAGAAATT 2701 ACACCCAAAG AGGTATTTAT CTTTACTTTT AAACAGTGAG CCTGAATTTT 2751 GTTGCTGTTT TGATTTGTAC TGAAAAATGG TAATTGTTGC TAATCTTCTT 2801 ATGCAATTTC CTTTTTTGTT ATTATTACTT ATTTTTGACA GTGTTGAAAA 2851 TGTTCAGAAG GTTGCTCTAG ATTGAGAGAA GAGACAAACA CCTCCCAGGA 2901 GACAGTTCAA GAAAGCTTCA AACTGCATGA TTCATGCCAA TTAGCAATTG 2951 ACTGTCACTG TTCCTTGTCA CTGGTAGACC AAAATAAAAC CAGCTCTACT 3001 GGTCTTGTGG AATTGGGAGC TTGGGAATGG ATCCTGGAGG ATGCCCAATT 3051 AGGGCCTAGC CTTAATCAGG TCCTCAGAGA ATTTCTACCA TTTCAGAGAG 3101 GCCTTTTGGA ATGTGGCCCC TGAACAAGAA TTGGAAGCTG CCCTGCCCAT 3151 GGGAGCTGGT TAGAAATGCA GAATCCTAGG CTCCACCCCA TCCAGTTCAT 3201 GAGAATCTAT ATTTAACAAG ATCTGCAGGG GGTGTGTCTG CTCAGTAATT 3251 TGAGGACAAC CATTCCAGAC TGCTTCCAAT TTTCTGGAAT ACATGAAATA 3301 TAGATCAGTT ATAAGTAGCA GGCCAAGTCA GGCCCTTATT TTCAAGAAAC 3351 TGAGGAATTT TCTTTGTGTA GCTTTGCTCT TTGGTAGAAA AGGCTAGGTA 3401 CACAGCTCTA GACACTGCCA CACAGGGTCT GCAAGGTCTT TGGTTCAGCT 3451 AAGCTAGGAA TGAAATCCTG CTTCAGTGTA TGGAAATAAA TGTATCATAG 3501 AAATGTAACT TTTGTAAGAC AAAGGTTTTC CTCTTCTATT TTGTAAACTC 3551 AAAATATTTG TACATAGTTA TTTATTTATT GGAGATAATC TAGAACACAG 3601 GCAAAATCCT TGCTTATGAC ATCACTTGTA CAAAATAAAC AAATAACAAT 3651 GTG // LOCUS HUMPBXPROA 3504 bp mRNA PRI 04-JAN-1994 DEFINITION Homo sapiens paired box protein mRNA, complete cds. ACCESSION L25597 NID g438649 VERSION L25597.1 GI:438649 KEYWORDS paired box protein. SOURCE Homo sapiens - juvenile (child) kidney cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3504) AUTHORS Ward,T.A., Nebel,A. and Eccles,M.R. TITLE An unusual alternative PAX2 transcript in human fetal and juvenile kidney JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1. .3504 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="-" /cell_type="-" /dev_stage="juvenile (child)" /haplotype="-" /sex="-" /tissue_type="kidney cortex" /map="10q22.1-q24.3" CDS 544. .1734 /note="octapeptide sequence from bp 1096 to bp 1120" /codon_start=1 /product="paired box protein" /protein_id="AAA36417.1" /db_xref="PID:g438650" /db_xref="GI:438650" /translation="MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVEL AHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEY KRQNPTMFAWEIRDRLLAEGICDNDTVPSVSSINRIIRTKVQQPFHPTPDGAGTGVTA PGHTIVPSTASPPVSSASNDPVGSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQ SGVDSLRKHLRADTFTQQQLEALDRVFERPSYPDVFQASEHIKSEQGNEYSLPALTPG LDEVKSSLSASTNPELGSNVSGTQTYPVVTGRDMASTTLPGYPPHVPPTGQGSYPTST LAGMVPEAAVGPSSSLMSKPGRKLAEVPPCVQPTGASSPATRTATPSTRPTTRLGDSA TPPY" misc_feature 589. .979 /function="'paired box'" exon 1565. .1647 /function="'alternatively spliced 83 bp exon'" BASE COUNT 609 a 1291 c 952 g 652 t ORIGIN 1 CGGGGGCCTG GCCGCGCGCT CCCCTCCCGC AGGCGCCACC TCGGACATCC 51 CCGGGATTGC TACTTCTCTG CCAACTTCGC CAACTCGCCA GCACTTGGAG 101 AGGCCCGGCT CCCCTCCCGG CGCCCTCTGA CCGCCCCCGC CCCGCGGCGC 151 TCTCCGACCA CCGCCTCTCG GATGACCAGG TTCCAGGGGA GCTGAGCGAG 201 TCGCCTCCCC CGCCCAGCTT CAGCCCTGGC TGCAGCTGCA GCGCGAGCCA 251 TGCGCCCCCA GTGCACCCCG GCCCACCGCC CCGGGGCCAT TCTGCTGACC 301 GCCCAGCCCC GAGCCCCGAC AGTGGCAAGT TGCGGCTACT GCAGTTGCAA 351 GCTCCGGCCA ACCCGGAGGA GCCCCACGGG GAAGGCAGTC GTGCGCCCCC 401 CGCCCCCGGG CGCCCCGCAG CAGCCGGGCG TTCACTCATC CTCCCTCCCC 451 CACCGTCCCT CCCTTTTCTC CTCAAGTCCT GAAGTTGAGT TTGAGAGGCG 501 ACACGGCGGC GGCGCCGCGC TGCTCCCGCT CCTCTGCCTC CCCATGGATA 551 TGCACTGCAA AGCAGACCCC TTCTCCGCGA TGCACCCAGG GCACGGGGGT 601 GTGAACCAGC TCGGGGGGGT GTTTGTGAAC GGCCGGCCCC TACCCGACGT 651 GGTGAGGCAG CGCATCGTGG AGCTGGCCCA CCAGGGTGTG CGGCCCTGTG 701 ACATCTCCCG GCAGCTGCGG GTCAGCCACG GCTGTGTCAG CAAAATCCTG 751 GGCAGGTACT ACGAGACCGG CAGCATCAAG CCGGGTGTGA TCGGTGGCTC 801 CAAGCCCAAA GTGGCGACGC CCAAAGTGGT GGACAAGATT GCTGAATACA 851 AACGACAGAA CCCGACTATG TTCGCCTGGG AGATTCGAGA CCGGCTCCTG 901 GCCGAGGGCA TCTGTGACAA TGACACAGTG CCCAGCGTCT CTTCCATCAA 951 CAGAATCATC CGGACCAAAG TTCAGCAGCC TTTCCACCCA ACGCCGGATG 1001 GGGCTGGGAC AGGAGTGACC GCCCCTGGCC ACACCATTGT TCCCAGCACG 1051 GCCTCCCCTC CTGTTTCCAG CGCCTCCAAT GACCCAGTGG GATCCTACTC 1101 CATCAATGGG ATCCTGGGGA TTCCTCGCTC CAATGGTGAG AAGAGGAAAC 1151 GTGATGAAGA TGTGTCTGAG GGCTCAGTCC CCAATGGAGA TTCCCAGAGT 1201 GGTGTGGACA GTTTGCGGAA GCACTTGCGA GCTGACACCT TCACCCAGCA 1251 GCAGCTGGAA GCTTTGGATC GGGTCTTTGA GCGTCCTTCC TACCCTGACG 1301 TCTTCCAGGC ATCAGAGCAC ATCAAATCAG AACAGGGGAA CGAGTACTCC 1351 CTCCCAGCCC TGACCCCTGG GCTTGATGAA GTCAAGTCGA GTCTATCTGC 1401 ATCCACCAAC CCTGAGCTGG GCAGCAACGT GTCAGGCACA CAGACATACC 1451 CCGTTGTGAC TGGTCGTGAC ATGGCGAGCA CCACTCTGCC TGGTTACCCC 1501 CCTCACGTGC CCCCCACTGG CCAGGGAAGC TACCCCACCT CCACCCTGGC 1551 AGGAATGGTG CCTGAGGCTG CAGTTGGTCC CTCATCCTCC CTCATGAGCA 1601 AGCCGGGGAG GAAGCTTGCA GAAGTGCCCC CTTGTGTGCA ACCCACTGGA 1651 GCGAGTTCTC CGGCAACCCG TACAGCCACC CCCAGTACAC GGCCTACAAC 1701 GAGGCTTGGA GATTCAGCAA CCCCGCCTTA CTAAGTTCCC CTTATTATTA 1751 TAGTGCCGCC CCCCGGTCCG CCCCTGCCGC TCGTGCCGCT GCCTATGACC 1801 GCCACTAGTT ACCGCGGGGA CCACATCAAG CTTCAGGCCG ACAGCTTCGG 1851 CCTCCACATC GTCCCCGTCT GACCCCACCC CGGAGGAGGG AGGACCGACG 1901 CGACGCATGC CTCCCGGCCA CCGCCCCAGC CTCACCCCAT CCCACGACCC 1951 CCGCAACCCT TCACATCACC CCCCTCGAAG GTCGGACAGG ACGGGTGGAG 2001 CCGCGGGGCG GGACCCTCAG GCCCGGGCCC ACCGCCCCCA GCCCCGCCTG 2051 CCGCCCCTCC CCGCCTGCCT GGACTGCGCG GCGCCGTGAG GGGGATTCGG 2101 CCCAGCTCGT CCCGGCCTCC ACCAAGCCAG CCCCGAAGCC CGCCAGCCAC 2151 CCTGCCGTAC TCGGGCGCGA CCTGCTGGTG CGCGCCGGAT GTTTCTGTGA 2201 CACACAATCA GCGCGGACCG CAGCGCGGCC CAGCCCCGGG CACCCGCCTC 2251 GGACGCTCGG GCGCCAGGAG CTTCGCTGGA GGGGCTGGGC CAAGGAGATT 2301 AAGAAGAAAA CGACTTTCTG CAGGAGGAAG AGCCCGCTGC CGAATCCCTG 2351 GGAAAAATTC TTTTCCCCCA GTGCCAGCCG GACTGCCCTC GCCTTCCGGG 2401 TGTGCCCTGT CCCAGAAGAT GGAATGGGGG TGTGGGGGTC CGGCTCTAGG 2451 AACGGGCTTT GGGGGCGTCA GGTCTTTCCA AGGTTGGGAC CCAAGGATCG 2501 GGGGGCCCAG CAGCCCGCAC CGATCGAGCC GGACTCTCGG CTCTTCACTG 2551 CTCCTCCTGG CCTGCCTAGT TCCCCAGGGC CCGGCACCTC CTGCTGCGAG 2601 ACCCGGCTCT CAGCCCTGCC TTGCCCCTAC CTCAGCGTCT CTTCCACCTG 2651 CTGGCCTCCC AGTTTCCCCT CCTGCCAGTC CTTCGCCTGT CCCTTGACGC 2701 CCTGCATCCT CCTCCCTGAC TCGCAGCCCC ATCGGACGCT CTCCCGGGAC 2751 CGCCGCAGGA CCAGTTTCCA TAGACTGCGG ACTGGGGTCT TCCTCCAGCA 2801 GTTACTTGAT GCCCCCTCCC CCGACACAGA CTCTCAATCT GCCGGTGGTA 2851 AGAACCGGTT CTGAGCTGGC GTCTGAGCTG CTGCGGGGTG GAAGTGGGGG 2901 GCTGCCCACT CCACTCCTCC CATCCCCTCC CAGCCTCCTC CTCCGGCAGG 2951 AACTGAACAG AACCACAAAA AGTCTACATT TATTTAATAT GATGGTCTTT 3001 GCAAAAAGGA ACAAAACAAC ACAAAAGCCC ACCAGGCTGC TGCTTTGTGG 3051 AAAGACGGTG TGTGTCGTGT GAAGGCGAAA CCCGGTGTAC ATAACCCCTC 3101 CCCCTCCGCC CCGCCCCGCC CGGCCCCGTA GAGTCCCTGT CGCCCGCCGG 3151 CCCTGCCTGT AGATACGCCC CGCTGTCTGT GCTGTGAGAG TCGCCGCTCG 3201 CTGGGGGGGA AGGGGGGGAC ACAGCTACAC GCCCATTAAA GCACAGCACG 3251 TCCTGGGGGA GGGGGGCATT TTTTATGTTA CAAAAAAAAA TTACGAAGAA 3301 AGAATCTCAT TTGCAAAATA GCGAACATGG TCTGTGACTC CTCTGGCCTG 3351 TTTGTTGGCT CTTTCTCTGT AATTCCGTGT TTTCGCTTTT TCCTCCCTGC 3401 CCCTCTCTCC CTCTGCCCCT CTCTCCTCTC CGCTTCTCTC CCCCTCTGTC 3451 TCTGTCTCTC TCCGTCTCTG TCGCTCTTGT CTGTCTGTCT CTGCTCTTTC 3501 TCGC // LOCUS HSU48436 13695 bp mRNA PRI 02-DEC-1998 DEFINITION Homo sapiens fragile X mental retardation protein FMR2p (FMR2) mRNA, complete cds. ACCESSION U48436 NID g3776600 VERSION U48436.1 GI:3776600 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13695) AUTHORS Gecz,J., Gedeon,A.K., Sutherland,G.R. and Mulley,J.C. TITLE Identification of the gene FMR2, associated with FRAXE mental retardation JOURNAL Nat. Genet. 13 (1), 105-108 (1996) MEDLINE 96241583 REFERENCE 2 (bases 1 to 13695) AUTHORS Gecz,J., Bielby,S., Sutherland,G.R. and Mulley,J.C. TITLE Gene structure and subcellular localization of FMR2, a member of a new family of putative transcription activators JOURNAL Genomics 44 (2), 201-213 (1997) MEDLINE 97446139 REFERENCE 3 (bases 1 to 13695) AUTHORS Gecz,J. and Mulley,J.C. TITLE Characterisation of a large, 13.7kb FMR2 isoform showing lack of MRX phenotype JOURNAL Eur. J. Hum. Genet. (1998) In press REFERENCE 4 (bases 1 to 13695) AUTHORS Gecz,J. TITLE Direct Submission JOURNAL Submitted (06-FEB-1996) Cytogenetics and Molecular Genetics, Women's and Children's Hospital, 72 King William Road, North Adelaide, SA 5001, Australia REFERENCE 5 (bases 1 to 13695) AUTHORS Gecz,J. TITLE Direct Submission JOURNAL Submitted (21-OCT-1998) Cytogenetics and Molecular Genetics, Women's and Children's Hospital, 72 King William Road, North Adelaide, SA 5001, Australia REMARK Sequence update by submitter COMMENT On Oct 21, 1998 this sequence version replaced gi:2228247. FEATURES Location/Qualifiers source 1. .13695 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq27.3-28" /clone_lib="fetal brain (Clontech HL3003b) and placental cDNA (Clontech HL3007a)" /tissue_type="brain; placenta" gene 1. .13695 /gene="FMR2" repeat_region 1. .13695 /note="MER63A; MER63B" /rpt_family="MER" /rpt_type=dispersed repeat_region 21. .65 /note="microsatellite; folate sensitive fragile site FRAXE" /rpt_type=tandem /rpt_unit=ccg CDS 480. .4415 /gene="FMR2" /note="fragile X mental retardation gene associated with FRAXE mild or borderline mental retardation" /codon_start=1 /product="fragile X mental retardation protein FMR2p" /protein_id="AAC82513.1" /db_xref="PID:g2228248" /db_xref="GI:2228248" /translation="MDLFDFFRDWDLEQQCHYEQDRSALKKREWERRNQEVQQEDDLF SSGFDLFGEPYKVAEYTNKGDALANRVQNTLGNYDEMKNLLTNHSNQNHLVGIPKNSV PQNPNNKNEPSFFPEQKNRIIPPHQDNTHPSAPMPPPSVVILNSTLIHSNRKSKPEWS RDSHNPSTVLASQASGQPNKMQTLTQDQSQAKLEDFFVYPAEQPQIGEVEESNPSAKE DSNPNSSGEDAFKEIFQSNSPEESEFAVQAPGSPLVASSLLAPSSGLSVQNFPPGLYC KTSMGQQKPTAYVRPMDGQDQAPDISPTLKPSIEFENSFGNLSFGTLLDGKPSAASSK TKLPKFTILQTSEVSLPSDPSCVEEILREMTHSWPTPLTSMHTAGHSEQSTFSIPGQE SQHLTPGFTLQKWNDPTTRASTKSVSFKSMLEDDLKLSSDEDDLEPVKTLTTQCTATE LYQAVEKAKPRNNPVNPPLATPQPPPAVQASGGSGSSSESESSSESDSDTESSTTDSE SNEAPRVATPEPEPPSTNKWQLDKWLNKVTSQNKSFICGPNETPMETISLPPPIIQPM EVQMKVKTNASQVPAEPKERPLLSLIREKARPRPTQKIPETKALKHKLSTTSETVSQR TIGKKQPKKVEKNTSTDEFTWPKPNITSSTPKEKESVELHDPPRGRNKATAHKPAPRK EPRPNIPLAPEKKKYRGPGKIVPKSREFIETDSSTSDSNTDQEETLQIKVLPPCIISG GNTAKSKEICGASLTLSTLMSSSGSNNNLSISNEEPTFSPIPVMQTEILSPLRDHENL KNLWVKIDLDLLSRVPGHSSLHAAPAKPDHKETATKPKRQTAVTAVEKPAPKGKRKHK PIEVAEKIPEKKQRLEEATTICLLPPCISPAPPHKPPNTRENNSSRRANRRKEEKLFP PPLSPLPEDPPRRRNVSGNNGPFGQDKNIAMTGQITSTKPKRTEGKFCATFKGISVNE GDTPKKASSATITVTNTAIATATVTATAIVTTTVTATATATATTTTTTTTISTITSTI TTGLMDSSHLEMTSWAALPLLSSSSTNVRRPKLTFDDSVHNADYYMQEAKKLKHKADA LFEKFGKAVNYADAALSFTECGNAMERDPLEAKSPYTMYSETVELLRYAMRLKNFASP LASDGDKKLAVLCYRCLSLLYLRMFKLKKDHAMKYSRSLMEYFKQNASKVAQIPSPWV SNGKNTPSPVSLNNVSPINAMGNCNNGPVTIPQRIHHMAASHVNITSNVLRGYEHWDM ADKLTRENKEFFGDLDTLMGPLTQHSSMTNLVRYVRQGLCWLRIDAHLL" repeat_region 12612. .12679 /note="microsatellite" /rpt_type=tandem /rpt_unit=tg BASE COUNT 3924 a 3128 c 2794 g 3849 t ORIGIN 1 CGCCGCCTGT GCAGCCGCTG CCGCCGCCGC CGCCGCCGCC GCCGCCGCCG 51 CCGCCGCCGC CGCCGCTGCC GCCCCGGCTG CCGCGCCGCG CCGCTGCCTC 101 TGCCCCGGCC GCCCCCGCCG CCGCTGCCGC CGCCGGCCCG CAGCCAGCCA 151 GGCGGGCGGC CCAGCCCGCC TGAGCCCGCA GCGGCTGCCG CCGCAGCGTC 201 GGGTCGCTGG GTGCGCGGGC TACCGCGGAC CGAGCGGACC CGAGTGGGCG 251 ACCAGGCGCT TGCCCGCCCA GTGCCACTGC CGCCGCTTCC TCGCCGGAGC 301 ACAGGACCAG ACACCTCCAG CGCCCGCTGC TGCTGCCGAT GCGGCCCGGA 351 CACTTTTAGC TGGGCGGGAG GGCTGGAGAG CCGGGGGCCG CCGAGAACCG 401 CCAGCGAGCT GTGCCGAGAG CCGCGCCGAC CCGCTGCGAT CAGGGACAGG 451 CGCCCGCCCG CCGCCGCCGC CTGGCCGCTA TGGATCTATT CGACTTTTTC 501 AGAGACTGGG ACTTGGAGCA GCAGTGTCAC TATGAACAAG ACCGTAGTGC 551 ACTTAAAAAA AGGGAATGGG AGCGGAGGAA TCAAGAAGTC CAGCAAGAAG 601 ACGATCTCTT TTCTTCAGGC TTTGATCTTT TTGGGGAGCC ATACAAGGTA 651 GCTGAATATA CAAACAAAGG TGATGCACTT GCCAACCGAG TCCAGAACAC 701 GCTTGGAAAC TATGATGAAA TGAAGAATTT GCTAACTAAC CATTCTAATC 751 AGAATCACCT AGTGGGAATT CCAAAGAATT CTGTGCCCCA GAATCCCAAC 801 AACAAAAATG AACCAAGCTT TTTTCCAGAA CAAAAGAACA GAATAATTCC 851 ACCTCACCAG GATAATACCC ATCCTTCAGC ACCAATGCCT CCACCTTCTG 901 TTGTGATACT GAATTCAACT CTAATACACA GCAACAGAAA ATCAAAACCT 951 GAGTGGTCAC GTGATAGTCA TAACCCTAGC ACTGTACTGG CAAGCCAGGC 1001 CAGTGGTCAG CCAAACAAGA TGCAGACTTT GACACAGGAC CAGTCTCAAG 1051 CCAAACTGGA AGACTTCTTT GTCTACCCAG CTGAACAGCC CCAGATTGGA 1101 GAAGTTGAAG AGTCAAACCC ATCTGCAAAG GAAGACAGTA ACCCTAATTC 1151 TAGTGGAGAA GATGCTTTCA AAGAAATCTT TCAATCCAAT TCACCGGAAG 1201 AATCTGAATT CGCCGTGCAA GCGCCTGGGT CTCCCCTAGT GGCTTCCTCT 1251 TTATTAGCTC CTAGCAGTGG CCTTTCAGTT CAAAACTTCC CACCAGGGCT 1301 TTACTGCAAA ACAAGCATGG GGCAGCAAAA GCCAACTGCA TACGTCAGAC 1351 CCATGGATGG CCAGGACCAG GCACCGGACA TCTCACCAAC ACTGAAACCT 1401 TCAATTGAAT TTGAGAACAG CTTTGGGAAT CTGTCATTTG GAACACTCTT 1451 GGATGGAAAA CCCAGTGCAG CCAGTTCAAA GACTAAACTG CCAAAGTTCA 1501 CCATCCTCCA AACAAGTGAA GTAAGCCTTC CCAGTGATCC AAGCTGTGTT 1551 GAAGAAATCT TGCGGGAGAT GACCCATTCC TGGCCTACTC CTCTCACTTC 1601 CATGCATACT GCTGGACACT CTGAGCAGAG CACCTTTTCC ATCCCAGGAC 1651 AGGAATCGCA GCATCTGACC CCAGGATTCA CCTTACAAAA GTGGAATGAC 1701 CCAACCACCA GAGCTTCTAC AAAGTCAGTG TCTTTCAAAT CGATGCTTGA 1751 GGATGACCTG AAGCTGAGCA GTGATGAAGA TGACCTTGAG CCTGTGAAGA 1801 CCTTGACCAC TCAGTGCACT GCCACTGAGC TCTACCAGGC TGTTGAAAAG 1851 GCAAAACCTA GGAATAATCC TGTGAACCCA CCCTTGGCCA CTCCCCAGCC 1901 CCCACCTGCA GTGCAAGCCA GCGGGGGTTC TGGCAGCTCC AGCGAATCGG 1951 AGAGCAGCTC TGAGTCGGAT TCAGACACTG AAAGTAGCAC CACTGACAGC 2001 GAATCTAATG AGGCACCTCG TGTGGCAACT CCAGAGCCTG AGCCACCCTC 2051 AACCAACAAG TGGCAACTGG ATAAATGGCT TAACAAAGTG ACATCCCAGA 2101 ACAAGTCTTT TATTTGTGGC CCAAATGAAA CACCCATGGA GACTATTTCT 2151 CTGCCTCCTC CAATCATCCA ACCAATGGAA GTCCAGATGA AAGTGAAGAC 2201 GAATGCCAGT CAGGTCCCAG CTGAACCCAA AGAAAGGCCT CTCCTCAGTC 2251 TCATTAGGGA GAAAGCCCGT CCACGGCCCA CTCAGAAAAT TCCAGAAACA 2301 AAGGCTTTGA AGCATAAGTT GTCAACAACT AGTGAGACAG TGTCTCAAAG 2351 GACAATTGGG AAAAAACAGC CCAAAAAAGT TGAGAAGAAC ACCAGCACTG 2401 ACGAGTTTAC CTGGCCCAAA CCAAATATTA CCAGCAGCAC TCCCAAAGAA 2451 AAAGAAAGTG TGGAGCTTCA TGACCCACCA AGAGGCCGCA ACAAAGCCAC 2501 TGCCCACAAA CCAGCCCCTA GGAAAGAACC AAGACCTAAC ATCCCTTTGG 2551 CTCCCGAGAA GAAGAAGTAC AGAGGGCCTG GCAAGATTGT GCCAAAGTCT 2601 CGGGAATTCA TTGAAACAGA TTCATCTACA TCTGACTCCA ACACAGATCA 2651 GGAAGAGACC CTGCAAATCA AAGTCCTGCC TCCGTGCATT ATTTCTGGAG 2701 GTAATACTGC CAAATCCAAG GAAATCTGTG GTGCCAGCCT GACCCTCAGC 2751 ACCTTAATGA GTAGCAGTGG CAGCAACAAC AACTTATCCA TCAGTAATGA 2801 AGAGCCAACA TTTTCACCTA TTCCTGTCAT GCAAACTGAA ATCCTGTCCC 2851 CTCTGCGAGA TCATGAGAAC CTGAAAAACC TCTGGGTGAA GATTGACCTT 2901 GACTTACTCT CTAGAGTACC TGGCCACAGC TCACTCCATG CAGCACCTGC 2951 CAAGCCAGAC CACAAGGAGA CTGCCACAAA ACCCAAGCGT CAGACAGCTG 3001 TCACAGCTGT GGAGAAACCA GCCCCTAAGG GCAAACGTAA GCACAAGCCA 3051 ATAGAAGTTG CAGAGAAGAT CCCTGAGAAG AAGCAGCGCC TGGAGGAGGC 3101 CACAACTATC TGCTTGCTCC CTCCTTGCAT CTCACCAGCC CCACCCCACA 3151 AGCCTCCCAA CACTAGAGAA AATAATTCAT CCAGGAGAGC AAATAGAAGA 3201 AAGGAAGAAA AACTATTTCC TCCTCCACTT TCCCCACTGC CAGAGGACCC 3251 TCCACGCCGC AGAAATGTCA GTGGCAATAA TGGTCCCTTT GGTCAAGACA 3301 AAAACATCGC CATGACTGGA CAAATCACAT CTACCAAACC TAAGAGAACT 3351 GAAGGCAAAT TCTGTGCTAC TTTCAAAGGG ATATCGGTAA ATGAGGGAGA 3401 CACTCCAAAA AAGGCATCCT CTGCCACCAT CACTGTCACC AATACTGCTA 3451 TTGCCACTGC TACTGTCACT GCTACTGCCA TTGTCACCAC CACTGTCACA 3501 GCTACTGCCA CCGCCACGGC CACCACCACA ACTACTACCA CTACCATTTC 3551 CACCATCACC TCTACCATCA CTACTGGCCT CATGGATAGC AGTCACCTGG 3601 AGATGACGTC CTGGGCGGCT CTGCCCCTTC TATCCAGCAG CAGCACTAAT 3651 GTCCGGAGAC CCAAGCTCAC TTTTGATGAC TCGGTTCACA ATGCTGATTA 3701 TTACATGCAA GAAGCTAAGA AGCTGAAGCA CAAAGCTGAT GCACTGTTCG 3751 AGAAATTTGG CAAAGCTGTG AATTATGCTG ATGCCGCCCT CTCCTTCACT 3801 GAATGTGGCA ATGCCATGGA ACGCGACCCT CTGGAAGCAA AGTCCCCATA 3851 CACCATGTAC TCTGAGACTG TGGAGCTCCT CAGGTATGCA ATGAGGCTGA 3901 AGAACTTTGC AAGTCCCTTG GCTTCGGATG GGGACAAAAA GCTAGCAGTA 3951 CTATGCTACC GATGTTTATC ACTCCTCTAT TTGAGAATGT TTAAGCTGAA 4001 GAAGGACCAT GCTATGAAGT ACTCCAGATC ACTGATGGAA TATTTTAAGC 4051 AAAATGCTTC AAAAGTCGCA CAGATACCCT CTCCATGGGT AAGCAATGGA 4101 AAGAACACTC CATCCCCAGT GTCTCTCAAC AACGTCTCCC CCATCAACGC 4151 AATGGGGAAC TGTAACAATG GCCCAGTCAC CATTCCCCAG CGCATTCACC 4201 ACATGGCTGC CAGCCACGTC AACATCACTA GCAATGTGTT ACGGGGCTAT 4251 GAACACTGGG ATATGGCCGA CAAACTGACA AGAGAAAACA AAGAATTCTT 4301 TGGTGATCTG GACACGCTGA TGGGGCCTCT GACCCAGCAC AGCAGCATGA 4351 CCAATCTTGT CCGCTACGTT CGCCAAGGAC TGTGTTGGCT GCGCATCGAT 4401 GCCCACTTGT TGTAGTGGGT GTTCTCAGAT CTCTAGCATC ACGACCCATC 4451 ACTCTACCTC TACCAGCGCA CTGATGGTCA CTGGTGGAAC TCCACTCACT 4501 GGGGAACGTT CTCTTTGGTT ATGTTTGTTT TTATGCTTCT TTTGTTATCT 4551 GTAAAAAACA GAAGTCATTG TAAGTTGACA CTACAACTTA AGGGCAGTGT 4601 ACGTTTTATT ACTTAGTCAT TTTTTTTCTT TTAGCATTTG ATATGCATTT 4651 CTCAGATTCC ACCATCTTTT TGTGCTTTAT GGAATGACAG TCCCTACAAT 4701 ATTGTTTTAA GCCCACACTA CCCAAAACAA AGAATGGGAA GCACTTGTGA 4751 TAAAGACAGG CTCCTGAGAA ATGCAACAAG TGGTCTTACA TATACATGAG 4801 AACTTAGACA CAAGGGACCA TCCCCCAAAC TCTACTCTTA TACCCAGAAA 4851 AGAACATATT TCAGAATCTG TCAAACTTTT GTGTATCCCA CAGATTCAAT 4901 CTTCAGGTGA GAATTTTCAT TGTCAAAACC CACTGGTTAG ATGTTGTAGC 4951 AACATCATAA AATCAAGAGT ATCAAGAAAA TAAATGAGCA TAGCAATGCT 5001 ACTCTTAAAA AGATGCTATG CCACACAACC AGAGGACTTT CTTGTTAGCA 5051 TCCCTTTCCT GATTCCCTAT TTTGTTAATT TTAATGATAA GAAGAAAGGG 5101 TGACATTTAT TTTGACAAGT TTTAGGCATC AGCTGGCATC AGTGTTTTTC 5151 AACTCCATTA TTTGAAGTGT AAATCCTCAC CTGGGGTTCT CTGTGTGCAA 5201 AGCTGTCCTT TTGAAGAACA GTTTGGTTGA TGCATGCCTT AGTAGCCAAA 5251 ATGCTACACT CTAGACTTAC AAGTGGGAGT TAAGAGAGGT CTGGAAAGTG 5301 TCCAACAAGG AATTCACACC TCTGCCTCCT TTGCAACAAC AACATTTACA 5351 CAGTTGGTAA GTGGGTCCAT AACTGGCAGG ATTTTTAAAT TGTATTTTGC 5401 TCAAATCTAT GGGAACAAAA GTCAAGGTAT CACTACCTAG AAGTAATGAT 5451 ATACAGTTTT CTTCCTAGTG GCTTGAAAAT CTGGACTTCC TCAATTATTA 5501 TTCACATTTT CTCTCTTATA GGTTTTCTGT TTTCTACTTT CTTTTTTCTC 5551 TTATCTGTGT TTCCCTTTCC TTTGTTTGGC TCATTAACTT TTGACTGAAT 5601 TACAATTACT CCTTTTATTA AAGTCCATAT TATTGTGAAT CATTTCCATG 5651 AAAATTTCTA AGAAAACTCC AAACTCTCTA AATAGTAGCT AACTTTTATT 5701 TTTTTAAAAT GAGTCGTGGG GTAGTGCTTC ACCTTGAGAT GCTTTGAAAG 5751 AGCCCTAAAC ATTGGGAACC ATTCACCTAA TTTGGAGACA TTTCTCACTG 5801 GTTGTGACTA CCCCCTTATG ATCCTTCACA TTCATTTTAT GTCCCTAAAC 5851 ATCACAATGT AAATATCATT TTTGATGTTC CAGCTCACCA GAAGATTCTT 5901 ACACTTGGGG TAAACACTAT CCATGCATTA CTTACTGGTA ATTACCTGCT 5951 GGTATATAAT TCCATGTAGC CTTTAATATG CTGGGTTATC AAATTCTGTT 6001 CACTGAGTTA TGACCAGATA AATAATAGAT ATGCACATGA AAGATGCAAA 6051 CTTGTGTGAT TATTAAAGCC AGCCATGCAG GTCCATGATA GAAACAGCAG 6101 GTGATGACTC TGCACTCTCA TTGTCAAGGT TAGCTATATC CCCAGTTGCA 6151 AAACAGCCAG ACTTGAGCTG TGCTCTGGTC ATCTTTGAGT TTAAGGCCTT 6201 TTGTTGTATA AGGCTGTGGA AGTTGTACTC CAATGGCTGA AGCCATGTTG 6251 TTAATATGGC TGATGGGAGC ATCCCTGCAG CTGAACCCAG CACTTTTTAT 6301 GCTCCCACTG TGGTTGAGCT TTATGTTTAC AGTCTCAGCA ACAACACTTA 6351 TGCATCCAAA CACTCACAAA TGAAACCTGA AAGAATCTTT TCTGAGCCTC 6401 TTAAAAGAGG AAAATGATGA TAACATTAAA GACTCTGAAC ACCCAAGGTT 6451 GGTGTCACAT ATAAAAATTA AGCTGATGAC TTTGCAGTGA CTCAAGTTGT 6501 CTCTTTATCA TGGTTTACCA GGTAGAGTGC CTGGCTATTA CTATATAATG 6551 AAGCCCACTG GCTTGACTTG TAAGTTCAAC CTAAACCACA ATCCTAGACC 6601 ATCATGGATT TAGGAGTAGA TTCTTCTTGA AATCCCACAT CCAGAAACTA 6651 GACATTAGAA TGTTGAGGCA GTTTCCCAGA GAAACAAGCA TATTGCCTCA 6701 TGGATGAAAG ACTTGTAGTT CTAGTTTCAG TGACTTGTTA TATCTACTTA 6751 CATACAACAG GGAGGCAAGA GGATTCTCTG TCATCTCTGG TGACTGAGTG 6801 TAAAATATGT GCCAAGTCTG CAGCACAGTG ACCAAATCTG ACAATCGAGC 6851 TCTGGATCAC CACTTGATTA TGTAGTAGAC TCATTTATAA AGCAGCTTAG 6901 GAACTAATTA AACATGGAGG ATGAATTACC TTCCTATCCC TTGAGATAAG 6951 ACATCTTTCA GTTTCATGAT TAAGGATTGT TGCTGTTTTA TAGTTACTCT 7001 GTTCATCACA GTGTAAATGG TGATGCGTGT CGTAGGTGTG CAGCTATTTG 7051 AGGGACTAAG GGATGGAGAT ATTCTGTCAA ATGAATCTCT TCAGTATACC 7101 AGTTTGTGGG AGGGATATGA GACATGTGGA TGGCAGTGAG AGATCGTGCC 7151 TCTAGATCTT GATGGAGGCT TGGTGAGACA CACTTAAATA AGCACGTGGA 7201 GGTTAGAATA GAGGGCAGAG TAAAAGGAAG CTCCATCTGA GCAAGTACAC 7251 CAAATGATCT CAGCCCTGCA ACTTGACCCA GGTAGGGCCA CCACTACGCC 7301 TTCACTTGTC ACCCAAGCTC CAACCACAGA GAGTTTGACA AGTTTGTGTT 7351 ATGATGTTGG CTTGGCTTTG TATTTTTAAT TAACTTTGGA TTTTTAGTGG 7401 TTTTGTCATA TAACTGTCTG AGTTTGGTAG GTAGGATTAC TTTGAAAAGG 7451 GTTTACTAGT GTGGTCCTCC GGGTAGAATT TAGCTGTAAC ATGTTGTTAG 7501 CCAGCCTGTA GACTGTTAAT TACTTAATAA TCTCATTGGG AAAATACTAG 7551 TAGTTTTATA TTTGGATGAC ATAATTGGAA AAAGCAGATT AGCTGCTACT 7601 ACTTTTAAAA GACTTAAGGT CGGGATGCCT TTTTTTCCAT GTAAGGAAAT 7651 GAAAAGACCC AAAATCTTCA GGCAAAAAGC AAGTTGCAAA ATTAGAAACC 7701 ATTGGCTAAA AATGTGTTTT GTTGAGTTTC CAAATGGATG AATTTTCATT 7751 TGGACATTAC ATCACTAAAT TCATTAGATT TTGTCTGCAT TGGAAAGATA 7801 CTCTTCTAGC ATATCTTTCC CAAAGATATC TAATTTGGAT TCTGTTTCAT 7851 GCAAATTTGC ATCCCGGAGG TTGAAGTTGG AGTTTGAGGT TGGAAAATAT 7901 CTTTGAAGGC AGAATCAGTT GAGTTGTGAG GGTGAAGCCT CACATACTTC 7951 TCAACAGACA TGATAAAATT CACCTGCATG AGTTGGCAGG TGGGAGAACC 8001 AAACTGGATC ACTGGGTAAG ACTACTCAGT AAAGCAATGA ACTGCTTGCT 8051 TAGAGAAGCA TCACTATCCC CATTGAGAAA AATGTGTGGC AAGATGATAC 8101 AGCTACACAG TATCAAATGA ATGGGTCAAT TCAGCACCCC CAAATTTAAT 8151 TCTGTGGGGA AAAATTATTG AGCCAGTTGT CAGTGTTCTG TTACATGACT 8201 GGCAGACTAA ATTCTTCATC GTTGTTGTTA TTGTTGTTGT TGTTTCTCAT 8251 TTTCACTCGC ACGGCCTTAT TCTCATAATT AAAATCTAAT TCATTTTCTC 8301 TTTAGTGTTA GTAGACTCCA ACAACAGAAG TGGCATCTGT GTATTCATAA 8351 TCAGCATTTA CCCTGGCAGG AGACTAATCA GATAGGCCGG TCTCAGACAT 8401 TAATCCTACC ATCTGATATT TTTGGTGAAG GAAAAAGTAT TAATTCTCTT 8451 TCCATCCTCC TCCTCAGAAA TATAGAAGCC CTCTTTACCA AAATCATCAC 8501 ATTTTACTCT GTAATCTACC AGCTAAAAGA AAATTGCATT GAAGCCCCAC 8551 AAAGCCAGAT TGCAGTTCTT GCCCCTTTTT GCGTCTGACA TGAGATGTTA 8601 AAGAATTATT CATTGTGCTC ACATTGGGTT AGGGGACACT GAACTGCTTT 8651 TTAGATCCAT GATCAGTCAT CATTCTTCTA AGAGATTGGA GCTTTGCTGT 8701 TTCATTAACT GTGCAGTGTA GACTAATGGT GTTTAATAAA AATCATTCAA 8751 AATTTCAAAC TCTTTTGCCA GTGACCTCAA TTTTGTTGGC TCTGTGATTT 8801 GTATCAGACT TTGAGGAGGG AAGGGGGAAG TGAAGGAAGC CTACGTCCAG 8851 GCCCCTGACA GGATGCTGCA GTAGCAAGCT CAAGCTCGCC TGCCTGCCAG 8901 CAGTTGCTGG TGAGCAGCAG CATGCAGACC AGCTGTGGGA AGCCTCCTGA 8951 AGAATGCCCC AGCTGATGCT TTCAGCTGGG AATAGTTTGT TCCTATTGGG 9001 GAACTCATTG TTCTCCAGTC TCTGCAGCAG GAAGCCAGCT GTCATATTCG 9051 GAGGGAATTT CAGATGCTTT ACCTTTTTGG TTTTGTCCTG CATCACTCAT 9101 GTGGCTACGA AAGTGTCTCT GAGAATAGAG CCCAATGTGG TGACAATGGG 9151 TAGTCAAATG CACCCCAGAT GCTCAAGCCC TGTTGTGGTT CTGCAGTGTT 9201 TATGAAATTG GGAGGAAGGA GACCCTGGAC AGTAAGCAAA ATTGGAGACA 9251 CTCCAACGAG GCTAAGTTAA TGCCGTGTTG CCCAGAACAA GATCTAGCTT 9301 CTCATTTGGT CAGCCTAGCA TGCAACCAGT GGTGTGCTGG TAAAATGTTT 9351 AACAACCAGC TCGCTGAGAA TAGAAAGCAC CTGGTTTGCA CCATTTGCCA 9401 ATTTCCATGG CATAAATACT ACCACTTTAG ATGATTTTAA GCTACCAACT 9451 GTGATGTCAC TGAACACATG GTTGGAAAGA GATGCACGCA GTTGGCTCTT 9501 GCAAGCCTGG GCAAAAATGC TTCAACACGC CACTGGATGC AGCCAGTCAG 9551 AGGGTTCATA TTTAATATAT GTGTTCATGT GGACACACAC AGACACACAC 9601 ACACAAACTC ACCCTTACAC ACACACTTCG ATGACTAAAA CAATTACATA 9651 GTTTTAAGAT ATGAATCAAT GTGTGAATGT AGAAAGCTTA TGATAAGGCC 9701 CTAGAGGTAT GGGTTGCCCT GGAAGCCTAG GTTTTAAGCA GGAGAATAGC 9751 TGAGAAGAAT GAAGCCCTCC TGAGCTGAAA GGAGAGATGG ATCAATGGAG 9801 ATGGTTCCAT CATCTCCTTC CATATCTCAC AGGTAAAATG GGCACTCAGA 9851 AAACCCTCAC GATTGATTTT TTAAAAAGAT AAGTGAGTGT TTTTTATTTT 9901 ATTATTATTG TCATCATTAT TTTGATTTAC AAATGCTATT TGTAACTTTT 9951 ACATGTAACT AGGATAAAGT ATTTACGGGA ACTCTATGGA GAATAGCACA 10001 ATCCAGAATT TACTGTGTTT TTCTTTTATG TGACGTGGAA ACTCAGTAAT 10051 TCTCCCACCT TCACATTGTT GTTCATAAGA ATTTTACTTT AGTTATTAGG 10101 GAATCTAAGT TTTTTGTTAA CATTTGTTTT TAGTTAAAAG TATCTACTTA 10151 CTGTTTTAGC TCTGAACTCA AACCAGAATA TCTCTGTATC AATTGCATGA 10201 CTATTCAGAA ACAATAATCC AAACCAAAAT AATTCTTTTT CCACCCAGTA 10251 CGAAGAAAAC TAAGCTCAGT AACAAGAAGG CATAAACTAA AGTATATAAT 10301 GAGGCTTTCA TTAAATACAC ACACACACAC ACTCACACAC ACACACATAC 10351 ACTTTTTAAA TTTTTAAATT AGGCCTCCAC ACATAAATCA TTTTGAAAGT 10401 AGAATAGAAA ATCTCAAAGA ATTCATTCTC CTGGTCCTGT GCATCTTCTG 10451 CAGTTAATAA GAGGTTTGTA TCTGGAAAGA TGGAAGAACT TGTTCTAAAA 10501 TCTTATTTTT CAAAAAAAAA TTTCCATTTT CTCTCTGGGC CTGTATCCAT 10551 GGTTGAATGT TAGCCCTGGA GGAGATCCAT GTCTTACTCG CTCTTTCTGG 10601 CCCTTCTGTC TTTTGCCTCT GCAATTCTTT TTGTAGCTGG CACGATAGCA 10651 GGGACTGGGG GTCTATCCTT TCATGGTATT GCTACAATAT TTGTCCTTAC 10701 TGGAAAATGG TAACATCCGG GTCTGATTTA ATTGGCATTA CACTTACACA 10751 GGGACTCTGA GCACCCCCGT CACCACACCA GACAGTGGAC CAGTTTTCAC 10801 AGCTACAAAG AGCTAGAAAT GTGTTTAACA TCATCCAGTG CATCCCCTAA 10851 TTCAAAACCA TCCTCACTAA TCAATCATAT TCACCCATAA ATATTACAAA 10901 TGAGATTGAT TCCATCTCAA GACAATTTGT CAAATACTTA ATTTTCTTCC 10951 TGGATGATTC TACTTACTGG ATATTTTAGA AAGAGAAATG TCTGAGATAA 11001 AATCCCTCAC ATTTACTCAA TATAACAAAT TACTGTTTCT ACTCCTATTC 11051 TGAGTAGTGC TTCTGAAGAT TGTTTGCTGT AGTGTTGTCT TTGATAAAAT 11101 GAATGTCAGT AGTGAGCCTT TTAGAGATAC CATGCTCAGA CATCCTCTTT 11151 GGGATCAGAA GATACCTAAA ATTCTCCCCT TTTGCCCACT TGGTTAGATG 11201 AGTGATATAT TCTTTGGATC CTGCAAAGAA GAGATTGGTT TCTTTTCTTT 11251 TCTGGTGGTG GTAGTGGTTG TATCTGTGGC TGTGATGGTT GTTGTTACTT 11301 GTCTCTCTCT CTCTCTGGCT CTGGCTTTTG CTTTCCTGCT AGTGTTCTTT 11351 CTCTTTCCAA ACAAATAGTT AAATTAAACG TGAGCTTCTG AATTGTACTT 11401 GTTCATACTT TCAAAACATA ACAGATTAAT AAAAATAGAT GTGTCCTGAT 11451 TTAAAACATG CCCCCTGGAA AGGCATGCTG TATTATGAAA TCGTGATAAT 11501 ATAACTGCAT TATTACATGG CAGTATAAAT ATTAGTCTGT TGAATTCATT 11551 TGTCCAATTG TATAACTTTG TGGAGCAGTG TTTTGACCTT TGATACATAA 11601 TTCTGGAGCA AGTGGAGTGG TTGCAGGCAG ATGAGACAGT GTTATATCAG 11651 GATTTTTCAA TCAACTTTAG TTGGAGGCCT GGCAATTACA AACATCTTCA 11701 GATGTTTCTG TAACCATTAT AAATATGAAA AAAACCTCTT CAAAAAATTT 11751 CCCATAGTAC TTCAGTCAAG ACTTTTTAGG TTTATCTTTT TTTTTTCATT 11801 TCTCCTTTTC CTTTTCCATT ATTTTTCGAT GGGGGGGTTG TTATCATTGA 11851 CTGAAGAAAT ATTTTGATTG CAATGGTCTC TCTCTCTCTC CCCCTCTCTC 11901 TCTCTCTCCT CTATTCTTTC CTCCTTCCCT CTGTCCATCA CCCCTCATTA 11951 AAATATTGAA ATCTGGAGTC TTTGATAAAT CTGCATTAGA CCAGGCTATA 12001 TGCTAGGAAT GAAATCTGGG CAAATATCGA TGGGTTTTCA AAGAATGCTC 12051 CATGTTCATT GGGCCCTTTC ACACCCCACA GTGATAAATG AAAAGGATAG 12101 AGGTAGTTTT TTCAAAAGAG CACTTTAATA ATATCCTCTG AGACCTAATG 12151 CAGTTTAACA AATGACTCCA CCTATTTTTC CAGTAGGTAA ATTGACTGAG 12201 ACTTGCAAAA TACCCCTGAG AGTTGTCAGG GGTGTCTTCT GCCTGGTCTA 12251 TAGCGTGTGT GTTTGCTTTG TATCTAACAG GCACATTCAC GTCTCGTGTA 12301 CTCATATGAA GTATTTCCTA ACATTCCCAT TAGCCTGTAT ATAAGAATCA 12351 GAAAGATAAT CCCAACATGT TGTAAATGAA GATGTGACTC TATAACCTTT 12401 CTCTTCTTCC TGGAAAAAAA AGGACATTTT CATGCATATT TTAAACAGAA 12451 ATTTTGTATA TTTAAGTGTC ATAGAAAATA TTTATTGAGT AACTGGGACA 12501 CAAATGGGAA TTTAATTGTC ATCATATGCT TTGTGTGTGG GGATGCTTAC 12551 CAACACCATG TCGCTGGACC ATTGTGGCAA GCCATAACTG CACAAAGAGT 12601 ACACATCGTC AGTGTGTGTG TGTGTGTGTG TGTGCGCGCA CGCACGTGCG 12651 TGTGTGTGTC CCTGCATGTG CAACATGTCT AGCTTGCTGT CCTTCATGGG 12701 ATTTTAGCTT TCCCTTCTTG AAAAACATTA TTTTACAGTT CCAGGAGGCC 12751 CTGGTTACAT TACTATATGA AGGCAGTGAT TTGAAATGAA AATTCCTTTC 12801 CTCTTGGAAG CTTTGGTCAT AATATCATGG TTCAATTAAA CGGATTCCAC 12851 CGGACTTTGT GATGAAAAAG GCTCTGTTAA AATCCAATTG AGTTTCCAAG 12901 AGGAAATTGT AGTAGGTCAA GATGCATGAG AGGGAAGATG GAGGCCACCT 12951 CAGCTGGAGA ACATGAGCTG AGTTGAGCCC TCAGTGTTGA AGTTGACTTG 13001 CTCCAAGCTG CAGTCTAAAA CCCTGGGGCC CGTGCCTGGC CTATGCTCCC 13051 TCCCAAGTAA GTAGAGGAGC AGAACCATCA GGAACAGCCT GCCTGGCTCC 13101 TATGAAGAAA ACTTCCTGAC GTCCTGTCCC CAAAGGAAGA CCCTTTCCCC 13151 AAGGGCACCC CAGGTGGCCA TTAAATTGTG ATGATCATTC AGAAAGTGCC 13201 CCCTTGGCTT TATGAGAATC CAATTAGTCT TCTGAACCAC CTTTTCTTGG 13251 GTGCAGATTT CCAACATTCA TGCTCATTGC AGATCCACCA ACTGTCACTG 13301 TTCTTAACAA GCATGCTCGT CTTGTCAGAA TTTCAGTAAG TTCCAATTTC 13351 CTGTACAGAC CAGGGTAAAC TGTTCTAAAA TCAATCAATT AATGAAATGT 13401 TATCTGGTTT TTAAAAGCTG GTTTCATGTG CTTTATGTGT ATAAAACTAT 13451 ATCTGCCTGT GTGGCTTTGC ATTTCAAATG TGTGGCGCAC AAGCGTTTTG 13501 TTGGTGCTTT GTTCTCAGTA CAGTAACTCT GTGTACAAAC ATTTTAATGT 13551 GGTTTTGTTG TTTTCCAACA AGATGTCTCT GTAAAAATGA TATTGGCTGA 13601 GCTGGTGCGT TGGTTTCTCT CATAGAGGCA TTAACTATAC TGCCAATGCA 13651 TTGAATTATT TAAAAATGCA AAATAAAATT TTTATGAAAA TCTCA // LOCUS HUMGALC 3777 bp mRNA PRI 10-MAR-1994 DEFINITION Homo sapiens galactocerebrosidase (GALC) mRNA, complete cds. ACCESSION L23116 NID g431309 VERSION L23116.1 GI:431309 KEYWORDS galactocerebrosidase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3777) AUTHORS Chen,Y.Q., Rafi,M.A., de Gala,G. and Wenger,D.A. TITLE Cloning and expression of cDNA encoding human galactocerebrosidase, the enzyme deficient in globoid cell leukodystrophy JOURNAL Hum. Mol. Genet. 2 (11), 1841-1845 (1993) MEDLINE 94108435 FEATURES Location/Qualifiers source 1. .3777 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain, kidney" /tissue_lib="Clontech" gene 48. .3777 /gene="GALC" CDS 48. .2057 /gene="GALC" /codon_start=1 /product="galctocerebrosidase" /protein_id="AAA16645.1" /db_xref="PID:g431310" /db_xref="GI:431310" /translation="MTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGA VSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMH YALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVV TWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESIS ASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRIL NQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQF TQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQF ATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTL TTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPG EHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTIKCDVYIETPDTGGVFIAGRV NKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHF ASGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR" polyA_site 3777 /gene="GALC" BASE COUNT 1040 a 744 c 813 g 1180 t ORIGIN 1 TGGCTGAGTG GCTACTCTCG GCTTCCTGGC AACGCCGAGC GAAAGCTATG 51 ACTGCGGCCG CGGGTTCGGC GGGCCGCGCC GCGGTGCCCT TGCTGCTGTG 101 TGCGCTGCTG GCGCCCGGCG GCGCGTACGT GCTCGACGAC TCCGACGGGC 151 TGGGCCGGGA GTTCGACGGC ATCGGCGCGG TCAGCGGCGG CGGGGCAACC 201 TCCCGACTTC TAGTAAATTA CCCAGAGCCC TATCGTTCTC AGATATTGGA 251 TTATCTCTTT AAGCCGAATT TTGGTGCCTC TTTGCATATT TTAAAAGTGG 301 AAATAGGTGG TGATGGGCAG ACAACAGACG GCACTGAGCC CTCCCACATG 351 CATTATGCAC TAGATGAGAA TTATTTCCGA GGATACGAGT GGTGGTTGAT 401 GAAAGAAGCT AAGAAGAGGA ATCCCAATAT TACACTCATT GGGTTGCCAT 451 GGTCATTCCC TGGATGGCTG GGAAAAGGTT TCGACTGGCC TTATGTCAAT 501 CTTCAGCTGA CTGCCTATTA TGTCGTGACC TGGATTGTGG GCGCCAAGCG 551 TTACCATGAT TTGGACATTG ATTATATTGG AATTTGGAAT GAGAGGTCAT 601 ATAATGCCAA TTATATTAAG ATATTAAGAA AAATGCTGAA TTATCAAGGT 651 CTCCAGCGAG TGAAAATCAT AGCAAGTGAT AATCTCTGGG AGTCCATCTC 701 TGCATCCATG CTCCTTGATG CCGAACTTTT CAAGGTGGTT GATGTTATAG 751 GGGCTCATTA TCCTGGAACC CATTCAGCAA AAGATGCAAA GTTGACTGGG 801 AAGAAGCTTT GGTCTTCTGA AGACTTTAGC ACTTTAAATA GTGACATGGG 851 TGCAGGCTGC TGGGGTCGCA TTTTAAATCA GAATTATATC AATGGCTATA 901 TGACTTCCAC AATCGCATGG AATTTAGTGG CTAGTTACTA TGAACAGTTG 951 CCTTATGGGA GATGCGGGTT GATGACGGCC CAAGAGCCAT GGAGTGGGCA 1001 CTACGTGGTA GAATCTCCTG TCTGGGTATC AGCTCATACC ACTCAGTTTA 1051 CTCAACCTGG CTGGTATTAC CTGAAGACAG TTGGCCATTT AGAGAAAGGA 1101 GGAAGCTACG TAGCTCTGAC TGATGGCTTA GGGAACCTCA CCATCATCAT 1151 TGAAACCATG AGTCATAAAC ATTCTAAGTG CATACGGCCA TTTCTTCCTT 1201 ATTTCAATGT GTCACAACAA TTTGCCACCT TTGTTCTTAA GGGATCTTTT 1251 AGTGAAATAC CAGAGCTACA GGTATGGTAT ACCAAACTTG GAAAAACATC 1301 CGAAAGATTT CTTTTTAAGC AGCTGGATTC TCTATGGCTC CTTGACAGTG 1351 ATGGCAGTTT CACACTGAGC CTGCATGAAG ATGAGCTGTT CACACTCACC 1401 ACTCTCACCA CTGGTCGCAA AGGCAGCTAC CCGCTTCCTC CAAAATCCCA 1451 GCCCTTCCCA AGTACCTATA AGGATGATTT CAATGTTGAT TACCCATTTT 1501 TTAGTGAAGC TCCAAACTTT GCTGATCAAA CTGGTGTATT TGAATATTTT 1551 ACAAATATTG AAGACCCTGG CGAGCATCAC TTCACGCTAC GCCAAGTTCT 1601 CAACCAGAGA CCCATTACGT GGGCTGCCGA TGCATCCAAC ACAATCAGTA 1651 TTATAGGAGA CTACAACTGG ACCAATCTGA CTATAAAGTG TGATGTTTAC 1701 ATAGAGACCC CTGACACAGG AGGTGTGTTC ATTGCAGGAA GAGTAAATAA 1751 AGGTGGTATT TTGATTAGAA GTGCCAGAGG AATTTTCTTC TGGATTTTTG 1801 CAAATGGATC TTACAGGGTT ACAGGTGATT TAGCTGGATG GATTATATAT 1851 GCTTTAGGAC GTGTTGAAGT TACAGCAAAA AAATGGTATA CACTCACGTT 1901 AACTATTAAG GGTCATTTCG CCTCTGGCAT GCTGAATGAC AAGTCTCTGT 1951 GGACAGACAT CCCTGTGAAT TTTCCAAAGA ATGGCTGGGC TGCAATTGGA 2001 ACTCACTCCT TTGAATTTGC ACAGTTTGAC AACTTTCTTG TGGAAGCCAC 2051 ACGCTAATAC TTAACAGGGC ATCATAGAAT ACTCTGGATT TTCTTCCCTT 2101 CTTTTTGGTT TTGGTTCAGA GCCAATTCTT GTTTCATTGG AACAGTATAT 2151 GAGGCTTTTG AGACTAAAAA TAATGAAGAG TAAAAGGGGA GAGAAATTTA 2201 TTTTTAATTT ACCCTGTGGA AGATTTTATT AGAATTAATT CCAAGGGGAA 2251 AACTGGTGAA TCTTTAACAT TACCTGGTGT GTTCCCTAAC ATTCAAACTG 2301 TGCATTGGCC ATACCCTTAG GAGTGGTTTG AGTAGTACAG ACCTCGAAGC 2351 CTTGCTGCTA ACACCTGAGG TAGCTCTCTT CATCTTATTT GCGAGCGGTC 2401 TCTGTAGAGT GGCAGTAACT TGATCATCAC TGAGATGTAT TGTATGCATG 2451 CTGACCGTGT GTCCAAGTGA GCCAGTGTCT GTCATCACAA GATGATGCTG 2501 CCATAATAGA AAGCTGAAGA ACACTAGAAG TAGCTTCTTG AAAACCACTT 2551 CAACCTGTTA TGCTTTATGC TCTAAAAAGT ATTTTTTTAT TTTCCTTTTT 2601 AAGATGATAC TTTTGAAATG CAGGATATGG ATGAGTGGGA TGATTTTAAA 2651 AACGCCTGTT TAATAAACTA CCTCTAACAC TATTTCTGCG GTAATAGATA 2701 TTAGCAGATT AATTGGGTTA TTTGCATTAT TTAATTTTTT TGATTCCAAG 2751 GTTTTGGTCT TGTAACCACT ATCACTCTCT GTGAACGTTT TTCCAGGTGG 2801 CTGGAAGAAG GAAGAAAACC TGATATAGCC AATGCTGTTG TAGTCGTTTC 2851 CTCAGCCTCA TCTCACTGTG CTGTGGTCTG TCCTCACATG TGCACTGGTA 2901 ACAGACTCAC ACAGCTGATG AATGCTTTTC TCTCCTTATG TGTGGAAGGA 2951 GGGGAGCACT TAGACATTTG CTAACTCCCA GAGTTGGATC ATCTCCTAAG 3001 ATGTACTTAC TTTTTAAAGT CCAAATATGT TTATATTTAA ATATACGTGA 3051 GCATGTTCAT CATGTTGTAT GATTTATACT AAGCATTAAT GTGGCTCTAT 3101 GTAGCAAATC AGTTATTCAT GTAGGTAAAG TAAATCTAGA ATTATTTATA 3151 AGAATTACTC ATTGAACTAA TTCTACTATT TAGGAATTTG TAAGAGTCTA 3201 ACATAGGCTT AGCTACAGTG AAGTTTTGCA TTGCTTTTGA AGACAAGAAA 3251 AGTGCTAGAA TAAATAAGAT TACAGAGAAA ATTTTTTGTT AAAACCAAGT 3301 GATTTCCAGC TGATGTATCT AATATTTTTT AAAACAAACA TTATAGAGGT 3351 GTAATTTATT TACAATAAAA TGTTCCTACT TTAAATATAC AATTCAGTGA 3401 GTTTTGATAA ATTGATATAC CCATGTAACC AACACTCCAG TCAAGCTTCA 3451 GAATATTTCC ATCACCCCAG AAGGTTCTCT TGTATACCTG CTCAGTCAGT 3501 TCCTTTCACT CCCAATTGTT GGCAGCCATT GATAGGAATT CTATCACTAT 3551 AGGTTAGTTT TCTTTGTTCC AGAACATCAT GAAAGCGGCG TCATGTACTG 3601 TGTATTCTTA TGAATGGTTT CTTTCCATCA GCATAATGCT TTGAGATTGG 3651 TCCATGTTGT GTGATTCAGT GGTTTGTTCC TTCTTATTTC TGAAAAGTTT 3701 TCCATTGTAT GAATATACCA CAATTTGTTT CCTCCCCACC AGTTTCTGAT 3751 ACTACAATTA AAACTGTCTA CATTTAC // LOCUS HUMPAX2A 3421 bp mRNA PRI 07-JAN-1995 DEFINITION Human paired-box protein (PAX2) mRNA, complete cds. ACCESSION M89470 NID g409138 VERSION M89470.1 GI:409138 KEYWORDS paired-box protein. SOURCE Homo sapiens (tissue library: lanbda-gt10 of Graham Bell and Clontech) kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3421) AUTHORS Eccles,M.R., Wallis,L.J., Fidler,A.E., Spurr,N.K., Goodfellow,P.J. and Reeve,A.E. TITLE Expression of the PAX2 gene in human fetal kidney and Wilms' tumor JOURNAL Cell Growth Differ. 3 (5), 279-289 (1992) MEDLINE 92338102 COMMENT On Oct 21, 1993 this sequence version replaced gi:189630. FEATURES Location/Qualifiers source 1. .3421 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /tissue_lib="lanbda-gt10 of Graham Bell and Clontech" /map="10q22.1-q24.3" gene 544. .1725 /gene="PAX2" CDS 544. .1725 /gene="PAX2" /note="octapeptide sequence bp 1096. .1120; paired box domain bp 589. .979" /codon_start=1 /product="paired-box protein" /protein_id="AAA60024.1" /db_xref="PID:g409139" /db_xref="GI:409139" /translation="MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVEL AHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEY KRQNPTMFAWEIRDRLLAEGICDNDTVPSVSSINRIIRTKVQQPFHPTPDGAGTGVTA PGHTIVPSTASPPVSSASNDPVGSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQ SGVDSLRKHLRADTFTQQQLEALDRVFERPSYPDVFQASEHIKSEQGNEYSLPALTPG LDEVKSSLSASTNPELGSNVSGTQTYPVVTGRDMASTTLPGYPPHVPPTGQGSYPTST LAGMVPGSEFSGNPYSHPQYTAYNEAWRFSNPALLSSPYYYSAAPRSAPAARAAAYDR H" BASE COUNT 593 a 1264 c 929 g 635 t ORIGIN 1 CGGGGGCCTG GCCGCGCGCT CCCCTCCCGC AGGCGCCACC TCGGACATCC 51 CCGGGATTGC TACTTCTCTG CCAACTTCGC CAACTCGCCA GCACTTGGAG 101 AGGCCCGGCT CCCCTCCCGG CGCCCTCTGA CCGCCCCCGC CCCGCGGCGC 151 TCTCCGACCA CCGCCTCTCG GATGACCAGG TTCCAGGGGA GCTGAGCGAG 201 TCGCCTCCCC CGCCCAGCTT CAGCCCTGGC TGCAGCTGCA GCGCGAGCCA 251 TGCGCCCCCA GTGCACCCCG GCCCACCGCC CCGGGGCCAT TCTGCTGACC 301 GCCCAGCCCC GAGCCCCGAC AGTGGCAAGT TGCGGCTACT GCAGTTGCAA 351 GCTCCGGCCA ACCCGGAGGA GCCCCACGGG GAAGGCAGTC GTGCGCCCCC 401 CGCCCCCGGG CGCCCCGCAG CAGCCGGGCG TTCACTCATC CTCCCTCCCC 451 CACCGTCCCT CCCTTTTCTC CTCAAGTCCT GAAGTTGAGT TTGAGAGGCG 501 ACACGGCGGC GGCGCCGCGC TGCTCCCGCT CCTCTGCCTC CCCATGGATA 551 TGCACTGCAA AGCAGACCCC TTCTCCGCGA TGCACCCAGG GCACGGGGGT 601 GTGAACCAGC TCGGGGGGGT GTTTGTGAAC GGCCGGCCCC TACCCGACGT 651 GGTGAGGCAG CGCATCGTGG AGCTGGCCCA CCAGGGTGTG CGGCCCTGTG 701 ACATCTCCCG GCAGCTGCGG GTCAGCCACG GCTGTGTCAG CAAAATCCTG 751 GGCAGGTACT ACGAGACCGG CAGCATCAAG CCGGGTGTGA TCGGTGGCTC 801 CAAGCCCAAA GTGGCGACGC CCAAAGTGGT GGACAAGATT GCTGAATACA 851 AACGACAGAA CCCGACTATG TTCGCCTGGG AGATTCGAGA CCGGCTCCTG 901 GCCGAGGGCA TCTGTGACAA TGACACAGTG CCCAGCGTCT CTTCCATCAA 951 CAGAATCATC CGGACCAAAG TTCAGCAGCC TTTCCACCCA ACGCCGGATG 1001 GGGCTGGGAC AGGAGTGACC GCCCCTGGCC ACACCATTGT TCCCAGCACG 1051 GCCTCCCCTC CTGTTTCCAG CGCCTCCAAT GACCCAGTGG GATCCTACTC 1101 CATCAATGGG ATCCTGGGGA TTCCTCGCTC CAATGGTGAG AAGAGGAAAC 1151 GTGATGAAGA TGTGTCTGAG GGCTCAGTCC CCAATGGAGA TTCCCAGAGT 1201 GGTGTGGACA GTTTGCGGAA GCACTTGCGA GCTGACACCT TCACCCAGCA 1251 GCAGCTGGAA GCTTTGGATC GGGTCTTTGA GCGTCCTTCC TACCCTGACG 1301 TCTTCCAGGC ATCAGAGCAC ATCAAATCAG AACAGGGGAA CGAGTACTCC 1351 CTCCCAGCCC TGACCCCTGG GCTTGATGAA GTCAAGTCGA GTCTATCTGC 1401 ATCCACCAAC CCTGAGCTGG GCAGCAACGT GTCAGGCACA CAGACATACC 1451 CCGTTGTGAC TGGTCGTGAC ATGGCGAGCA CCACTCTGCC TGGTTACCCC 1501 CCTCACGTGC CCCCCACTGG CCAGGGAAGC TACCCCACCT CCACCCTGGC 1551 AGGAATGGTG CCTGGGAGCG AGTTCTCCGG CAACCCGTAC AGCCACCCCC 1601 AGTACACGGC CTACAACGAG GCTTGGAGAT TCAGCAACCC CGCCTTACTA 1651 AGTTCCCCTT ATTATTATAG TGCCGCCCCC CGGTCCGCCC CTGCCGCTCG 1701 TGCCGCTGCC TATGACCGCC ACTAGTTACC GCGGGGACCA CATCAAGCTT 1751 CAGGCCGACA GCTTCGGCCT CCACATCGTC CCCGTCTGAC CCCACCCCGG 1801 AGGAGGGAGG ACCGACGCGA CGCATGCCTC CCGGCCACCG CCCCAGCCTC 1851 ACCCCATCCC ACGACCCCCG CAACCCTTCA CATCACCCCC CTCGAAGGTC 1901 GGACAGGACG GGTGGAGCCG CGGGGCGGGA CCCTCAGGCC CGGGCCCACC 1951 GCCCCCAGCC CCGCCTGCCG CCCCTCCCCG CCTGCCTGGA CTGCGCGGCG 2001 CCGTGAGGGG GATTCGGCCC AGCTCGTCCC GGCCTCCACC AAGCCAGCCC 2051 CGAAGCCCGC CAGCCACCCT GCCGTACTCG GGCGCGACCT GCTGGTGCGC 2101 GCCGGATGTT TCTGTGACAC ACAATCAGCG CGGACCGCAG CGCGGCCCAG 2151 CCCCGGGCAC CCGCCTCGGA CGCTCGGGCG CCAGGAGCTT CGCTGGAGGG 2201 GCTGGGCCAA GGAGATTAAG AAGAAAACGA CTTTCTGCAG GAGGAAGAGC 2251 CCGCTGCCGA ATCCCTGGGA AAAATTCTTT TCCCCCAGTG CCAGCCGGAC 2301 TGCCCTCGCC TTCCGGGTGT GCCCTGTCCC AGAAGATGGA ATGGGGGTGT 2351 GGGGGTCCGG CTCTAGGAAC GGGCTTTGGG GGCGTCAGGT CTTTCCAAGG 2401 TTGGGACCCA AGGATCGGGG GGCCCAGCAG CCCGCACCGA TCGAGCCGGA 2451 CTCTCGGCTC TTCACTGCTC CTCCTGGCCT GCCTAGTTCC CCAGGGCCCG 2501 GCACCTCCTG CTGCGAGACC CGGCTCTCAG CCCTGCCTTG CCCCTACCTC 2551 AGCGTCTCTT CCACCTGCTG GCCTCCCAGT TTCCCCTCCT GCCAGTCCTT 2601 CGCCTGTCCC TTGACGCCCT GCATCCTCCT CCCTGACTCG CAGCCCCATC 2651 GGACGCTCTC CCGGGACCGC CGCAGGACCA GTTTCCATAG ACTGCGGACT 2701 GGGGTCTTCC TCCAGCAGTT ACTTGATGCC CCCTCCCCCG ACACAGACTC 2751 TCAATCTGCC GGTGGTAAGA ACCGGTTCTG AGCTGGCGTC TGAGCTGCTG 2801 CGGGGTGGAA GTGGGGGGCT GCCCACTCCA CTCCTCCCAT CCCCTCCCAG 2851 CCTCCTCCTC CGGCAGGAAC TGAACAGAAC CACAAAAAGT CTACATTTAT 2901 TTAATATGAT GGTCTTTGCA AAAAGGAACA AAACAACACA AAAGCCCACC 2951 AGGCTGCTGC TTTGTGGAAA GACGGTGTGT GTCGTGTGAA GGCGAAACCC 3001 GGTGTACATA ACCCCTCCCC CTCCGCCCCG CCCCGCCCGG CCCCGTAGAG 3051 TCCCTGTCGC CCGCCGGCCC TGCCTGTAGA TACGCCCCGC TGTCTGTGCT 3101 GTGAGAGTCG CCGCTCGCTG GGGGGGAAGG GGGGGACACA GCTACACGCC 3151 CATTAAAGCA CAGCACGTCC TGGGGGAGGG GGGCATTTTT TATGTTACAA 3201 AAAAAAATTA CGAAGAAAGA ATCTCATTTG CAAAATAGCG AACATGGTCT 3251 GTGACTCCTC TGGCCTGTTT GTTGGCTCTT TCTCTGTAAT TCCGTGTTTT 3301 CGCTTTTTCC TCCCTGCCCC TCTCTCCCTC TGCCCCTCTC TCCTCTCCGC 3351 TTCTCTCCCC CTCTGTCTCT GTCTCTCTCC GTCTCTGTCG CTCTTGTCTG 3401 TCTGTCTCTG CTCTTTCTCG C // LOCUS AB023207 4565 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0990 protein, complete cds. ACCESSION AB023207 NID g4589623 VERSION AB023207.1 GI:4589623 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk05454. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 4565) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4565 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk05454" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 495. .2903 /gene="KIAA0990" CDS 495. .2903 /gene="KIAA0990" /codon_start=1 /product="KIAA0990 protein" /protein_id="BAA76834.1" /db_xref="PID:d1040587" /db_xref="PID:g4589624" /db_xref="GI:4589624" /translation="MAARGRRAWLSVLLGLVLGFVLASRLVLPRASELKRAGPRRRAS PEGCRSGQAAASQAGGARGDARGAQLWPPGSDPDGGPRDRNFLFVGVMTAQKYLQTRA VAAYRTWSKTIPGKVQFFSSEGSDTSVPIPVVPLRGVDDSYPPQKKSFMMLKYMHDHY LDKYEWFMRADDDVYIKGDRLENFLRSLNSSEPLFLGQTGLGTTEEMGKLALEPGENF CMGGPGVIMSREVLRRMVPHIGKCLREMYTTHEDVEVGRCVRRFAGVQCVWSYEMQQL FYENYEQNKKGYIRDLHNSKIHQAITLHPNKNPPYQYRLHSYMLSRKISELRHRTIQL HREIVLMSKYSNTEIHKEDLQLGIPPSFMRFQPRQREEILEWEFLTGKYLYSAVDGQP PRRGMDSAQREALDDIVMQVMEMINANAKTRGRIIDFKEIQYGYRRVNPMYGAEYILD LLLLYKKHKGKKMTVPVRRHAYLQQTFSKIQFVEHEELDAQELAKRINQESGSLSFLS NSLKKLVPFQLPGSKSEHKEPKDKKINILIPLSGRFDMFVRFMGNFEKTCLIPNQNVK LVVLLFNSDSNPDKAKQVELMTDYRIKYPKADMQILPVSGEFSRALALEVGSSQFNNE SLLFFCDVDLVFTTEFLQRCRANTVLGQQIYFPIIFSQYDPKIVYSGKVPSDNHFAFT QKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLEDVDLFNKVVQAGLKTFRSQEV GVVHVHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQLAEMWLEKNDPSYSKSSNNNGS VRTA" BASE COUNT 1091 a 1084 c 1200 g 1190 t ORIGIN 1 GGCGAGCTAA GCCGGAGGAT GTGCAGCTGC GGCGGCGGCG CCGGCTACGA 51 AGAGGACGGG GACAGGCGCC GTGCGAACCG AGCCCAGCCA GCCGGAGGAC 101 GCGGGCAGGG CGGGACGGGA GCCCGGACTC GTCTGCCGCC GCCGTCGTCG 151 CCGTCGTGCC GGCCCCGCGT CCCCGCGCGC GAGCGGGAGG AGCCGCCGCC 201 ACCTCGCGCC CGAGCCGCCG CTAGCGCGCG CCGGGCATGG TCCCCTCTTA 251 AAGGCGCAGG CCGCGGCGGC GGGGGCGGGC GTGCGGAACA AAGCGCCGGC 301 GCGGGGCCTG CGGGCGGCTC GGGGGCCGCG ATGGGCGCGG CGGGCCCGCG 351 GCGGCGGCGG CGCTGCCCGG GCCGGGCCTC GCGGCGCTAG GGCGGGCTGG 401 CCTCCGCGGG CGGGGGCAGC GGGCTGAGGG CGCGCGGGGC CTGCGGCGGC 451 GGCGGCGGCG GCGGCGGCGG CCCGGCGGGC GGAGCGGCGC GGGCATGGCC 501 GCGCGCGGCC GGCGCGCCTG GCTCAGCGTG CTGCTCGGGC TCGTCCTGGG 551 CTTCGTGCTG GCCTCGCGGC TCGTCCTGCC CCGGGCTTCC GAGCTGAAGC 601 GAGCGGGCCC ACGGCGCCGC GCCAGCCCCG AGGGCTGCCG GTCCGGGCAG 651 GCGGCGGCTT CCCAGGCCGG CGGGGCGCGC GGCGATGCGC GCGGGGCGCA 701 GCTCTGGCCG CCCGGCTCGG ACCCAGATGG CGGCCCGCGC GACAGGAACT 751 TTCTCTTCGT GGGAGTCATG ACCGCCCAGA AATACCTGCA GACTCGGGCC 801 GTGGCCGCCT ACAGAACATG GTCCAAGACA ATTCCTGGGA AAGTTCAGTT 851 CTTCTCAAGT GAGGGTTCTG ACACATCTGT ACCAATTCCA GTAGTGCCAC 901 TACGGGGTGT GGACGACTCC TACCCGCCCC AGAAGAAGTC CTTCATGATG 951 CTCAAGTACA TGCACGACCA CTACTTGGAC AAGTATGAAT GGTTTATGAG 1001 AGCAGATGAT GACGTGTACA TCAAAGGAGA CCGTCTGGAG AACTTCCTGA 1051 GGAGTTTGAA CAGCAGCGAG CCCCTCTTTC TTGGGCAGAC AGGCCTGGGC 1101 ACCACGGAAG AAATGGGAAA ACTGGCCCTG GAGCCTGGTG AGAACTTCTG 1151 CATGGGGGGG CCTGGCGTGA TCATGAGCCG GGAGGTGCTT CGGAGAATGG 1201 TGCCGCACAT TGGCAAGTGT CTCCGGGAGA TGTACACCAC CCATGAGGAC 1251 GTGGAGGTGG GAAGGTGTGT CCGGAGGTTT GCAGGGGTGC AGTGTGTCTG 1301 GTCTTATGAG ATGCAGCAGC TTTTTTATGA GAATTACGAG CAGAACAAAA 1351 AGGGGTACAT TAGAGATCTC CATAACAGTA AAATTCACCA AGCTATCACA 1401 TTACACCCCA ACAAAAACCC ACCCTACCAG TACAGGCTCC ACAGCTACAT 1451 GCTGAGCCGC AAGATATCCG AGCTCCGCCA TCGCACAATA CAGCTGCACC 1501 GCGAAATTGT CCTGATGAGC AAATACAGCA ACACAGAAAT TCATAAAGAG 1551 GACCTCCAGC TGGGAATCCC TCCCTCCTTC ATGAGGTTTC AGCCCCGCCA 1601 GCGAGAGGAG ATTCTGGAAT GGGAGTTTCT GACTGGAAAA TACTTGTATT 1651 CGGCAGTTGA CGGCCAGCCC CCTCGAAGAG GAATGGACTC CGCCCAGAGG 1701 GAAGCCTTGG ACGACATTGT CATGCAGGTC ATGGAGATGA TCAATGCCAA 1751 CGCCAAGACC AGAGGGCGCA TCATTGACTT CAAAGAGATC CAGTACGGCT 1801 ACCGCCGGGT GAACCCCATG TATGGGGCTG AGTACATCCT GGACCTGCTG 1851 CTTCTGTACA AAAAGCACAA AGGGAAGAAA ATGACGGTCC CTGTGAGGAG 1901 GCACGCGTAT TTACAGCAGA CTTTCAGCAA AATCCAGTTT GTGGAGCATG 1951 AGGAGCTGGA TGCACAAGAG TTGGCCAAGA GAATCAATCA GGAATCTGGA 2001 TCCTTGTCCT TTCTCTCAAA CTCCCTGAAG AAGCTCGTCC CCTTTCAGCT 2051 CCCTGGGTCG AAGAGTGAGC ACAAAGAACC CAAAGATAAA AAGATAAACA 2101 TACTGATTCC TTTGTCTGGG CGTTTCGACA TGTTTGTGAG ATTTATGGGA 2151 AACTTTGAGA AGACGTGTCT TATCCCCAAT CAGAACGTCA AGCTCGTGGT 2201 TCTGCTTTTC AATTCTGACT CCAACCCTGA CAAGGCCAAA CAAGTTGAAC 2251 TGATGACAGA TTACCGCATT AAGTACCCTA AAGCCGACAT GCAGATTTTG 2301 CCTGTGTCTG GAGAGTTTTC AAGAGCCCTG GCCCTGGAAG TAGGATCCTC 2351 CCAGTTTAAC AATGAATCTT TGCTCTTCTT CTGCGACGTC GACCTCGTCT 2401 TTACTACAGA ATTCCTTCAG CGATGTCGAG CAAATACAGT TCTGGGCCAA 2451 CAAATATATT TTCCAATCAT CTTCAGCCAG TATGACCCAA AGATTGTTTA 2501 TAGTGGGAAA GTTCCCAGTG ACAACCATTT TGCCTTTACT CAGAAAACTG 2551 GCTTCTGGAG AAACTATGGG TTTGGCATCA CGTGTATTTA TAAGGGAGAT 2601 CTTGTCCGAG TGGGTGGCTT TGATGTTTCC ATCCAAGGCT GGGGGCTGGA 2651 GGATGTGGAC CTTTTCAACA AGGTTGTCCA GGCAGGTTTG AAGACGTTTA 2701 GGAGCCAGGA AGTAGGAGTA GTCCACGTCC ACCATCCTGT CTTTTGTGAT 2751 CCCAATCTTG ACCCCAAACA GTACAAAATG TGCTTGGGGT CCAAAGCATC 2801 GACCTATGGG TCCACACAGC AGCTGGCTGA GATGTGGCTG GAAAAAAATG 2851 ATCCAAGTTA CAGTAAAAGC AGCAATAATA ATGGCTCAGT GAGGACAGCC 2901 TAATGTCCAG CTTTGCTGGA AAAGACGTTT TTAATTATCT AATTTATTTT 2951 TCAAAAATTT TTTGTATGAT CAGTTTTTGA AGTCCGTATA CAAGGATATA 3001 TTTTACAAGT GGTTTTCTTA CATAGGACTC CTTTAAGATT GAGCTTTCTG 3051 AACAAGAAGG TGATCAGTGT TTGCCTTTGA ACACATCTTC TTGCTGAACA 3101 TTATGTAGCA GACCTGCTTA ACTTTGACTT GAAATGTACC TGATGAACAA 3151 AACTTTTTTA AAAAAATGTT TTCTTTTGAG ACCCTTTGCT CCAGTCCTAT 3201 GGCAGAAAAC GTGAACATTC CTGCAAAGTA TTATTGTAAC AAAACACTGT 3251 AACTCTGGTA AATGTTCTGT TGTGATTGTT AACATTCCAC AGATTCTACC 3301 TTTTGTGTTT TGTTTTTTTT TTTTTACAAT TGTTTTAAAG CCATTTCATG 3351 TTCCAGTTGT AAGATAAGGA AATGTGATAA TAGCTGTTTC ATCATTGTCT 3401 TCAGGAGAGC TTTCCAGAGT TGATCATTTC CCCTCATGGT ACTCTGCTCA 3451 GCATGGCCAC GTAGGTTTTT TGTTTGTTTT GTTTTGTTCT TTTTTTGAGA 3501 CGGAGTCTCA CTCTGTTACC CAGGCTGGAA TGCAGTGGCG CAATCTTGGC 3551 TCACTTTAAC CTCCACTTCC CTGGTTCAAG CAATTCCCCT GCCTTTGCCT 3601 CCCGAGTAGC TGGGATTACA GGCACACACC ACCACGCCCA GCTAGTTTTT 3651 TTGTATTTTT AGTAGAGACG GGGTTTCACC ATGCAAGCCC AGCTGGCCAC 3701 GTAGGTTTTA AAGCAAGGGG CGTGAAGAAG GCACAGTGAG GTATGTGGCT 3751 GTTCTCGTGG TAGTTCATTC GGCCTAAATA GACCTGGCAT TAAATTTCAA 3801 GAAGGATTTG GCATTTTCTC TTCTTGACCC TTCTCTTTAA AGGGTAAAAT 3851 ATTAATGTTT AGAATGACAA AGATGAATTA TTACAATAAA TCTGATGTAC 3901 ACAGACTGAA ACACACACAC ATACACCCTA ATCAAAACGT TGGGGAAAAA 3951 TGTATTTGGT TTTGTTCCTT TCATCCTGTC TGTGTTATGT GGGTGGAGAT 4001 GGTTTTCATT CTTTCATTAC TGTTTTGTTT TATCCTTTGT ATCTGAAATA 4051 CCTTTAATTT ATTTAATATC TGTTGTTCAG AGCTCTGCCA TTTCTTGAGT 4101 ACCTGTTAGT TAGTATTATT TATGTGTATC GGGAGTGTGT TTAGTCTGTT 4151 TTATTTGCAG TAAACCGATC TCCAAAGATT TCCTTTTGGA AACGCTTTTT 4201 CCCCTCCTTA ATTTTTATAT TCCTTACTGT TTTACTAAAT ATTAAGTGTT 4251 CTTTGACAAT TTTGGTGCTC ATGTGTTTTG GGGACAAAAG TGAAATGAAT 4301 CTGTCATTAT ACCAGAAAGT TAAATTCTCA GATCAAATGT GCCTTAATAA 4351 ATTTGTTTTC ATTTAGATTT CAAACAGTGA TAGACTTGCC ATTTTAATAC 4401 ACGTCATTGG AGGGCTGCGT ATTTGTAAAT AGCCTGATGC TCATTTGGAA 4451 AAATAAACCA GTGAACAATA TTTTTCTATT GTACTTTTCA GAACCATTTT 4501 GTCTCATTAT TCCTGTTTTA GCTGAAGAAT TGTATTACAT TTGGAGAGTA 4551 AAAAACTTAA ACACG // LOCUS HSCSIST 2498 bp mRNA PRI 12-SEP-1993 DEFINITION Human mRNA for c-sis gene (clone pSM-1). ACCESSION X02744 NID g30246 VERSION X02744.1 GI:30246 KEYWORDS oncogene cellular; platelet-derived growth factor; sis cellular oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2498) AUTHORS Ratner,L., Josephs,S.F., Jarrett,R., Reitz,M.S. Jr. and Wong-Staal,F. TITLE Nucleotide sequence of transforming human c-sis cDNA clones with homology to platelet-derived growth factor JOURNAL Nucleic Acids Res. 13 (14), 5007-5018 (1985) MEDLINE 85269623 FEATURES Location/Qualifiers source 1. .2498 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 118. .843 /note="open reading frame (c-sis) (aa 1-241)" /codon_start=1 /protein_id="CAA26524.1" /db_xref="PID:g30247" /db_xref="GI:30247" /db_xref="SWISS-PROT:P01127" /translation="MNRCWALFLSLCCYLRLVSAEGDPIPEELYEMLSDHSIRSFDDL QRLLHGDPGEEDGAELDLNMTRSHSGGELESLARGRRSLGSLTIAEPAMIAECKTRTE VFEISRRLIDRTNANFLVWPPCVEVQRCSGCCNNRNVQCRPTQVQLRPVQVRKIEIVR KKPIFKKATVTLEDHLACKCETVAAARPVTRSPGGSQEQRAKTPQTRVTIRTVRVRRP PKGKHRKFKHTHDKTALKETLGA" misc_feature 180. .181 /note="RNA splice (-1)-1 site" misc_feature 277. .278 /note="RNA splice 1-2 site" misc_feature 361. .687 /note="PDGF homology region" misc_feature 367. .368 /note="RNA splice 2-3 site" misc_feature 552. .553 /note="RNA splice 3-4 site" misc_feature 717. .718 /note="RNA splice 4-5 site" misc_feature 871. .872 /note="RNA splice 5-6 site" polyA_site 2498 /note="polyadenylation site" BASE COUNT 505 a 771 c 680 g 542 t ORIGIN 1 TGATCGCCGC GGACCCGAGC CGAGCCCACC CCCCTCCCCA GCCCCCCACC 51 CTGGCCGCGG GGGCGGCGCG CTCGATCTAC GCGTCCGGGG CCCCGCGGGG 101 CCGGGCCCGG AGTCGGCATG AATCGCTGCT GGGCGCTCTT CCTGTCTCTC 151 TGCTGCTACC TGCGTCTGGT CAGCGCCGAG GGGGACCCCA TTCCCGAGGA 201 GCTTTATGAG ATGCTGAGTG ACCACTCGAT CCGCTCCTTT GATGATCTCC 251 AACGCCTGCT GCACGGAGAC CCCGGAGAGG AAGATGGGGC CGAGTTGGAC 301 CTGAACATGA CCCGCTCCCA CTCTGGAGGC GAGCTGGAGA GCTTGGCTCG 351 TGGAAGAAGG AGCCTGGGTT CCCTGACCAT TGCTGAGCCG GCCATGATCG 401 CCGAGTGCAA GACGCGCACC GAGGTGTTCG AGATCTCCCG GCGCCTCATA 451 GACCGCACCA ACGCCAACTT CCTGGTGTGG CCGCCCTGTG TGGAGGTGCA 501 GCGCTGCTCC GGCTGCTGCA ACAACCGCAA CGTGCAGTGC CGCCCCACCC 551 AGGTGCAGCT GCGACCTGTC CAGGTGAGAA AGATCGAGAT TGTGCGGAAG 601 AAGCCAATCT TTAAGAAGGC CACGGTGACG CTGGAAGACC ACCTGGCATG 651 CAAGTGTGAG ACAGTGGCAG CTGCACGGCC TGTGACCCGA AGCCCGGGGG 701 GTTCCCAGGA GCAGCGAGCC AAAACGCCCC AAACTCGGGT GACCATTCGG 751 ACGGTGCGAG TCCGCCGGCC CCCCAAGGGC AAGCACCGGA AATTCAAGCA 801 CACGCATGAC AAGACGGCAC TGAAGGAGAC CCTTGGAGCC TAGGGGCATC 851 GGCAGGAGAG TGTGTGGGCA GGGTTATTTA ATATGGTATT TGCTGTATTG 901 CCCCCATGGG GCCTTCGGAG CGATAATATT GTTTCCCTCG TCCGTCTGTC 951 TCGATGCCTG ATTCGGACGG CCAATGGTGC TTCCCCCACC CCTCCACGTG 1001 TCCGTCCACC CTTCCATCAG CGGGTCTCCT CCCAGCGGCC TCCGGTCTTG 1051 CCCAGCAGCT CAAGAAGAAA AAGAAGGACT GAACTCCATC GCCATCTTCT 1101 TCCCTTAACT CCAAGAACTT GGGATAAGAG TGTGAGAGAG ACTGATGGGG 1151 TCGCTCTTTG GGGGAAACGG GTTCCTTCCC CTGCACCTGG CCTGGGCCAC 1201 ACCTGAGCGC TGTGGACTGT CCTGAGGAGC CCTGAGGACC TCTCAGCATA 1251 GCCTGCCTGA TCCCTGAACC CCTAGCCAGC TCTGAGGGGA GGCACCTCCA 1301 GGCAGGCCAG GCCAGGCTGC CTCGGACTCC ATGGCTAAGA CCACAGACGG 1351 GCACACAGAC TGGAGAAAAC CCCTCCCACG GTGCCCAAAC ACCAGTCACT 1401 CGTCTCCCTG TGCCTCTGTG CACAGTGGCT TCTTTTCGTT TTCGTTTTGA 1451 AGACGTGGAC TCCTCTTGGT GGGTGTGGCC AGCACACCAA GTGGCTGGGT 1501 GCCCTCTCAG GTGGGTTAGA GATGGAGTTT GCTGTTGAGG TGGCTGTAGA 1551 TGGTGACCTG GGTATCCCCT GCCTCCTGCC ACCCCTTCCT CCCCACACTC 1601 CACTCTGATT CACCTCTTCC TCTGGTTCCT TTCATCTCTC TACCTCCACC 1651 CTGCATTTTC CTCTTGTCCT GGCCCTTCAG TCTGCTCCAC CAAGGGGCTC 1701 TTGAACCCCT TATTAAGGCC CCAGATGATC CCAGTCACTC CTCTCTAGGG 1751 CAGAAGACTA GAGGCCAGGG CAGCAAGGGA CCTGCTCATC ATATTCCAAC 1801 CCAGCCACGA CTGCCATGTA AGGTTGTGCA GGGTGTGTAC TGCACAAGGA 1851 CATTGTATGC AGGGAGCACT GTTCACATCA TAGATAAAGC TGATTTGTAT 1901 ATTTATTATG ACAATTTCTG GCAGATGTAG GTAAAGAGGA AAAGGATCCT 1951 TTCCTAATTC ACACAAAGAC TCCTTGTGGA CTGGCTGTGC CCCTGATGCA 2001 GCCTGTGGCT TGGAGTGGCC AAATAGGAGG GAGACTGTGG TAGGGGCAGG 2051 GAGGCAACAC TGCTGTCCAC ATGACCTCCA TTTCCCAAAG TCCTCCGCTC 2101 CAGCAACTGC CCTTCTAGGT GGGTGTGGGA CACTTGGGAG AAGGTCTCCA 2151 AGGGAGGGTG CAGCCCTCTT GCCCGCACCC CTCCCTGCTT GCACACTTCC 2201 CCATCTTTGA TCCTTCCGAG CTCCACCTCC GGCGGCTCCT CCTAGGAAAC 2251 CAGCTCGTGG GCCGGGAACG GGGGAGAGAA GGGAAAAGAT TCCCAAGACC 2301 CCCTGGGGTG GGATCTGAGC TCCCACCTCC CTTCCCACCT ACTGCACTTT 2351 CCCCCTTCCC GCCTTCCAAA ACCTGCTTCC TTCAGTTTGT AAAGTCGGTG 2401 ATTATATTTT TGGGGGCTTT CCTTTTATTT TTTAAATGTA AAATTTATTT 2451 ATATTTCGTA TTTAAAGTTG TAAAAAAAAA TAACCACAAA ACAAAACC // LOCUS HUMGUABIND 3334 bp mRNA PRI 12-JUN-1993 DEFINITION Human nucleotide binding protein mRNA, complete cds. ACCESSION L04510 NID g292069 VERSION L04510.1 GI:292069 KEYWORDS ADP-ribosylation factor; DNA-binding; guanine nucleotide-binding protein; nucleotide binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3334) AUTHORS Mishima,K., Tsuchiya,M., Nightingale,M.S., Moss,J. and Vaughan,M. TITLE ARD 1, a 64-kDa guanine nucleotide-binding protein with a carboxyl- terminal ADP-ribosylation factor (ARF) domain JOURNAL J. Biol. Chem. 268, 8801-8807 (1993) MEDLINE 93232038 FEATURES Location/Qualifiers source 1. .3334 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 23. .1747 /standard_name="ARD 1" /note="64 kDa protein; contains ADP-ribosylation factor domain" /codon_start=1 /product="nucleotide binding protein" /protein_id="AAA35940.1" /db_xref="PID:g292070" /db_xref="GI:292070" /translation="MATLVVNKLGAGVDSGRQGSRGTAVVKVLECGVCEDVFSLQGDK VPRLLLCGHTVCHDCLTRLPLHGRAIRCPFDRQVTDLGDSGVWGLKKNFALLELLERL QNGPIGQYGAAEESIGISGESIIRCDEDEAHLASVYCTVCATHLCSECSQVTHSTKTL AKHRRVPLADKPHEKTMCSQHQVHAIEFVCLEEGCQTSPLMCCVCKEYGKHQGHKHSV LEPEANQIRASILDMAHCIRTFTEEISDYSRKLVGIVQHIEGGEQIVEDGIGMAHTEH VPGTAENARSCIRAYFYDLHETLCRQEEMALSVVDAHVREKLIWLRQQQEDMTILLSE VSAACLHCEKTLQQDDCRVVLAKQEITRLLETLQKQQQQFTEVADHIQLDASIPVTFT KDNRVHIGPKMEIRVVTLGLDGAGKTTILFKLKQDEFMQPIPTIGFNVETVEYKNLKF TIWDVGGKHKLRPLWKHYYLNTQAVVFVVDSSHRDRISEAHSELAKLLTEKELRDALL LIFANKQDVAGALSVEEITELLSLHKLCCGRSWYIQGCDARSGMGLYEGLDWLSRQLV AAGVLDVA" BASE COUNT 1068 a 507 c 718 g 1041 t ORIGIN 1 CTGTGGCGCT TCCCCTGCGA GGATGGCTAC CCTGGTTGTA AACAAGCTCG 51 GAGCGGGAGT AGACAGTGGC CGGCAGGGCA GCCGGGGGAC AGCTGTAGTG 101 AAGGTGCTAG AGTGTGGAGT TTGTGAAGAT GTCTTTTCTT TGCAAGGAGA 151 CAAAGTTCCC CGTCTTTTGC TTTGTGGCCA TACCGTCTGT CATGACTGTC 201 TCACTCGCCT ACCTCTTCAT GGAAGAGCAA TCCGTTGCCC ATTTGATCGA 251 CAAGTAACAG ACCTAGGTGA TTCAGGTGTC TGGGGATTGA AAAAAAATTT 301 TGCTTTATTG GAGCTTTTGG AACGACTGCA GAATGGGCCT ATTGGTCAGT 351 ATGGAGCTGC AGAAGAATCC ATTGGGATAT CTGGAGAGAG CATCATTCGT 401 TGTGATGAAG ATGAAGCTCA CCTTGCCTCT GTATATTGCA CTGTGTGTGC 451 AACTCATTTG TGCTCTGAGT GTTCTCAAGT TACTCATTCT ACAAAGACAT 501 TAGCAAAGCA CAGGCGAGTT CCTCTAGCTG ATAAACCTCA TGAGAAAACT 551 ATGTGCTCTC AGCACCAGGT GCATGCCATT GAGTTTGTTT GCTTGGAAGA 601 AGGTTGTCAA ACTAGCCCAC TCATGTGCTG TGTCTGCAAA GAATATGGAA 651 AACACCAGGG TCACAAGCAT TCAGTATTGG AACCAGAAGC TAATCAGATC 701 CGAGCATCAA TTTTAGATAT GGCTCACTGC ATACGGACCT TCACAGAGGA 751 AATCTCAGAT TATTCCAGAA AATTAGTTGG AATTGTGCAG CACATTGAAG 801 GAGGAGAACA AATCGTGGAA GATGGAATTG GAATGGCTCA CACAGAACAT 851 GTACCAGGGA CTGCAGAGAA TGCCCGGTCA TGTATTCGAG CTTATTTTTA 901 TGATCTACAT GAAACTCTGT GTCGTCAAGA AGAAATGGCT CTAAGTGTTG 951 TTGATGCTCA TGTTCGTGAA AAATTGATTT GGCTCAGGCA GCAACAAGAA 1001 GATATGACTA TTTTGTTGTC AGAGGTTTCT GCAGCCTGCC TCCACTGTGA 1051 AAAGACTTTG CAGCAGGATG ATTGTAGAGT TGTCTTGGCA AAACAGGAAA 1101 TTACAAGGTT ACTGGAAACA TTGCAGAAAC AGCAGCAGCA GTTTACAGAA 1151 GTTGCAGATC ACATTCAGTT GGATGCCAGC ATCCCTGTCA CTTTTACAAA 1201 GGATAATCGA GTTCACATTG GACCAAAAAT GGAAATTCGG GTCGTTACGT 1251 TAGGATTGGA TGGTGCTGGA AAAACTACTA TCTTGTTTAA GTTAAAACAG 1301 GATGAATTCA TGCAGCCCAT TCCAACAATT GGTTTTAACG TGGAAACTGT 1351 AGAATATAAA AATCTAAAAT TCACTATTTG GGATGTAGGT GGAAAACACA 1401 AATTAAGACC ATTGTGGAAA CATTATTACC TCAATACTCA AGCTGTTGTG 1451 TTTGTTGTAG ATAGCAGTCA TAGAGACAGA ATTAGTGAAG CACACAGCGA 1501 ACTTGCAAAG TTGTTAACGG AAAAAGAACT CCGAGATGCT CTGCTCCTGA 1551 TTTTTGCTAA CAAACAGGAT GTTGCTGGAG CACTGTCAGT AGAAGAAATC 1601 ACTGAACTAC TCAGTCTCCA TAAATTATGC TGTGGCCGTA GCTGGTATAT 1651 TCAGGGCTGT GATGCTCGAA GTGGTATGGG ACTGTATGAA GGGTTGGACT 1701 GGCTCTCACG GCAACTTGTA GCTGCTGGAG TATTGGATGT TGCTTGATTT 1751 TAAAGGCAGC AGTTGTTTGA AGTTTTGTGG TTAAAAGTAA CTTTGCACAT 1801 AGTATGTTTT AAGAAATTAT ACATCTCAAA AGATGGTAAT TTAGGATGCA 1851 TATATATATA TATATATATA AAGGAATCTT GGATTGGGAA TTCAGTACTT 1901 TGCTTTAAAA AAATTTTGTG GCAGAATTAA ATTTCTAATT GAGCAGATTA 1951 GATTGAATTA AATAGAAACT TATTGAATAT ACATTCTTTT AAAAAGTATA 2001 TTTGTTATTT AAGTTTTTCA GATAATATGT GACCAATATA CTGGGAAAGA 2051 GGTAGTCACA GAGAAAGGGT AAGTGAAGGT TTATTCTTTC AGTGAAAAAA 2101 GAATAGCCAA TTGAGTGCCT AATGAGACCT CTGTGTGAAG CAAGTGAAGT 2151 ATAGCTGCTT CTTTTAACCT GCCTTTTCAC TGAATGTTGG CAGCATTTAG 2201 TAGTAGAAAT GACAGTTGCT TAATGAAATA GAATCCAAAC TACATATTTG 2251 GATAATAGGA TTACTTTATG TTTATGTTCA GAGTTAACAG AACACCTTTA 2301 ATGCTAAGAA CTATAAGGTA CAGAAAATTA ATACTTTATA TAGTGTTTTA 2351 TTAACTTTCT CCTACAGCAT TTTGTATAAA ACACAATGAG GGAGTGAAAT 2401 GTTACCCAAT TAGGCTTGTC AGGTTAGTAA TAAACTGAAC AGTAATAAAA 2451 CTGTGGAAGT AATTGGATCT GAATTTATGA AAGACCCATT TCCAGGACTG 2501 AACCTAGGTC AGAGCTCTAA ATTGGTCCTT CTATTTTTCA ACAAATTTAA 2551 AGTAATATTT CTTTCTAATA TAATATTGCA TCCTTTGTGG GAATGACTAT 2601 AGGTAAAATG TAGTAAGTAA CGCAGAACCA GGGTTGGCTT TATTTAAAAG 2651 CTAGTGACCT AAATAGAAAG CGAACTTCAA GAGAAGTTGT AAGTACAGTG 2701 GCAAATGCTT ATTACTTACT TCAAACTGTT TCCCAAAATA AGTGCATTTA 2751 TTTTGACAAT AAAACTTAAG GCTGTTCATG AGAAGGCCTT GAAAAGTTAC 2801 TCTAGAGGAA AAATGTCTAA AGAAAAAAAA AATTCAAAAA GTTTACATTA 2851 ATTATTCAGT GTTGTGAGTA AATAAAAATG TGTGCTCTTT ACTGTTTTTC 2901 ATTTTTAAAG AATATTATTA TGGAAGCACG ATTTATTTAA ATAGGTACAT 2951 TGAGACTTTT TTTTTTAATG TTCTGATACA TTAGGATGAA GTTAAATCTT 3001 AAATCTTATT AGTTGAATTG TTGTAAGGAC AGTGATGTCT GGTAACAAGA 3051 TGTGACTTTT TGGTAGCACT GTTGTGGTTC ATTCTTTTCA AATCTATTTT 3101 TGTTTAAAAA CAATACAAGT TTTAGAAAAC AAAGCATTAA AAAAAAAGCC 3151 TATCAGTATT ATGGGCAATA TGTAAATAAA TAAATGTAAT ATTTCATCCT 3201 TTATTTTTCA GGTAAAAGGT CATGCTGTTA CAGGTGTAGT TTGTGTGCAT 3251 AAATAATACT TCCGAATTAA ATTATTTAAT ATTTGACTGA TTTCAATAAC 3301 TGTGAAAATA AAAAGGTGTT GTATTGCTTG TGAG // LOCUS HSHE6 4665 bp mRNA PRI 21-MAY-1997 DEFINITION H.sapiens mRNA for HE6 Tm7 receptor. ACCESSION X81892 NID g2117160 VERSION X81892.1 GI:2117160 KEYWORDS HE6 gene; HE6 receptor; seven transmembrane-domain receptor. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4665) AUTHORS Osterhoff,C., Ivell,R. and Kirchhoff,C. TITLE Cloning of a human epididymis-specific mRNA, HE6, encoding a novel member of the seven transmembrane-domain receptor superfamily JOURNAL DNA Cell Biol. 16 (4), 379-389 (1997) MEDLINE 97294669 REFERENCE 2 (bases 1 to 4665) AUTHORS Osterhoff,C. TITLE Direct Submission JOURNAL Submitted (13-AUG-1996) C. Osterhoff, Institute for Hormone & Fertility Res, Grandweg 64, D- 22529 Hamburg, FRG FEATURES Location/Qualifiers source 1. .4665 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="epididymis" /clone_lib="lambda Uni-ZAP" gene 73. .3117 /gene="HE6" CDS 73. .3117 /gene="HE6" /note="human epididymis gene 6" /codon_start=1 /product="seven transmembrane-domain receptor" /protein_id="CAA57479.1" /db_xref="PID:e274706" /db_xref="PID:g2117161" /db_xref="GI:2117161" /db_xref="SPTREMBL:O00406" /translation="MVFSVRQCGHVGRTEEVLLTFKIFLVIICLHVVLVTSLEEDTDN SSLSPPPAKLSVVSFAPSSNEVETTSLNDVTLSLLPSNETEKTKITIVKTFNASGVKP QRNICNLSSICNDSAFFRGEIMFQYDKESTVPQNQHITNGTLTGVLSLSELKRSELNK TLQTLSETYFIMCATAEAQSTLNCTFTIKLNNTMNACAAIAALERVKIRPMEHCCCSV RIPCPSSPEELGKLQCDLQDPIVCLADHPRGPPFSSSQSIPVVPRATVLSQVPKATSF AEPPDYSPVTHNVPSPIGEIQPLSPQPSAPIASSPAIDMPPQSETISSPMPQTHVSGT PPPVKASFSSPTVSAPANVNTTSAPPVQTDIVNTSSISDLENQVLQMEKALSLGSLEP NLAGEMINQVSRLLHSPPDMLAPLAQRLLKVVDDIGLQLNFSNTTISLTSPSLALAVI RVNASSFNTTTFVAQDPANLQVSLETQAPENSIGTITLPSSLMNNLPAHDMELASRVQ FNFFETPALFQDPSLENLSLISYVISSSVANLTVRNLTRNVTVTLKHINPSQDELTVR CVFWDLGRNGGRGGWSDNGCSVKDRRLNETICTCSHLTSFGVLLDLSRTSVLPAQMMA LTFITYIGCGLSSIFLSVTLVTYIAFEKIRRDYPSKILIQLCAALLLLNLVFLLDSWI ALYKMQGLCISVAVFLHYFLLVSFTWMGLEAFHMYLALVKVFNTYIRKYILKFCIVGW GVPAVVVTIILTISPDNYGLGSYGKFPNGSPDDFCWINNNAVFYITVVGYFCVIFLLN VSMFIVVLVQLCRIKKKKQLGAQRKTSIQDLRSIAGLTFLLGITWGFAFFAWGPVNVT FMYLFAIFNTLQGFFIFIFYCVAKENVRKQWRRYLCCGKLRLAENSDWSKTATNGLKK QTVNQGVSSSSNSLQSSSNSTNSTTLLVNNDCSVHASGNGNASTERNGVSFSVQNGDV CLHDFTGKQHMFNEKEDSCNGKGRMALRRTSKRGSLHFIEQM" polyA_signal 4647. .4652 BASE COUNT 1295 a 1070 c 939 g 1361 t ORIGIN 1 AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT 51 GTTTTCTGAA CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC 101 ATGTTGGCAG AACTGAAGAA GTTTTACTGA CGTTCAAGAT ATTCCTTGTC 151 ATCATTTGTC TTCATGTCGT TCTGGTAACA TCCCTGGAAG AAGATACTGA 201 TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTATCTGTT GTCAGTTTTG 251 CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 301 AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA 351 AACCTTCAAT GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT 401 CATCTATTTG CAATGACTCA GCATTTTTTA GAGGTGAGAT CATGTTTCAA 451 TATGATAAAG AAAGCACTGT TCCCCAGAAT CAACATATAA CGAATGGCAC 501 CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA GAGCTCAACA 551 AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 601 GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC 651 AATGAATGCA TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC 701 CAATGGAACA CTGCTGCTGT TCTGTCAGGA TACCCTGCCC TTCCTCCCCA 751 GAAGAGTTGG GAAAGCTTCA GTGTGACCTG CAGGATCCCA TTGTCTGTCT 801 TGCTGACCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA TCCATCCCAG 851 TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 901 GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT 951 AGGGGAGATT CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA 1001 GCCCTGCCAT TGACATGCCC CCACAGTCTG AAACGATCTC TTCCCCTATG 1051 CCCCAAACCC ATGTCTCCGG CACCCCACCT CCTGTGAAAG CCTCATTTTC 1101 CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC AGCGCACCTC 1151 CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1201 CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA 1251 CCTCGCAGGA GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC 1301 CTGACATGCT GGCCCCTCTG GCTCAAAGAT TGCTGAAAGT AGTGGATGAC 1351 ATTGGCCTAC AGCTGAACTT TTCAAACACG ACTATAAGTC TAACCTCCCC 1401 TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT TTCAACACAA 1451 CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1501 CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT 1551 GAATAATTTA CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTCA 1601 ATTTTTTTGA AACACCTGCT TTGTTTCAGG ATCCTTCCCT GGAGAACCTC 1651 TCTCTGATCA GCTACGTCAT ATCATCGAGT GTTGCAAACC TGACCGTCAG 1701 GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC AACCCGAGCC 1751 AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1801 GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT 1851 GAATGAAACC ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC 1901 TGGACCTATC TAGGACATCT GTGCTGCCTG CTCAAATGAT GGCTCTGACG 1951 TTCATTACAT ATATTGGTTG TGGGCTTTCA TCAATTTTTC TGTCAGTGAC 2001 TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT TACCCTTCCA 2051 AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2101 CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC 2151 AGTGGCTGTA TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG 2201 GCCTAGAAGC ATTCCATATG TACCTGGCCC TTGTCAAAGT ATTTAATACT 2251 TACATCCGAA AATACATCCT TAAATTCTGC ATTGTCGGTT GGGGGGTACC 2301 AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT AACTATGGGC 2351 TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2401 ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT 2451 GATATTTTTG CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT 2501 GTCGAATTAA AAAGAAGAAG CAACTGGGAG CCCAGCGAAA AACCAGTATT 2551 CAAGACCTCA GGAGTATCGC TGGCCTTACA TTTTTACTGG GAATAACTTG 2601 GGGCTTTGCC TTCTTTGCCT GGGGACCAGT TAACGTGACC TTCATGTATC 2651 TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2701 TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG 2751 TGGAAAGTTA CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA 2801 ATGGTTTAAA GAAGCAGACT GTAAACCAAG GAGTGTCCAG CTCTTCAAAT 2851 TCCTTACAGT CAAGCAGTAA CTCCACTAAC TCCACCACAC TGCTAGTGAA 2901 TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT TCTACAGAGA 2951 GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3001 TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG 3051 GAAAGGCCGT ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT 3101 TTATTGAGCA AATGTGATTC CTTTCTTCTA AAATCAAAGC ATGATGCTTG 3151 ACAGTGTGAA ATGTCCAATT TTACCTTTTA CACAATGTGA GATGTATGAA 3201 AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT AAGCTAATTA 3251 AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3301 TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG 3351 GTTTTAAACA ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG 3401 TAGTGAACTT TCAGCTACCT TTTAAAGAGG CTAAGTTATC TTTGATAACA 3451 TCATATAAAG CAACTGTTGA CTTCAGCCTG TTGGTGAGTT TAGTTGTGCA 3501 TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT GTCAAAAATC 3551 TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3601 TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG 3651 CTCGTTCCGC AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA 3701 ATACAGAAAC CTCAGTGAAA TCAAGAAATA ATGATCCCAG CCAGACTGAG 3751 AAAATGTAAG CAGACAGTGC CACAGTTAGC TCATACAGTG CCTTTGAGCA 3801 AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA TGGGTCATGG 3851 TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3901 GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC 3951 CTGACCGCAT CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA 4001 GCTTTATACC TTGGCCACAG AGAGGGATGA ACTGTCATCC AGACCATGTG 4051 TCAGGAAAAT TGTGAACGTA GATGAGGTAC ATACACTGCC GCTTCTCAAA 4101 TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC CTTCTCTTAA 4151 AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4201 ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC 4251 AGAGCTATAA TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG 4301 GGGAGTAGCT ATTGGGAACA GGGAACTGTC CTACACTGCT ATTGTTGCTA 4351 CATGTATCGA GCCTTGATTG CTCCTAGTTA TATACAGGGT CTATCTTGCT 4401 TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC TTATTAGGAA 4451 CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4501 ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT 4551 TCTGTGAGTG CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA 4601 TTTTCTTGTA TTTATATGTT AAAATCAAAA ATGTTAAAAT CAATGAAATA 4651 AATTTGCAGT TAAGA // LOCUS HSCGM2ANT 2292 bp mRNA PRI 30-JUN-1997 DEFINITION H.sapiens mRNA for carcinoembryonic antigen family member 2, CGM2. ACCESSION X98311 NID g1524059 VERSION X98311.1 GI:1524059 KEYWORDS carcinoembryonic antigen; carcinoembryonic antigen family member 2; CGM2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2292) AUTHORS Thompson,J., Seitz,M., Chastre,E., Ditter,M., Aldrian,C., Gespach,C. and Zimmermann,W. TITLE Down-regulation of carcinoembryonic antigen family member 2 expression is an early event in colorectal tumorigenesis JOURNAL Cancer Res. 57 (9), 1776-1784 (1997) MEDLINE 97280695 REFERENCE 2 (bases 1 to 2292) AUTHORS Zimmermann,W. TITLE Direct Submission JOURNAL Submitted (24-MAY-1996) W. Zimmermann, Albert-Ludwigs-University, Institute of Immunobiology, Stefan-Meier-Strasse 8, D-79104 Freiburg, FRG FEATURES Location/Qualifiers source 1. .2292 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /germline /dev_stage="adult" /map="q13.2" /tissue_type="normal colonic mucosa" gene 3. .800 /gene="CGM2" CDS 3. .800 /gene="CGM2" /note="family member 2" /codon_start=1 /product="carcinoembryonic antigen" /protein_id="CAA66955.1" /db_xref="PID:e249945" /db_xref="PID:g1524060" /db_xref="GI:1524060" /db_xref="SWISS-PROT:Q14002" /translation="MGSPSACPYRVCIPWQGLLLTASLLTFWNLPNSAQTNIDVVPFN VAEGKEVLLVVHNESQNLYGYNWYKGERVHANYRIIGYVKNISQENAPGPAHNGRETI YPNGTLLIQNVTHNDAGFYTLHVIKENLVNEEVTRQFYVFSEPPKPSITSNNFNPVEN KDIVVLTCQPETQNTTYLWWVNNQSLLVSPRLLLSTDNRTLVLLSATKNDIGPYECEI QNPVGASRSDPVTLNVRYESVQASSPDLSAGTAVSIMIGVLAGMALI" BASE COUNT 661 a 537 c 457 g 637 t ORIGIN 1 CCATGGGTTC CCCTTCAGCC TGTCCATACA GAGTGTGCAT TCCCTGGCAG 51 GGGCTCCTGC TCACAGCCTC GCTTTTAACC TTCTGGAACC TGCCAAACAG 101 TGCCCAGACC AATATTGATG TCGTGCCGTT CAATGTCGCA GAAGGGAAGG 151 AGGTCCTTCT AGTAGTCCAT AATGAGTCCC AGAATCTTTA TGGCTACAAC 201 TGGTACAAAG GGGAAAGGGT GCATGCCAAC TATCGAATTA TAGGATATGT 251 AAAAAATATA AGTCAAGAAA ATGCCCCAGG GCCCGCACAC AACGGTCGAG 301 AGACAATATA CCCCAATGGA ACCCTGCTGA TCCAGAACGT TACCCACAAT 351 GACGCAGGAT TCTATACCCT ACACGTTATA AAAGAAAATC TTGTGAATGA 401 AGAAGTAACC AGACAATTCT ACGTATTCTC GGAGCCACCC AAGCCCTCCA 451 TCACCAGCAA CAACTTCAAT CCGGTGGAGA ACAAAGATAT TGTGGTTTTA 501 ACCTGTCAAC CTGAGACTCA GAACACAACC TACCTGTGGT GGGTAAACAA 551 TCAGAGCCTC CTGGTCAGTC CCAGGCTGCT GCTCTCCACT GACAACAGGA 601 CCCTCGTTCT ACTCAGCGCC ACAAAGAATG ACATAGGACC CTATGAATGT 651 GAAATACAGA ACCCAGTGGG TGCCAGCCGC AGTGACCCAG TCACCCTGAA 701 TGTCCGCTAT GAGTCAGTAC AAGCAAGTTC ACCTGACCTC TCAGCTGGGA 751 CCGCTGTCAG CATCATGATT GGAGTACTGG CTGGGATGGC TCTGATATAG 801 CAGCCTTGGT GTAGTTTCTG CATTTCGGGA AGAGTGTTTT TATTATCCAC 851 CTGCAGACTG GACTGGATTC TTCTAGCTCC TTCAATCCCA TTTTCTCCTG 901 TGGCATCACT AAGTATAAGA CCTGCTCTCT TCCTGAAGAC CTATAAGCTG 951 GAGGTGGACA ACTCAATGTA AATTTCAAGG AAAAACCCTC ATGCCTGAGA 1001 TGTGGGCCAC TCAGAGCTAA CCAAAATGTT CAACACCATA ACTAGAGACA 1051 CTCAAATTGC CAACCAGGAC AAGAAGTTGA TGACTTCATG CTGTGGACAG 1101 TTTTTCCCAA GATGTCCCAA GCCTCATCGT GACGAGGCTC TTATCCCACT 1151 CCATTTTTCC CTGCTCATGC CTGCCTCTTT AATTTGGTAA GATAATGCTG 1201 TAACTAGAAT TTCACAATCA GCGCCTTGTG CAGGCAATTT GACAGAGTGT 1251 TGGATGTGTC ATGTCATCAT GTCAAACCCA AATATTTGAC CTAAGGGATC 1301 CTTTATTCTG CCCAGTGGCT AACTTTAACA ACATCCCTAA TACAACTGTT 1351 TATTCAAATG CACGGTGGTC CCTGTTAGAG TTAGACCTCT AGACTCACCT 1401 GTTCTCACGC CCTGTTTTAA TTTAACCCAG CTATGGGATG CCAGATAACA 1451 GAATTGCTGC CTACGAGCTG AACAGGGAGG AGTTTGTGCA GTTGCTGACA 1501 CTTCTTGTTG CACATAAATA AATACAGTGG GTACTATAGA GACTCAGTTG 1551 CAAAAATTAA CAAATATGCT GCTTGATTAA AATGGGTAGG CTTCTCATGT 1601 GGCTCATTCT TTAATCTATT CTCTTTTATT TGGTTTGGTT CATGGGGTCT 1651 CTGCCTATGG ATCATACTTC AAACTCTTGG TGTGATCCTC CTGATTGTCA 1701 CAATATTAGT TACCCTGGTG TGCTGTATTC TCTAAAACCT TTAAATGTTT 1751 GCATGCAGCC ATTCGTCAAA TGTCAAATAT TCTCTCTTTG GCTGGAATGA 1801 CAAAAACTCA AATAAATGTA TGATTAGGAG GACATCATAA CCTATGAATG 1851 ATGGAAGTCC AAAATGATGG TAACTGACAG TAGTGTTAAT GCCTTATGTT 1901 TAGTCAAACT CTCATTTAGG TGACAGCCTG GTGACTCCAG AATGGAGCCA 1951 GTCATGCTAA ATGCCATATA CTCACACTGA AACATGAGGA AGCAGGTAGA 2001 TCCCAGAACA GACAAAACTT TCCTAAAAAC ATGAGAGTCC AGGCTGTCTG 2051 AGTCAGCACA GTAAGAAAGT CCTTTCTGCT TTAACTCTTA GAAAAAAGTA 2101 ATATGAAGTA TTCTGAAATT AACCAATCAG TTTATTTAAA TCAATTTATT 2151 TATATTCTTC TGTTCCTGGA TTCCCATTTT ACAAAACCCA CTGTTCTACT 2201 GTTGTATTGC CCAGTAGGAG CTATCACTAT ATTTTGCAGA ATGGAAACTG 2251 CCCTGACTCT TGAATCACAA ATAAAAGCCA ATTGTATCTG TT // LOCUS HUMHBEGF 2360 bp mRNA PRI 27-APR-1993 DEFINITION Human heparin-binding EGF-like growth factor mRNA, complete cds. ACCESSION M60278 NID g183866 VERSION M60278.1 GI:183866 KEYWORDS heparin-binding EGF-like growth factor. SOURCE Human histiocytic lymphoma derived cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2360) AUTHORS Higashiyama,S., Abraham,J.A., Miller,J.L., Fiddes,J.C. and Klagsbrun,M. TITLE A heparin-binding growth factor secreted by macrophage-like cells that is related to EGF JOURNAL Science 251, 936-939 (1991) MEDLINE 91157008 FEATURES Location/Qualifiers source 1. .2360 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="histiocytic lymphoma derived" sig_peptide 262. .318 /note="putative" CDS 262. .888 /note="putative" /codon_start=1 /product="heparin-binding EGF-like growth factor" /protein_id="AAA35956.1" /db_xref="PID:g183867" /db_xref="GI:183867" /translation="MKLLPSVVLKLFLAAVLSALVTGESLERLRRGLAAGTSNPDPPT VSTDQLLPLGGGRDRKVRDLQEADLDLLRVTLSSKPQALATPNKEEHGKRKKKGKGLG KKRDPCLRKYKDFCIHGECKYVKELRAPSCICHPGYHGERCHGLSLPVENRLYTYDHT TILAVVAVVLSSVCLLVIVGLLMFRYHRRGGYDVENEEKVKLGMTNSH" mat_peptide 481. .702 /evidence=experimental /product="heparin-binding EGF-like growth factor" BASE COUNT 599 a 579 c 605 g 577 t ORIGIN 1 GCTACGCGGG CCACGCTGCT GGCTGGCCTG ACCTAGGCGC GCGGGGTCGG 51 GCGGCCGCGC GGGCGGGCTG AGTGAGCAAG ACAAGACACT CAAGAAGAGC 101 GAGCTGCGCC TGGGTCCCGG CCAGGCTTGC ACGCAGAGGC GGGCGGCAGA 151 CGGTGCCCGG CGGAATCTCC TGAGCTCCGC CGCCCAGCTC TGGTGCCAGC 201 GCCCAGTGGC CGCCGCTTCG AAAGTGACTG GTGCCTCGCC GCCTCCTCTC 251 GGTGCGGGAC CATGAAGCTG CTGCCGTCGG TGGTGCTGAA GCTCTTTCTG 301 GCTGCAGTTC TCTCGGCACT GGTGACTGGC GAGAGCCTGG AGCGGCTTCG 351 GAGAGGGCTA GCTGCTGGAA CCAGCAACCC GGACCCTCCC ACTGTATCCA 401 CGGACCAGCT GCTACCCCTA GGAGGCGGCC GGGACCGGAA AGTCCGTGAC 451 TTGCAAGAGG CAGATCTGGA CCTTTTGAGA GTCACTTTAT CCTCCAAGCC 501 ACAAGCACTG GCCACACCAA ACAAGGAGGA GCACGGGAAA AGAAAGAAGA 551 AAGGCAAGGG GCTAGGGAAG AAGAGGGACC CATGTCTTCG GAAATACAAG 601 GACTTCTGCA TCCATGGAGA ATGCAAATAT GTGAAGGAGC TCCGGGCTCC 651 CTCCTGCATC TGCCACCCGG GTTACCATGG AGAGAGGTGT CATGGGCTGA 701 GCCTCCCAGT GGAAAATCGC TTATATACCT ATGACCACAC AACCATCCTG 751 GCCGTGGTGG CTGTGGTGCT GTCATCTGTC TGTCTGCTGG TCATCGTGGG 801 GCTTCTCATG TTTAGGTACC ATAGGAGAGG AGGTTATGAT GTGGAAAATG 851 AAGAGAAAGT GAAGTTGGGC ATGACTAATT CCCACTGAGA GAGACTTGTG 901 CTCAAGGAAT CGGCTGGGGA CTGCTACCTC TGAGAAGACA CAAGGTGATT 951 TCAGACTGCA GAGGGGAAAG ACTTCCATCT AGTCACAAAG ACTCCTTCGT 1001 CCCCAGTTGC CGTCTAGGAT TGGGCCTCCC ATAATTGCTT TGCCAAAATA 1051 CCAGAGCCTT CAAGTGCCAA ACAGAGTATG TCCGATGGTA TCTGGGTAAG 1101 AAGAAAGCAA AAGCAAGGGA CCTTCATGCC CTTCTGATTC CCCTCCACCA 1151 AACCCCACTT CCCCTCATAA GTTTGTTTAA ACACTTATCT TCTGGATTAG 1201 AATGCCGGTT AAATTCCATA TGCTCCAGGA TCTTTGACTG AAAAAAAAAA 1251 AGAAGAAGAA GAAGGAGAGC AAGAAGGAAA GATTTGTGAA CTGGAAGAAA 1301 GCAACAAAGA TTGAGAAGCC ATGTACTCAA GTACCACCAA GGGATCTGCC 1351 ATTGGGACCC TCCAGTGCTG GATTTGATGA GTTAACTGTG AAATACCACA 1401 AGCCTGAGAA CTGAATTTTG GGACTTCTAC CCAGATGGAA AAATAACAAC 1451 TATTTTTGTT GTTGTTGTTT GTAAATGCCT CTTAAATTAT ATATTTATTT 1501 TATTCTATGT ATGTTAATTT ATTTAGTTTT TAACAATCTA ACAATAATAT 1551 TTCAAGTGCC TAGACTGTTA CTTTGGCAAT TTCCTGGCCC TCCACTCCTC 1601 ATCCCCACAA TCTGGCTTAG TGCCACCCAC CTTTGCCACA AAGCTAGGAT 1651 GGTTCTGTGA CCCATCTGTA GTAATTTATT GTCTGTCTAC ATTTCTGCAG 1701 ATCTTCCGTG GTCAGAGTGC CACTGCGGGA GCTCTGTATG GTCAGGATGT 1751 AGGGGTTAAC TTGGTCAGAG CCACTCTATG AGTTGGACTT CAGTCTTGCC 1801 TAGGCGATTT TGTCTACCAT TTGTGTTTTG AAAGCCCAAG GTGCTGATGT 1851 CAAAGTGTAA CAGATATCAG TGTCTCCCCG TGTCCTCTCC CTGCCAAGTC 1901 TCAGAAGAGG TTGGGCTTCC ATGCCTGTAG CTTTCCTGGT CCCTCACCCC 1951 CATGGCCCCA GGCCACAGCG TGGGAACTCA CTTTCCCTTG TGTCAAGACA 2001 TTTCTCTAAC TCCTGCCATT CTTCTGGTGC TACTCCATGC AGGGGTCAGT 2051 GCAGCAGAGG ACAGTCTGGA GAAGGTATTA GCAAAGCAAA AGGCTGAGAA 2101 GGAACAGGGA ACATTGGAGC TGACTGTTCT TGGTAACTGA TTACCTGCCA 2151 ATTGCTACCG AGAAGGTTGG AGGTGGGGAA GGCTTTGTAT AATCCCACCC 2201 ACCTCACCAA AACGATGAAG GTATGCTGTC ATGGTCCTTT CTGGAAGTTT 2251 CTGGTGCCAT TTCTGAACTG TTACAACTTG TATTTCCAAA CCTGGTTCAT 2301 ATTTATACTT TGCAATCCAA ATAAAGATAA CCCTTATTCC ATAAAAAAAA 2351 AAAAAAAAAA // LOCUS HSU65785 4503 bp mRNA PRI 24-JAN-1997 DEFINITION Human 150 kDa oxygen-regulated protein ORP150 mRNA, complete cds. ACCESSION U65785 NID g1794218 VERSION U65785.1 GI:1794218 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4503) AUTHORS Ikeda,J., Kaneda,S., Kuwabara,K., Ogawa,S., Kobayashi,T., Matsumoto,M., Yura,T. and Yanagi,H. TITLE Cloning and expression of cDNA encoding the human 150 kDa oxygen-regulated protein, ORP150 JOURNAL Biochem. Biophys. Res. Commun. 230 (1), 94-99 (1997) MEDLINE 97148579 REFERENCE 2 (bases 1 to 4503) AUTHORS Ikeda,J., Kobayashi,T., Kuwabara,K., Ogawa,S., Yura,T. and Yanagi,H. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) HSP Research Institute, Kyoto Research Park, Shimogyo-ku, Kyoto 600, Japan FEATURES Location/Qualifiers source 1. .4503 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="astrocytoma U373" CDS 103. .3102 /function="proposed ER chaperone" /codon_start=1 /product="150 kDa oxygen-regulated protein ORP150" /protein_id="AAC50947.1" /db_xref="PID:g1794219" /db_xref="GI:1794219" /translation="MADKVRRQRPRRRVCWALVAVLLADLLALSDTLAVMSVDLGSES MKVAIVKPGVPMEIVLNKESRRKTPVIVTLKENERFFGDSAASMAIKNPKATLRYFQH LLGKQADNPHVALYQARFPEHELTFDPQRQTVHFQISSQLQFSPEEVLGMVLNYSRSL AEDFAEQPIKDAVITVPVFFNQAERRAVLQAARMAGLKVLQLINDNTATALSYGVFRR KDINTTAQNIMFYDMGSGSTVCTIVTYQMVKTKEAGMQPQLQIRGVGFDRTLGGLEME LRLRERLAGLFNEQRKGQRAKDVRENPRAMAKLLREANRLKTVLSANADHMAQIEGLM DDVDFKAKVTRVEFEELCADLFERVPGPVQQALQSAEMSLDEIEQVILVGGATRVPRV QEVLLKAVGKEELGKNINADEAAAMGAVYQAAALSKAFKVKPFVVRDAVVYPILVEFT REVEEEPGIHSLKHNKRVLFSRMGPYPQRKVITFNRYSHDFNFHINYGDLGFLGPEDL RVFGSQNLTTVKLKGVGDSFKKYPDYESKGIKAHFNLDESGVLSLDRVESVFETLVED SAEEESTLTKLGNTISSLFGGGTTPDAKENGTDTVQEEEESPAEGSKDEPGEQVELKE EAEAPVEDGSQPPPPEPKGDATPEGEKATEKENGDKSEAQKPSEKAEAGPEGVAPAPE GEKKQKPARKRRMVEEIGVELVVLDLPDLPEDKLAQSVQKLQDLTLRDLEKQEREKAA NSLEAFIFETQDKLYQPEYQEVSTEEQREEISGKLSAASTWLEDEGVGATTVMLKEKL AELRKLCQGLFFRVEERKKWPERLSALDNLLNHSSMFLKGARLIPEMDQIFTEVEMTT LEKVINETWAWKNATLAEQAKLPATEKPVLLSKDIEAKMMALDREVQYLLNKAKFTKP RPRPKDKNGTRAEPPLNASASDQGEKVIPPAGQTEDAEPISEPEKVETGSEPGDTEPL ELGGPGAEPEQKEQSTGQKRPLKNDEL" BASE COUNT 1045 a 1177 c 1311 g 970 t ORIGIN 1 TTGTGAAGGG CGCGGGTGGG GGGCGCTGCC GGCCTCGTGG GTACGTTCGT 51 GCCGCGTCTG TCCCAGAGCT GGGGCCGCAG GAGCGGAGGC AAGAGGGGCA 101 CTATGGCAGA CAAAGTTAGG AGGCAGAGGC CGAGGAGGCG AGTCTGTTGG 151 GCCTTGGTGG CTGTGCTCTT GGCAGACCTG TTGGCACTGA GTGATACACT 201 GGCAGTGATG TCTGTGGACC TGGGCAGTGA GTCCATGAAG GTGGCCATTG 251 TCAAACCTGG AGTGCCCATG GAAATTGTCT TGAATAAGGA ATCTCGGAGG 301 AAAACACCGG TGATCGTGAC CCTGAAAGAA AATGAAAGAT TCTTTGGAGA 351 CAGTGCAGCA AGCATGGCGA TTAAGAATCC AAAGGCTACG CTACGTTACT 401 TCCAGCACCT CCTGGGGAAG CAGGCAGATA ACCCCCATGT AGCTCTTTAC 451 CAGGCCCGCT TCCCGGAGCA CGAGCTGACT TTCGACCCAC AGAGGCAGAC 501 TGTGCACTTT CAGATCAGCT CGCAGCTGCA GTTCTCACCT GAGGAAGTGT 551 TGGGCATGGT TCTCAATTAT TCTCGTTCTC TAGCTGAAGA TTTTGCAGAG 601 CAGCCCATCA AGGATGCAGT GATCACCGTG CCAGTCTTCT TCAACCAGGC 651 CGAGCGCCGA GCTGTGCTGC AGGCTGCTCG TATGGCTGGC CTCAAAGTGC 701 TGCAGCTCAT CAATGACAAC ACCGCCACTG CCCTCAGCTA TGGTGTCTTC 751 CGCCGGAAAG ATATTAACAC CACTGCCCAG AATATCATGT TCTATGACAT 801 GGGCTCAGGC AGCACCGTAT GCACCATTGT GACCTACCAG ATGGTGAAGA 851 CTAAGGAAGC TGGGATGCAG CCACAGCTGC AGATCCGGGG AGTAGGATTT 901 GACCGTACCC TGGGGGGCCT GGAGATGGAG CTCCGGCTTC GAGAACGCCT 951 GGCTGGGCTT TTCAATGAGC AGCGCAAGGG TCAGAGAGCA AAGGATGTGC 1001 GGGAGAACCC GCGTGCCATG GCCAAGCTGC TGCGTGAGGC TAATCGGCTC 1051 AAAACCGTCC TCAGTGCCAA CGCTGACCAC ATGGCACAGA TTGAAGGCCT 1101 GATGGATGAT GTGGACTTCA AGGCAAAAGT GACTCGTGTG GAATTTGAGG 1151 AGTTGTGTGC AGACTTGTTT GAGCGGGTGC CTGGGCCTGT ACAGCAGGCC 1201 CTCCAGAGTG CCGAAATGAG TCTGGATGAG ATTGAGCAGG TGATCCTGGT 1251 GGGTGGGGCC ACTCGGGTCC CCAGAGTTCA GGAGGTGCTG CTGAAGGCCG 1301 TGGGCAAGGA GGAGCTGGGG AAGAACATCA ATGCAGATGA AGCAGCCGCC 1351 ATGGGGGCAG TGTACCAGGC AGCTGCGCTC AGCAAAGCCT TTAAAGTGAA 1401 GCCATTTGTC GTCCGAGATG CAGTGGTCTA CCCCATCCTG GTGGAGTTCA 1451 CGAGGGAGGT GGAGGAGGAG CCTGGGATTC ACAGCCTGAA GCACAATAAA 1501 CGGGTACTCT TCTCTCGGAT GGGGCCCTAC CCTCAACGCA AAGTCATCAC 1551 CTTTAACCGC TACAGCCATG ATTTCAACTT CCACATCAAC TACGGCGACC 1601 TGGGCTTCCT GGGGCCTGAA GATCTTCGGG TATTTGGCTC CCAGAATCTG 1651 ACCACAGTGA AGCTAAAAGG GGTGGGTGAC AGCTTCAAGA AGTATCCTGA 1701 CTACGAGTCC AAGGGCATCA AGGCTCACTT CAACCTGGAT GAGAGTGGCG 1751 TGCTCAGTCT AGACAGGGTG GAGTCTGTAT TTGAGACACT GGTAGAGGAC 1801 AGCGCAGAAG AGGAATCTAC TCTCACCAAA CTTGGCAACA CCATTTCCAG 1851 CCTGTTTGGA GGCGGTACCA CACCAGATGC CAAGGAGAAT GGTACTGATA 1901 CTGTCCAGGA GGAAGAGGAG AGCCCTGCAG AGGGGAGCAA GGACGAGCCT 1951 GGGGAGCAGG TGGAGCTCAA GGAGGAAGCT GAGGCCCCAG TGGAGGATGG 2001 CTCTCAGCCC CCACCCCCTG AACCTAAGGG AGATGCAACC CCTGAGGGAG 2051 AAAAGGCCAC AGAAAAAGAA AATGGGGACA AGTCTGAGGC CCAGAAACCA 2101 AGTGAGAAGG CAGAGGCAGG GCCTGAGGGC GTCGCTCCAG CCCCAGAGGG 2151 AGAGAAGAAG CAGAAGCCCG CCAGGAAGCG GCGAATGGTA GAGGAGATCG 2201 GGGTGGAGCT GGTTGTTCTG GACCTGCCTG ACTTGCCAGA GGATAAGCTG 2251 GCTCAGTCGG TGCAGAAACT TCAGGACTTG ACACTCCGAG ACCTGGAGAA 2301 GCAGGAACGG GAAAAAGCTG CCAACAGCTT GGAAGCGTTC ATATTTGAGA 2351 CCCAGGACAA GCTGTACCAG CCCGAGTACC AGGAAGTGTC CACAGAGGAG 2401 CAGCGTGAGG AGATCTCTGG GAAGCTCAGC GCCGCATCCA CCTGGCTGGA 2451 GGATGAGGGT GTTGGAGCCA CCACAGTGAT GTTGAAGGAG AAGCTGGCTG 2501 AGCTGAGGAA GCTGTGCCAA GGGCTGTTTT TTCGGGTAGA GGAGCGCAAG 2551 AAGTGGCCCG AACGGCTGTC TGCCCTCGAT AATCTCCTCA ACCATTCCAG 2601 CATGTTCCTC AAGGGGGCCC GGCTCATCCC AGAGATGGAC CAGATCTTCA 2651 CTGAGGTGGA GATGACAACG TTAGAGAAAG TCATCAATGA GACCTGGGCC 2701 TGGAAGAATG CAACTCTGGC CGAGCAGGCT AAGCTGCCCG CCACAGAGAA 2751 GCCTGTGTTG CTCTCAAAAG ACATTGAAGC TAAGATGATG GCCCTGGACC 2801 GAGAGGTGCA GTATCTGCTC AATAAGGCCA AGTTTACCAA GCCCCGGCCC 2851 CGGCCTAAGG ACAAGAATGG GACCCGGGCA GAGCCACCCC TCAATGCCAG 2901 TGCCAGTGAC CAGGGGGAGA AGGTCATCCC TCCAGCAGGC CAGACTGAAG 2951 ATGCAGAGCC CATTTCAGAA CCTGAGAAAG TAGAGACTGG ATCCGAGCCA 3001 GGAGACACTG AGCCTTTGGA GTTAGGAGGT CCTGGAGCAG AACCTGAACA 3051 GAAAGAACAA TCGACAGGAC AGAAGCGGCC TTTGAAGAAC GACGAACTAT 3101 AACCCCCACC TCTGTTTTCC CCATTCATCT CCACCCCCTT CCCCCACCAC 3151 TTCTATTTAT TTAACATCGA GGGTTGGGGG AGGGGTTGGT CCTGCCCTCG 3201 GCTGGAGTTC CTTTCTCACC CCTGTGATTT GGAGGTGTGG AGAAGGGGAA 3251 GGGAGGGACA GCTCACTGGT TCCTTCTGCA GTACCTCTGT GGTTAAAAAT 3301 GGAAACTGTT CTCCTCCCCA GCCCCACTCC CTGTTCCCTA CCCATATAGG 3351 CCCTAAATTT GGGAAAAATC ACTATTAATT TCTGAATCCT TTGCCTGTGG 3401 GTAGGAAGAG AATGGCTGCC AGTGGCTGAT GGGTCCCGGT GATGGGAAGG 3451 GTATCAGGTT GCTGGGGAGT TTCCACTCTT CTCTGGTGAT TGTTCCTTCC 3501 CTCCCTTCCT CTCCCACCAT GCGATGAGCA TCCTTTCAGG CCAGTGTCTG 3551 CAGAGCCTCA GTTACCAGGT TTGGTTTCTG AGTGCCTATC TGTGCTCTTT 3601 CCTCCCTCTG CGGGCTTCTC TTGCTCTGAG CCTCCCTTCC CCATTCCCAT 3651 GCAGCTCCTT TCCCCCTGGG TTTCCTTGGC TTCCTGCAGC AAATTGGGCA 3701 GTTCTCTGCC CCTTGCCTAA AAGCCTGTAC CTCTGGATTG GCGGAAGTAA 3751 ATCTGGAAGG ATTCTCACTC GTATTTCCCA CCCCTAGTGG CCAGAGGAGG 3801 GAGGGGCACA GTGAAGAAGG GAGCCCACCA CCTCTCCGAA GAGGAAAGCC 3851 ACGTAGAGTG GTTGGCATGG GGTGCCAGCA TCGTGCAAGC TCTGTCATAA 3901 TCTGCATCTT CCCAGCAGCC TGGTACCCCA GGTTCCTGTA ACTCCCTGCC 3951 TCCTCCTCTC TTCTGCTGTT CTGCTCCTCC CAGACAGAGC CTTTCCCTCA 4001 CCCCCTGACC CCCTGGGCTG ACCAAAATGT GCTTTCTACT GTGAGTCCCT 4051 ATCCCAAGAT CCTGGGGAAA GGAGAGACCA TGGTGTGAAT GTAGAGATGC 4101 CACCTCCCTC TCTCTGAGGC AGGCCTGTGG ATGAAGGAGG AGGGTCAGGG 4151 CTGGCCTTCC TCTGTGCATC ACTCTGCTAG GTTGGGGGCC CCCGACCCAC 4201 CATACCTACG CCTAGGGAGC CCGTCCTCCA GTATTCCGTC TGTAGCAGGA 4251 GCTAGGGCTG CTGCCTCAGC TCCAAGACAA GAATGAACCT GGCTGTGTCA 4301 GTCATTTTGT CTTTTCCTTT TTTTTTTTTT GCCACATTGG CAGAGATGGG 4351 ACCTAAGGGT CCCACCCCTC ACCCCACCCC CACCTCTTCT GTATGTTTGA 4401 ATTCTTTCAG TAGCTGTTGA TGCTGGTTGG ACAGGTTTGA GTCAAATTGT 4451 ACTTTGCTCC ATTGTTAATT GAGAAACTGT TTCAATAAAA TATTCTTTTC 4501 TAC // LOCUS AB014588 4301 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0688 protein, complete cds. ACCESSION AB014588 NID g3327189 VERSION AB014588.1 GI:3327189 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HK03410. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4301) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .4301 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HK03410" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 401. .2914 /gene="KIAA0688" CDS 401. .2914 /gene="KIAA0688" /codon_start=1 /product="KIAA0688 protein" /protein_id="BAA31663.1" /db_xref="PID:d1032624" /db_xref="PID:g3327190" /db_xref="GI:3327190" /translation="MSQTGSHPGRGLAGRWLWGAQPCLLLPIVPLSWLVWLLLLLLAS LLPSARLASPLPREEEIVFPEKLNGSVLPGSGTPARLLCRLQAFGETLLLELEQDSGV QVEGLTVQYLGQAPELLGGAEPGTYLTGTINGDPESVASLHWDGGALLGVLQYRGAEL HLQPLEGGTPNSAGGPGAHILRRKSPASGQGPMCNVKAPLGSPSPRPRRAKRFASLSR FVETLVVADDKMAAFHGAGLKRYLLTVMAAAAKAFKHPSIRNPVSLVVTRLVILGSGE EGPQVGPSAAQTLRSFCAWQRGLNTPEDSDPDHFDTAILFTRQDLCGVSTCDTLGMAD VGTVCDPARSCAIVEDDGLQSAFTAAHELGHVFNMLHDNSKPCISLNGPLSTSRHVMA PVMAHVDPEEPWSPCSARFITDFLDNGYGHCLLDKPEAPLHLPVTFPGKDYDADRQCQ LTFGPDSRHCPQLPPPCAALWCSGHLNGHAMCQTKHSPWADGTPCGPAQACMGGRCLH MDQLQDFNIPQAGGWGPWGPWGDCSRTCGGGVQFSSRDCTRPVPRNGGKYCEGRRTRF RSCNTEDCPTGSALTFREEQCAAYNHRTDLFKSFPGPMDWVPRYTGVAPQDQCKLTCQ ARALGYYYVLEPRVVDGTPCSPDSSSVCVQGRCIHAGCDRIIGSKKKFDKCMVCGGDG SGCSKQSGSFRKFRYGYNNVVTIPAGATHILVRQQGNPGHRSIYLALKLPDGSYALNG EYTLMPSPTDVVLPGAVSLRYSGATAASETLSGHGPLAQPLTLQVLVAGNPQDTRLRY SFFVPRPTPSTPRPTPQDWLHRRAQILEILRRRPWAGRK" BASE COUNT 844 a 1267 c 1232 g 958 t ORIGIN 1 CACATATGCA CGAGAGAGAC AGAGGAGGAA AGAGACAGAG ACAAAGGCAC 51 AGCGGAAGAA GGCAGAGACA GGGCAGGCAC AGAAGCGGCC CAGACAGAGT 101 CCTACAGAGG GAGAGGCCAG AGAAGCTGCA GAAGACACAG GCAGGGAGAG 151 ACAAAGATCC AGGAAAGGAG GGCTCAGGAG GAGAGTTTGG AGAAGCCAGA 201 CCCCTGGGCA CCTCTCCCAA GCCCAAGGAC TAAGTTTTCT CCATTTCCTT 251 TAACGGTCCT CAGCCCTTCT GAAAACTTTG CCTCTGACCT TGGCAGGAGT 301 CCAAGCCCCC AGGCTACAGA GAGGAGCTTT CCAAAGCTAG GGTGTGGAGG 351 ACTTGGTGCC CTAGACGGCC TCAGTCCCTC CCAGCTGCAG TACCAGTGCC 401 ATGTCCCAGA CAGGCTCGCA TCCCGGGAGG GGCTTGGCAG GGCGCTGGCT 451 GTGGGGAGCC CAACCCTGCC TCCTGCTCCC CATTGTGCCG CTCTCCTGGC 501 TGGTGTGGCT GCTTCTGCTA CTGCTGGCCT CTCTCCTGCC CTCAGCCCGG 551 CTGGCCAGCC CCCTCCCCCG GGAGGAGGAG ATCGTGTTTC CAGAGAAGCT 601 CAACGGCAGC GTCCTGCCTG GCTCGGGCAC CCCTGCCAGG CTGTTGTGCC 651 GCTTGCAGGC CTTTGGGGAG ACGCTGCTAC TAGAGCTGGA GCAGGACTCC 701 GGTGTGCAGG TCGAGGGGCT GACAGTGCAG TACCTGGGCC AGGCGCCTGA 751 GCTGCTGGGT GGAGCAGAGC CTGGCACCTA CCTGACTGGC ACCATCAATG 801 GAGATCCGGA GTCGGTGGCA TCTCTGCACT GGGATGGGGG AGCCCTGTTA 851 GGCGTGTTAC AATATCGGGG GGCTGAACTC CACCTCCAGC CCCTGGAGGG 901 AGGCACCCCT AACTCTGCTG GGGGACCTGG GGCTCACATC CTACGCCGGA 951 AGAGTCCTGC CAGCGGTCAA GGTCCCATGT GCAACGTCAA GGCTCCTCTT 1001 GGAAGCCCCA GCCCCAGACC CCGAAGAGCC AAGCGCTTTG CTTCACTGAG 1051 TAGATTTGTG GAGACACTGG TGGTGGCAGA TGACAAGATG GCCGCATTCC 1101 ACGGTGCGGG GCTAAAGCGC TACCTGCTAA CAGTGATGGC AGCAGCAGCC 1151 AAGGCCTTCA AGCACCCAAG CATCCGCAAT CCTGTCAGCT TGGTGGTGAC 1201 TCGGCTAGTG ATCCTGGGGT CAGGCGAGGA GGGGCCCCAA GTGGGGCCCA 1251 GTGCTGCCCA GACCCTGCGC AGCTTCTGTG CCTGGCAGCG GGGCCTCAAC 1301 ACCCCTGAGG ACTCGGACCC TGACCACTTT GACACAGCCA TTCTGTTTAC 1351 CCGTCAGGAC CTGTGTGGAG TCTCCACTTG CGACACGCTG GGTATGGCTG 1401 ATGTGGGCAC CGTCTGTGAC CCGGCTCGGA GCTGTGCCAT TGTGGAGGAT 1451 GATGGGCTCC AGTCAGCCTT CACTGCTGCT CATGAACTGG GTCATGTCTT 1501 CAACATGCTC CATGACAACT CCAAGCCATG CATCAGTTTG AATGGGCCTT 1551 TGAGCACCTC TCGCCATGTC ATGGCCCCTG TGATGGCTCA TGTGGATCCT 1601 GAGGAGCCCT GGTCCCCCTG CAGTGCCCGC TTCATCACTG ACTTCCTGGA 1651 CAATGGCTAT GGGCACTGTC TCTTAGACAA ACCAGAGGCT CCATTGCATC 1701 TGCCTGTGAC TTTCCCTGGC AAGGACTATG ATGCTGACCG CCAGTGCCAG 1751 CTGACCTTCG GGCCCGACTC ACGCCATTGT CCACAGCTGC CGCCGCCCTG 1801 TGCTGCCCTC TGGTGCTCTG GCCACCTCAA TGGCCATGCC ATGTGCCAGA 1851 CCAAACACTC GCCCTGGGCC GATGGCACAC CCTGCGGGCC CGCACAGGCC 1901 TGCATGGGTG GTCGCTGCCT CCACATGGAC CAGCTCCAGG ACTTCAATAT 1951 TCCACAGGCT GGTGGCTGGG GTCCTTGGGG ACCATGGGGT GACTGCTCTC 2001 GGACCTGTGG GGGTGGTGTC CAGTTCTCCT CCCGAGACTG CACGAGGCCT 2051 GTCCCCCGGA ATGGTGGCAA GTACTGTGAG GGCCGCCGTA CCCGCTTCCG 2101 CTCCTGCAAC ACTGAGGACT GCCCAACTGG CTCAGCCCTG ACCTTCCGCG 2151 AGGAGCAGTG TGCTGCCTAC AACCACCGCA CCGACCTCTT CAAGAGCTTC 2201 CCAGGGCCCA TGGACTGGGT TCCTCGCTAC ACAGGCGTGG CCCCCCAGGA 2251 CCAGTGCAAA CTCACCTGCC AGGCCCGGGC ACTGGGCTAC TACTATGTGC 2301 TGGAGCCACG GGTGGTAGAT GGGACCCCCT GTTCCCCGGA CAGCTCCTCG 2351 GTCTGTGTCC AGGGCCGATG CATCCATGCT GGCTGTGATC GCATCATTGG 2401 CTCCAAGAAG AAGTTTGACA AGTGCATGGT GTGCGGAGGG GACGGTTCTG 2451 GTTGCAGCAA GCAGTCAGGC TCCTTCAGGA AATTCAGGTA CGGATACAAC 2501 AATGTGGTCA CTATCCCCGC GGGGGCCACC CACATTCTTG TCCGGCAGCA 2551 GGGAAACCCT GGCCACCGGA GCATCTACTT GGCCCTGAAG CTGCCAGATG 2601 GCTCCTATGC CCTCAATGGT GAATACACGC TGATGCCCTC CCCCACAGAT 2651 GTGGTACTGC CTGGGGCAGT CAGCTTGCGC TACAGCGGGG CCACTGCAGC 2701 CTCAGAGACA CTGTCAGGCC ATGGGCCACT GGCCCAGCCT TTGACACTGC 2751 AAGTCCTAGT GGCTGGCAAC CCCCAGGACA CACGCCTCCG ATACAGCTTC 2801 TTCGTGCCCC GGCCGACCCC TTCAACGCCA CGCCCCACTC CCCAGGACTG 2851 GCTGCACCGA AGAGCACAGA TTCTGGAGAT CCTTCGGCGG CGCCCCTGGG 2901 CGGGCAGGAA ATAACCTCAC TATCCCGGCT GCCCTTTCTG GGCACCGGGG 2951 CCTCGGACTT AGCTGGGAGA AAGAGAGAGC TTCTGTTGCT GCCTCATGCT 3001 AAGACTCAGT GGGGAGGGGC TGTGGGCGTG AGACCTGCCC CTCCTCTCTG 3051 CCCTAATGCG CAGGCTGGCC CTGCCCTGGT TTCCTGCCCT GGGAGGCAGT 3101 GATGGGTTAG TGGATGGAAG GGGCTGACAG ACAGCCCTCC ATCTAAACTG 3151 CCCCCTCTGC CCTGCGGGTC ACAGGAGGGA GGGGGAAGGC AGGGAGGGCC 3201 TGGGCCCCAG TTGTATTTAT TTAGTATTTA TTCACTTTTA TTTAGCACCA 3251 GGGAAGGGGA CAAGGACTAG GGTCCTGGGG AACCTGACCC CTGACCCCTC 3301 ATAGCCCTCA CCCTGGGGCT AGGAAATCCA GGGTGGTGGT GATAGGTATA 3351 AGTGGTGTGT GTATGCGTGT GTGTGTGTGT GTGAAAATGT GTGTGTGCTT 3401 ATGTATGAGG TACAACCTGT TCTGCTTTCC TCTTCCTGAA TTTTATTTTT 3451 TGGGAAAAGA AAAGTCAAGG GTAGGGTGGG CCTTCAGGGA GTGAGGGATT 3501 ATCTTTTTTT TTTTTTCTTT CTTTCTTTCT TTTTTTTTTT TGAGACAGAA 3551 TCTCGCTCTG TCGCCCAGGC TGGAGTGCAA TGGCACAATC TCGGCTCACT 3601 GCATCCTCCG CCTCCCGGGT TCAAGTGATT CTCATGCCTC AGCCTCCTGA 3651 GTAGCTGGGA TTACAGGCTC CTGCCACCAC GCCCAGCTAA TTTTTGTTTT 3701 GTTTTGTTTG GAGACAGAGT CTCGCTATTG TCACCAGGGC TGGAATGATT 3751 TCAGCTCACT GCAACCTTCG CCACCTGGGT TCCAGCAATT CTCCTGCCTC 3801 AGCCTCCCGA GTAGCTGAGA TTATAGGCAC CTACCACCAC GCCCGGCTAA 3851 TTTTTGTATT TTTAGTAGAG ACGGGGTTTC ACCATGTTGG CCAGGCTGGT 3901 CTCGAACTCC TGACCTTAGG TGATCCACTC GCCTTCATCT CCCAAAGTGC 3951 TGGGATTACA GGCGTGAGCC ACCGTGCCTG GCCACGCCCA ACTAATTTTT 4001 GTATTTTTAG TAGAGACAGG GTTTCACCAT GTTGGCCAGG CTGCTCTTGA 4051 ACTCCTGACC TCAGGTAATC GACCTGCCTC GGCCTCCCAA AGTGCTGGGA 4101 TTACAGGTGT GAGCCACCAC GCCCGGTACA TATTTTTTAA ATTGAATTCT 4151 ACTATTTATG TGATCCTTTT GGAGTCAGAC AGATGTGGTT GCATCCTAAC 4201 TCCATGTCTC TGAGCATTAG ATTTCTCATT TGCCAATAAT AATACCTCCC 4251 TTAGAAGTTT GTTGTGAGGA TTAAATAATG TAAATAAAGA ACTAGCATAA 4301 C // LOCUS HSSCA1 10660 bp mRNA PRI 09-FEB-1995 DEFINITION H.sapiens SCA1 mRNA for ataxin. ACCESSION X79204 NID g529661 VERSION X79204.1 GI:529661 KEYWORDS ataxin; SCA1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10660) AUTHORS Banfi,S., Servadio,A., Chung,M.Y., Kwiatkowski,T.J. Jr., McCall,A.E., Duvick,L.A., Shen,Y., Roth,E.J., Orr,H.T. and Zoghbi,H.Y. TITLE Identification and characterization of the gene causing type 1 spinocerebellar ataxia JOURNAL Nat. Genet. 7 (4), 513-520 (1994) MEDLINE 95038838 REFERENCE 2 (bases 1 to 10660) AUTHORS Banfi,S. TITLE Direct Submission JOURNAL Submitted (11-MAY-1994) S. Banfi, Baylor College of Medicine, Dept of Pediatrics, One Baylor Plaza, Houston TX 77030, USA FEATURES Location/Qualifiers source 1. .10660 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /dev_stage="adult, fetal" /tissue_type="cerebellum, brain" /map="6p22" gene 936. .3386 /gene="SCA1" CDS 936. .3386 /gene="SCA1" /codon_start=1 /product="ataxin-1" /protein_id="CAA55793.1" /db_xref="PID:g529662" /db_xref="GI:529662" /db_xref="SWISS-PROT:P54253" /translation="MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTA WLPGNPGGRGHGGGRHGPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTL PAAYATPQPGTPVSPVQYAHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVA SAAGATTPSQRSQLEAYSTLLANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQ QQQQQQQQHLSRAPGLITPGSPPPAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQT MIPHTLTLGPPSQVVMQYADSGSHFVPREATKKAESSRLQQAIQAKEVLNGEMEKSRR YGAPSSADLGLGKAGGKSVPHPYESRHVVVHPSPSDYSSRDPSGVRASVMVLPNSNTP AADLEVQQATHREASPSTLNDKSGLHLGKPGHRSYALSPHTVIQTTHSASEPLPVGLP ATAFYAGTQPPVIGYLSGQQQAITYAGSLPQHLVIPGTQPLLIPVGSTDMEASGAAPA IVTSSPQFAAVPHTFVTTALPKSENFNPEALVTQAAYPAMVQAQIHLPVVQSVASPAA APPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQSAEISNDLKIDSSTVERIEDSHS PGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSCCPERTSQLFDLPCSKLSVGDV CISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSRHRYAEQENGINQGSAQMLS ENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESRKLEKSEDEPPLTLPKPS LIPQEVKICIEGRSNVGK" BASE COUNT 2807 a 2515 c 2424 g 2914 t ORIGIN 1 CTACTACAGT GGCGGACGTA CAGGACCTGT TTCACTGCAG GGGGATCCAA 51 AACAAGCCCC GTGGAGCAAC AGCCAGAGCA ACAGCAGCTG CAAGACATTG 101 TTTCTCTCCC TCTGCCCCCC CTTCCCCACG CAACCCCAGA TCCATTTACA 151 CTTTACAGTT TTACCTCACA AAAACTACTA CAAGCACCAA GCTCCCTGAT 201 GGAAAGGAGC ATCGTGCATC AAGTCACCAG GGTGGTCCAT TCAAGCTGCA 251 GATTTGTTTG TCATCCTTGT ACAGCAATCT CCTCCTCCAC TGCCACTACA 301 GGGAAGTGCA TCACATGTCA GCATACTGGA GCATAGTGAA AGAGTCTATT 351 TTGAAGCTTC AAACTTAGTG CTGCTGCAGA CCAGGAACAA GAGAGAAAGA 401 GTGGATTTCA GCCTGCACGG ATGGTCTTGA AACACAAATG GTTTTTGGTC 451 TAGGCGTTTT ACACTGAGAT TCTCCACTGC CACCCTTTCT ACTCAAGCAA 501 AATCTTCGTG AAAAGATCTG CTGCAAGGAA CTGATAGCTT ATGGTTCTCC 551 ATTGTGATGA AAGCACATGG TACAGTTTTC CAAAGAAATT AGACCATTTT 601 CTTCGTGAGA AAGAAATCGA CGTGCTGTTT TCATAGGGTA TTTCTCACTT 651 CTCTGTGAAA GGAAGAAAGA ACACGCCTGA GCCCAAGAGC CCTCAGGAGC 701 CCTCCAGAGC CTGTGGGAAG TCTCCATGGT GAAGTATAGG CTGAGGCTAC 751 CTGTGAACAG TACGCAGTGA ATGTTCATCC AGAGCTGCTG TTGGCGGATT 801 GTACCCACGG GGAGATGATT CCTCATGAAG AGCCTGGATC CCCTACAGAA 851 ATCAAATGTG ACTTTCCGTT TATCAGACTA AAATCAGAGC CATCCAGACA 901 GTGAAACAGT CACCGTGGAG GGGGGACGGC GAAAAATGAA ATCCAACCAA 951 GAGCGGAGCA ACGAATGCCT GCCTCCCAAG AAGCGCGAGA TCCCCGCCAC 1001 CAGCCGGTCC TCCGAGGAGA AGGCCCCTAC CCTGCCCAGC GACAACCACC 1051 GGGTGGAGGG CACAGCATGG CTCCCGGGCA ACCCTGGTGG CCGGGGCCAC 1101 GGGGGCGGGA GGCATGGGCC GGCAGGGACC TCGGTGGAGC TTGGTTTACA 1151 ACAGGGAATA GGTTTACACA AAGCATTGTC CACAGGGCTG GACTACTCCC 1201 CGCCCAGCGC TCCCAGGTCT GTCCCCGTGG CCACCACGCT GCCTGCCGCG 1251 TACGCCACCC CGCAGCCAGG GACCCCGGTG TCCCCCGTGC AGTACGCTCA 1301 CCTGCCGCAC ACCTTCCAGT TCATTGGGTC CTCCCAATAC AGTGGAACCT 1351 ATGCCAGCTT CATCCCATCA CAGCTGATCC CCCCAACCGC CAACCCCGTC 1401 ACCAGTGCAG TGGCCTCGGC CGCAGGGGCC ACCACTCCAT CCCAGCGCTC 1451 CCAGCTGGAG GCCTATTCCA CTCTGCTGGC CAACATGGGC AGTCTGAGCC 1501 AGACGCCGGG ACACAAGGCT GAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 1551 CAGCAGCAGC ATCAGCATCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1601 GCAGCAGCAG CAGCACCTCA GCAGGGCTCC GGGGCTCATC ACCCCGGGGT 1651 CCCCCCCACC AGCCCAGCAG AACCAGTACG TCCACATTTC CAGTTCTCCG 1701 CAGAACACCG GCCGCACCGC CTCTCCTCCG GCCATCCCCG TCCACCTCCA 1751 CCCCCACCAG ACGATGATCC CACACACGCT CACCCTGGGG CCCCCCTCCC 1801 AGGTCGTCAT GCAATACGCC GACTCCGGCA GCCACTTTGT CCCTCGGGAG 1851 GCCACCAAGA AAGCTGAGAG CAGCCGGCTG CAGCAGGCCA TCCAGGCCAA 1901 GGAGGTCCTG AACGGTGAGA TGGAGAAGAG CCGGCGGTAC GGGGCCCCGT 1951 CCTCAGCCGA CCTGGGCCTG GGCAAGGCAG GCGGCAAGTC GGTTCCTCAC 2001 CCGTACGAGT CCAGGCACGT GGTGGTCCAC CCGAGCCCCT CAGACTACAG 2051 CAGTCGTGAT CCTTCGGGGG TCCGGGCCTC TGTGATGGTC CTGCCCAACA 2101 GCAACACGCC CGCAGCTGAC CTGGAGGTGC AACAGGCCAC TCATCGTGAA 2151 GCCTCCCCTT CTACCCTCAA CGACAAAAGT GGCCTGCATT TAGGGAAGCC 2201 TGGCCACCGG TCCTACGCGC TCTCACCCCA CACGGTCATT CAGACCACAC 2251 ACAGTGCTTC AGAGCCACTC CCGGTGGGAC TGCCAGCCAC GGCCTTCTAC 2301 GCAGGGACTC AACCCCCTGT CATCGGCTAC CTGAGCGGCC AGCAGCAAGC 2351 AATCACCTAC GCCGGCAGCC TGCCCCAGCA CCTGGTGATC CCCGGCACAC 2401 AGCCCCTGCT CATCCCGGTC GGCAGCACTG ACATGGAAGC GTCGGGGGCA 2451 GCCCCGGCCA TAGTCACGTC ATCCCCCCAG TTTGCTGCAG TGCCTCACAC 2501 GTTCGTCACC ACCGCCCTTC CCAAGAGCGA GAACTTCAAC CCTGAGGCCC 2551 TGGTCACCCA GGCCGCCTAC CCAGCCATGG TGCAGGCCCA GATCCACCTG 2601 CCTGTGGTGC AGTCCGTGGC CTCCCCGGCG GCGGCTCCCC CTACGCTGCC 2651 TCCCTACTTC ATGAAAGGCT CCATCATCCA GTTGGCCAAC GGGGAGCTAA 2701 AGAAGGTGGA AGACTTAAAA ACAGAAGATT TCATCCAGAG TGCAGAGATA 2751 AGCAACGACC TGAAGATCGA CTCCAGCACC GTAGAGAGGA TTGAAGACAG 2801 CCATAGCCCG GGCGTGGCCG TGATACAGTT CGCCGTCGGG GAGCACCGAG 2851 CCCAGGTCAG CGTTGAAGTT TTGGTAGAGT ATCCTTTTTT TGTGTTTGGA 2901 CAGGGCTGGT CATCCTGCTG TCCGGAGAGA ACCAGCCAGC TCTTTGATTT 2951 GCCGTGTTCC AAACTCTCAG TTGGGGATGT CTGCATCTCG CTTACCCTCA 3001 AGAACCTGAA GAACGGCTCT GTTAAAAAGG GCCAGCCCGT GGATCCCGCC 3051 AGCGTCCTGC TGAAGCACTC AAAGGCCGAC GGCCTGGCGG GCAGCAGACA 3101 CAGGTATGCC GAGCAGGAAA ACGGAATCAA CCAGGGGAGT GCCCAGATGC 3151 TCTCTGAGAA TGGCGAACTG AAGTTTCCAG AGAAAATGGG ATTGCCTGCA 3201 GCGCCCTTCC TCACCAAAAT AGAACCCAGC AAGCCCGCGG CAACGAGGAA 3251 GAGGAGGTGG TCGGCGCCAG AGAGCCGCAA ACTGGAGAAG TCAGAAGACG 3301 AACCACCTTT GACTCTTCCT AAGCCTTCTC TAATTCCTCA GGAGGTTAAG 3351 ATTTGCATTG AAGGCCGGTC TAATGTAGGC AAGTAGAGGC AGCGTGGGGG 3401 AAAGGAAACG TGGCTCTCCC TTATCATTTG TATCCAGATT ACTGTACTGT 3451 AGGCTAAAAT AACACAGTAT TTACATGTTA TCTTCTTAAT TTTAGGTTTC 3501 TGTTCTAACC TTGTCATTAG AGTTACAGCA GGTGTGTCGC AGGAGACTGG 3551 TGCATATGCT TTTTCCACGA GTGTCTGTCA GTGAGCGGGC GGGAGGAAGG 3601 GCACAGCAGG AGCGGTCAGG GCTCCAGGCA TCCCCGGGGA AGAAAGGAAC 3651 GGGGCTTCAC AGTGCCTGCC TTCTCTAGCG GCACAGAAGC AGCCGGGGGC 3701 GCTGACTCCC GCTAGTGTCA GGAGAAAAGT CCCGTGGGAA GAGTCCTGCA 3751 GGGGTGCAGG GTTGCACGCA TGTGGGGGTG CACAGGCGCT GTGGCGGCGA 3801 GTGAGGGTCT CTTTTTCTCT GCCTCCCTCT GCCTCACTCT CTTGCTATCG 3851 GCATGGGCCG GGGGGGTTCA GAGCAGTGTC CTCCTGGGGT TCCCACGTGC 3901 AAAATCAACA TCAGGAACCC AGCTTCAGGG CATCGCGGAG ACGCGTCAGA 3951 TGGCAGATTT GGAAAGTTAA CCATTTAAAA GAACATTTTT CTCTCCAACA 4001 TATTTTACAA TAAAAGCAAC TTTTAATTGT ATAGATATAT ATTTCCCCCT 4051 ATGGGGCCTG ACTGCACTGA TATATATTTT TTTTAAAGAG CAACTGCCAC 4101 ATGCGGGATT TCATTTCTGC TTTTTACTAG TGCAGCGATG TCACCAGGGT 4151 GTTGTGGTGG ACAGGGAAGC CCCTGCTGTC ATGGCCCCAC ATGGGGTAAG 4201 GGGGGTTGGG GGTGGGGGAG AGGGAGAGAG CGAACACCCA CGCTGGTTTC 4251 TGTGCAGTGT TAGGAAAACC AATCAGGTTA TTGCATTGAC TTCACTCCCA 4301 AGAGGTAGAT GCAAACTGCC CTTCAGTGAG AGCAACAGAA GCTCTTCACG 4351 TTGAGTTTGC GAAATCTTTT TGTCTTTGAA CTCTAGTACT GTTTATAGTT 4401 CATGACTATG GACAACTCGG GTGCCACTTT TTTTTTTTTC AGATTCCAGT 4451 GTGACATGAG GAATTAGATT TTGAAGATGA GCATATATTA CTATCTTTAA 4501 GCATTTAAAA ATACTGTTCA CACTTTATTA CCAAGCATCT TGGTCTCTCA 4551 TTCAACAAGT ACTGTATCTC ACTTTAAACT CTTTGGGGAA AAAACAAAAA 4601 CAAAAAAAAC TAAGTTGCTT TCTTTTTTTC AACACTGTAA CTACATTTCA 4651 GCTCTGCAGA ATTGCTGAAG AGCAAGATAT TGAAAGTTTC AATGTGGTTT 4701 AAAGGGATGA ATGTGAATTA TGAACTAGTA TGTGACAATA AATGACCACC 4751 AAGTACTACC TGACGGGAGG CACTTTTCAC TTTGATGTCT GAGAATCAGT 4801 TCAAGGCATA TGCAGAGTTG GCAGAGAAAC TGAGAGAAAA GGGATGGAGA 4851 AGAGAATACT CATTTTTGTC CAGTGTTTTT CTTTTTAAGA TGAACTTTTA 4901 AAGAACCTTG CGATTTGCAC ATATTGAGTT TATAACTTGT GTGATATTCC 4951 TGCAGTTTTT ATCCAATAAC ATTGTGGGAA AGGTTTGGGG GACTGAACGA 5001 GCATAAATAA ATGTAGCAAA ATTTCTTTCT AACCTGCCTA AACTCTAGGC 5051 CATTTTATAA GGTTATGTTC CTTTGAAAAT TCATTTTGGT CTTTTTACCA 5101 CATCTGTCAC AAAAAGCCAG GTCTTAGCGG GCTCTTAGAA ACTCTGAGAA 5151 TTTTCTTCAG ATTCATTGAG AGAGTTTTCC ATAAAGACAT TTATATATGT 5201 GAGCAAGATT TTTTTTAAAC AATTACTTTA TTATTGTTGT TATTAATGTT 5251 ATTTTCAGAA TGGCTTTTTT TTTCTATTCA AAATCAAATC GAGATTTAAT 5301 GTTTGGTACA AACCCAGAAA GGGTATTTCA TAGTTTTTAA ACCTTTCATT 5351 CCCAGAGATC CGAAATATCA TTTGTGGGTT TTGAATGCAT CTTTAAAGTG 5401 CTTTAAAAAA AAGTTTTATA AGTAGGGAGA AATTTTTAAA TATTCTTACT 5451 TGGATGGCTG CAACTAAACT GAACAAATAC CTGACTTTTC TTTTACCCCA 5501 TTGAAAATAG TACTTTCTTC GTTTCACAAA TTAAAAAAAA AATCTGGTAT 5551 CAACCCACAT TTTGGCTGTC TAGTATTCAT TTACATTTAG GGTTCACCAG 5601 GACTAATGAT TTTTATAAAC CGTTTTCTGG GGTGTACCAA AAACATTTGA 5651 ATAGGTTTAG AATAGCTAGA ATAGTTCCTT GACTTTCCTC GAATTTCATT 5701 ACCCTCTCAG CATGCTTGCA GAGAGCTGGG TGGGCTCATT CTTGCAGTCA 5751 TACTGCTTAT TTAGTGCTGT ATTTTTTAAA CGTTTCTGTT CAGAGAACTT 5801 GCTTAATCTT CCATATATTC TGCTCAGGGC ACTTGCAATT ATTAGGTTTT 5851 GTTTTTCTTT TTGTTTTTTA GCCTTTGATG GTAAGAGGAA TACGGGCTGC 5901 CACATAGACT TTGTTCTCAT TAATATCACT ATTTACAACT CATGTGGACT 5951 CAGAAAAACA CACACCACCT TTTGGCTTAC TTCGAGTATT GAATTGACTG 6001 GATCCACTAA ACCAACACTA AGATGGGAAA ACACACATGG TTTGGAGCAA 6051 TAGGAACATC ATCATAATTT TTGTGGTTCT ATTTCAGGTA TAGGAATTAT 6101 AAAATAATTG GTTCTTTCTA AACACTTGTC CCATTTCATT CTCTTGCTTT 6151 TTTAGCATGT GCAATACTTT CTGTGCCAAT AGAGTCTGAC CAGTGTGCTA 6201 TATAGTTAAA GCTCATTCCC TTTTGGCTTT TTCCTTGTTT GGTTGATCTT 6251 CCCCATTCTG GCCAGAGCAG GGCTGGAGGG AAGGAGCCAG GAGGGAGAGA 6301 GCCTCCCACC TTTCCCCTGC TGCGGATGCT GAGTGCTGGG GCGGGGAGCC 6351 TTCAGGAGCC CCGTGCGTCT GCCGCCACGT TGCAGAAAGA GCCAGCCAAG 6401 GAGACCCGGG GGAGGAACCG CAGTGTCCCC TGTCACCACA CGGAATAGTG 6451 AATGTGGAGT GTGGAGAGGA AGGAGGCAGA TTCATTTCTA AGACGCACTC 6501 TGGAGCCATG TAGCCTGGAG TCAACCCATT TTCCACGGTC TTTTCTGCAA 6551 GTGGGCAGGC CCCTCCTCGG GGTCTGTGTC CTTGAGACTT GGAGCCCTGC 6601 CTCTGAGCCT GGACGGGAAG TGTGGCCTGT TGTGTGTGTG CGTTCTGAGC 6651 GTGTTGGCCA GTGGCTGTGG AGGGGACCAC CTGCCACCCA CGGTCACCAC 6701 TCCCTTGTGG CAGCTTTCTC TTCAAATAGG AAGAACGCAC AGAGGGCAGG 6751 AGCCTCCTGT TTGCAGACGT TGGCGGGCCC CGAGGCTCCC AGAGCAGCCT 6801 CTGTCACCGC TTCTGTGTAG CAAACATTAA CGATGACAGG GGTAGAAATT 6851 CTTCGGTGCC GTTCAGCTTA CAAGGATCAG CCATGTGCCT CTGTACTATG 6901 TCCACTTTGC AATATTTACC GACAGCCGTC TTTTGTTCTT TCTTTCCTGT 6951 TTTCCATTTT TAAACTAGTA ACAGCAGGCC TTTTGCGTTT ACAATGGAAC 7001 ACAATCACCA AGAAATTAGT CAGGGCGAAA AGAAAAAAAT AATACTATTA 7051 ATAAGAAACC AACAAACAAG AACCTCTCTT TCTAGGGATT TCTAAATATA 7101 TAAAATGACT GTTCCTTAGA ATGTTTAACT TAAGAATTAT TTCAGTTTGT 7151 CTGGGCCACA CTGGGGCAGA GGGGGGAGGG AGGGATACAG AGATGGATGC 7201 CACTTACCTC AGATCTTTTA AAGTGGAAAT CCAAATTGAA TTTTCATTTG 7251 GACTTTCAGG ATAATTTTCT ATGTTGGTCA ACTTTTCGTT TTCCCTAACT 7301 CACCCAGTTT AGTTTGGGAT GATTTGATTT CTGTTGTTGT TGATCCCATT 7351 TCTAACTTGG AATTGTGAGC CTCTATGTTT TCTGTTAGGT GAGTGTGTTG 7401 GGTTTTTTCC CCCCACCAGG AAGTGGCAGC ATCCCTCCTT CTCCCCTAAA 7451 GGGACTCTGC GGAACCTTTC ACACCTCTTT CTCAGGGACG GGGCAGGTGT 7501 GTGTGTGGTA CACTGACGTG TCCAGAAGCA GCACTTTGAC TGCTCTGGAG 7551 TAGGGTTGTA CAATTTCAAG GAATGTTTGG ATTTCCTGCA TCTTGTGGAT 7601 TACTCCTTAG ATACCGCATA GATTGCAATA TAATGCTGCA TGTTCAAGAT 7651 GAACAGTAGC TCCTAGTAAT CATAAAATCC ACTCTTTGCA CAGTTTGATC 7701 TTTACTGAAA TATGTTGCCA AAATTTATTT TTGTTGTTGT AGCTCTGGAT 7751 TTTGTTTTGT TTTGTTTTTT AAGGAAACGA TTGACAATAC CCTTTAACAT 7801 CTGTGACTAC TAAGGAAACC TATTTCTTTC ATAGAGAGAA AAATCTCCAA 7851 TGCTTTTGAA GACACTAATA CCGTGCTATT TCAGATATGG GTGAGGAAGC 7901 AGAGCTCTCG GTACCGAAGG CCGGGCTTCT TGAGCTGTGT TGGTTGTCAT 7951 GGCTACTGTT TCATGAACCA CAAGCAGCTC AACAGACTGG TCTGTTGCCT 8001 TCTGAAACCC TTTGCACTTC AATTTGCACC AGGTGAAAAC AGGGCCAGCA 8051 GACTCCATGG CCCAATTCGG TTTCTTCGGT GGTGATGTGA AAGGAGAGAA 8101 TTACACTTTT TTTTTTTTTA AGTGGCGTGG AGGCCTTTGC TTCCACATTT 8151 GTTTTTAACC CAGAATTTCT GAAATAGAGA ATTTAAGAAC ACATCAAGTA 8201 ATAAATATAC AGAGAATATA CTTTTTTATA AAGCACATGC ATCTGCTATT 8251 GTGTTGGGTT GGTTTCCTCT CTTTTCCACG GACAGTGTTG TGTTTCTGGC 8301 ATAGGGAAAC TCCAAACAAC TTGCACACCT CTACTCCGGA GCTGAGATTT 8351 CTTTTACATA GATGACCTCG CTTCAAATAC GTTACCTTAC TGATGATAGG 8401 ATCTTTTCTT GTAGCACTAT ACCTTGTGGG AATTTTTTTT TAAATGTACA 8451 CCTGATTTGA GAAGCTGAAG AAAACAAAAT TTTGAAGCAC TCACTTTGAG 8501 GAGTACAGGT AATGTTTTAA AAAATTGCAC AAAAGAAAAA TGAATGTCGA 8551 AATGATTCAT TCAGTGTTTG AAAGATATGG CTCTGTTGAA ACAATGAGTT 8601 TCATACTTTG TTTGTAAAAA AAAAAAGCAG AGAAGGGTTG AAAGTTACAT 8651 GTTTTTTTGT ATATAGAAAT TTGTCATGTC TAAATGATCA GATTTGTATG 8701 GTTATGGCCT GGAAGAATTA CTACGTAAAA GGCTCTTAAA CTATACCTAT 8751 GCTTATTGTT ATTTTTGTTA CATATAGCCC TCGTCTGAGG GAGGGGAACT 8801 CGGTATTCTG CGATTTGAGA ATACTGTTCA TTCCTATGCT GAAAGTACTT 8851 CTCTGAGCTC CCTTCTTAGT CTAAACTCTT AAGCCATTGC AACTTCTTTT 8901 TCTTCAGAGA TGATGTTTGA CATTTTCAGC ACTTCCTGTT CCTATAAACC 8951 CAAAGAATAT AATCTTGAAC ACGAAGTGTT TGTAACAAGG GATCCAGGCT 9001 ACCAATCAAA CAGGACTCAT TATGGGGACA AAAAAAAAAA AAATTATTTC 9051 ACCTTCTTTC CCCCCACACC TCATTTAAAT GGGGGGAGTA AAAACATGAT 9101 TTCAATGTAA ATGCCTCATT TTATTTTAGT TTTATTTTGA TTTTTATTTA 9151 ATATAAAGAG GCCAGAATAA ATACGGAGCA TCTTCTCAGA ATAGTATTCC 9201 TGTCCAAAAA TCAAGCCGGA CAGTGGAAAC TGGACAGCTG TGGGGATATT 9251 AAGCACCCCC ACTTACAATT CTTAAATTCA GAATCTCGTC CCCTCCCTTC 9301 TCGTTGAAGG CAACTGTTCT GGTAGCTAAC TTTCTCCTGT GTAATGGCGG 9351 GAGGGAACAC CGGCTTCAGT TTTTCATGTC CCCATGACTT GCATACAAAT 9401 GGTTCAACTG TATTAAAATT AAGTGCATTT GGCCAATAGG TAGTATCTAT 9451 ACAATAACAA CAATCTCTAA GAATTTCCAT AACTTTTCTT ATCTGAAAGG 9501 ACTCAAGTCT TCCACTGCAG ATACATTGGA GGCTTCACCC ACGTTTTCTT 9551 TCCCTTTAGT TTGTTTGCTG TCTGGATGGC CAATGAGCCT GTCTCCTTTT 9601 CTGTGGCCAA TCTGAAGGCC TTCGTTGGAA GTGTTGTTCA CAGTAATCCT 9651 TACCAAGATA ACATACTGTC CTCCAGAATA CCAAGTATTA GGTGACACTA 9701 GCTCAAGCTG TTGTCTTCAG AGCAGTTACC AAGAAGCTCG GTGCACAGGT 9751 TTTCTCTGGT TCTTACAGGA ACCACCTACT CTTTCAGTTT TCTGGCCCAG 9801 GAGTGGGGTA AATCCTTTAG TTAGTGCATT TGAACTTGGT ACCTGTGCAT 9851 TCAGTTCTGT GAATACTGCC CTTTTTGGCG GGGTTTCCTC ATCTCCCCAG 9901 CCTGAACTGC TCAACTCTAA ACCCAAATTA GTGTCAGCCG AAAGGAGGTT 9951 TCAAGATAGT CCTGTCAGTA TTTGTGGTGA CCTTCAGATT AGACAGTCTT 10001 CATTTCCAGC CAGTGGAGTC CTGGCTCCAG AGCCATCTCT GAGACTCCGT 10051 ACTACTGGAT GTTTTAATAT CAGATCATTA CCCACCATAT GCCTCCCACA 10101 GGCCAAGGGA AAACAGACAC CAGAACTTGG GTTGAGGGCA CTACCAGACT 10151 GACATGGCCA GTACAGAGGA GAACTAGGGA AGGAATGATG TTTTGCACCT 10201 TATTGAAAAG AAAATTTTAA GTGCATACAT AATAGTTAAG AGCTTTTATT 10251 GTGACAGGAG AACTTTTTTC CATATGCGTG CATACTCTCT GTAATTCCAG 10301 TGTAAAATAT TGTACTTGCA CTAGCTTTTT TAAACAAATA TTAAAAAATG 10351 GAAGAATTCA TATTCTATTT TCTAATCGTG GTGTGTCTAT TTGTAGGATA 10401 CACTCGAGTC TGTTTATTGA ATTTTATGGT CCCTTTCTTT GATGGTGCTT 10451 GCAGGTTTTC TAGGTAGAAA TTATTTCATT ATTATAATAA AACAATGTTT 10501 GATTCAAAAT TTGAACAAAA TTGTTTTAAA TAAATTGTCT GTATACCAGT 10551 ACAAGTTTAT TGTTTCAGTA TACTCGTACT AATAAAATAA CAGTGCCAAT 10601 TGCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 10651 AAAAAAAAAA // LOCUS HUMPLCE 4565 bp mRNA PRI 09-FEB-1999 DEFINITION Human mRNA for phospholipase C, complete cds. ACCESSION D42108 NID g780121 VERSION D42108.1 GI:780121 KEYWORDS PLC-L (PLC-epsilon); phospholipase C. SOURCE Homo sapiens cDNA to mRNA, clone_lib:adult brain, Hela, fetal heart clone:HOP. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4565) AUTHORS Kohno,T. TITLE Direct Submission JOURNAL Submitted (14-NOV-1994) to the DDBJ/EMBL/GenBank databases. Takashi Kohno, National Cancer Center Research Institute, Biology Division; 5-1-1, Tsukiji, Chuo-ku, Tokyo 104, Japan (E-mail:tkohno@gan1.ncc.go.jp, Tel:03-3542-2511(ex.4651), Fax:03-3542-0807) REFERENCE 2 (bases 1 to 3370; 3235 to 4565) AUTHORS Kohno,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 3371 to 3534) AUTHORS Otsuka,T., Kohno,T., Mori,M., Noguchi,M., Hirohashi,S. and Yokota,J. TITLE Deletion mapping of chromosome 2 in human lung carcinoma JOURNAL Genes Chromosomes Cancer 16 (2), 113-119 (1996) MEDLINE 96415751 REFERENCE 4 (sites) AUTHORS Kohno,T., Otsuka,T., Takano,H., Yamamoto,T., Hamaguchi,M., Terada,M. and Yokota,J. TITLE Identification of a novel phospholipase C family gene at chromosome 2q33 that is homozygously deleted in human small cell lung carcinoma JOURNAL Hum. Mol. Genet. 4 (4), 667-674 (1995) MEDLINE 95359973 FEATURES Location/Qualifiers source 1. .4565 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /clone="HOP" /clone_lib="adult brain, Hela, fetal heart" 5'UTR 1. .203 gene 204. .3197 /gene="PLC-L (PLC-epsilon)" CDS 204. .3197 /gene="PLC-L (PLC-epsilon)" /codon_start=1 /product="Phospholipase C" /protein_id="BAA07688.1" /db_xref="PID:d1008271" /db_xref="PID:g780122" /db_xref="GI:780122" /translation="MPSEKKISSANDCISFMQAGCELKKVRPNSRIYNRFFTLDTDLQ ALRWEPSKKDLEKAKLDISAIKEIRLGKNTETFTNNGLADQICEDCAFSILHGENYES LDLVANSADVANIWVSGLRYLVSRSKQPLDFMEGNQNTPRFMWLKTVFEAADVDGNGI MLEDTSVELIKQLNPTLKEAKIRLKFKEIQKSKEKLTTRVTEEEFCEAFCELCTRPEV YFLLVQISKNKEYLDANDLMLFLEAEQGVTHITEDICLDIIRRYELSEEGRQKGFLAI DGFTQYLLSSECDIFDPEQKKVAQDMTQPLSHYYINASHNTYLIEDQFRGPADINGYI RALKMGCRSVELDVSDGSDNEPILCNRNNMTTHVSFRSVIEVINKFAFVASEYPLILC LGNHCSLPQQKVMAQQMKKVFGNKLYTEAPLPSESYLPSPEKLKRMIIVKGKKLPSDP DVLEGEVTDEDEEAQMSRRMSVDYNGEQKQIRLCRELSDLVSICKSVQYRDFELSMKS QNYWEMCSFSETEASRIANEYPEDFVNYNKKFLSRIYPSAMRIDSSNLNPQDFWNCGC QIVAMNFQTPGPMMDLHTGWFLQNGGCGYVLRPSIMRDEVSYFSANTKGILPGVSPLA LHIKIISGQNFPKPKGACAKGDVIDPYVCIEIHGIPADCSEQRTKTVQQNSDNPIFDE TFEFQVNLPELAMIRFVVLDDDYIGDEFIGQYTIPFECLQPGYRHVPLRSFVGDIMEH VTLFVHIAITNRSGGGKAQKRSLSVRMGKKVREYTMLRNIGLKTIDDIFKIAVHPLRE AIDMRENMQNAIVSIKELCGLPPIASLKQCLLTLSSRLITSDNTPSVSLVMKDSFPYL EPLGAIPDVQKKMLTAYDLMIQESRFLIEMADTVQEKIVQCQKAGMEFHEELHNLGAK EGLKGRKLNKATESFAWNITVLKGQGDLLKNAKNEAIENMKQIQLACLSCGLSKAPSS SAEAKSKRSLEAIEEKESSEENGKL" 3'UTR 3198. .4565 BASE COUNT 1439 a 815 c 992 g 1319 t ORIGIN 1 GAATTCCGGG CAGCATCATC AAGGCACATA GATTCTCTCT TTGAACAATG 51 CAGCTGCTGC AGGGGATGGG GAAGGGATGG CACTGGCAAT TCAAGACTGC 101 CTTTCCTACT TCTTCAGTGC CTCTTTCACC AATATGAAGT TAAAACCAGG 151 ATCCTTCAAA CCAAAAATGT GGTGGAAGAA AGAAAACCGT GTCTTTCAGC 201 AGCATGCCAT CGGAAAAGAA AATTAGCAGT GCAAATGACT GCATCAGCTT 251 CATGCAAGCT GGCTGTGAGT TGAAGAAAGT CCGGCCAAAT TCTCGCATTT 301 ACAACCGTTT TTTCACTCTG GACACAGACC TTCAAGCTCT TCGCTGGGAA 351 CCTTCAAAGA AAGACCTCGA GAAAGCCAAG CTTGATATTT CTGCCATAAA 401 AGAGATCAGA CTGGGGAAAA ACACGGAAAC ATTTACAAAC AATGGCCTTG 451 CTGACCAGAT CTGTGAGGAC TGTGCCTTTT CCATACTCCA CGGGGAAAAC 501 TATGAGTCTC TGGACCTAGT TGCCAATTCA GCAGATGTGG CAAACATCTG 551 GGTGTCTGGG TTACGGTACC TGGTTTCTCG AAGTAAGCAG CCTCTTGATT 601 TTATGGAGGG CAACCAGAAC ACACCACGGT TCATGTGGTT GAAAACAGTG 651 TTTGAAGCAG CAGATGTTGA TGGGAATGGG ATTATGTTGG AAGACACCTC 701 TGTAGAGTTA ATAAAACAAC TCAACCCTAC TCTGAAGGAA GCCAAGATCA 751 GGTTAAAGTT TAAAGAAATC CAGAAGAGCA AGGAAAAACT AACCACCCGC 801 GTGACCGAAG AGGAATTTTG TGAAGCTTTT TGTGAACTTT GCACCAGGCC 851 AGAAGTGTAT TTCTTACTTG TACAGATATC TAAAAACAAA GAATATTTGG 901 ATGCCAATGA TCTCATGCTC TTTTTAGAAG CTGAGCAAGG AGTCACCCAT 951 ATCACCGAGG ATATATGCTT AGACATCATA AGGAGATACG AACTTTCTGA 1001 AGAGGGACGT CAAAAAGGGT TTCTTGCAAT TGATGGCTTT ACCCAGTATT 1051 TATTGTCATC AGAATGTGAC ATTTTTGATC CTGAGCAAAA GAAGGTTGCC 1101 CAAGATATGA CCCAGCCATT ATCTCACTAC TATATCAATG CCTCTCATAA 1151 CACCTATCTA ATAGAAGACC AGTTCAGGGG GCCAGCTGAC ATCAATGGGT 1201 ACATTAGAGC TTTGAAAATG GGCTGTCGAA GCGTTGAACT CGATGTAAGT 1251 GATGGTTCAG ATAATGAACC AATCCTTTGT AATCGAAATA ACATGACAAC 1301 CCATGTTTCC TTTCGAAGTG TCATAGAGGT AATAAATAAA TTTGCCTTTG 1351 TTGCTTCTGA ATACCCACTC ATTCTTTGCT TGGGAAATCA CTGCTCCTTG 1401 CCGCAGCAGA AGGTAATGGC TCAACAGATG AAAAAGGTCT TTGGCAATAA 1451 ACTCTATACT GAAGCACCTT TGCCCTCAGA ATCCTACCTC CCATCACCAG 1501 AAAAATTAAA AAGAATGATC ATTGTGAAAG GAAAGAAGTT GCCTTCTGAT 1551 CCAGATGTGT TAGAAGGAGA AGTAACAGAT GAAGATGAAG AAGCTCAAAT 1601 GTCTCGAAGG ATGTCGGTAG ATTACAATGG TGAGCAGAAG CAAATCCGAC 1651 TCTGTAGGGA GCTCTCTGAT TTGGTGTCTA TTTGTAAATC TGTTCAATAC 1701 AGGGATTTTG AACTATCTAT GAAAAGCCAA AACTATTGGG AAATGTGTTC 1751 ATTTAGTGAA ACAGAGGCCA GCCGCATTGC AAATGAGTAC CCAGAGGATT 1801 TTGTTAATTA TAATAAGAAG TTCTTATCAA GAATCTATCC AAGTGCCATG 1851 AGGATCGATT CCAGTAACTT GAATCCACAG GACTTTTGGA ATTGTGGCTG 1901 TCAGATTGTA GCAATGAATT TTCAGACTCC GGGTCCAATG ATGGACCTTC 1951 ACACGGGCTG GTTTCTTCAA AACGGGGGAT GTGGTTATGT TCTAAGGCCG 2001 TCTATAATGC GAGATGAAGT TTCTTACTTC AGCGCAAATA CAAAGGGCAT 2051 TCTACCTGGG GTGTCTCCTC TAGCTCTTCA TATCAAGATC ATCAGTGGTC 2101 AGAATTTCCC AAAGCCCAAG GGAGCTTGTG CCAAAGGGGA TGTCATAGAT 2151 CCCTATGTTT GTATAGAGAT ACACGGAATT CCAGCGGATT GTTCGGAACA 2201 AAGAACTAAA ACTGTACAGC AAAACAGTGA TAATCCTATT TTTGATGAAA 2251 CTTTTGAGTT CCAAGTAAAC CTACCTGAGC TGGCCATGAT CCGTTTTGTT 2301 GTTCTGGATG ATGACTACAT TGGGGATGAG TTTATAGGGC AATATACGAT 2351 ACCATTTGAA TGTTTGCAGC CTGGATATCG GCATGTTCCC CTGCGTTCTT 2401 TTGTGGGTGA CATCATGGAG CACGTAACCC TTTTTGTCCA CATAGCAATA 2451 ACTAATCGAA GTGGAGGAGG AAAGGCACAG AAGCGCAGTC TTTCAGTGAG 2501 AATGGGGAAG AAAGTTCGGG AATATACCAT GCTCAGGAAT ATCGGTCTTA 2551 AAACCATTGA TGACATCTTT AAAATAGCGG TTCATCCATT ACGAGAAGCC 2601 ATAGATATGA GAGAAAATAT GCAGAATGCA ATCGTGTCTA TTAAGGAACT 2651 ATGTGGACTC CCTCCAATTG CCAGTCTGAA GCAGTGCCTG TTAACTCTGT 2701 CATCTCGGCT CATCACCAGT GACAATACTC CTTCAGTCTC ACTTGTGATG 2751 AAAGACAGCT TTCCTTACCT GGAGCCTCTG GGTGCAATTC CAGATGTGCA 2801 GAAAAAGATG CTGACTGCTT ATGATCTGAT GATTCAAGAG AGCCGGTTTC 2851 TCATAGAAAT GGCGGACACA GTCCAGGAAA AGATTGTACA GTGTCAGAAA 2901 GCAGGGATGG AGTTCCATGA AGAACTTCAT AATTTGGGGG CAAAAGAAGG 2951 CTTGAAGGGA AGAAAACTCA ACAAAGCAAC TGAGAGCTTT GCTTGGAACA 3001 TTACAGTATT GAAGGGCCAA GGAGATCTGT TGAAGAATGC CAAGAATGAA 3051 GCTATAGAAA ACATGAAGCA GATCCAGCTG GCATGCCTGT CCTGTGGACT 3101 GAGTAAAGCC CCCAGCAGCA GTGCTGAGGC CAAGAGCAAG CGCAGCCTGG 3151 AAGCCATAGA GGAGAAGGAA AGTAGTGAGG AGAATGGGAA GCTGTGACTC 3201 TGGGCATTAT CGACACGTTC ACCCATCTTA TCAAGGACTC TGGTTTCTCA 3251 TTCTTGTTTT CTTTCTTTAA ATGTTTTATA AGTTCACAAA ATGGTGCCCT 3301 ATATGGGGTA TTGGACATAG ATATTTTCAC AATGTCAGTA TTTCAGTGTA 3351 GTTAATTTAT CTAAATTAAA GCCTTTAGTA TCAGTGTTTT AAATTCTGAG 3401 ACATGTGTCA ACACCCCTGT GTGGATGCCT GTGGAAGAGT GTGTGTGTGT 3451 GTGTGTGTGT GTGTGTGTGT GTGTGTGGCA GAGAGAGAGA AAGAGAGAGA 3501 GAGAAATTCT GTTAAAATCT ATTCTGTGTT GCATTATTCA TTTAGTGAGT 3551 TATTCCTTGA TCATTTTGGG ACAATTGTTT TAATCTGAAA TTCTAAAGAG 3601 CACTTACTGT AACCTGTTGC TGTGTTTAAT TTGACTTCTC TGCCTTTGAC 3651 ATTTAATTTA GTGATCTTAG CATAGCTTAT TATTGAAGGA AGCCAAATTT 3701 ATCAAAGCAT ACATGTTTTG GTAGATTAAA TATAGATTAG AAAAATTCCT 3751 AAGAATCAGA GTAGAAATAA AAGTGAATGA AAGATTAAAC AGATGATGAG 3801 AATTTCTAAA AAGATTAGCA AGGTCATTTC TTCAGTCAGA AAACTTTAAA 3851 AAATATTTAT TAAATAAAAT CAATTTTTAG GAAGTTTTCT GTAGTCATTT 3901 ACTAAACATA TGATTTCACT AGAAAAGCTG ATCATAAGTG AATTTATACC 3951 TACCTGTGTG GCTACTCTGA AACACACTGA AAGCTCTGTT GCAATTAGGA 4001 TTTGATGTGA CATAATATTG TTGTATAATT TCGAGATTTG TAGGAAGGTC 4051 TCATTCTTCC AAGCTGAGAG TCTAGCACTC ATTTTCTATA ACAGATATGG 4101 CAGCTTAGAG GTGTTGGCTT TGTTTGGATG TAATTTTAGG GGTACTAAAA 4151 TTTAAAATTT AAAGATAATT GTTCAACAAT ATCATATCAT CACATTGAGC 4201 TGATATAAAT TCTGTGGGTC CGATAATATC TTTGTGATAA TTTAAGAGCT 4251 AACCAGTTAC CACACATCTA TGATATAACC CTAACACACA CAGAAAGCAT 4301 ACATGCAAAA AGAAATGACT AATTAGGGTA CATTTATAAT TGCATCTAGG 4351 TAATTTTTAC CCTAATGTCT TCATAAAGTA CTTGAGTGTA ATGTTTGTTA 4401 CCTCCAACAG AACTAAATGT TCTATGGTTA TGAAAGAATA TATTTATTTA 4451 AAGCATTGCT TTTATTTTGA AAAGCTTCTT AATTAATTTG ATTAACAAAT 4501 ATGCTAATTT GGGGAAACCT AGAGAAGATA ATTGTTGAAA TTTTGCAAAT 4551 ATAAACATCT CCTAT // LOCUS HSU11287 5969 bp mRNA PRI 16-AUG-1995 DEFINITION Human N-methyl-D-aspartate receptor subunit NR3 (hNR3) mRNA, complete cds. ACCESSION U11287 NID g560546 VERSION U11287.1 GI:560546 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5969) AUTHORS Adams,S.L., Foldes,R.L. and Kamboj,R.K. TITLE Human N-methyl-D-aspartate receptor modulatory subunit hNR3: cloning and sequencing of the cDNA and primary structure of the protein JOURNAL Biochim. Biophys. Acta 1260 (1), 105-108 (1995) MEDLINE 95092783 REFERENCE 2 (bases 1 to 5969) AUTHORS Foldes,R.L. TITLE Direct Submission JOURNAL Submitted (24-JUN-1994) Robert L. Foldes, Allelix Biopharmaceuticals Inc., 6850 Goreway Drive, Mississauga, Ontario L4V 1V7, Canada FEATURES Location/Qualifiers source 1. .5969 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="FB2C, FB6B, FB2B, FB17, FB19A, FB5, FB18, FB2A, FB10" /clone_lib="Stratagene Library No.936206" /tissue_type="brain" /dev_stage="fetus" 5'UTR <1. .210 /evidence=experimental gene 211. .4665 /gene="hNR3" CDS 211. .4665 /gene="hNR3" /codon_start=1 /evidence=experimental /product="N-methyl-D-aspartate receptor subunit NR3" /protein_id="AAB60368.1" /db_xref="PID:g560547" /db_xref="GI:560547" /translation="MKPRAECCSPKFWLVLAVLAVSGSRARSQKSPPSIGIAVILVGT SDEVAIKDAHEKDDFHHLSVVPRVELVAMNETDPKSIITRICDLMSDRKIQGVVFADD TDQEAIAQILDFISAQTLTPILGIHGGSSMIMADKDESSMFFQFGPSIEQQASVMLNI MEEYDWYIFSIVTTYFPGYQDFVNKIRSTIENSFVGWELEEVLLLDMSLDDGDSKIQN QLKKLQSPIILLYCTKEEATYIFEVANSVGLTGYGYTWIVPSLVAGDTDTVPAEFPTG LISVSYDEWDYGLPARVRDGIAIITTAASDMLSEHSFIPEPKSSCYNTHEKRIYQSNM LNRYLINVTFEGRNLSFSEDGYQMHPKLVIILLNKERKWERVGKWKDKSLQMKYYVWP RMCPETEEQEDDHLSIVTLEEAPFVIVESVDPLSGTCMRNTVPCQKRIVTENKTDEEP GYIKKCCKGFCIDILKKISKSVKFTYDLYLVTNGKHGKKINGTWNGMIGEVVMKRAYM AVGSLTINEERSEVVDFSVPFIETGISVMVSRSNGTVSPSAFLEPFSADVWVMMFVML LIVSAVAVFVFEYFSPVGYNRCLADGREPGGPSFTIGKAIWLLWGLVFNNSVPVQNPK GTTSKIMVSVWAFFAVIFLASYTANLAAFMIQEEYVDQVSGLSDKKFQRPNDFSPPFR FGTVPNGSTERNIRNNYAEMHAYMGKFNQRGVDDALLSLKTGKLDAFIYDAAVLNYMA GRDEGCKLVTIGSGKVFASTGYGIAIQKDSGWKRQVDLAILQLFGDGEMEELEALWLT GICHNEKNEVMSSQLDIDNMAGVFYMLGAAMALSLITFICEHLFYWQFRHCFMGVCSG KPGMVFSISRGIYSCIHGVAIEERQSVMNSPTATMNNTHSNILRLLRTAKNMANLSGV NGSPQRPLDFIRRESSVYDISEHRRSFTHSDCKSYNNPPCEENLFSDYISEVERTFGN LQLKDSNVYQDHYHHHHRPHSIGSASSIDGLYDCDNPPFTTQSRSISKKPLDIGLPSS KHSQLSDLYGKFSFKSDRYSGHDDLIRSDVSDISTHTVTYGNIEGNAAKRRKQQYKDS LKKRPASAKSRREFDEIELAYRRRPPRSPDHKRYFRDKEGLRDFYLDQFRTKENSPHW EHVDLTDIYKERSDDFKRDSVSGGGPCTNRSHIKHGTGDKHGVVSGVPAPWEKNLTNV EWEDRSGGNFCRSCPSKLHNYSTTVTGQNSGRQACIRCEACKKAGNLYDISEDNSLQE LDQPAAPVAVTSNASTTKYPQSPTNSKAQKKNRNKLRRQHSYDTFVDLQKEEAALAPR SVSLKDKGRFMDGSPYAHMFEMSAGESTFANNKSSVPTAGHHHHNNPGGGYMLSKSLY PDRVTQNPFIPTFGDDQCLLHGSKSYFFRQPTVAGASKARPDFRALVTNKPVVSALHG AVPARFQKDICIGNQSNPCVPNNKNPRAFNGSSNGHVYEKLSSIESDV" misc_difference 1430 /gene="hNR3" /note="hNR3-2" /replace="a" misc_difference 2874 /gene="hNR3" /note="hNR3-3" /replace="t" 3'UTR 4666. .>5969 /evidence=experimental BASE COUNT 1504 a 1548 c 1559 g 1358 t ORIGIN 1 TTTGAATTTG CATCTCTTCA AGACACAAGA TTAAAACAAA ATTTACGCTA 51 AATTGGATTT TAAATTATCT TCCGTTCATT TATCCTTCGT CTTTCTTATG 101 TGGATATGCA AGCGAGAAGA AGGGACTGGA CATTCCCAAC ATGCTCACTC 151 CCTTAATCTG TCCGTCTAGA GGTTTGGCTT CTACAAACCA AGGGAGTCGA 201 CGAGTTGAAG ATGAAGCCCA GAGCGGAGTG CTGTTCTCCC AAGTTCTGGT 251 TGGTGTTGGC CGTCCTGGCC GTGTCAGGCA GCAGAGCTCG TTCTCAGAAG 301 AGCCCCCCCA GCATTGGCAT TGCTGTCATC CTCGTGGGCA CTTCCGACGA 351 GGTGGCCATC AAGGATGCCC ACGAGAAAGA TGATTTCCAC CATCTCTCCG 401 TGGTACCCCG GGTGGAACTG GTAGCCATGA ATGAGACCGA CCCAAAGAGC 451 ATCATCACCC GCATCTGTGA TCTCATGTCT GACCGGAAGA TCCAGGGGGT 501 GGTGTTTGCT GATGACACAG ACCAGGAAGC CATCGCCCAG ATCCTCGATT 551 TCATTTCAGC ACAGACTCTC ACCCCGATCC TGGGCATCCA CGGGGGCTCC 601 TCTATGATAA TGGCAGATAA GGATGAATCC TCCATGTTCT TCCAGTTTGG 651 CCCATCAATT GAACAGCAAG CTTCCGTAAT GCTCAACATC ATGGAAGAAT 701 ATGACTGGTA CATCTTTTCT ATCGTCACCA CCTATTTCCC TGGCTACCAG 751 GACTTTGTAA ACAAGATCCG CAGCACCATT GAGAATAGCT TTGTGGGCTG 801 GGAGCTAGAG GAGGTCCTCC TACTGGACAT GTCCCTGGAC GATGGAGATT 851 CTAAGATCCA GAATCAGCTC AAGAAACTTC AAAGCCCCAT CATTCTTCTT 901 TACTGTACCA AGGAAGAAGC CACCTACATC TTTGAAGTGG CCAACTCAGT 951 AGGGCTGACT GGCTATGGCT ACACGTGGAT CGTGCCCAGT CTGGTGGCAG 1001 GGGATACAGA CACAGTGCCT GCGGAGTTCC CCACTGGGCT CATCTCTGTA 1051 TCATATGATG AATGGGACTA TGGCCTCCCC GCCAGAGTGA GAGATGGAAT 1101 TGCCATAATC ACCACTGCTG CTTCTGACAT GCTGTCTGAG CACAGCTTCA 1151 TCCCTGAGCC CAAAAGCAGT TGTTACAACA CCCACGAGAA GAGAATCTAC 1201 CAGTCCAATA TGCTAAATAG GTATCTGATC AATGTCACTT TTGAGGGGAG 1251 GAATTTGTCC TTCAGTGAAG ATGGCTACCA GATGCACCCG AAACTGGTGA 1301 TAATTCTTCT GAACAAGGAG AGGAAGTGGG AAAGGGTGGG GAAGTGGAAA 1351 GACAAGTCCC TGCAGATGAA GTACTATGTG TGGCCCCGAA TGTGTCCAGA 1401 GACTGAAGAG CAGGAGGATG ACCATCTGAG CATTGTGACC CTGGAGGAGG 1451 CACCATTTGT CATTGTGGAA AGTGTGGACC CTCTGAGTGG AACCTGCATG 1501 AGGAACACAG TCCCCTGCCA AAAACGCATA GTCACTGAGA ATAAAACAGA 1551 CGAGGAGCCG GGTTACATCA AAAAATGCTG CAAGGGGTTC TGTATTGACA 1601 TCCTTAAGAA AATTTCTAAA TCTGTGAAGT TCACCTATGA CCTTTACCTG 1651 GTTACCAATG GCAAGCATGG GAAGAAAATC AATGGAACCT GGAATGGTAT 1701 GATTGGAGAG GTGGTCATGA AGAGGGCCTA CATGGCAGTG GGCTCACTCA 1751 CCATCAATGA GGAACGATCG GAGGTGGTCG ACTTCTCTGT GCCCTTCATA 1801 GAGACAGGCA TCAGTGTCAT GGTGTCACGC AGCAATGGGA CTGTCTCACC 1851 TTCTGCCTTC TTAGAGCCAT TCAGCGCTGA CGTATGGGTG ATGATGTTTG 1901 TGATGCTGCT CATCGTCTCA GCCGTGGCTG TCTTTGTCTT TGAGTACTTC 1951 AGCCCTGTGG GTTATAACAG GTGCCTCGCT GATGGCAGAG AGCCTGGTGG 2001 ACCCTCTTTC ACCATCGGCA AAGCTATTTG GTTGCTCTGG GGTCTGGTGT 2051 TTAACAACTC CGTACCTGTG CAGAACCCAA AGGGGACCAC CTCCAAGATC 2101 ATGGTGTCAG TGTGGGCCTT CTTTGCTGTC ATCTTCCTGG CCAGCTACAC 2151 TGCCAACTTA GCTGCCTTCA TGATCCAAGA GGAATATGTG GACCAGGTTT 2201 CTGGCCTGAG CGACAAAAAG TTCCAGAGAC CTAATGACTT CTCACCCCCT 2251 TTCCGCTTTG GGACCGTGCC CAACGGCAGC ACAGAGAGAA ATATTCGCAA 2301 TAACTATGCA GAAATGCATG CCTACATGGG AAAGTTCAAC CAGAGGGGTG 2351 TAGATGATGC ATTGCTCTCC CTGAAAACAG GGAAACTGGA TGCCTTCATC 2401 TATGATGCAG CAGTGCTGAA CTATATGGCA GGCAGAGATG AAGGCTGCAA 2451 GCTGGTGACC ATTGGCAGTG GGAAGGTCTT TGCTTCCACT GGCTATGGCA 2501 TTGCCATCCA AAAAGATTCT GGGTGGAAGC GCCAGGTGGA CCTTGCTATC 2551 CTGCAGCTCT TTGGAGATGG GGAGATGGAA GAACTGGAAG CTCTCTGGCT 2601 CACTGGCATT TGTCACAATG AGAAGAATGA GGTCATGAGC AGCCAGCTGG 2651 ACATTGACAA CATGGCAGGG GTCTTCTACA TGTTGGGGGC GGCCATGGCT 2701 CTCAGCCTCA TCACCTTCAT CTGCGAACAC CTTTTCTATT GGCAGTTCCG 2751 ACATTGCTTT ATGGGTGTCT GTTCTGGCAA GCCTGGCATG GTCTTCTCCA 2801 TCAGCAGAGG TATCTACAGC TGCATCCATG GGGTGGCGAT CGAGGAGCGC 2851 CAGTCTGTAA TGAACTCCCC CACCGCAACC ATGAACAACA CACACTCCAA 2901 CATCCTGCGC CTGCTGCGCA CGGCCAAGAA CATGGCTAAC CTGTCTGGTG 2951 TGAATGGCTC ACCGCAGAGG CCCCTGGACT TCATCCGACG GGAGTCATCC 3001 GTCTATGACA TCTCAGAGCA CCGCCGCAGC TTCACGCATT CTGACTGCAA 3051 ATCCTACAAC AACCCGCCCT GTGAGGAGAA CCTCTTCAGT GACTACATCA 3101 GTGAGGTAGA GAGAACGTTC GGGAACCTGC AGCTGAAGGA CAGCAACGTG 3151 TACCAAGATC ACTACCACCA TCACCACCGG CCCCATAGTA TTGGCAGTGC 3201 CAGCTCCATC GATGGGCTCT ACGACTGTGA CAACCCACCC TTCACCACCC 3251 AGTCCAGGTC CATCAGCAAG AAGCCCCTGG ACATCGGCCT CCCCTCCTCC 3301 AAGCACAGCC AGCTCAGTGA CCTGTACGGC AAATTCTCCT TCAAGAGCGA 3351 CCGCTACAGT GGCCACGACG ACTTGATCCG CTCCGATGTC TCTGACATCT 3401 CAACCCACAC CGTCACCTAT GGGAACATCG AGGGCAATGC CGCCAAGAGG 3451 CGTAAGCAGC AATATAAGGA CAGCCTGAAG AAGCGGCCTG CCTCGGCCAA 3501 GTCCCGCAGG GAGTTTGACG AGATCGAGCT GGCCTACCGT CGCCGACCGC 3551 CCCGCTCCCC TGACCACAAG CGCTACTTCA GGGACAAGGA AGGGCTACGG 3601 GACTTCTACC TGGACCAGTT CCGAACAAAG GAGAACTCAC CCCACTGGGA 3651 GCACGTAGAC CTGACCGACA TCTACAAGGA GCGGAGTGAT GACTTTAAGC 3701 GCGACTCCGT CAGCGGAGGA GGGCCCTGTA CCAACAGGTC TCACATCAAG 3751 CACGGGACGG GCGACAAACA CGGCGTGGTC AGCGGGGTAC CTGCACCTTG 3801 GGAGAAGAAC CTGACCAACG TGGAGTGGGA GGACCGGTCC GGGGGCAACT 3851 TCTGCCGCAG CTGTCCCTCC AAGCTGCACA ACTACTCCAC GACGGTGACG 3901 GGTCAGAACT CGGGCAGGCA GGCGTGCATC CGGTGTGAGG CTTGCAAGAA 3951 AGCAGGCAAC CTGTATGACA TCAGTGAGGA CAACTCCCTG CAGGAACTGG 4001 ACCAGCCGGC TGCCCCAGTG GCGGTGACGT CAAACGCCTC CACCACTAAG 4051 TACCCTCAGA GCCCGACTAA TTCCAAGGCC CAGAAGAAGA ACCGGAACAA 4101 ACTGCGCCGG CAGCACTCCT ACGACACCTT CGTGGACCTG CAGAAGGAAG 4151 AAGCCGCCCT GGCCCCGCGC AGCGTAAGCC TGAAAGACAA GGGCCGATTC 4201 ATGGATGGGA GCCCCTACGC CCACATGTTT GAGATGTCAG CTGGCGAGAG 4251 CACCTTTGCC AACAACAAGT CCTCAGTGCC CACTGCCGGA CATCACCACC 4301 ACAACAACCC CGGCGGCGGG TACATGCTCA GCAAGTCGCT CTACCCTGAC 4351 CGGGTCACGC AAAACCCTTT CATCCCCACT TTTGGGGACG ACCAGTGCTT 4401 GCTCCATGGC AGCAAATCCT ACTTCTTCAG GCAGCCCACG GTGGCGGGGG 4451 CGTCGAAAGC CAGGCCGGAC TTCCGGGCCC TTGTCACCAA CAAGCCGGTG 4501 GTCTCGGCCC TTCATGGGGC CGTGCCAGCC CGTTTCCAGA AGGACATCTG 4551 TATAGGGAAC CAGTCCAACC CCTGTGTGCC TAACAACAAA AACCCCAGGG 4601 CTTTCAATGG CTCCAGCAAT GGGCATGTTT ATGAGAAACT TTCTAGTATT 4651 GAGTCTGATG TCTGAGTGAG GGAACAGAGA GGTTAAGGTG GGTACGGGAG 4701 GGTAAGGCTG TGGGTCGCGT GATGCGCATG TCACGGAGGG TGACGGGGGT 4751 GAACTTGGTT CCCATTTGCT CCTTTCTTGT TTTAATTTAT TTATGGGGAT 4801 CCTGGAGTTC TGGTTCCTAC TGGGGGCAAC CCTGGTGACC AGCACCATCT 4851 CTCCTCCTTT TCACAGTTCT CTCCTTCTTC CCCCCGCTGT CAGCCATTCC 4901 TGTTCCCATG AGATGATGCC ATGGGTCTCA GCAGGGGAGG GTAGAGCGGA 4951 GAAAGGAAGG GCAGCATGCG GGCTTCCTCC TGGTGTGGAA GAGCTCCTTG 5001 ATATCCTCTT TGAGTGAAGC TGGGAGAACC AAAAAGAGGC TATGTGAGCA 5051 CAAAGGTAGC TTTTCCCAAA CTGATCTTTT CATTTAGGTG AGGAAGCAAA 5101 AGCATCTATG TGAGACCATT TAGCACACTG CTTGTGAAAG GAAAGAGGCT 5151 CTGGCTAAAT TCATGCTGCT TAGATGACAT CTGTCTAGGA ATCATGTGCC 5201 AAGCAGAGGT TGGGAGGCCA TTTGTGTTTA TATATAAGCC AAAAAATGCT 5251 TGCTTCAACC CCATGAGACT CGATAGTGGT GGTGAACAGA ACAAAAGGTC 5301 ATTGGTGGCA GAGTGGATTC TTGAACAAAC TGGAAAGTAC GTTATGATAG 5351 TGTCCCACGG TGCCTTGGGG ACAAGAGCAG GTGGATTGTG CGTGCATGTG 5401 TGTTCATGCA CACTTGCACC CATGTGTAGT CAGGTGCCTC AAGAGAAGGC 5451 AACCTTGACT CTTTCTATTG TTTCTTTCAA TATCCCCAAG CAGTGTGATT 5501 GTTTGGCTTA TATACAGACA GAGATGGCCA TGTATTACCT GAATTTTGGC 5551 TGTGTCTCCC TTCATCCTTC TGGAATAAGG AGAATGAAAA TTCTTGATAA 5601 AGAAGATTCT GTGGTCTAAA CAAAAAAAGG CGGTGAGCAA TCCTGCAAGA 5651 GCAAGGTACA TAAACAAGTC CTCAGTGGTT GGCAACTGTT TCAACCTGTT 5701 TGAACCAAGA ACCTTCCAGG AAGGCTAAAG GGAAACCGAA TTTCACAGCC 5751 ATGATTCTTT TGCCCACACT TGGGAGCAAA AGATTCTACA AAGCTCTTTT 5801 GAGCATTTAG ACTCTCGACT GGCCAAGGTT TGGGGAAGAA CGAAGCCACC 5851 TTTGAAGAAG TAAGGAGTCG TGTATGGTAG GGTAAGTGAG AGAGGGGGAT 5901 GTTTCCAATG CTTTGATCCC TTCTTACTTA ACCTGAAGCT AGACGAGCAG 5951 GCTTCTTCCC CCCAAAACT // LOCUS HS46KDA 2434 bp mRNA PRI 11-MAR-1997 DEFINITION H.sapiens mRNA for 46 kDa coxsackievirus and adenovirus receptor (CAR) protein. ACCESSION Y07593 NID g1881446 VERSION Y07593.1 GI:1881446 KEYWORDS 46 kDa receptor protein; coxsackie and adenovirus receptor protein. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2434) AUTHORS Bergelson,J.M., Cunningham,J.A., Droguett,G., Kurt-Jones,E.A., Krithivas,A., Hong,J.S., Horwitz,M.S., Crowell,R.L. and Finberg,R.W. TITLE Isolation of a common receptor for Coxsackie B viruses and adenoviruses 2 and 5 JOURNAL Science 275 (5304), 1320-1323 (1997) MEDLINE 97190109 REFERENCE 2 (bases 1 to 2434) AUTHORS Bergelson,J.M. TITLE Direct Submission JOURNAL Submitted (20-AUG-1996) J.M. Bergelson, Dana Farber Cancer Institute, Lab of Infectious Disease, 44 Binney Street, Boston Ma 02115, MA, 02115, USA FEATURES Location/Qualifiers source 1. .2434 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="cervical carcinoma" gene 60. .1157 /gene="CAR" CDS 60. .1157 /gene="CAR" /codon_start=1 /product="coxsackie and adenovirus receptor protein" /protein_id="CAA68868.1" /db_xref="PID:e284081" /db_xref="PID:g1881447" /db_xref="GI:1881447" /db_xref="SPTREMBL:P78310" /translation="MALLLCFVLLCGVVDFARSLSITTPEEMIEKAKGETAYLPCKFT LSPEDQGPLDIEWLISPADNQKVDQVIILYSGDKIYDDYYPDLKGRVHFTSNDLKSGD ASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVLVKPSGARCYVDGSEEIGSDFKI KCEPKEGSLPLQYEWQKLSDSQKMPTSWLAEMTSSVISVKNASSEYSGTYSCTVRNRV GSDQCLLRLNVVPPSNKAGLIAGAIIGTLLALALIGLIIFCCRKKRREEKYEKEVHHD IREDVPPPKSRTSTARSYIGSNHSSLGSMSPSNMEGYSKTQYNQVPSEDFERTPQSPT LPPAKVAAPNLSRMGAIPVMIPAQSKDGSIV" BASE COUNT 743 a 443 c 487 g 761 t ORIGIN 1 GAATTCCCAG GAGCGAGAGC CGCCTACCTG CAGCCGCCGC CCACGGCACG 51 GCAGCCACCA TGGCGCTCCT GCTGTGCTTC GTGCTCCTGT GCGGAGTAGT 101 GGATTTCGCC AGAAGTTTGA GTATCACTAC TCCTGAAGAG ATGATTGAAA 151 AAGCCAAAGG GGAAACTGCC TATCTGCCGT GCAAATTTAC GCTTAGTCCC 201 GAAGACCAGG GACCGCTGGA CATCGAGTGG CTGATATCAC CAGCTGATAA 251 TCAGAAGGTG GATCAAGTGA TTATTTTATA TTCTGGAGAC AAAATTTATG 301 ATGACTACTA TCCAGATCTG AAAGGCCGAG TACATTTTAC GAGTAATGAT 351 CTCAAATCTG GTGATGCATC AATAAATGTA ACGAATTTAC AACTGTCAGA 401 TATTGGCACA TATCAGTGCA AAGTGAAAAA AGCTCCTGGT GTTGCAAATA 451 AGAAGATTCA TCTGGTAGTT CTTGTTAAGC CTTCAGGTGC GAGATGTTAC 501 GTTGATGGAT CTGAAGAAAT TGGAAGTGAC TTTAAGATAA AATGTGAACC 551 AAAAGAAGGT TCACTTCCAT TACAGTATGA GTGGCAAAAA TTGTCTGACT 601 CACAGAAAAT GCCCACTTCA TGGTTAGCAG AAATGACTTC ATCTGTTATA 651 TCTGTAAAAA ATGCCTCTTC TGAGTACTCT GGGACATACA GCTGTACAGT 701 CAGAAACAGA GTGGGCTCTG ATCAGTGCCT GTTGCGTCTA AACGTTGTCC 751 CTCCTTCAAA TAAAGCTGGA CTAATTGCAG GAGCCATTAT AGGAACTTTG 801 CTTGCTCTAG CGCTCATTGG TCTTATCATC TTTTGCTGTC GTAAAAAGCG 851 CAGAGAAGAA AAATATGAAA AGGAAGTTCA TCACGATATC AGGGAAGATG 901 TGCCACCTCC AAAGAGCCGT ACGTCCACTG CCAGAAGCTA CATCGGCAGT 951 AATCATTCAT CCCTGGGGTC CATGTCTCCT TCCAACATGG AAGGATATTC 1001 CAAGACTCAG TATAACCAAG TACCAAGTGA AGACTTTGAA CGCACTCCTC 1051 AGAGTCCGAC TCTCCCACCT GCTAAGGTAG CTGCCCCTAA TCTAAGTCGA 1101 ATGGGTGCGA TTCCTGTGAT GATTCCAGCA CAGAGCAAGG ATGGGTCTAT 1151 AGTATAGAGC CTCCATATGT CTCATCTGTG CTCTCCGTGT TCCTTTCCTT 1201 TTTTTGATAT ATGAAAACCT ATTCTGGTCT AAATTGTGTT ACTAGCCTCA 1251 AAATACATCA AAAAATAAGT TAATCAGGAA CTGTACGGAA TATATTTTTA 1301 AAAATTTTTG TTTGGTTATA TCGAAATAGT TACAGGCACT AAAGTTAGTA 1351 AAGAAAAGTT TACCATCTGA AAAAGCTGGA TTTTCTTTAA GAGGTTGATT 1401 ATAAAGTTTT CTAAATTTAT CAGTACCTAA GTAAGATGTA GCGCTTTGAA 1451 TATGAAATCA TAGGTGAAGA CATGGGTGAA CTTACTTGCA TACCAAGTTG 1501 ATACTTGAAT AACCATCTGA AAGTGGTACT TGATCATTTT TACCATTATT 1551 TTTAGGATGT GTATTTCATT TATTTATGGC CCACCAGTCT CCCCCAAATT 1601 AGTACAGAAA TATCCATGAC AAAATTACTT ACGTATGTTT GTACTTGGTT 1651 TTACAGCTCC TTTGAAAACT CTGTGTTTGG AATATCTCTA AAAACATAGA 1701 AAACACTACA GTGGTTTAGA AATTACTAAT TTTACTTCTA AGTCATTCAT 1751 AAACCTTGTC TATGAAATGA CTTCTTAAAT ATTTAGTTGA TAGACTGCTA 1801 CAGGTAATAG GGACTTAGCA AGCTCTTTTA TATGCTAAAG GAGCATCTAT 1851 CAGATTAAGT TAGAACATTT GCTGTCAGCC ACATATTGAG ATGACACTAG 1901 GTGCAATAGC AGGGATAGAT TTTGTTGGTG AGTAGTCTCA TGCCTTGAGA 1951 TCTGTGGTGG TCTTCAAAAT GGTGGCCAGC CAGATCAAGG ATGTAGTATC 2001 TCATAGTTCC CAGGTGATAT TTTTCTTATT AGAAAAATAT TATAACTCAT 2051 TTGTTGTTTG ACACTTATAG ATTGAAATTT CCTAATTTAT TCTAAATTTT 2101 AAGTGGTTCT TTGGTTCCAG TGCTTTATGT TGTTGTTGTT TTTGGATGGT 2151 GTTACATATT ATATGTTCTA GAAACATGTA ATCCTAAATT TACCCTCTTG 2201 AATATAATCC CTGGATGATA TTTTTTATCA TAAATGCAGA ATAATCAAAT 2251 ACATTTTAAG CAAGTTAAGT GTCCTCCATC AATTCTGTAT TCCAGACTTG 2301 GGAGGATGTA CAGTTGCTGT TGTGTGATCA AACATGTCTC TGTGTAGTTC 2351 CAGCAAATCA AGCTGAGCTT TGAAAAAGTT TGTCTTAGTT TTGTGAAGGT 2401 GATTTATTCT TAGAAAAAAA AAAAAAAAAA AAAA // LOCUS AF148213 4193 bp mRNA PRI 30-JUN-1999 DEFINITION Homo sapiens aggrecanase-1 mRNA, complete cds. ACCESSION AF148213 NID g5281380 VERSION AF148213.1 GI:5281380 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4193) AUTHORS Tortorella,M.D., Burn,T.C., Pratta,M.A., Abbaszade,I., Hollis,J.M., Liu,R., Rosenfeld,S.A., Copeland,R.A., Decicco,C.P., Wynn,R., Rockwell,A., Yang,F., Duke,J.L., Solomon,K., George,H., Bruckner,R., Nagase,H., Itoh,Y., Ellis,D.M., Ross,H., Wiswall,B.H., Murphy,K., Hillman,M.C. Jr., Hollis,G.F., Newton,R.C., Magolda,R.L., Trzaskos,J.M. and Arner,E.C. TITLE Purification and cloning of aggrecanase-1: A member of the ADAMTS family of proteins JOURNAL Science 284 (5420), 1664-1666 (1999) MEDLINE 99286303 REFERENCE 2 (bases 1 to 4193) AUTHORS Tortorella,M.D., Burn,T.C., Pratta,M.A., Abbaszade,I., Hollis,J.M., Liu,R.-Q., Rosenfeld,S.A., Copeland,R.A., Decicco,C.P., Wynn,R., Rockewell,A., Yang,F., Duke,J.L., Solomon,K., George,H.J., Bruchner,R., Nagase,H., Ito,Y., Ellis,D.M., Ross,O.H., Wiswall,B.H., Murphy,K., Hillman,M.C. Jr., Hollis,G.F., Newton,R.C., Magolda,R.L., Trzaskos,J.M. and Arner,E.C. TITLE Direct Submission JOURNAL Submitted (03-MAY-1999) Applied Biotechnology, DuPont Pharmaceuticals Company, Experimental Station, E336/237B, Wilmington, DE 19880, USA FEATURES Location/Qualifiers source 1. .4193 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 407. .2920 /note="aggrecan degrading metalloproteinase-1; disintegrin metalloproteinase; ADMP-1; ADAMTS-4" /codon_start=1 /product="aggrecanase-1" /protein_id="AAD41494.1" /db_xref="PID:g5281381" /db_xref="GI:5281381" /translation="MSQTGSHPGRGLAGRWLWGAQPCLLLPIVPLSWLVWLLLLLLAS LLPSARLASPLPREEEIVFPEKLNGSVLPGSGAPARLLCRLQAFGETLLLELEQDSGV QVEGLTVQYLGQAPELLGGAEPGTYLTGTINGDPESVASLHWDGGALLGVLQYRGAEL HLQPLEGGTPNSAGGPGAHILRRKSPASGQGPMCNVKAPLGSPSPRPRRAKRFASLSR FVETLVVADDKMAAFHGAGLKRYLLTVMAAAAKAFKHPSIRNPVSLVVTRLVILGSGE EGPQVGPSAAQTLRSFCAWQRGLNTPEDSDPDHFDTAILFTRQDLCGVSTCDTLGMAD VGTVCDPARSCAIVEDDGLQSAFTAAHELGHVFNMLHDNSKPCISLNGPLSTSRHVMA PVMAHVDPEEPWSPCSARFITDFLDNGYGHCLLDKPEAPLHLPVTFPGKDYDADRQCQ LTFGPDSRHCPQLPPPCAALWCSGHLNGHAMCQTKHSPWADGTPCGPAQACMGGRCLH MDQLQDFNIPQAGGWGPWGPWGDCSRTCGGGVQFSSRDCTRPVPRNGGKYCEGRRTRF RSCNTEDCPTGSALTFREEQCAAYNHRTDLFKSFPGPMDWVPRYTGVAPQDQCKLTCQ ARALGYYYVLEPRVVDGTPCSPDSSSVCVQGRCIHAGCDRIIGSKKKFDKCMVCGGDG SGCSKQSGSFRKFRYGYNNVVTIPAGATHILVRQQGNPGHRSIYLALKLPDGSYALNG EYTLMPSPTDVVLPGAVSLRYSGATAASETLSGHGPLAQPLTLQVLVAGNPQDTRLRY SFFVPRPTPSTPRPTPQDWLHRRAQILEILRRRPWAGRK" BASE COUNT 807 a 1249 c 1220 g 917 t ORIGIN 1 CACAGACACA TATGCACGAG AGAGACAGAG GAGGAAAGAG ACAGAGACAA 51 AGGCACAGCG GAAGAAGGCA GAGACAGGGC AGGCACAGAA GCGGCCCAGA 101 CAGAGTCCTA CAGAGGGAGA GGCCAGAGAA GCTGCAGAAG ACACAGGCAG 151 GGAGAGACAA AGATCCAGGA AAGGAGGGCT CAGGAGGAGA GTTTGGAGAA 201 GCCAGACCCC TGGGCACCTC TCCCAAGCCC AAGGACTAAG TTTTCTCCAT 251 TTCCTTTAAC GGTCCTCAGC CCTTCTGAAA ACTTTGCCTC TGACCTTGGC 301 AGGAGTCCAA GCCCCCAGGC TACAGAGAGG AGCTTTCCAA AGCTAGGGTG 351 TGGAGGACTT GGTGCCCTAG ACGGCCTCAG TCCCTCCCAG CTGCAGTACC 401 AGTGCCATGT CCCAGACAGG CTCGCATCCC GGGAGGGGCT TGGCAGGGCG 451 CTGGCTGTGG GGAGCCCAAC CCTGCCTCCT GCTCCCCATT GTGCCGCTCT 501 CCTGGCTGGT GTGGCTGCTT CTGCTACTGC TGGCCTCTCT CCTGCCCTCA 551 GCCCGGCTGG CCAGCCCCCT CCCCCGGGAG GAGGAGATCG TGTTTCCAGA 601 GAAGCTCAAC GGCAGCGTCC TGCCTGGCTC GGGCGCCCCT GCCAGGCTGT 651 TGTGCCGCTT GCAGGCCTTT GGGGAGACGC TGCTACTAGA GCTGGAGCAG 701 GACTCCGGTG TGCAGGTCGA GGGGCTGACA GTGCAGTACC TGGGCCAGGC 751 GCCTGAGCTG CTGGGTGGAG CAGAGCCTGG CACCTACCTG ACTGGCACCA 801 TCAATGGAGA TCCGGAGTCG GTGGCATCTC TGCACTGGGA TGGGGGAGCC 851 CTGTTAGGCG TGTTACAATA TCGGGGGGCT GAACTCCACC TCCAGCCCCT 901 GGAGGGAGGC ACCCCTAACT CTGCTGGGGG ACCTGGGGCT CACATCCTAC 951 GCCGGAAGAG TCCTGCCAGC GGTCAAGGTC CCATGTGCAA CGTCAAGGCT 1001 CCTCTTGGAA GCCCCAGCCC CAGACCCCGA AGAGCCAAGC GCTTTGCTTC 1051 ACTGAGTAGA TTTGTGGAGA CACTGGTGGT GGCAGATGAC AAGATGGCCG 1101 CATTCCACGG TGCGGGGCTA AAGCGCTACC TGCTAACAGT GATGGCAGCA 1151 GCAGCCAAGG CCTTCAAGCA CCCAAGCATC CGCAATCCTG TCAGCTTGGT 1201 GGTGACTCGG CTAGTGATCC TGGGGTCAGG CGAGGAGGGG CCCCAAGTGG 1251 GGCCCAGTGC TGCCCAGACC CTGCGCAGCT TCTGTGCCTG GCAGCGGGGC 1301 CTCAACACCC CTGAGGACTC GGACCCTGAC CACTTTGACA CAGCCATTCT 1351 GTTTACCCGT CAGGACCTGT GTGGAGTCTC CACTTGCGAC ACGCTGGGTA 1401 TGGCTGATGT GGGCACCGTC TGTGACCCGG CTCGGAGCTG TGCCATTGTG 1451 GAGGATGATG GGCTCCAGTC AGCCTTCACT GCTGCTCATG AACTGGGTCA 1501 TGTCTTCAAC ATGCTCCATG ACAACTCCAA GCCATGCATC AGTTTGAATG 1551 GGCCTTTGAG CACCTCTCGC CATGTCATGG CCCCTGTGAT GGCTCATGTG 1601 GATCCTGAGG AGCCCTGGTC CCCCTGCAGT GCCCGCTTCA TCACTGACTT 1651 CCTGGACAAT GGCTATGGGC ACTGTCTCTT AGACAAACCA GAGGCTCCAT 1701 TGCATCTGCC TGTGACTTTC CCTGGCAAGG ACTATGATGC TGACCGCCAG 1751 TGCCAGCTGA CCTTCGGGCC CGACTCACGC CATTGTCCAC AGCTGCCGCC 1801 GCCCTGTGCT GCCCTCTGGT GCTCTGGCCA CCTCAATGGC CATGCCATGT 1851 GCCAGACCAA ACACTCGCCC TGGGCCGATG GCACACCCTG CGGGCCCGCA 1901 CAGGCCTGCA TGGGTGGTCG CTGCCTCCAC ATGGACCAGC TCCAGGACTT 1951 CAATATTCCA CAGGCTGGTG GCTGGGGTCC TTGGGGACCA TGGGGTGACT 2001 GCTCTCGGAC CTGTGGGGGT GGTGTCCAGT TCTCCTCCCG AGACTGCACG 2051 AGGCCTGTCC CCCGGAATGG TGGCAAGTAC TGTGAGGGCC GCCGTACCCG 2101 CTTCCGCTCC TGCAACACTG AGGACTGCCC AACTGGCTCA GCCCTGACCT 2151 TCCGCGAGGA GCAGTGTGCT GCCTACAACC ACCGCACCGA CCTCTTCAAG 2201 AGCTTCCCAG GGCCCATGGA CTGGGTTCCT CGCTACACAG GCGTGGCCCC 2251 CCAGGACCAG TGCAAACTCA CCTGCCAGGC CCGGGCACTG GGCTACTACT 2301 ATGTGCTGGA GCCACGGGTG GTAGATGGGA CCCCCTGTTC CCCGGACAGC 2351 TCCTCGGTCT GTGTCCAGGG CCGATGCATC CATGCTGGCT GTGATCGCAT 2401 CATTGGCTCC AAGAAGAAGT TTGACAAGTG CATGGTGTGC GGAGGGGACG 2451 GTTCTGGTTG CAGCAAGCAG TCAGGCTCCT TCAGGAAATT CAGGTACGGA 2501 TACAACAATG TGGTCACTAT CCCCGCGGGG GCCACCCACA TTCTTGTCCG 2551 GCAGCAGGGA AACCCTGGCC ACCGGAGCAT CTACTTGGCC CTGAAGCTGC 2601 CAGATGGCTC CTATGCCCTC AATGGTGAAT ACACGCTGAT GCCCTCCCCC 2651 ACAGATGTGG TACTGCCTGG GGCAGTCAGC TTGCGCTACA GCGGGGCCAC 2701 TGCAGCCTCA GAGACACTGT CAGGCCATGG GCCACTGGCC CAGCCTTTGA 2751 CACTGCAAGT CCTAGTGGCT GGCAACCCCC AGGACACACG CCTCCGATAC 2801 AGCTTCTTCG TGCCCCGGCC GACCCCTTCA ACGCCACGCC CCACTCCCCA 2851 GGACTGGCTG CACCGAAGAG CACAGATTCT GGAGATCCTT CGGCGGCGCC 2901 CCTGGGCGGG CAGGAAATAA CCTCACTATC CCGGCTGCCC TTTCTGGGCA 2951 CCGGGGCCTC GGACTTAGCT GGGAGAAAGA GAGAGCTTCT GTTGCTGCCT 3001 CATGCTAAGA CTCAGTGGGG AGGGGCTGTG GGCGTGAGAC CTGCCCCTCC 3051 TCTCTGCCCT AATGCGCAGG CTGGCCCTGC CCTGGTTTCC TGCCCTGGGA 3101 GGCAGTGATG GGTTAGTGGA TGGAAGGGGC TGACAGACAG CCCTCCATCT 3151 AAACTGCCCC CTCTGCCCTG CGGGTCACAG GAGGGAGGGG GAAGGCAGGG 3201 AGGGCCTGGG CCCCAGTTGT ATTTATTTAG TATTTATTCA CTTTTATTTA 3251 GCACCAGGGA AGGGGACAAG GACTAGGGTC CTGGGGAACC TGACCCCTGA 3301 CCCCTCATAG CCCTCACCCT GGGGCTAGGA AATCCAGGGT GGTGGTGATA 3351 GGTATAAGTG GTGTGTGTAT GCGTGTGTGT GTGTGTGTGA AAATGTGTGT 3401 GTGCTTATGT ATGAGGTACA ACCTGTTCTG CTTTCCTCTT CCTGAATTTT 3451 ATTTTTTGGG AAAAGAAAAG TCAAGGGTAG GGTGGGCCTT CAGGGAGTGA 3501 GGGATTATCC TTTTTTTTTT CTTTCTTTCT TTCTTTTTTT TTTTGAGACA 3551 GAATCTCGCT CTGTCGCCCA GGCTGGAGTG CAATGGCACA ATCTCGGCTC 3601 ACTGCATCCT CCGCCTCCCG GGTTCAAGTG ATTCTCATGC CTCAGCCTCC 3651 TGAGTAGCTG GGATTACAGG CTCCTGCCAC CACGCCCGGC TAATTTTTGT 3701 TTTGTTTTGT TTGGAGACAG AGTCTCGCTA TTGTCACCAG GGCTGGAATG 3751 ATTTCAGCTC ACTGCAACCT TCGCCACCTG GGTTCCAGCA ATTCTCCTGC 3801 CTCAGCCTCC CGAGTAGCTG AGATTATAGG CACCTACCAC CACGCCCGGC 3851 TAATTTTTGT ATTTTTAGTA GAGACGGGGT TTCACCATGT TGGCCAGGCT 3901 GGTCTCGAAC TCCTGACCTT AGGTGATCCA CTCGCCTTCA TCTCCCAAAG 3951 TGCTGGGATT ACAGGCGTGA GCCACCGTGC CTGGCCACGC CCAACTAATT 4001 TTTGTATTTT TAGTAGAGAC AGGGTTTCAC CATGTTGGCC AGGCTGCTCT 4051 TGAACTCCTG ACCTCAGGTA ATCGACCTGC CTCGGCCTCC CAAAGTGCTG 4101 GGATTACAGG TGTGAGCCAC CACGCCCGGT ACATATTTTT TAAATTGAAT 4151 TCTACTATTT ATGTGATCCT TTTGGAGTCA GACAGATGTG GGT // LOCUS HUMMONAP 1639 bp mRNA PRI 27-APR-1993 DEFINITION Human monocyte-derived neutrophil-activating protein (MONAP) mRNA, complete cds. ACCESSION M26383 NID g188627 VERSION M26383.1 GI:188627 KEYWORDS cytokine; monocyte-derived neutrophil-activating protein; monokine. SOURCE Human ATCC promyelocyte cell line HL60, cDNA to mRNA, clone b4. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1639) AUTHORS Kowalski,J. and Denhardt,D.T. TITLE Regulation of the mRNA for monocyte-derived neutrophil-activating peptide in differentiating HL60 promyelocytes JOURNAL Mol. Cell. Biol. 9, 1946-1957 (1989) MEDLINE 89313739 COMMENT Computer readable copy of sequence [1] kindly submitted by J. Kowalski, 20-JUL-1989. FEATURES Location/Qualifiers source 1. .1639 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 75. .134 /note="monocyte-derived neutrophil-activating protein signal peptide" CDS 75. .374 /note="monocyte-derived neutrophil-activating protein precursor" /codon_start=1 /protein_id="AAA36323.1" /db_xref="PID:g188628" /db_xref="GI:188628" /translation="MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPF HPKFIKELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS" mat_peptide 135. .371 /note="monocyte-derived neutrophil-activating protein" misc_feature 215. .219 /note="ATTTA box (associated with mRNA stability)" misc_feature 586. .590 /note="ATTTA box (associated with mRNA stability)" misc_feature 797. .801 /note="ATTTA box (associated with mRNA stability)" misc_feature 1023. .1027 /note="ATTTA box (associated with mRNA stability)" misc_feature 1030. .1034 /note="ATTTA box (associated with mRNA stability)" misc_feature 1038. .1042 /note="ATTTA box (associated with mRNA stability)" misc_feature 1042. .1046 /note="ATTTA box (associated with mRNA stability)" misc_feature 1126. .1130 /note="ATTTA box (associated with mRNA stability)" misc_feature 1175. .1179 /note="ATTTA box (associated with mRNA stability)" misc_feature 1598. .1602 /note="ATTTA box (associated with mRNA stability)" BASE COUNT 561 a 244 c 290 g 544 t ORIGIN 1 AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA 51 GGAACCATCT CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT 101 CTTGGCAGCC TTCCTGATTT CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC 151 CAAGGAGTGC TAAAGAACTT AGATGTCAGT GCATAAAGAC ATACTCCAAA 201 CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG AGAGTGGACC 251 ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 301 TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT 351 TTGAAGAGGG CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA 401 GAATCAGTGA AGATGCCAGT GAAACTTCAA GCAAATCTAC TTCAACACTT 451 CATGTATTGT GTGGGTCTGT TGTAGGGTTG CCAGATGCAA TACAAGATTC 501 CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT CATTGTACCA 551 TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 601 CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT AGATATTGCA 651 CGGGAGAATA TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC 701 GAACTTTAAT TTCAGGAATT GAATGGGTTT GCTAGAATGT GATATTTGAA 751 GCATCACATA AAAATGATGG GACAATAAAT TTTGCCATAA AGTCAAATTT 801 AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC CCTAGTCTGC 851 TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTCTCCTTTA 901 TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC AGTGATGTTG 951 TGAGGACATG TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT 1001 TTCAAGTGTA ACTTATTAAC CTATTTATTA TTTATGTATT TATTTAAGCA 1051 TCAAATATTT GTGCAAGAAT TTGGAAAAAT AGAAGATGAA TCATTGATTG 1101 AATAGTTATA AAGATGTTAT AGTAAATTTA TTTTATTTTA GATATTAAAT 1151 GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 1201 ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT 1251 TTTTAGTATA AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC 1301 AATCCTAGTT TGATACTCCC AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT 1351 GCTGTGTTGA ATTACGGAAT AATGAGTTAG AACTATTAAA ACAGCCAAAA 1401 CTCCACAGTC AATATTAGTA ATTTCTTGCT GGTTGAAACT TGTTTATTAT 1451 GTACAAATAG ATTCTTATAA TATTATTTAA ATGACTGCAT TTTTAAATAC 1501 AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATGTGCT CTCCAAATTT 1551 TTTTTACTGT TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT 1601 TAAAATATAA TTTGTTGTCA AAGTAAAAAA AAAAAAAAA // LOCUS HSY15228 1750 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens mRNA for leukemia associated gene 2. ACCESSION Y15228 NID g2664280 VERSION Y15228.1 GI:2664280 KEYWORDS LEU2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1750) AUTHORS Liu,Y., Corcoran,M., Rasool,O., Ivanova,G., Ibbotson,R., Grander,D., Iyengar,A., Baranova,A., Kashuba,V., Merup,M., Wu,X., Gardiner,A., Mullenbach,R., Poltaraus,A., Hulstrom,A.L., Juliusson,G., Chapman,R., Tiller,M., Cotter,F., Gahrton,G., Yankovsky,N., Zabarovsky,E., Einhorn,S. and Oscier,D. TITLE Cloning of two candidate tumor suppressor genes within a 10 kb region on chromosome 13q14, frequently deleted in chronic lymphocytic leukemia JOURNAL Oncogene 15 (20), 2463-2473 (1997) MEDLINE 98055620 REFERENCE 2 (bases 1 to 1750) AUTHORS Ivanova,G.M. TITLE Direct Submission JOURNAL Submitted (24-OCT-1997) G.M. Ivanova, Radiumhemmet, Karolinska Hospital, Stockholm, S-17176, SWEDEN FEATURES Location/Qualifiers source 1. .1750 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="q14.3" exon 1. .156 /number=1 exon 157. .253 /number=2 gene 241. .495 /gene="Leu2" CDS 241. .495 /gene="Leu2" /codon_start=1 /protein_id="CAA75516.1" /db_xref="PID:e1202445" /db_xref="PID:g2664281" /db_xref="GI:2664281" /db_xref="SPTREMBL:O43262" /translation="MRLRFNNDRMKTTIKETTILSSAILTFLTYLMKMSFERCTARNK MFVNSPFYPRVDNYCTSSWKKFYLKCYFSLNTIKKEKKMT" exon 254. .386 /gene="Leu2" /number=3 exon 387. .1735 /number=4 polyA_signal 1717. .1721 polyA_site 1735 BASE COUNT 582 a 264 c 279 g 625 t ORIGIN 1 GATGCCTGAT CTCATCAATC TAGCGGGAGA GACAGGATAA CCTGTCCGAG 51 AGTATAGCGC CACTTATGAC TCCGCCGGAA AAATTACTTT AAAAATCGCC 101 AAAAATTACT TGGAGCAAAG GGCAGTCCGG CGGCGTTCGC CAAGGTGGCG 151 CAGTCGGTTT TGACCTGTAG CAGAGAACCA ATTCTGGAGA ACAGCCTCAC 201 TTCTTTGATT GAATACTTAC ATAATGCATT GGAACATGAC ATGAGATTAA 251 GGTTTAATAA TGATAGAATG AAGACCACAA TAAAAGAGAC CACAATCCTT 301 AGCTCAGCAA TTCTTACCTT TCTTACCTAT TTGATGAAGA TGTCTTTTGA 351 AAGGTGTACT GCAAGGAACA AAATGTTTGT AAATTCTCCC TTTTACCCAA 401 GGGTGGATAA TTACTGTACC TCCTCATGGA AAAAGTTTTA TTTAAAGTGT 451 TATTTCTCAT TGAATACTAT CAAAAAGGAA AAAAAAATGA CCTAAACTTT 501 TGAGATAGAT TTGGCTCTAG TAAGTATTTA GTTATATCAC TTGCATATCT 551 GGGAGAAGAA ATAAGAGACT ATCATCAGTA CATTCCCATC TACTAAAAAA 601 ATTTATTTTA CACATGTCAA GGGATTACTT ATAACTTCCA TTTTATTACT 651 AATAGCTTGA ACCCTTTTAA TGAAGACCTA ACTCCTCCAC CAGAAATTTA 701 AGTTTATGTT CTTACTTTGT TTACTTATAA AATACATCTC AGGTATTTCG 751 GATGTCTTTT TTTTTTCTAA GCCTATATGA AATGAAAAAT ATATTGGCAA 801 AGTAAATGTT TAAACCTTTT ACGTTAAAAT TACTTTGAAA GATGAAAAGT 851 TAGTGCTGTT TTTGTCACGT TATACTGAAA TTAAATGTTT ATAATTTATA 901 TTTTGGGTTT ATGTATAAAT CATGGAATTT ATGCAAAAAT ATGAGTAGTA 951 CAGATTCTCC TCTAATTCTG TAGGACTTTG AATAATGTGA TATTTTTCTT 1001 ATAATTGGAC CCTTGTGTTT TGAAGAAATG CCAACTGCTT GAAGAATCTC 1051 CTTGTTATTT GTATTATTTG CTATAGGGTT AGATGTTGAG AAATTCTGCT 1101 GACAAAAAAT TTTAAGCCAG TTTTACACTA AATGTTCCTC AGTCTGATTA 1151 ATTTGTTATT GGATGTATTC TGTATCTTTC TTTTGTAATT TGTGACTTTT 1201 ATCCACTTAG CACGAATGAT TCTATTAAAG AAAATCATTA GGAAGTGGTA 1251 GAAACTTTAA ATCGCCCCAG AGTTTGCCTG TTTCCATATT TTATTATCTT 1301 ATAATCTTCG GGAGTGCTTA CACTTATGGA GCTAACATTT TCAGAGATAC 1351 AGCTTCTTAT AGTAACACTA AAACTTTCTT CCTCTTTGGA CTGAATACCT 1401 ATAATTATAA CTATATGGTA GTTTAAGTTT CCTTGTGATT AGTCAAAAAT 1451 ACCATTTTAG TATGAAGCAA TGAAGTCTAT TATTTGTTGT CCCATAATTG 1501 AGAAAGCTTA AATACACCTT TTATGTAAGA GTTTAGTAAG ATTCTAGCTT 1551 AGTCTACACA GATTTTTATA TCAATTTGTT TATATTTTTA TTAATGTCAT 1601 TTCTGGAAGT GTGAAAATGT TAATGTTCAA CAAGCAACAT TAAAAATAGA 1651 TTTGAAACAT TTATATATAG AGAGGTACAC ATTTATTTAC TGTTTAGGTA 1701 CTGAAGATTA TCACTTAATA AAAAATATAT ATCCCAAAAA AAAAAAAAAA // LOCUS AF151869 2056 bp mRNA PRI 01-JUN-1999 DEFINITION Homo sapiens CGI-111 protein mRNA, complete cds. ACCESSION AF151869 NID g4929690 VERSION AF151869.1 GI:4929690 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2056) AUTHORS Lin,W.-C. TITLE Comparative gene cloning: Identification of novel human genes with Caenorhabditis elegans proteome as template JOURNAL Unpublished REFERENCE 2 (bases 1 to 2056) AUTHORS Lin,W.-C. TITLE Direct Submission JOURNAL Submitted (17-MAY-1999) Institute of Biomedical Sciences, Academia Sinica, No. 128, Sec. II, Academia Road, Taipei 115, Taiwan FEATURES Location/Qualifiers source 1. .2056 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" CDS 207. .806 /codon_start=1 /product="CGI-111 protein" /protein_id="AAD34106.1" /db_xref="PID:g4929691" /db_xref="GI:4929691" /translation="MHRKFVVQLFAEEWGQYVDLPKGFAVSERCKVRLVPLQIQLTTL GNLTPSSTVFFCCDMQERFRPAIKYFGDIISVGQRLLQGARILGIPVIVTEQYPKGLG STVQEIDLTGVKLVLPKTKFSMVLPEVEAALAEIPGVRSVVLFGVETHVCIQQTALEL VGRGVEVHIVADATSSRSMMDRMFARLTSRSNGDHSDHE" BASE COUNT 548 a 401 c 460 g 647 t ORIGIN 1 TCGCAGACAG CTCGGGGGAA CAGTGGCGGC TTCGGCCGGC GGTCCTTGCG 51 CTCCCCAACA GCGGCGCGGG CGGGTGAGCC GTCGGGGCAC AGTCCCGGTG 101 CTCTTCTGTT TCTCAGTCTT CGCGCGACCC TCGTGCGGTG CCACACGGGG 151 CGGGCTACGA GCTGCTCATC CAGAAGTTCC TCAGCCTGTA CGGCGACCAG 201 ATCGACATGC ACCGCAAATT CGTGGTGCAG CTGTTCGCCG AGGAGTGGGG 251 CCAGTACGTG GACTTGCCCA AGGGTTTCGC GGTGAGCGAG CGCTGCAAGG 301 TGCGCCTCGT GCCGCTGCAG ATCCAGCTCA CTACCCTGGG AAATCTTACA 351 CCTTCAAGCA CTGTGTTTTT CTGCTGTGAT ATGCAGGAAA GGTTCAGACC 401 AGCCATCAAG TATTTTGGGG ATATTATTAG CGTGGGACAG AGATTGTTGC 451 AAGGGGCCCG GATTTTAGGA ATTCCTGTTA TTGTAACAGA ACAATACCCT 501 AAAGGTCTTG GGAGCACGGT TCAAGAAATT GATTTAACAG GTGTAAAACT 551 GGTACTTCCA AAGACCAAGT TTTCAATGGT ATTACCAGAA GTAGAAGCGG 601 CATTAGCAGA GATTCCCGGA GTCAGGAGTG TTGTATTATT TGGAGTAGAA 651 ACTCATGTGT GCATCCAACA AACTGCCCTG GAGCTAGTTG GCCGAGGAGT 701 CGAGGTTCAC ATTGTTGCTG ATGCCACCTC ATCAAGAAGC ATGATGGACA 751 GGATGTTTGC ACGCCTCACG TCTCGCTCGA ACGGGGATCA TAGTGACCAC 801 GAGTGAGGCT GTTCTGCGTT CAGCTGGTAG CTGATAAGGA CGCATCGCAA 851 AATTCAAGGA AATTCAGAAT CTAATTAAGG ACGAGTGCTC CAGAGTCGGG 901 TCTGCTTTCC AAAGTATAGG ACATTTGAAG AACTGGTATG CTACTCACTG 951 GTGAAGGACA GTCAGGTGAA GGACTGTAAG CCCACACAAG CTCTTCTTAT 1001 CTCTACTAGA ATTAAAATGT TAAGTCAAAA ACGGCTCCTT TTTTGCGCCT 1051 CCTAGTGAAA CTTAACCAGC TAGACCATTT GAGTACCAGC ATTTAGTTAC 1101 AAACGTCAAA GGCTTCCGGT GCTGCTTACC TTCCTTTTTT GTTAATGTGC 1151 TTTTATTTAT TAAAAAAAAT TACAATGAAG ATGCCTGTTT TGTCTCTACT 1201 GTGTACTCTG ATCGTATCTT TCCAAAGTGC AGACTCTTGT GAAGTTTTCT 1251 TAAATTGTTC ACTTTAAAGA AAATGACGTA CCAACAATGA TTTGGCTTTT 1301 ATATTACTGT AAGATGTTAT AATGTTAATG TGGATGTAGT GCTTTTACTT 1351 TACAGATTGA TTGGAATAAG ATTATTGCAT ATGAATTTAC CCACAGGACT 1401 CTGAATCATG TTACCCACTC CCCTCACAAT GTTGTCCACT TAGTGAGTTG 1451 CATTGATCTA TCCGTACCAA ATGATGTTGA ATAATTACAT ATCTTTCTTG 1501 ACTATACTGA TTTCTTATTT TGGTCACTAT TACTAAATCT CTGTTAATAT 1551 TCTCTCTTTT AACTGAAAAG GGATGGGATA GAAGGGTTTG CAATGCCATA 1601 TTATTGGTGG AGGGCTGTTT TAACATCTTT GAAGTATGGC TTGCTGAATA 1651 TCTTTACCAA CATCTTGAAT ATATATTCTA GTGTCCACAA GATTTAGCAA 1701 AAAGATAAAG CTTGGGTGGA ATATCATTTT AAAATGTTCA TGTTCTGTTC 1751 TATATTTTCT TCACCTACTC TCCAAATATT GTAATGCAAA AAGTCTCAGT 1801 AATGATTTGG TAGTATTAAT TTTGTGGTCA TTGTTTCTCT TCGATAAATT 1851 TATTTTCATT AAATACTTGT TAGAGGGTTT TGAAATGTTT TTCAAATATG 1901 TGAAATGTGA AACTGCTGTC TTTTATATTA AAGTAATTAA AGAAAATGTA 1951 TTGTGATTGA AATTATTTTG GCCTCCACAA GATGGCTCTA TGAGTATTCT 2001 TCCAGGGATT CTAATATTTA TTTAAGGTAA TAAAATCTTG ACATTTATAA 2051 TCTTTC // LOCUS AF076844 2124 bp mRNA PRI 16-DEC-1998 DEFINITION Homo sapiens Hus1-like protein (HUS1) mRNA, complete cds. ACCESSION AF076844 NID g4019216 VERSION AF076844.1 GI:4019216 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2124) AUTHORS Dean,F.B., Lian,L. and O'Donnell,M. TITLE cDNA cloning and gene mapping of human homologs for Schizosaccharomyces pombe rad17, rad1, and hus1 and cloning of homologs from mouse, Caenorhabditis elegans, and Drosophila melanogaster JOURNAL Genomics 54 (3), 424-436 (1998) MEDLINE 99097342 REFERENCE 2 (bases 1 to 2124) AUTHORS Dean,F.B. and O'Donnell,M. TITLE Direct Submission JOURNAL Submitted (09-JUL-1998) Laboratory of DNA Replication, Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1. .2124 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7p13-p12" gene 1. .2124 /gene="HUS1" CDS 64. .906 /gene="HUS1" /note="similar to Schizosaccharomyces pombe cell cycle checkpoint protein Hus1p" /codon_start=1 /product="Hus1-like protein" /protein_id="AAC95526.1" /db_xref="PID:g4019217" /db_xref="GI:4019217" /translation="MKFRAKIVDGACLNHFTRISNMIAKLAKTCTLRISPDKLNFILC DKLANGGVSMWCELEQENFFNEFQMEGVSAENNEIYLELTSENLSRALKTAQNARALK IKLTNKHFPCLTVSVELLSMSSSSRIVTHDIPIKVIPRKLWKDLQEPVVPDPDVSIYL PVLKTMKSVVEKMKNISNHLVIEANLDGELNLKIETELVCVTTHFKDLGNPPLASEST HEDRNVEHMAEVHIDIRKLLQFLAGQQVNPTKALCNIVNNKMVHFDLLHEDVSLQYFI PALS" BASE COUNT 548 a 511 c 462 g 603 t ORIGIN 1 GCTCAGGGCG CGACGCTTTT CTGTTACCAA CAGAGGCCCG CCGCGGCTGC 51 GCCATCCGCG GCCATGAAGT TTCGGGCCAA GATCGTGGAC GGGGCCTGTC 101 TGAACCACTT CACACGAATC AGTAACATGA TAGCCAAGCT TGCCAAAACC 151 TGCACCCTCC GCATCAGCCC TGATAAGCTT AACTTCATCC TTTGTGACAA 201 GCTGGCTAAT GGAGGAGTGA GCATGTGGTG TGAGCTGGAA CAGGAGAACT 251 TCTTCAACGA ATTTCAAATG GAGGGTGTCT CTGCAGAAAA CAATGAGATT 301 TATTTAGAGC TAACATCGGA AAACTTATCT CGAGCCTTGA AGACTGCCCA 351 GAATGCCAGG GCTTTGAAAA TCAAACTGAC TAATAAACAC TTTCCCTGCC 401 TCACGGTCTC CGTGGAGCTG TTATCTATGT CAAGCAGTAG CCGCATTGTG 451 ACCCATGACA TCCCCATAAA GGTGATTCCT AGGAAATTGT GGAAGGACTT 501 ACAAGAACCG GTGGTCCCAG ATCCTGATGT TAGTATTTAT TTACCAGTCT 551 TGAAGACTAT GAAGAGTGTT GTGGAAAAAA TGAAAAACAT CAGCAATCAC 601 CTTGTTATTG AAGCAAACCT AGATGGAGAA TTGAATTTGA AAATAGAAAC 651 TGAATTAGTA TGTGTTACAA CTCATTTTAA AGATCTTGGA AATCCTCCAT 701 TAGCCTCTGA AAGCACCCAT GAGGACAGAA ACGTGGAACA CATGGCTGAA 751 GTGCACATAG ATATTAGGAA GCTCCTACAG TTTCTTGCTG GACAACAAGT 801 AAATCCCACA AAGGCCTTAT GCAATATTGT GAATAACAAG ATGGTGCATT 851 TTGATCTGCT TCATGAAGAC GTGTCCCTTC AGTATTTCAT CCCTGCGCTG 901 TCCTAGCACC CTGTCGCTGG AGTTGGCATG CAGAGACTTT GTCAGGATGG 951 GAGAGCCGCA GGTGTTGTGT TCTGATCACT GGTCTGTGCC CTCACAGCAC 1001 CGCACATCGA CACACTGTAC TTATTTGTCC CTCTCTAACA TTTTAACTAA 1051 AAGTTGATTC AACAACACAC AGTTGGATAA ACATATCACT TCATGTTGCT 1101 CATGTCTGTT TTGCTTTGTT TTTAAGACAC TGAAAAGAAA AGCTAGAATT 1151 TATTTATTCA GACTTTAAAG AACAATTTCT CATTGATGTT GTGAAAATCG 1201 TCATGTATTT AGACTTGGTG TAGTAGCCAG AATTCGTAAA GCTGTTGCCT 1251 GGGAGCTTGG TACTTTCCCT CCAGGCAGAG GCTCTAGCTC AGCACGGCCT 1301 GTAGCGCACA GTCAGTCTTG CATTTCAGTG TGTTCACCCC GCTGCTCCTG 1351 CCCCTTGGAG CCCAGTGACA GAAAGAACAG CCTCTGTCAC CCCGCCGCCA 1401 CTGCCTTGGT TACTCAGAGC ACTGTGGGGT GTCACAGCTG CAGCATTTGG 1451 AGTCTCTCTC TTGCTGAGGA CTCAAGCCCA CCTGAGTCCA CTCCCCTCTT 1501 GATGCCTAGA GAGCTGGCCC AGCCAACACA GCTCTTAGCT GGGAGCTCCT 1551 TCTGCCATTC CAACTAGTTT CTTCCTGGGG CCAGTTTTGG GTTTAGGTTG 1601 TAATTCCTTA TATTTCTTTC TTCCACAGTG TATCGGATCT GTCGTTCTGG 1651 AAAGAAGACC CTTCTATTTA GAGTAGAAAC AAACGAAACT TCTAAGGTAT 1701 CATCTGTGTT AAGTGATGAG ACCATATTTC TTTGATGTTT CTGAACATCA 1751 AAGCTGATTC AGTACTGGTA GATGTGCTCA TTCTCCCTGA AACATACCCA 1801 TCATATTTCC TATTATAATT ACATCTCATT GTCCTGTGGA GGTGGACATG 1851 ATAAACATTA TCTTTTGTTT TCTTGTTTTG TTTTGTTTGA GACGGTCTCA 1901 TTCTGTCACC CAGACTGGAG TGCAGTGCCA CAATCATGGC TCACCGCATT 1951 GACCTCCTTG GCTCAAGGCA TCCTCCCACC TCAGCTTCCT GACTAGCTGG 2001 GACTACTGGT GTGCACCACC ACACCCAGCT AATTTTCAAT TTTTCATAGA 2051 GACAGGGTCT CACTGTGTTG TCCAGGAGTT GGAGACCAGC CCGGGCAACA 2101 AAGTGAGACC CCCGTCTCTA CTTT // LOCUS HSU26424 2820 bp mRNA PRI 26-FEB-1996 DEFINITION Human Ste20-like kinase (MST2) mRNA, complete cds. ACCESSION U26424 NID g1203795 VERSION U26424.1 GI:1203795 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2820) AUTHORS Creasy,C.L. and Chernoff,J. TITLE Cloning and characterization of a member of the MST subfamily of Ste20-like kinases JOURNAL Gene 167 (1-2), 303-306 (1995) MEDLINE 96144292 REFERENCE 2 (bases 1 to 2820) AUTHORS Creasy,C.L. TITLE Direct Submission JOURNAL Submitted (05-MAY-1995) Caretha L. Creasy, Fox Chase Cancer Center, 7701 Burholme Avenue, Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1. .2820 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS-4-11" /clone_lib="lambda-ZAP HeLa cDNA library, Stratagene" /cell_line="HeLa" gene 139. .1614 /gene="MST2" CDS 139. .1614 /gene="MST2" /note="Ste20-like kinase" /codon_start=1 /product="MST2" /protein_id="AAC50386.1" /db_xref="PID:g1203796" /db_xref="GI:1203796" /translation="MEQPPAPKSKLKKLSEDSLTKQPEEVFDVLEKLGEGSYGSVFKA IHKESGQVVAIKQVPVESDLQEIIKEISIMQQCDSPYVVKYYGSYFKNTDLWIVMEYC GAGSVSDIIRLRNKTLIEDEIATILKSTLKGLEYLHFMRKIHRDIKAGNILLNTEGHA KLADFGVAGQLTDTMAKRNTVIGTPFWMAPEVIQEIGYNCVADIWSLGITSIEMAEGK PPYADIHPMRAIFMIPTNPPPTFRKPELWSDDFTDFVKKCLVKNPEQRATATQLLQHP FIKNAKPVSILRDLITEAMEIKAKRHDEQQRELEEEEENSDEDELDSHTMVKTSVGEC GTMRATSTMSEGAQTMIEHNSTMLESDLGTMVINSEDEEEEDGTMKRNATSPQVQRPS FMDYFDKQDFKNKSHENCNQNMHEPFPMSKNVFPDNWKVPQDGDFDFLKNLSLEELQM RLKALDPMMEREIEELRQRYTAKRQPILDAMDAKKRRQQNF" BASE COUNT 925 a 498 c 579 g 818 t ORIGIN 1 CCGCGGAGTT ACGGGAAAGT TGGTCCGAGT TCCCAGAGTT TCCCTCTGTG 51 GTGCCCTAGG CTTCGGCCCG GTGCCCCGGC TCCTTTCCTC CTTTCGGCCT 101 TCGCCGTCCA CCAGGTCCCT CTCTCTGTCC CGGCCGCCAT GGAGCAGCCG 151 CCGGCGCCTA AGAGTAAACT AAAAAAGCTG AGTGAAGACA GTTTGACTAA 201 GCAGCCTGAA GAAGTTTTTG ATGTATTAGA GAAGCTTGGA GAAGGGTCTT 251 ATGGAAGTGT ATTTAAAGCA ATACACAAGG AATCCGGTCA AGTTGTCGCA 301 ATTAAACAAG TACCTGTTGA ATCAGATCTT CAGGAAATAA TCAAAGAAAT 351 TTCCATAATG CAGCAATGTG ACAGCCCATA TGTTGTAAAG TACTATGGCA 401 GTTATTTTAA GAATACAGAC CTCTGGATTG TTATGGAGTA CTGTGGCGCT 451 GGCTCTGTCT CAGACATAAT TAGATTACGA AACAAGACAT TAATAGAAGA 501 TGAAATTGCA ACCATTCTTA AATCTACATT GAAAGGACTA GAATATTTGC 551 ACTTTATGAG AAAAATACAC AGAGATATAA AAGCTGGAAA TATTCTCCTC 601 AATACAGAAG GACATGCAAA ATTGGCAGAT TTTGGAGTGG CTGGTCAGTT 651 AACAGATACA ATGGCAAAAC GCAATACTGT AATAGGAACT CCATTTTGGA 701 TGGCTCCTGA GGTGATTCAA GAAATAGGCT ATAACTGTGT GGCCGACATC 751 TGGTCCCTTG GCATTACTTC TATAGAAATG GCTGAAGGAA AACCTCCTTA 801 TGCTGATATA CATCCAATGA GGGCTATTTT TATGATTCCC ACAAATCCAC 851 CACCAACATT CAGAAAGCCA GAACTTTGGT CCGATGATTT CACCGATTTT 901 GTTAAAAAGT GTTTGGTGAA GAATCCTGAG CAGAGAGCTA CTGCAACACA 951 ACTTTTACAG CATCCTTTTA TCAAGAATGC CAAACCTGTA TCAATATTAA 1001 GAGACCTGAT CACAGAAGCT ATGGAGATCA AAGCTAAAAG ACATGACGAA 1051 CAGCAACGAG AATTGGAAGA GGAAGAAGAA AATTCGGATG AAGATGAGCT 1101 GGATTCCCAC ACCATGGTGA AGACTAGTGT GGGAGAGTGT GGCACCATGC 1151 GGGCCACAAG CACGATGAGT GAAGGGGCCC AGACCATGAT TGAACATAAT 1201 AGCACGATGT TGGAATCCGA CTTGGGGACC ATGGTGATAA ACAGTGAGGA 1251 TGAGGAAGAA GAAGATGGAA CTATGAAAAG AAATGCAACC TCACCACAAG 1301 TACAAAGACC ATCTTTCATG GACTACTTTG ATAAGCAAGA CTTCAAGAAT 1351 AAGAGTCACG AAAACTGTAA TCAGAACATG CATGAACCCT TCCCTATGTC 1401 CAAAAACGTT TTTCCTGATA ACTGGAAAGT TCCTCAAGAT GGAGACTTTG 1451 ACTTTTTGAA AAATCTAAGT TTAGAAGAAC TACAGATGCG GTTAAAAGCA 1501 CTGGACCCCA TGATGGAACG GGAGATAGAA GAACTTCGTC AGAGATACAC 1551 TGCGAAAAGA CAGCCCATTC TGGATGCGAT GGATGCAAAG AAAAGAAGGC 1601 AGCAAAACTT TTGAGTCTAA TTTCCTCTCT GTTTTTAACT ATTCTGGAGA 1651 CCAAGAAACC ACTAGGAATT GAAGGAATAT TTGGATATTT TTAATCCTAA 1701 GATTTTGCCC TACAATTAGG CAGAGGTCAA AAAGTGACAA TGGTACATGC 1751 CCAGGTAAAT TCCCAAAAGG CAGAATTGAC AGTTGTATCT GCTGTGCATT 1801 CACTCTAAGA TGAGGAGAAC AAAAGAAGTG TATTCTCTTG TTCTGTCAGC 1851 TGCATACCAG TAATAAAACT GTTATGAAAT GGATTTTCAA GGTCTCTAAA 1901 CCTTGAAAAT CCAAAGGCTA TTGTTGCATT GTACAGCACT GAAAGGGCTT 1951 TATGTTACAA TATTCTTTAT TCCTATCTAG TATACTAGGC TATTTATTGT 2001 CCCCTTAGGT AAACTTATTT ATTTATGCTA TTTTGGCTTT GTTTCATTTT 2051 TTAAGGACAA GATCAGGATA GCTTTGGTGA AGGTAGGGTC ATATTAATAT 2101 GATGATAATG TGCAACCAAT TTATACTTTC TGCAGGGAGC TATGGGGTAC 2151 ATTCCTTGAT TTCCAGGATA GTTTTTCAAA TAGGAAAGCA ATAATGGCAG 2201 TAGTTCTCAA ATGGGCTAGG CCTTTTTTAT ATTGAAGCAA TAATTCCATT 2251 TTTACCCTTT GAAATTTTGT TTTTTTGATT TTTGATGTTT GGTACAAATA 2301 GAACTATATA TATTTAGGTA AAATAGATCT ATCGTGTTTA AAACCAAAGA 2351 AATCAATGGA ACCCTTGCAC AAAAAAGTGT GATAAATATT TTTAAATAAA 2401 AACTTAATAC AAATGTAATT TGTTAATATT GTTTCATGTT TTATGTGTAG 2451 ATCTAATAGC TGAACTGATT CAAACTGTAA TAAGCTCATC AATTTCATTT 2501 CTATGAAAAT GTGCTCTGTT GTCACAGGAT GTTTCTGTTG ATTTTATTCA 2551 TTTCCTGGGA ATTGGTAAAC ATCATGTTCC TGATGATAAC CCAGTAGCAA 2601 AAACATTTGT ACTGAGTGGT ACAAGCCTTG GGGACTGAAA AAAAAAAAAG 2651 ATTAAAACCA TTAAAAAGAA ACTCATTTTT ACGCTGAATG AACATTTATA 2701 TGATTGCATT GGGACCAGTC ATTTCCTAAG CTACATATGG CCATCTTGAC 2751 AGTGTTTTTT CTTTTGTGTG TTTAATTATT ATGTGTAAAT CATAAAGACA 2801 AATAAATTTC ACTGTGCCAC // LOCUS AF151522 2103 bp mRNA PRI 02-AUG-1999 DEFINITION Homo sapiens hairy and enhancer of split related-1 (HESR-1) mRNA, complete cds. ACCESSION AF151522 NID g5059322 VERSION AF151522.1 GI:5059322 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2103) AUTHORS Kokubo,H., Lun,Y. and Johnson,R.L. TITLE Identification and expression of a novel family of bHLH cDNAs related to Drosophila hairy and enhancer of split JOURNAL Biochem. Biophys. Res. Commun. 260 (2), 459-465 (1999) MEDLINE 99333699 REFERENCE 2 (bases 1 to 2103) AUTHORS Kokubo,H. and Johnson,R.L. TITLE Direct Submission JOURNAL Submitted (14-MAY-1999) Biochemistry and Molecular Biology, University of Texas M.D. Anderson Cancer Center, 1515 Holcombe Blvd., Houston, Texas 77030, USA FEATURES Location/Qualifiers source 1. .2103 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 1. .2103 /gene="HESR-1" /note="similar to GenBank Accession Number AI272343" CDS 25. .939 /gene="HESR-1" /note="basic helix loop helix factor" /codon_start=1 /product="hairy and enhancer of split related-1" /protein_id="AAD38967.1" /db_xref="PID:g5059323" /db_xref="GI:5059323" /translation="MKRAHPEYSSSDSELDETIEVEKESADENGNLSSALGSMSPTTS SQILARKRRRGIIEKRRRDRINNSLSELRRLVPSAFEKQGSAKLEKAEILQMTVDHLK MLHTAGGKGYFDAHALAMDYRSLGFRECLAEVARYLSIIEGLDASDPLRVRLVSHLNN YASQREAASGAHAGLGHIPWGTVFGHHPHIAHPLLLPQNGHGNAGTTASPTEPHHQGR LGSAHPEAPALRAPPSGSLGPVLPVVTSASKLSPPLLSSVASLSAFPFSFGSFHLLSP NALSPSAPTQAANLGKPYRPWGTEIGAF" BASE COUNT 540 a 463 c 504 g 596 t ORIGIN 1 CCCAGGAACC CCCAGGGAGC CAGCATGAAG CGAGCTCACC CCGAGTACAG 51 CTCCTCGGAC AGCGAGCTGG ACGAGACCAT CGAGGTGGAG AAGGAGAGTG 101 CGGACGAGAA TGGAAACTTG AGTTCGGCTC TAGGTTCCAT GTCCCCAACT 151 ACATCTTCCC AGATTTTGGC CAGAAAAAGA CGGAGAGGAA TAATTGAGAA 201 GCGCCGACGA GACCGGATCA ATAACAGTTT GTCTGAGCTG AGAAGGCTGG 251 TACCCAGTGC TTTTGAGAAG CAGGGATCTG CTAAGCTAGA AAAAGCCGAG 301 ATCCTGCAGA TGACCGTGGA TCACCTGAAA ATGCTGCATA CGGCAGGAGG 351 GAAAGGTTAC TTTGACGCGC ACGCCCTTGC TATGGACTAT CGGAGTTTGG 401 GATTTCGGGA ATGCCTGGCA GAAGTTGCGC GTTATCTGAG CATCATTGAA 451 GGACTAGATG CCTCTGACCC GCTTCGAGTT CGACTGGTTT CGCATCTCAA 501 CAACTACGCT TCCCAGCGGG AAGCCGCGAG CGGCGCCCAC GCGGGCCTCG 551 GACACATTCC CTGGGGGACC GTCTTCGGAC ATCACCCGCA CATCGCGCAC 601 CCGCTGTTGC TGCCCCAGAA CGGCCACGGG AACGCGGGCA CCACGGCCTC 651 ACCCACGGAA CCGCACCACC AGGGCAGGCT GGGCTCGGCA CATCCGGAGG 701 CGCCTGCTTT GCGAGCGCCC CCTAGCGGCA GCCTCGGACC GGTGCTCCCT 751 GTGGTCACCT CCGCCTCCAA ACTGTCGCCG CCTCTGCTCT CCTCAGTGGC 801 CTCCCTGTCG GCCTTCCCCT TCTCTTTCGG CTCCTTCCAC TTACTGTCTC 851 CCAATGCACT GAGCCCTTCA GCACCCACGC AGGCTGCAAA CCTTGGCAAG 901 CCCTATAGAC CTTGGGGGAC GGAGATCGGA GCTTTTTAAA GAACTGATGT 951 AGAATGAGGG AGGGGAAAGT TTAAAATCCC AGCTGGGCTG GACTGTTGCC 1001 AACATCACCT TAAAGTCGTC AGTAAAAGTA AAAAGGAAAA AGGTACACTT 1051 TCAGATAATT TTTTTTTTAA AGACTAAAGG TTTGTTGGTT TACTTTTATC 1101 TTTTTTAATG TTTTTTTCAT CATGTCATGT ATTAGCAGTT TTTAAAAACT 1151 AGTTGTTAAA TTTTGTTCAA GACATTAAAT TGAAATAGTG AGTATAAGCC 1201 AACACTTTGT GATAGGTTTG TACTGTGCCT AATTTACTTT GTAAACCAGA 1251 ATGATTCCGT TTTTGCCTCA AAATTTGGGG AATCTTAACA TTTAGTATTT 1301 TTGGTCTGTT TTTCTCCTTG TATAGTTATG GTCTGTTTTT AGAATTAATT 1351 TTCCAAACCA CTATGCTTAA TGTTAACATG ATTCTGTTTG TTAATATTTT 1401 GACAGATTAA GGTGTTGTAT AAATAATATT CTTTTGGGGG GAGGGGAACT 1451 ATATTGAATT TTATATTTCT GAGCAAAGCG TTGACAAATC AGATGATCAG 1501 CTTTATCCAA GAAAGAAGAC TAGTAAATTG TCTGCCTCCT ATAGCAGAAA 1551 GGTGAATGTA CAAACTGTTG GTGGCCCTGA ATCCATCTGA CCAGCTGCTG 1601 GTATCTGCCA GGACTGGCAG TTCTGATTTA GTTAGGAGAG AGCCGCTGAT 1651 AGGTTAGGTC TCATTTGGAG TGTTGGTGGA AAGGAAACTG AAGGTAATTG 1701 AATAGAATAC GCCTGCATTT ACCAGCCCCA GCAACACAAA GAATTTTTAA 1751 TCACACGGAT CTCAAATTCA CAAATGTTAA CATGGATAAG TGATCATGGT 1801 GTGCGAGTGG TCAATTGAGT AGTACAGTGG AAACTGTTAA ATGCATAACC 1851 TAATTTTCCT GGGACTGCCA TATTTTCTTT TAACTGGAAA TTTTTATGTG 1901 AGTTTTCCTT TTGGTGCATG GAACTGTGGT TGCCAAGGTA TTTAAAAGGG 1951 CTTTCCTGCC TCCTTCTCTT TGATTTATTT AATTTGATTT GGGCTATAAA 2001 ATATCATTTT TCAGGTTTAT TCTTTTAGCA GGTGTAGTTA AACGACCTCC 2051 ACTGAACTGG GTTTGACCTC TGTTGTACTG ATGTGTTGTG ACTAAATAAA 2101 AAA // LOCUS HSMDNCF 1560 bp mRNA PRI 31-MAR-1995 DEFINITION Human mRNA for MDNCF (monocyte-derived neutrophil chemotactic factor). ACCESSION Y00787 NID g34518 VERSION Y00787.1 GI:34518 KEYWORDS cytokine. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Matsushima,K. TITLE Direct Submission JOURNAL Submitted (03-MAY-1988) Matsushima K., National Cancer Institute,, Bldg 560, Rm 31-19, Frederick, MD 21701 REFERENCE 2 (bases 1 to 1560) AUTHORS Matsushima,K., Morishita,K., Yoshimura,T., Lavu,S., Kobayashi,Y., Lew,W., Appella,E., Kung,H.F., Leonard,E.J. and Oppenheim,J.J. TITLE Molecular cloning of a human monocyte-derived neutrophil chemotactic factor (MDNCF) and the induction of MDNCF mRNA by interleukin 1 and tumor necrosis factor JOURNAL J. Exp. Med. 167 (6), 1883-1893 (1988) MEDLINE 88258376 COMMENT for overlapping sequence see M17016 - M17017. FEATURES Location/Qualifiers source 1. .1560 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" /clone_lib="lambda gt10" sig_peptide 102. .182 /product="signal peptide (AA -27 to -1)" CDS 102. .401 /codon_start=1 /product="MDNCF precursor (AA -27 to 72)" /protein_id="CAA68742.1" /db_xref="PID:g34519" /db_xref="GI:34519" /db_xref="SWISS-PROT:P10145" /translation="MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPF HPKFIKELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS" mat_peptide 183. .398 /product="mat. MDNCF (AA 1 - 72)" BASE COUNT 526 a 247 c 281 g 506 t ORIGIN 1 CTCCATAAGG CACAAACTTT CAGAGACAGC AGAGCACACA AGCTTCTAGG 51 ACAAGAGCCA GGAAGAAACC ACCGGAAGGA ACCATCTCAC TGTGTGTAAA 101 CATGACTTCC AAGCTGGCCG TGGCTCTCTT GGCAGCCTTC CTGATTTCTG 151 CAGCTCTGTG TGAAGGTGCA GTTTTGCCAA GGAGTGCTAA AGAACTTAGA 201 TGTCAGTGCA TAAAGACATA CTCCAAACCT TTCCACCCCA AATTTATCAA 251 AGAACTGAGA GTGATTGAGA GTGGACCACA CTGCGCCAAC ACAGAAATTA 301 TTGTAAAGCT TTCTGATGGA AGAGAGCTCT GTCTGGACCC CAAGGAAAAC 351 TGGGTGCAGA GGGTTGTGGA GAAGTTTTTG AAGAGGGCTG AGAATTCATA 401 AAAAAATTCA TTCTCTGTGG TATCCAAGAA TCAGTGAAGA TGCCAGTGAA 451 ACTTCAAGCA AATCTACTTC AACACTTCAT GTATTGTGTG GGTCTGTTGT 501 AGGGTTGCCA GATGCAATAC AAGATTCCTG GTTAAATTTG AATTTCAGTA 551 AACAATGAAT AGTTTTTCAT TGTACCATGA AATATCCAGA ACATACTTAT 601 ATGTAAAGTA TTATTTATTT GAATCTACAA AAAACAACAA ATAATTTTTA 651 AATATAAGGA TTTTCCTAGA TATTGCACGG GAGAATATAC AAATAGCAAA 701 ATTGGGCCAA GGGCCAAGAG AATATCCGAA CTTTAATTTC AGGAATTGAA 751 TGGGTTTGCT AGAATGTGAT ATTTGAAGCA TCACATAAAA ATGATGGGAC 801 AATAAATTTT GCCATAAAGT CAAATTTAGC TGGAAATCCT GGATTTTTTT 851 CTGTTAAATC TGGCAACCCT AGTCTGCTAG CCAGGATCCA CAAGTCCTTG 901 TTCCACTGTG CCTTGGTTTC TCCTTTATTT CTAAGTGGAA AAAGTATTAG 951 CCACCATCTT ACCTCACAGT GATGTTGTGA GGACATGTGG AAGCACTTTA 1001 AGTTTTTTCA TCATAACATA AATTATTTTC AAGTGTAACT TATTAACCTA 1051 TTTATTATTT ATGTATTTAT TTAAGCATCA AATATTTGTG CAAGAATTTG 1101 GAAAAATAGA AGATGAATCA TTGATTGAAT AGTTATAAAG ATGTTATAGT 1151 AAATTTATTT TATTTTAGAT ATTAAATGAT GTTTTATTAG ATAAATTTCA 1201 ATCAGGGTTT TTAGATTAAA CAAACAAACA ATTGGGTACC CAGTTAAATT 1251 TTCATTTCAG ATATACAACA AATAATTTTT TAGTATAAGT ACATTATTGT 1301 TTATCTGAAA TTTTAATTGA ACTAACAATC CTAGTTTGAT ACTCCCAGTC 1351 TTGTCATTGC CAGCTGTGTT GGTAGTGCTG TGTTGAATTA CGGAATAATG 1401 AGTTAGAACT ATTAAAACAG CCAAAACTCC ACAGTCAATA TTAGTAATTT 1451 CTTGCTGGTT GAAACTTGTT TATTATGTAC AAATAGATTC TTATAATATT 1501 ATTTAAATGA CTGCATTTTT AAATACAAGG CTTTATATTT TTAACTTTAA 1551 AAAAAACCGG // LOCUS HSU12535 3832 bp mRNA PRI 18-FEB-1995 DEFINITION Human epidermal growth factor receptor kinase substrate (Eps8) mRNA, complete cds. ACCESSION U12535 NID g530822 VERSION U12535.1 GI:530822 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3832) AUTHORS Wong,W.T., Carlomagno,F., Druck,T., Barletta,C., Croce,C.M., Huebner,K., Kraus,M.H. and Di Fiore,P.P. TITLE Evolutionary conservation of the EPS8 gene and its mapping to human chromosome 12q23-q24 JOURNAL Oncogene 9 (10), 3057-3061 (1994) MEDLINE 94366758 REFERENCE 2 (bases 1 to 3832) AUTHORS Di Fiore,P. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Pier Paolo Di Fiore, Lab. Cellular & Molecular Biology, National Cancer Institute, Building 37, Room 1D23, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1. .3832 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q23-q24" gene 210. .2678 /gene="Eps8" CDS 210. .2678 /gene="Eps8" /codon_start=1 /product="epidermal growth factor receptor kinase substrate" /protein_id="AAA62280.1" /db_xref="PID:g530823" /db_xref="GI:530823" /translation="MNGHISNHPSSFGMYPSQMNGYGSSPTFSQTDREHGSKTSAKAL YEQRKNYARDSVSSVSDISQYRVEHLTTFVLDRKDAMITVDDGIRKLKLLDAKGKVWT QDMILQVDDRAVSLIDLESKNELENFPLNTIQHCQAVMHSCSYDSVLALVCKEPTQNK PDLHLFQCDEVKANLISEDIESAISDSKGGKQKRRPDALRMISNADPSIPPPPRAPAP APPGTVTQVDVRSRVAAWSAWAADQGDFEKPRQYHEQEETPEMMAARIDRDVQILNHI LDDIEFFITKLQKAAEAFSELSKRKKNKKGKRKGPGEGVLTLRAKPPPPDEFLDCFQK FKHGFNLLAKLKSHIQNPSAADLVHFLFTPLNMVVQATGGPELASSVLSPLLNKDTID FLNYTVNGDERQLWMSLGGTWMKARAEWPKEQFIPPYVPRFRNGWEPPMLNFMGATME QDLYQLAESVANVAEHQRKQEIKRLSTEHSSVSEYHPADGYAFSSNIYTRGSHLDQGE AAVAFKPTSNRHIDRNYEPLKTQPKKYAKSKYDFVARNNSELSVLKDDILEILDDRKQ WWKVRNASGDSGFVPNNILDIVRPPESGLGRADPPYTHTIQKQRMEYGPRPADTPPAP SPPPTPAPVPVPLPPSTPAPVPVSKVPANITRQNSSSSDSGGSIVRDSQRHKQLPVDR RKSQMEEVQDELIHRLTIGRSAAQKKFHVPRQNVPVINITYDSTPEDVKTWLQSKGFN PVTVNSLGVLNGAQLFSLNKDELRTVCPEGARVYSQITVQKAALEDSSGSSELQEIMR RRQEKISAAASDSGVESFDEGSSH" BASE COUNT 1171 a 781 c 840 g 1040 t ORIGIN 1 CGCGAGGCCG GCGCTGTGCT CGCCTCGGAG ATCGCTGCTC TTTAGCTGGG 51 TGCAGAAGGC GGCTCCGCGG CTCGCGGACG ACTGGCTGGG CGCGAATCAG 101 ATTGGGGGGC TTTCTCCCGG TCCCCTCCCA CCTCGTCTGG GCTCGCGGCG 151 TCTCCGGGGA AAGCCGTGGC CCCGAGGGCG GATCCGAGAA CACACAAGTG 201 AAAGACACAA TGAATGGTCA TATTTCTAAT CATCCCAGTA GTTTTGGAAT 251 GTACCCATCT CAGATGAATG GCTACGGATC ATCACCTACC TTTTCCCAGA 301 CGGACAGAGA ACATGGTTCA AAAACAAGTG CAAAGGCCCT TTATGAACAA 351 AGGAAGAATT ATGCACGGGA CAGTGTCAGC AGTGTGTCAG ATATATCTCA 401 ATACCGTGTT GAACACTTGA CTACCTTTGT CCTGGATCGG AAAGATGCTA 451 TGATCACTGT TGATGATGGA ATAAGGAAAT TGAAATTGCT TGATGCCAAG 501 GGCAAAGTGT GGACTCAAGA TATGATTCTT CAAGTGGATG ACAGAGCTGT 551 GAGCCTGATT GATTTAGAAT CAAAGAATGA ACTGGAGAAT TTTCCTTTAA 601 ACACAATCCA GCACTGCCAA GCTGTGATGC ATTCATGCAG CTATGATTCA 651 GTTCTTGCAC TGGTGTGCAA AGAGCCAACC CAGAACAAGC CAGATCTTCA 701 TCTCTTCCAG TGTGATGAGG TTAAGGCAAA CCTAATTAGT GAAGATATTG 751 AAAGTGCAAT CAGTGACAGT AAAGGAGGGA AACAGAAGAG GCGGCCCGAC 801 GCCCTGAGGA TGATTTCCAA TGCAGACCCT AGTATACCGC CTCCACCCAG 851 AGCTCCTGCC CCTGCGCCCC CTGGGACCGT CACCCAGGTG GATGTTAGAA 901 GTCGAGTGGC AGCCTGGTCT GCATGGGCAG CCGACCAAGG GGACTTTGAG 951 AAACCAAGGC AGTATCATGA GCAGGAAGAA ACACCTGAGA TGATGGCAGC 1001 CCGCATTGAC AGAGATGTGC AAATCTTAAA CCACATTTTG GATGACATTG 1051 AATTTTTTAT CACAAAACTC CAAAAAGCAG CAGAAGCATT TTCTGAGCTT 1101 TCTAAAAGGA AGAAAAACAA GAAAGGTAAA AGGAAAGGAC CAGGAGAGGG 1151 TGTTTTAACG CTGCGGGCAA AACCTCCACC TCCTGATGAA TTTCTTGACT 1201 GTTTCCAAAA GTTTAAACAC GGATTTAACC TTCTGGCCAA ACTGAAGTCT 1251 CATATTCAGA ATCCTAGTGC TGCAGATTTG GTTCACTTTT TGTTTACTCC 1301 ATTAAATATG GTGGTGCAGG CAACAGGAGG TCCTGAACTA GCCAGTTCAG 1351 TACTTAGTCC CCTATTGAAT AAGGACACAA TTGATTTCTT AAATTATACT 1401 GTCAATGGTG ATGAACGGCA GCTGTGGATG TCATTGGGAG GAACTTGGAT 1451 GAAAGCCAGA GCAGAGTGGC CAAAAGAACA GTTTATTCCA CCATATGTTC 1501 CACGATTCCG CAATGGCTGG GAGCCCCCAA TGCTGAACTT TATGGGAGCC 1551 ACAATGGAAC AAGATCTTTA TCAACTGGCA GAATCTGTGG CAAATGTAGC 1601 AGAACATCAG CGCAAACAGG AAATAAAAAG ATTATCCACA GAGCATTCCA 1651 GTGTATCAGA GTATCATCCA GCCGATGGCT ATGCGTTCAG TAGCAACATT 1701 TACACAAGAG GATCCCACCT GGACCAAGGG GAAGCTGCTG TTGCTTTTAA 1751 GCCAACTTCT AATCGCCATA TAGATAGAAA TTATGAACCA CTCAAAACAC 1801 AACCCAAGAA ATATGCCAAA TCCAAGTATG ACTTTGTAGC AAGGAACAAC 1851 AGTGAGCTCT CGGTTCTAAA GGATGATATT TTAGAGATAC TTGATGATCG 1901 GAAGCAATGG TGGAAAGTTC GAAATGCAAG TGGAGACTCT GGATTTGTGC 1951 CAAATAACAT TTTGGATATT GTGAGACCTC CAGAATCTGG ATTGGGGCGT 2001 GCTGATCCAC CTTATACTCA TACTATACAG AAACAAAGGA TGGAGTATGG 2051 CCCAAGACCA GCTGATACTC CCCCTGCTCC ATCACCTCCT CCAACACCAG 2101 CTCCTGTTCC TGTTCCCCTT CCCCCTTCCA CTCCAGCACC TGTTCCTGTG 2151 TCAAAGGTCC CAGCAAATAT AACACGTCAA AACAGCAGCT CCAGTGACAG 2201 TGGTGGCAGT ATCGTGCGAG ACAGCCAGAG ACACAAACAA CTTCCGGTGG 2251 ACCGAAGGAA ATCTCAGATG GAGGAAGTGC AAGATGAACT CATCCACAGA 2301 CTGACCATTG GTCGGAGTGC CGCTCAGAAG AAATTCCATG TGCCACGGCA 2351 GAACGTGCCA GTTATCAATA TCACTTACGA CTCCACACCA GAGGATGTGA 2401 AGACGTGGTT ACAGTCAAAG GGATTCAACC CTGTGACTGT CAATAGTCTT 2451 GGAGTATTAA ATGGTGCACA ACTTTTCTCT CTCAATAAGG ATGAACTGAG 2501 GACAGTCTGC CCTGAAGGGG CGAGAGTCTA TAGCCAAATC ACTGTACAAA 2551 AAGCTGCATT GGAGGATAGC AGTGGCAGCT CCGAGTTACA AGAAATTATG 2601 AGAAGACGAC AGGAAAAAAT CAGTGCTGCC GCTAGTGATT CAGGAGTGGA 2651 ATCTTTTGAT GAAGGAAGCA GTCACTAATT TGTTTGTTTG TATTTAAACT 2701 CCATTGTTTT TGGCATTATT CCAACATGCT TTGTTTTAAG AAGCCTTGAA 2751 GGGAATGTCA GATTCATTTT TCTTGATGTA ATTTATCACC ATAAAAAAAA 2801 AACCCATGCA AACCTGAGTG AGCACAGGAT TTGCTTCTAG GCCCATTATT 2851 TTTATTAAAA CTGAAAAAAT TTAAACTGAA TTTTTTGACC TTGGAAAATA 2901 TTTTTCTTAC TTTACCAAGG TGAAGTTTCC TTAATTAGAC TAATTATTTT 2951 ATCCCCATCC CAGGGTATAA ACAGGAATTG TTTTGATAGT GGTGGAGTTA 3001 TTCACTGCAA CAAAGCAACA ATGTTGTCCA TGATTCAAAA TCTAAGCAGT 3051 TTCGATTTTG CCTGTGAATA TGGTGTCTGT CATTCAGGGC ATAGCTCACT 3101 GTAGGCTAGC CTCTGCTTAC TTAAGTCTCT TCTCTGACAT ACTCAATGGA 3151 AGAATATTTA GATTTATTTA AAGTTCTTAA TGCCAACAGT TTAAAAAAAA 3201 ATTAAAACAT TTGAATGAAC TGTAAAGTAC AGCCATACCT TGGACATGCA 3251 AATATAAATC TATGGAGCAT TCTCAAGACA GTTTGTCATG GCTCTGTTGA 3301 TTGCAACTCC TTGTATAGCT TGTATTTTGA TTTAGTTTAT ATTCTGCTTA 3351 TTATGTATAC TGTGTTCTTA TATATGAGAA AGCACAAATG CGAAAGAGGT 3401 CATGTCTTCT CAAAATCTAG CAAAGGAAGT AGTCTGCATT GGTGTGCATT 3451 ACAGTATTTT GCTTAATGAA AGCCTCAGTT CTGAATGTTG ATATGAGTAG 3501 TTAAAAGGAA GTGGGGCCAT TTTATGTGTT TATCTGTGTC AAGTATTTCT 3551 GGTAATAAGA AGCACTTAAT TTACACATAT TTTAATCCTG TGAAAGATTC 3601 CACATAGAGA AAAGAAAGAT ACCTAACCTT CAACAAATGT TATTTTTGGA 3651 AACACAATTT TTGTCATTAA ATGTTATATT ATTTCACATA TATAAAACAG 3701 ATGTTATGTA AGAATGTTGT ATATTTTAAC ATAAATCATT TAGAGAAATT 3751 ATCTAGATTC ATTAATTTTC ATAGTGCCTT TTTCACATGA GTCAGCTGGA 3801 AAGTCTGCAA TAAACAGTAT TTGCTGTCTG TT // LOCUS D84454 2620 bp mRNA PRI 06-FEB-1999 DEFINITION Human mRNA for UDP-galactose translocator, complete cds. ACCESSION D84454 NID g1526437 VERSION D84454.1 GI:1526437 KEYWORDS UDP-galactose translocator; UGT. SOURCE Homo sapiens normal fibroblast cell_line:TIG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2620) AUTHORS Ishida,N. TITLE Direct Submission JOURNAL Submitted (18-APR-1996) to the DDBJ/EMBL/GenBank databases. Nobuhiro Ishida, The Tokyo Metropolitan Institute of Medical Science, The Physiological Chemistry; 18-22, Honkomagome 3-chome, Bunkyo-ku, Tokyo 113-8613, Japan (E-mail:ishidan@rinshoken.or.jp, Tel:81-3-3823-2101, Fax:81-3-3823-2965) REFERENCE 2 (bases 1 to 2620) AUTHORS Miura,N., Ishida,N., Hoshino,M., Yamauchi,M., Hara,T., Ayusawa,D. and Kawakita,M. TITLE Human UDP-galactose translocator: molecular cloning of a complementary DNA that complements the genetic defect of a mutant cell line deficient in UDP-galactose translocator JOURNAL J. Biochem. 120 (2), 236-241 (1996) MEDLINE 97044734 REFERENCE 3 (sites) AUTHORS Miura,N., Ishida,N., Mamauchi,M., Hara,T., Ayusawa,D., Hoshino,M. and Kawakita,M. TITLE Molecular cloning and expression of human UDP-galactose translocator JOURNAL Unpublished (1996) REFERENCE 4 (sites) AUTHORS Hara,T., Yamauchi,M., Takahashi,E., Hoshino,M., Aoki,K., Ayusawa,D. and Kawakita,M. TITLE The UDP-galactose translocator gene is mapped to band Xp11.23-p11.22 containing the Wiskott-Aldrich syndrome locus JOURNAL Somat. Cell Mol. Genet. 19 (6), 571-575 (1993) MEDLINE 94174379 FEATURES Location/Qualifiers source 1. .2620 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TIG-1" /cell_type="normal fibroblast" /chromosome="X" /map="Xp11.22-p11.23" CDS 324. .1505 /codon_start=1 /product="UDP-galactose translocator" /protein_id="BAA12673.1" /db_xref="PID:d1013353" /db_xref="PID:g1526438" /db_xref="GI:1526438" /translation="MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLV VQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLH EAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLML NRSLSRLQWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLSSGFAGV YFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVL NQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVDPLFALGAGLVIGA VYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDLITEPFLPKSVLV K" BASE COUNT 466 a 727 c 761 g 666 t ORIGIN 1 CATCGGGGGA TGATCTGGAA AGCGCGATCA GTGAAGCGGA CGAACGGCAG 51 GATAAGGCGG GTCTAGTGAC AGGAATGGGC CGATGAAGCT CTGTAGGGAT 101 GGTGGGTAGG CCGATCGGGC CGTGTCCCCG GCCTCCCGAT CCGACGGAAT 151 TTGGAAATCC CGGGGCTATT CATACATTGA GCTTTTAGGA GCGGAGGAGA 201 AAAGCCACCA CCCTGACGAT CCCGGCTCTC GCTCCACCTT CACTCAGGTG 251 GCCCGGCAGC GGAAGTGACG AACGCGGAAG TGGTTTTTCT GTTGCCGAGG 301 GGACGGGCCG GGCAGATGCC AACATGGCAG CGGTTGGGGC TGGTGGTTCC 351 ACCGCGGCGC CCGGGCCAGG GGCGGTTTCC GCGGGTGCAT TGGAGCCGGG 401 GACCGCCAGT GCGGCTCACA GGCGCCTGAA GTACATATCC CTAGCTGTGC 451 TGGTGGTCCA GAATGCCTCC CTCATCCTCA GCATCCGCTA CGCCCGCACG 501 TTGCCAGGGG ACCGCTTCTT TGCCACCACT GCTGTGGTCA TGGCGGAAGT 551 GCTCAAAGGT CTCACCTGCC TGCTGCTGCT CTTCGCACAG AAGAGGGGTA 601 ACGTGAAGCA CCTGGTTCTC TTCCTCCATG AGGCTGTCCT GGTGCAGTAT 651 GTGGACACGC TCAAGCTCGC AGTGCCCTCT CTCATCTACA CCTTGCAGAA 701 TAACCTCCAG TATGTTGCCA TCTCTAACCT ACCAGCTGCC ACTTTCCAGG 751 TGACATACCA GCTGAAGATC CTGACCACAG CGCTGTTCTC CGTGCTCATG 801 CTGAATCGCA GCCTTTCCCG GCTGCAGTGG GCCTCCCTGC TGCTCCTCTT 851 CACTGGCGTC GCCATTGTCC AGGCACAGCA AGCCGGTGGG GGAGGCCCAC 901 GGCCACTGGA TCAGAACCCT GGGGCAGGCC TGGCAGCCGT CGTGGCCTCC 951 TGTCTCTCCT CCGGCTTCGC AGGTGTCTAC TTTGAGAAGA TCCTCAAAGG 1001 CAGCTCAGGC TCCGTGTGGC TGCGCAACCT GCAACTGGGC CTCTTCGGCA 1051 CAGCACTGGG CCTGGTGGGG CTCTGGTGGG CTGAGGGTAC CGCCGTGGCC 1101 ACCCGTGGTT TCTTTTTTGG GTACACACCT GCTGTCTGGG GCGTGGTGCT 1151 CAACCAGGCC TTCGGCGGGC TACTGGTGGC TGTGGTTGTC AAGTACGCTG 1201 ACAATATCCT CAAGGGCTTT GCCACCTCCC TGTCCATTGT GCTGTCCACT 1251 GTTGCCTCCA TTCGCCTCTT TGGCTTCCAC GTGGACCCAT TATTTGCCCT 1301 TGGCGCTGGA CTCGTCATTG GTGCTGTCTA CCTCTACAGC CTTCCCCGAG 1351 GTGCAGCCAA AGCCATAGCC TCTGCCTCTG CCTCCGCCTC CGGGCCCTGC 1401 GTTCACCAGC AGCCTCCCGG GCAGCCACCA CCACCGCAGC TGTCTTCCCA 1451 CCGTGGAGAC CTCATCACGG AGCCCTTTCT GCCAAAGTCA GTGCTGGTGA 1501 AGTGAGGGCT GGCAGCAATG GGGGGACACA AGGGAGGGGG ACTGGGGTGG 1551 AGGGTGTTGG GCATCTGCAG GACCCAAGTC GCCACCCTCC GGGGCCTGGC 1601 TCCTCTGGGT TTGGGAGATG GTCTTTTCTC CCAGGTCACT GAGACTTCTG 1651 GAGGGGTGTG GGACTAGAGC TGGGTGTCAC GTGAACCCTT CCTGGTAGGG 1701 TGACCCCCTT CCCCTGGAGG GGGTTTTAGA GCTGCCGCCT CTGCTCCCTC 1751 TAACCTCTTT GGAGGCAGGG TTGGGGGTAT TGTCATTCAA GGCCTTTTTT 1801 TTGTCTGCTC CCTCCCCGAC CCTGTGCCCT CTTCTGGAGG TTTCTCGTCT 1851 GGGAGAGTCC CTCCCAGCAG TCCCTCCACC TCCATAAGGA CACACTGGAC 1901 AAAACTCCCG CAGCTCTTCA GGAATGACCG ATGCCTACCT GTGGGGTTCA 1951 GTTGCCCATA GTTTGAGGCC TTCTCTCCTC CCTTACCACC GCTCTGGATC 2001 ATGTTACTAG TTCCGTCTTT TGTGTGGCCT TGGGCCAGCT TCCTTGATAC 2051 CTTGAAGATG GGCTTCTTGT GAGTCCCCAG GGAGAAAGGG ACAAGAGCTA 2101 AGATTTTTGC ATCAGCCCTT CTGGCAGAAG GTGTGGTAGG GGCCATTTGT 2151 TTTTTTTAGT GGACTTGGGA TTTGTGGTGT AATCATATCA TTAATGATCC 2201 AGGGTGTGGG AAAAATGGAG GTCCTTGAAG TGGCTGAATC TCATTGTATT 2251 TAAGACACTG TCAGTTGCCA GATGTAGGCT TATTTTTGGA GATGTCTAGG 2301 AGAGGAAAAA GCTACCAATC ATACTCTTGA TATCCGTCTG GCTGTGTGAG 2351 GCACCCCTAC CTCATGGGGG TGTCTTGGGA TTGATGAACT GTGGAACCTG 2401 CCTCCTGCGC TCCCCAAAGC TTATTAACCC CTTAACTGTA TCGGGGCGGG 2451 GTGTGTGTGT GCATGGAAGA TGCCTGGGCT GTCTTTGCTA TATGTAAATA 2501 GAGCCATTGG ATCTTTATTT TTGATTAATT TGTTCTGATT TTTTGGTTTG 2551 TTTTTTAAGG AACTGTAATG AACAAATGTC AGGATATCCA ATGCCAAATA 2601 AAGATGTTGT ATTTATTTAG // LOCUS HSU90543 2904 bp mRNA PRI 02-MAY-1997 DEFINITION Human butyrophilin (BTF1) mRNA, complete cds. ACCESSION U90543 NID g2062687 VERSION U90543.1 GI:2062687 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2904) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE A 1.1 megabase transcript map of the human hereditary hemochromatosis locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 2904) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Sequencing, Mercator Genetics, 4040 Campbell Avenue, Menlo Park, CA 94025, USA FEATURES Location/Qualifiers source 1. .2904 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" gene 211. .1794 /gene="BTF1" CDS 211. .1794 /gene="BTF1" /codon_start=1 /product="butyrophilin" /protein_id="AAB53421.1" /db_xref="PID:g2062688" /db_xref="GI:2062688" /translation="MESAAALHFSRPASLLLLLLSLCALVSAQFIVVGPTDPILATVG ENTTLRCHLSPEKNAEDMEVRWFRSQFSPAVFVYKGGRERTEEQMEEYRGRTTFVSKD ISRGSVALVIHNITAQENGTYRCYFQEGRSYDEAILHLVVAGLGSKPLISMRGHEDGG IRLECISRGWYPKPLTVWRDPYGGVAPALKEVSMPDADGLFMVTTAVIIRDKSVRNMS CSINNTLLGQKKESVIFIPESFMPSVSPCAVALPIIVVILMIPIAVCIYWINKLQKEK KILSGEKEFERETREIALKELEKERVQKEEELQVKEKLQEELRWRRTFLHAVDVVLDP DTAHPDLFLSEDRRSVRRCPFRHLGESVPDNPERFDSQPCVLGRESFASGKHYWEVEV ENVIEWTVGVCRDSVERKGEVLLIPQNGFWTLEMHKGQYRAVSSPDRILPLKESLCRV GVFLDYEAGDVSFYNMRDRSHIYTCPRSAFSVPVRPFFRLGCEDSPIFICPALTGANG VTVPEEGLTLHRVGTHQSL" BASE COUNT 784 a 678 c 768 g 674 t ORIGIN 1 CGACCCACGC GTCCGAACAT GGCGACCTAG GAGAAAGGGA AGAACAATTT 51 TTTCTCCTCT TTTGGGAAGG TTTGCGTCTA GTAGTGCCTG TGCCCCTGGG 101 CAGATTGGAG AGAAGAGGGA CGACTGGAGA ATCGTCGAGA ACCAGCGGAG 151 AAAAGAAAAA GCAACGTTTA ATTCTAGAAG GCCTCCTGTC CCTGCCTGCT 201 CTGGGTGCTC ATGGAATCAG CTGCTGCCCT GCACTTCTCC CGGCCAGCCT 251 CCCTCCTCCT CCTCCTCCTC AGCCTGTGTG CACTGGTCTC AGCCCAGTTT 301 ATTGTCGTGG GGCCCACTGA TCCCATCTTG GCCACGGTTG GAGAAAACAC 351 TACGTTACGC TGCCATCTGT CACCCGAGAA AAATGCTGAG GACATGGAGG 401 TGCGGTGGTT CCGGTCTCAG TTCTCCCCCG CAGTGTTTGT GTATAAAGGT 451 GGCAGAGAGA GAACAGAGGA GCAGATGGAG GAGTACCGAG GAAGAACCAC 501 CTTTGTGAGC AAAGACATCA GCAGGGGCAG CGTGGCCCTG GTCATACACA 551 ACATCACAGC CCAGGAAAAC GGCACCTACC GCTGTTACTT CCAAGAAGGC 601 AGGTCCTACG ATGAGGCCAT CCTGCACCTC GTAGTGGCAG GACTAGGCTC 651 TAAGCCCCTC ATTTCAATGA GGGGCCATGA AGACGGGGGC ATCCGGCTGG 701 AGTGCATATC TAGAGGGTGG TACCCAAAGC CCCTCACAGT GTGGAGGGAC 751 CCCTACGGTG GGGTTGCGCC TGCCCTGAAA GAGGTCTCCA TGCCTGATGC 801 AGACGGCCTC TTCATGGTCA CCACGGCTGT GATCATCAGA GACAAGTCTG 851 TGAGGAACAT GTCCTGCTCT ATCAACAACA CCCTGCTCGG CCAGAAGAAA 901 GAAAGTGTCA TTTTTATTCC AGAATCCTTT ATGCCCAGTG TGTCTCCCTG 951 TGCAGTGGCC CTGCCTATCA TTGTGGTTAT TCTGATGATA CCCATTGCCG 1001 TATGCATCTA TTGGATCAAC AAACTCCAAA AGGAAAAAAA GATTCTGTCA 1051 GGGGAAAAGG AGTTTGAACG GGAAACAAGA GAAATTGCTC TAAAGGAACT 1101 GGAGAAAGAA CGTGTGCAAA AAGAGGAAGA ACTTCAAGTA AAAGAGAAAC 1151 TTCAAGAAGA ATTGCGATGG AGAAGAACAT TCTTACATGC TGTTGATGTG 1201 GTCCTGGATC CAGACACCGC TCATCCCGAT CTCTTCCTGT CAGAGGACCG 1251 GAGAAGTGTG AGAAGGTGCC CCTTCAGGCA CCTAGGGGAG AGCGTGCCTG 1301 ACAACCCAGA GAGATTCGAC AGTCAGCCTT GTGTCCTAGG CCGGGAGAGC 1351 TTCGCTTCAG GGAAACATTA CTGGGAGGTG GAGGTGGAAA ACGTGATTGA 1401 GTGGACTGTG GGGGTCTGTA GAGACAGTGT TGAGAGGAAA GGGGAGGTCC 1451 TGCTGATTCC TCAGAATGGC TTCTGGACCT TGGAGATGCA TAAAGGGCAA 1501 TACCGGGCCG TGTCCTCCCC TGATAGGATT CTCCCTTTGA AGGAGTCCCT 1551 TTGCCGGGTG GGCGTCTTCC TGGACTATGA AGCTGGAGAT GTCTCCTTCT 1601 ACAACATGAG GGACAGATCG CACATCTACA CATGTCCCCG TTCAGCCTTT 1651 TCCGTGCCTG TGAGGCCCTT CTTCAGGTTG GGGTGTGAGG ACAGCCCCAT 1701 CTTCATCTGC CCTGCACTCA CAGGAGCCAA TGGGGTCACG GTGCCTGAAG 1751 AGGGCCTGAC ACTTCACAGA GTGGGGACCC ACCAGAGCCT ATAGAATCAA 1801 TTCCTTGGTC TCACAGCCAT GTAGACAAGC CCTGGTCATC TCAGCAGCCA 1851 CCGCACAACA CCCCTGGTGG AAGACACGCC CTCCTCCCCT CTGGTCACAC 1901 AAGAGAACAT CTTCCAGCTG CCTCTTTCAC ACCCACTACA GACCTCAGCC 1951 CCAGTTTTCT CCTCCTCACT AGGCTGTGTT TTTAGTAGTT CCTTTGCTTG 2001 TAACTATGGG ATGGGATCCA GGCATAGGGA ACTAGTTGTT ACACAGCTCC 2051 CAGCCAAGAA GAAAGTGTGA GAAGTTGATG GGCAGCAAAC CTGCTGTTTA 2101 ACATCAGGGT GACCACATTA AGCCCAGTAT TCCAGTTGGC ACCAGAAGAT 2151 ATGGACTTGG AATGAGGCCT ACAGGGTTCA CCAGGATGTA AGAGGAGAGA 2201 GGAATCCACA GGACCACCAG AGAGGAGAGG GAACCAGATA TGCAGATCAG 2251 AGATAGAGGA AGTGGAACCA GAGAGCTGGG AGGGACCAAG GTTGTAAGGG 2301 TGGCTAAGTC CCACCATAAC AGCTAAGGGG ACCTGGGAGA TGATGGCTCA 2351 TTTCCACCCA GCCCCAGGAT TTCCAGAGCG CACATCCACA GGCCTGGACC 2401 TGGGATGAAG ATGAATGAAG AACATGGATG CACGTGGATG TAGTTTGGCT 2451 CAGGTGTCCC TGCAGTTGGC AAGGAGTCAG TACTCAGTCC CTGAGTGTGG 2501 CTGAAATTTG AGGTCCTGGC TGAGCCAAGG AGTAATGGAC CAGATCTACC 2551 TCAGTATTCA AGTTCAGTGG GGACACCAGT GGCTTCAAAC TTCCTGGTTT 2601 CATGATATCT TGAGACGCCT TACAAATGAT GGAGGATTCC AAAGAGTTTT 2651 TGTTTATTTG GGTTAATATT TGTTGGTATT TATGGCATTT GAGATTGAAA 2701 CTAAGAAATG TTTTAATTTA TTACCTTTAC AACATTTATT TACATTACAT 2751 ACATACATTT ACAACATTTA TTAATTTATA TTAAAATAGC ATGAATAAGC 2801 CAATTATAGG TTAATATAAG TAGAATGTTT GTGAAAAATA AGTATGGTAT 2851 CCAAAGCAAA ATAAATTTTA TTGTGAAGTG TGAAAAAAAA AAAAAAAAAA 2901 AAAA // LOCUS HUMGFB 6757 bp mRNA PRI 08-NOV-1994 DEFINITION Human basic fibroblast growth factor (bFGF) 22.5 kd, 21 kd and 18 kd protein mRNA, complete cds. ACCESSION J04513 NID g183083 VERSION J04513.1 GI:183083 KEYWORDS 18 kDa basic fibroblast growth factor; 21 kDa basic fibroblast growth factor; 22.5 kDa basic fibroblast growth factor; basic fibroblast growth factor; growth factor; heparin binding protein. SOURCE Human hepatoma SK-HEP-1 cell line, cDNA to mRNA, clone pUC-SK1. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6757) AUTHORS Prats,H., Kaghad,M., Prats,A.C., Klagsbrun,M., Lelias,J.M., Liauzun,P., Chalon,P., Tauber,J.P., Amalric,F., Smith,J.A. and Caput,D. TITLE High molecular mass forms of basic fibroblast growth factor are initiated by alternative CUG codons JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (6), 1836-1840 (1989) MEDLINE 89184522 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Caput, 03-MAR-1989. FEATURES Location/Qualifiers source 1. .6757 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q25-q27" mRNA 1. .6757 /note="bFGF mRNA" gene 302. .934 /gene="FGF2" CDS 302. .934 /gene="FGF2" /note="basic fibroblast growth factor (ctg start codon); putative" /codon_start=1 /transl_except=(pos:302. .304,aa:Met) /db_xref="GDB:G00-119-910" /protein_id="AAA52531.1" /db_xref="PID:g459811" /db_xref="GI:459811" /translation="MGDRGRGRALPGGRLGGRGRGRAPERVGGRGRGRGTAAPRAAPA ARGSRPGPAGTMAAGSITTLPALPEDGGSGAFPPGHFKDPKRLYCKNGGFFLRIHPDG RVDGVREKSDPHIKLQLQAEERGVVSIKGVCANRYLAMKEDGRLLASKCVTDECFFFE RLESNNYNTYRSRKYTSWYVALKRTGQYKLGSKTGPGQKAILFLPMSAKS" CDS 344. .934 /gene="FGF2" /note="21 kd basic fibroblast growth factor (ctg start codon; put.); putative" /codon_start=1 /transl_except=(pos:344. .346,aa:Met) /db_xref="GDB:G00-119-910" /protein_id="AAA52532.1" /db_xref="PID:g183084" /db_xref="GI:183084" /translation="MGGRGRGRAPERVGGRGRGRGTAAPRAAPAARGSRPGPAGTMAA GSITTLPALPEDGGSGAFPPGHFKDPKRLYCKNGGFFLRIHPDGRVDGVREKSDPHIK LQLQAEERGVVSIKGVCANRYLAMKEDGRLLASKCVTDECFFFERLESNNYNTYRSRK YTSWYVALKRTGQYKLGSKTGPGQKAILFLPMSAKS" CDS 467. .934 /gene="FGF2" /note="18 kd basic fibroblast growth factor" /codon_start=1 /db_xref="GDB:G00-119-910" /protein_id="AAA52533.1" /db_xref="PID:g183085" /db_xref="GI:183085" /translation="MAAGSITTLPALPEDGGSGAFPPGHFKDPKRLYCKNGGFFLRIH PDGRVDGVREKSDPHIKLQLQAEERGVVSIKGVCANRYLAMKEDGRLLASKCVTDECF FFERLESNNYNTYRSRKYTSWYVALKRTGQYKLGSKTGPGQKAILFLPMSAKS" BASE COUNT 2038 a 1201 c 1333 g 2185 t ORIGIN 1 CGGCCCCAGA AAACCCGAGC GAGTAGGGGG CGGCGCGCAG GAGGGAGGAG 51 AACTGGGGGC GCGGGAGGCT GGTGGGTGTC GGGGGTGGAG ATGTAGAAGA 101 TGTGACGCCG CGGCCCGGCG GGTGCCAGAT TAGCGGACGG CTGCCCGCGG 151 TTGCAACGGG ATCCCGGGCG CTGCAGCTTG GGAGGCGGCT CTCCCCAGGC 201 GGCGTCCGCG GAGACACCCA TCCGTGAACC CCAGGTCCCG GGCCGCCGGC 251 TCGCCGCGCA CCAGGGGCCG GCGGACAGAA GAGCGGCCGA GCGGCTCGAG 301 GCTGGGGGAC CGCGGGCGCG GCCGCGCGCT GCCGGGCGGG AGGCTGGGGG 351 GCCGGGGCCG GGGCCGTGCC CCGGAGCGGG TCGGAGGCCG GGGCCGGGGC 401 CGGGGGACGG CGGCTCCCCG CGCGGCTCCA GCGGCTCGGG GATCCCGGCC 451 GGGCCCCGCA GGGACCATGG CAGCCGGGAG CATCACCACG CTGCCCGCCT 501 TGCCCGAGGA TGGCGGCAGC GGCGCCTTCC CGCCCGGCCA CTTCAAGGAC 551 CCCAAGCGGC TGTACTGCAA AAACGGGGGC TTCTTCCTGC GCATCCACCC 601 CGACGGCCGA GTTGACGGGG TCCGGGAGAA GAGCGACCCT CACATCAAGC 651 TACAACTTCA AGCAGAAGAG AGAGGAGTTG TGTCTATCAA AGGAGTGTGT 701 GCTAACCGTT ACCTGGCTAT GAAGGAAGAT GGAAGATTAC TGGCTTCTAA 751 ATGTGTTACG GATGAGTGTT TCTTTTTTGA ACGATTGGAA TCTAATAACT 801 ACAATACTTA CCGGTCAAGG AAATACACCA GTTGGTATGT GGCACTGAAA 851 CGAACTGGGC AGTATAAACT TGGATCCAAA ACAGGACCTG GGCAGAAAGC 901 TATACTTTTT CTTCCAATGT CTGCTAAGAG CTGATTTTAA TGGCCACATC 951 TAATCTCATT TCACATGAAA GAAGAAGTAT ATTTTAGAAA TTTGTTAATG 1001 AGAGTAAAAG AAAATAAATG TGTATAGCTC AGTTTGGATA ATTGGTCAAA 1051 CAATTTTTTA TCCAGTAGTA AAATATGTAA CCATTGTCCC AGTAAAGAAA 1101 AATAACAAAA GTTGTAAAAT GTATATTCTC CCTTTTATAT TGCATCTGCT 1151 GTTACCCAGT GAAGCTTACC TAGAGCAATG ATCTTTTTCA CGCATTTGCT 1201 TTATTCGAAA AGAGGCTTTT AAAATGTGCA TGTTTAGAAA CAAAATTTCT 1251 TCATGGAAAT CATATACATT AGAAAATCAC AGTCAGATGT TTAATCAATC 1301 CAAAATGTCC ACTATTTCTT ATGTCATTCG TTAGTCTACA TGTTTCTAAA 1351 CATATAAATG TGAATTTAAT CAATTCCTTT CATAGTTTTA TAATTCTCTG 1401 GCAGTTCCTT ATGATAGAGT TTATAAAACA GTCCTGTGTA AACTGCTGGA 1451 AGTTCTTCCA CAGTCAGGTC AATTTTGTCA AACCCTTCTC TGTACCCATA 1501 CAGCAGCAGC CTAGCAACTC TGCTGGTGAT GGGAGTTGTA TTTTCAGTCT 1551 TCGCCAGGTC ATTGAGATCC ATCCACTCAC ATCTTAAGCA TTCTTCCTGG 1601 CAAAAATTTA TGGTGAATGA ATATGGCTTT AGGCGGCAGA TGATATACAT 1651 ATCTGACTTC CCAAAAGCTC CAGGATTTGT GTGCTGTTGC CGAATACTCA 1701 GGACGGACCT GAATTCTGAT TTTATACCAG TCTCTTCAAA AACTTCTCGA 1751 ACCGCTGTGT CTCCTACGTA AAAAAAGAGA TGTACAAATC AATAATAATT 1801 ACACTTTTAG AAACTGTATC ATCAAAGATT TTCAGTTAAA GTAGCATTAT 1851 GTAAAGGCTC AAAACATTAC CCTAACAAAG TAAAGTTTTC AATACAAATT 1901 CTTTGCCTTG TGGATATCAA GAAATCCCAA AATATTTTCT TACCACTGTA 1951 AATTCAAGAA GCTTTTGAAA TGCTGAATAT TTCTTTGGCT GCTACTTGGA 2001 GGCTTATCTA CCTGTACATT TTTGGGGTCA GCTCTTTTTA ACTTCTTGCT 2051 GCTCTTTTTC CCAAAAGGTA AAAATATAGA TTGAAAAGTT AAAACATTTT 2101 GCATGGCTGC AGTTCCTTTG TTTCTTGAGA TAAGATTCCA AAGAACTTAG 2151 ATTCATTTCT TCAACACCGA AATGCTGGAG GTGTTTGATC AGTTTTCAAG 2201 AAACTTGGAA TATAAATAAT TTTATAATTC AACAAAGGTT TTCACATTTT 2251 ATAAGGTTGA TTTTTCAATT AAATGCAAAT TTGTGTGGCA GGATTTTTAT 2301 TGCCATTAAC ATATTTTTGT GGCTGCTTTT TCTACACATC CAGATGGTCC 2351 CTCTAACTGG GCTTTCTCTA ATTTTGTGAT GTTCTGTCAT TGTCTCCCAA 2401 AGTATTTAGG AGAAGCCCTT TAAAAAGCTG CCTTCCTCTA CCACTTTGCT 2451 GGAAAGCTTC ACAATTGTCA CAGACAAAGA TTTTTGTTCC AATACTCGTT 2501 TTGCCTCTAT TTTTCTTGTT TGTCAAATAG TAAATGATAT TTGCCCTTGC 2551 AGTAATTCTA CTGGTGAAAA ACATGCAAAG AAGAGGAAGT CACAGAAACA 2601 TGTCTCAATT CCCATGTGCT GTGACTGTAG ACTGTCTTAC CATAGACTGT 2651 CTTACCCATC CCCTGGATAT GCTCTTGTTT TTTCCCTCTA ATAGCTATGG 2701 AAAGATGCAT AGAAAGAGTA TAATGTTTTA AAACATAAGG CATTCATCTG 2751 CCATTTTTCA ATTACATGCT GACTTCCCTT ACAATTGAGA TTTGCCCATA 2801 GGTTAAACAT GGTTAGAAAC AACTGAAAGC ATAAAAGAAA AATCTAGGCC 2851 GGGTGCAGTG GCTCATGCCT ATATTCCCTG CACTTTGGGA GGCCAAAGCA 2901 GGAGGATCGC TTGAGCCCAG GAGTTCAAGA CCAACCTGGT GAAACCCCGT 2951 CTCTACAAAA AAACACAAAA AATAGCCAGG CATGGTGGCG TGTACATGTG 3001 GTCTCAGATA CTTGGGAGGC TGAGGTGGGA GGGTTGATCA CTTGAGGCTG 3051 AGAGGTCAAG GTTGCAGTGA GCCATAATCG TGCCACTGCA GTCCAGCCTA 3101 GGCAACAGAG TGAGACTTTG TCTCAAAAAA AGAGAAATTT TCCTTAATAA 3151 GAAAAGTAAT TTTTACTCTG ATGTGCAATA CATTTGTTAT TAAATTTATT 3201 ATTTAAGATG GTAGCACTAG TCTTAAATTG TATAAAATAT CCCCTAACAT 3251 GTTTAAATGT CCATTTTTAT TCATTATGCT TTGAAAAATA ATTATGGGGA 3301 AATACATGTT TGTTATTAAA TTTATTATTA AAGATAGTAG CACTAGTCTT 3351 AAATTTGATA TAACATCTCC TAACTTGTTT AAATGTCCAT TTTTATTCTT 3401 TATGCTTGAA AATAAATTAT GGGGATCCTA TTTAGCTCTT AGTACCACTA 3451 ATCAAAAGTT CGGCATGTAG CTCATGATCT ATGCTGTTTC TATGTCGTGG 3501 AAGCACCGGA TGGGGGTAGT GAGCAAATCT GCCCTGCTCA GCAGTCACCA 3551 TAGCAGCTGA CTGAAAATCA GCACTGCCTG AGTAGTTTTG ATCAGTTTAA 3601 CTTGAATCAC TAACTGACTG AAAATTGAAT GGGCAAATAA GTGCTTTTGT 3651 CTCCAGAGTA TGCGGGAGAC CCTTCCACCT CAAGATGGAT ATTTCTTCCC 3701 CAAGGATTTC AAGATGAATT GAAATTTTTA ATCAAGATAG TGTGCTTTAT 3751 TCTGTTGTAT TTTTTATTAT TTTAATATAC TGTAAGCCAA ACTGAAATAA 3801 CATTTGCTGT TTTATAGGTT TGAAGAACAT AGGAAAAACT AAGAGGTTTT 3851 GTTTTTATTT TTGCTGATGA AGAGATATGT TTAAATATGT TGTATTGTTT 3901 TGTTTAGTTA CAGGACAATA ATGAAATGGA GTTTATATTT GTTATTTCTA 3951 TTTTGTTATA TTTAATAATA GAATTAGATT GAAATAAAAT ATAATGGGAA 4001 ATAATCTGCA GAATGTGGGT TTCCTGGTGT TTCCTCTGAC TCTAGTGCAC 4051 TGATGATCTC TGATAAGGCT CAGCTGCTTT ATAGTTCTCT GGCTAATGCA 4101 GCAGATACTC TTCCTGCCAG TGGTAATACG ATTTTTTAAG AAGGCAGTTT 4151 GTCAATTTTA ATCTTGTGGA TACCTTTATA CTCTTAGGGT ATTATTTTAT 4201 ACAAAAGCCT TGAGGATTGC ATTCTATTTT CTATATGACC CTCTTGATAT 4251 TTAAAAAACA CTATGGATAA CAATTCTTCA TTTACCTAGT ATTATGAAAG 4301 AATGAAGGAG TTCAAACAAA TGTGTTTCCC AGTTAACTAG GGTTTACTGT 4351 TTGAGCCAAT ATAAATGTTT AACTGTTTGT GATGGCAGTA TTCCTAAAGT 4401 ACATTGCATG TTTTCCTAAA TACAGAGTTT AAATAATTTC AGTAATTCTT 4451 AGATGATTCA GCTTCATCAT TAAGAATATC TTTTGTTTTA TGTTGAGTTA 4501 GAAATGCCTT CATATAGACA TAGTCTTTCA GACCTCTACT GTCAGTTTTC 4551 ATTTCTAGCT GCTTTCAGGG TTTTATGAAT TTTCAGGCAA AGCTTTAATT 4601 TATACTAAGC TTAGGAAGTA TGGCTAATGC CAACGGCAGT TTTTTTCTTC 4651 TTAATTCCAC ATGACTGAGG CATATATGAT CTCTGGGTAG GTGAGTTGTT 4701 GTGACAACCA CAAGCACTTT TTTTTTTTTT AAAGAAAAAA AGGTAGTGAA 4751 TTTTTAATCA TCTGGACTTT AAGAAGGATT CTGGAGTATA CTTAGGCCTG 4801 AAATTATATA TATTTGGCTT GGAAATGTGT TTTTCTTCAA TTACATCTAC 4851 AAGTAAGTAC AGCTGAAATT CAGAGGACCC ATAAGAGTTC ACATGAAAAA 4901 AATCAATTCA TTTGAAAAGG CAAGATGCAG GAGAGAGGAA GCCTTGCAAA 4951 CCTGCAGACT GCTTTTTGCC CAATATAGAT TGGGTAAGGC TGCAAAACAT 5001 AAGCTTAATT AGCTCACATG CTCTGCTCTC ACGTGGCACC AGTGGATAGT 5051 GTGAGAGAAT TAGGCTGTAG AACAAATGGC CTTCTCTTTC AGCATTCACA 5101 CCACTACAAA ATCATCTTTT ATATCAACAG AAGAATAAGC ATAAACTAAG 5151 CAAAAGGTCA ATAAGTACCT GAAACCAAGA TTGGCTAGAG ATATATCTTA 5201 ATGCAATCCA TTTTCTGATG GATTGTTACG AGTTGGCTAT ATAATGTATG 5251 TATGGTATTT TGATTTGTGT AAAAGTTTTA AAAATCAAGC TTTAAGTACA 5301 TGGACATTTT TAAATAAAAT ATTTAAAGAC AATTTAGAAA ATTGCCTTAA 5351 TATCATTGTT GGCTAAATAG AATAGGGGAC ATGCATATTA AGGAAAAGGT 5401 CATGGAGAAA TAATATTGGT ATCAAACAAA TACATTGATT TGTCATGATA 5451 CACATTGAAT TTGATCCAAT AGTTTAAGGA ATAGGTAGGA AAATTTGGTT 5501 TCTATTTTTC GATTTCCTGT AAATCAGTGA CATAAATAAT TCTTAGCTTA 5551 TTTTATATTT CCTTGTCTTA AATACTGAGC TCAGTAAGTT GTGTTAGGGG 5601 ATTATTTCTC AGTTGAGACT TTCTTATATG ACATTTTACT ATGTTTTGAC 5651 TTCCTGACTA TTAAAAATAA ATAGTAGAAA CAATTTTCAT AAAGTGAAGA 5701 ATTATATAAT CACTGCTTTA TAACTGACTT TATTATATTT ATTTCAAAGT 5751 TCATTTAAAG GCTACTATTC ATCCTCTGTG ATGGAATGGT CAGGAATTTG 5801 TTTTCTCATA GTTTAATTCC AACAACAATA TTAGTCGTAT CCAAAATAAC 5851 CTTTAATGCT AAACTTTACT GATGTATATC CAAAGCTTCT CCTTTTCAGA 5901 CAGATTAATC CAGAAGCAGT CATAAACAGA AGAATAGGTG GTATGTTCCT 5951 AATGATATTA TTTCTACTAA TGGAATAAAC TGTAATATTA GAAATTATGC 6001 TGCTAATTAT ATCAGCTCTG AGGTAATTTC TGAAATGTTC AGACTCAGTC 6051 GGAACAAATT GGAAAATTTA AATTTTTATT CTTAGCTATA AAGCAAGAAA 6101 GTAAACACAT TAATTTCCTC AACATTTTTA AGCCAATTAA AAATATAAAA 6151 GATACACACC AATATCTTCT TCAGGCTCTG ACAGGCCTCC TGGAAACTTC 6201 CACATATTTT TCAACTGCAG TATAAAGTCA GAAAATAAAG TTAACATAAC 6251 TTTCACTAAC ACACACATAT GTAGATTTCA CAAAATCCAC CTATAATTGG 6301 TCAAAGTGGT TGAGAATATA TTTTTTAGTA ATTGCATGCA AAATTTTTCT 6351 AGCTTCCATC CTTTCTCCCT CGTTTCTTCT TTTTTTGGGG GAGCTGGTAA 6401 CTGATGAAAT CTTTTCCCAC CTTTTCTCTT CAGGAAATAT AAGTGGTTTT 6451 GTTTGGTTAA CGTGATACAT TCTGTATGAA TGAAACATTG GAGGGAAACA 6501 TCTACTGAAT TTCTGTAATT TAAAATATTT TGCTGCTAGT TAACTATGAA 6551 CAGATAGAAG AATCTTACAG ATGCTGCTAT AAATAAGTAG AAAATATAAA 6601 TTTCATCACT AAAATATGCT ATTTTAAAAT CTATTTCCTA TATTGTATTT 6651 CTAATCAGAT GTATTACTCT TATTATTTCT ATTGTATGTG TTAATGATTT 6701 TATGTAAAAA TGTAATTGCT TTTCATGAGT AGTATGAATA AAATTGATTA 6751 GTTTGTG // LOCUS HUMIL10 1601 bp mRNA PRI 07-MAR-1995 DEFINITION Human interleukin 10 (IL10) mRNA, complete cds. ACCESSION M57627 NID g186270 VERSION M57627.1 GI:186270 KEYWORDS cytokine synthesis inhibitory factor; interleukin 10. SOURCE Human T cell line B21, cDNA to mRNA, clone H15C. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1601) AUTHORS Vieira,P., de Waal-Malefyt,R., Dang,M.N., Johnson,K.E., Kastelein,R., Fiorentino,D.F., deVries,J.E., Roncarolo,M.G., Mosmann,T.R. and Moore,K.W. TITLE Isolation and expression of human cytokine synthesis inhibitory factor cDNA clones: homology to Epstein-Barr virus open reading frame BCRFI JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (4), 1172-1176 (1991) MEDLINE 91142134 FEATURES Location/Qualifiers source 1. .1601 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H15C" /cell_line="B21" /cell_type="T cell" /map="Unassigned" sig_peptide 31. .84 /gene="IL10" /note="G00-128-636" CDS 31. .567 /gene="IL10" /codon_start=1 /db_xref="GDB:G00-128-636" /evidence=experimental /product="interleukin 10" /protein_id="AAA63207.1" /db_xref="PID:g186271" /db_xref="GI:186271" /translation="MHSSALLCCLVLLTGVRASPGQGTQSENSCTHFPGNLPNMLRDL RDAFSRVKTFFQMKDQLDNLLLKESLLEDFKGYLGCQALSEMIQFYLEEVMPQAENQD PDIKAHVNSLGENLKTLRLRLRRCHRFLPCENKSKAVEQVKNAFNKLQEKGIYKAMSE FDIFINYIEAYMTMKIRN" gene 31. .567 /gene="IL10" mat_peptide 85. .564 /gene="IL10" /note="G00-128-636" /product="interleukin 10" BASE COUNT 445 a 368 c 356 g 432 t ORIGIN 1 AAACCACAAG ACAGACTTGC AAAAGAAGGC ATGCACAGCT CAGCACTGCT 51 CTGTTGCCTG GTCCTCCTGA CTGGGGTGAG GGCCAGCCCA GGCCAGGGCA 101 CCCAGTCTGA GAACAGCTGC ACCCACTTCC CAGGCAACCT GCCTAACATG 151 CTTCGAGATC TCCGAGATGC CTTCAGCAGA GTGAAGACTT TCTTTCAAAT 201 GAAGGATCAG CTGGACAACT TGTTGTTAAA GGAGTCCTTG CTGGAGGACT 251 TTAAGGGTTA CCTGGGTTGC CAAGCCTTGT CTGAGATGAT CCAGTTTTAC 301 CTGGAGGAGG TGATGCCCCA AGCTGAGAAC CAAGACCCAG ACATCAAGGC 351 GCATGTGAAC TCCCTGGGGG AGAACCTGAA GACCCTCAGG CTGAGGCTAC 401 GGCGCTGTCA TCGATTTCTT CCCTGTGAAA ACAAGAGCAA GGCCGTGGAG 451 CAGGTGAAGA ATGCCTTTAA TAAGCTCCAA GAGAAAGGCA TCTACAAAGC 501 CATGAGTGAG TTTGACATCT TCATCAACTA CATAGAAGCC TACATGACAA 551 TGAAGATACG AAACTGAGAC ATCAGGGTGG CGACTCTATA GACTCTAGGA 601 CATAAATTAG AGGTCTCCAA AATCGGATCT GGGGCTCTGG GATAGCTGAC 651 CCAGCCCCTT GAGAAACCTT ATTGTACCTC TCTTATAGAA TATTTATTAC 701 CTCTGATACC TCAACCCCCA TTTCTATTTA TTTACTGAGC TTCTCTGTGA 751 ACGATTTAGA AAGAAGCCCA ATATTATAAT TTTTTTCAAT ATTTATTATT 801 TTCACCTGTT TTTAAGCTGT TTCCATAGGG TGACACACTA TGGTATTTGA 851 GTGTTTTAAG ATAAATTATA AGTTACATAA GGGAGGAAAA AAAATGTTCT 901 TTGGGGAGCC AACAGAAGCT TCCATTCCAA GCCTGACCAC GCTTTCTAGC 951 TGTTGAGCTG TTTTCCCTGA CCTCCCTCTA ATTTATCTTG TCTCTGGGCT 1001 TGGGGCTTCC TAACTGCTAC AAATACTCTT AGGAAGAGAA ACCAGGGAGC 1051 CCCTTTGATG ATTAATTCAC CTTCCAGTGT CTCGGAGGGA TTCCCCTAAC 1101 CTCATTCCCC AACCACTTCA TTCTTGAAAG CTGTGGCCAG CTTGTTATTT 1151 ATAACAACCT AAATTTGGTT CTAGGCCGGG CGCGGTGGCT CACGCCTGTA 1201 ATCCCAGCAC TTTGGGAGGC TGAGGCGGGT GGATCACTTG AGGTCAGGAG 1251 TTCCTAACCA GCCTGGTCAA CATGGTGAAA CCCCGTCTCT ACTAAAAATA 1301 CAAAAATTAG CCGGGCATGG TGGCGCGCAC CTGTAATCCC AGCTACTTGG 1351 GAGGCTGAGG CAAGAGAATT GCTTGAACCC AGGAGATGGA AGTTGCAGTG 1401 AGCTGATATC ATGCCCCTGT ACTCCAGCCT GGGTGACAGA GCAAGACTCT 1451 GTCTCAAAAA AATAAAAATA AAAATAAATT TGGTTCTAAT AGAACTCAGT 1501 TTTAACTAGA ATTTATTCAA TTCCTCTGGG AATGTTACAT TGTTTGTCTG 1551 TCTTCATAGC AGATTTTAAT TTTGAATAAA TAAATGTATC TTATTCACAT 1601 C // LOCUS HSTRE213 8201 bp mRNA PRI 22-DEC-1993 DEFINITION H.sapiens mRNA for tre oncogene (clone 213). ACCESSION X63547 NID g37332 VERSION X63547.1 GI:37332 KEYWORDS oncogene; transforming capacity; tre gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8201) AUTHORS Hillova,J. TITLE Direct Submission JOURNAL Submitted (16-DEC-1991) J. Hillova, C.N.R.S. - SDI I6204, I.C.I.G., Dept of Molecular and Cellular Biology, 14, avenue Paul-Vaillant Couturier, 94804 Villejuif Cedex, FRANCE REFERENCE 2 (bases 1 to 8201) AUTHORS Nakamura,T., Hillova,J., Mariage-Samson,R., Onno,M., Huebner,K., Cannizzaro,L.A., Boghosian-Sell,L., Croce,C.M. and Hill,M. TITLE A novel transcriptional unit of the tre oncogene widely expressed in human cancer cells JOURNAL Oncogene 7 (4), 733-741 (1992) MEDLINE 92228503 COMMENT See also X63547, X63596 & X71366-79 The tre-2 genetic element identified in tre-transfectants was renamed oncRTE17 because of its origin from the repetoire of hypervariable TRE17 genes (see X63586). FEATURES Location/Qualifiers source 1. .8201 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17q" /tissue_type="Ewings' sarcoma" /cell_line="transfected NIH 3T3 cell" /clone="213" 5'UTR 1. .1696 gene 1697. .6128 /gene="tre" CDS 1697. .2827 /gene="tre" /codon_start=1 /product="oncogene" /protein_id="CAA45110.1" /db_xref="PID:g37333" /db_xref="GI:37333" /db_xref="SPTREMBL:Q15635" /translation="MDMVENADSLQAQERKDILMKYDKGHRAGLPEDKGPEPVGINSS IDRFGILHETELPPVTAREAKKIRREMTRTSKWMEMLGEWETYKHSSKLIDRVYKGIP MNIRGPVWSVLLNIQEIKLKNPGRYQIMKERGKRSSEHIHHIDLDVRTTLRNHVFFRD RYGAKQRELFYILLAYSEYNPEVGYCRDLSHITALFLLYLPEEDAFWALVQLLASERH SLPGFHSPNGGTVQGLQDQQEHVVPKSQPKTMWHQDKEGLCGQCASLGCLLRNLIDGI SLGLTLRLWDVYLVEGEQVLMPITSIALKVQQKRLMKTSRCGLWARLRNQFFDTWAMN DDTVLKHLRASTKKLTRKQGDLPPPGPTALGRRCVAGSPQPV" CDS 2859. .6128 /gene="tre" /codon_start=1 /product="oncogene" /protein_id="CAA45111.1" /db_xref="PID:g37334" /db_xref="GI:37334" /db_xref="SWISS-PROT:P35125" /translation="MPQRLPHARQHTPLPLGSADYRRVVSVRPQGPHRDPKDSRDAAK REQGSLAPRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASRFSTPCPGGA VREDTYPVGTQGVPSLALAQGGPQGSWRFLEWKSMPRLPTDLDIGGPWFPHYDFERSC WVRAISQEDQLATCWQAEHCGEVHNKDMSWPEEMSFTANSSKIDRQKVPTEKGATGLS NLGNTCFMNSSIQCVSNTQPLTQYFISGRHLYELNRTNPIGMKGHMAKCYGDLVQELW SGTQKSVAPLKLRRTIAKYAPKFDGFQQQDSQELLAFLLDGLHEDLNRVHEKPYVELK DSDGRPDWEVAAEAWDNHLRRNRSIIVDLFHGQLRSQVKCKTCGHISVRFDPFNFLSL PLPMDSYMDLEITVIKLDGTTPVRYGLRLNMDEKYTGLKKQLRDLCGLNSEQILLAEV HDSNIKNFPQDNQKVQLSVSGFLCAFEIPVPSSPISASSPTQIDFSSSPSTNGMFTLT TNGDLPKPIFIPNGMPNTVVPCGTEKNFTNGMVNGHMPSLPDSPFTGYIIAVHRKMMR TELYFLSPQENRPSLFGMPLIVPCTVHTQKKDLYDAVWIQVSWLARPLPPQEASIHAQ DRDNCMGYQYPFTLRVVQKDGISCAWCPQYRFCRGCKIDCGEDRAFIGNAYIAVDWHP TALHLRYQTSQERVVDKHESVEQSRRAQAEPINLDSCLRAFTSEEELGESEMYYCSKC KTHCLATKKLDLWRLPPFLIIHLKRFQFVNDQWIKSQKIVRFLRESFDPSAFLVPRDP ALCQHKPLTPQGDELSKPRILAREVKKVDAQSSAGKEDMLLSKSPSSLSANISSSPKG SPSSSRKSGTSCPSSKNSSPNSSPRTLGRSKGRLRLPQIGSKNKPSSSKKNLDASKEN GAGQICELADALSRGHMRGGSQPELVTPQDHEVALANGFLYEHEACGNGCGDGYSNGQ LGNHSEEDSTDDQREDTHIKPIYNLYAISCHSGILSGGHYITYAKNPNCKWYCYNDSS CEELHPDEIDTDSAYILFYEQQGIDYAQFLPKIDGKKMADTSSTDEDSESDYEKYSML Q" 3'UTR 6129. .8201 polyA_site 8179 BASE COUNT 2258 a 1887 c 2003 g 2053 t ORIGIN 1 CTAAAAATAC CATTAAGTAA TAGTATTAGC TTTTGTATTC TGAGATTCAA 51 CAGCAGCAGT CACTTCCCTC CACTCCTATG TGTATCCCAG GACCACCCTG 101 GGCGGGGAGG GCTGAGGTCA GGGAGGTCTG AAGCTGGTCC TGGGCTCCGG 151 GGGTGACAGT GATGAGGAAC TGGGTGCACA CATGAGTGGG GCAGCCGGGC 201 CTGGCCAGAG AAGCAACACA CACGTGCACA GACATGTTTA TCCACATACA 251 CATGTGCACG CATGTGCACA AACACATTGC AGGCAGGCAT GTTGACGCCT 301 CAGGCAGCGG AGGACCCTGA CTCTGGGCCC TGCTGACCCG GGCAAGGCCC 351 ATTGTGATGC GTGCCATGAC CTCAGAATGT CACTGGTGCT TAGCACCTAT 401 CCGCTCTCCA GACTGCGTCT GTGTTCTACG GCAGTTACAC ACACGCAGTG 451 GTATTCACAA GCGGTTTTGT GGACTCAAAG GTTTTCTCCC TGAGAGGCAT 501 AACCCAGGCC AGCTGATTCA TCAGAATCAG GTGAGTGTGA CCTGCTCTCT 551 TCCCTCCAGG CTGACTTGGG GACAGTGGCT ATGGTATGGG CGGTGTTGGC 601 CTCTGGGCAG CTACAGAGGA GGGTCATCCC TGAGCACTCA CCGGGCGCCC 651 GTTCTACACT GCCCATGTAG ACGATTTTCT CTTTCGTCTT CATGGTGGCT 701 TCGTAGAGTG GGTGCTGTTC CCAAATGTAC CCATTCGACA GGTGAGCCGT 751 CTGGGGTCAG AGAGGCAGTA ACTGGCCTGG GAATCCAGAC AAGACCCTGG 801 GTTTTGCTCT CAGCCCTGCT GTGTGCCATG CTAGACTTCA GGCCTCAACC 851 CTGAGACCTC CCTGCTCTAG ATCCCAAATC TGCCCAGATT TCCGATCCAA 901 TGGGCAGAGC CTGGCCCTGG CAGAGACACT GGGATGGATC CACTGTGGGT 951 GGGGAGGAGG GAAGGGTCCT CAGAACACAC CTGGGGCCTA AGCTGGGTCT 1001 TGATGGTCAC TGTGGGACCC ACTGGACACA CACAGTCCCT TGTCTGGGAG 1051 TGGCATGGGG AGCCTTCTGC CCTTGGGCAG TTGTGGAAAG TGAAGGAGCC 1101 CTGGAGAGCT GGCTGAGGGG AGACTATCTT CCCTTGTGTT CAAAGGGGTC 1151 CAGGCACTGG GGCTCTCCCC AAGTATTTCT TATTCTGTCT GGCCTCGCTT 1201 TCCTTTTGCC CTGAGTATTC TCAGGAGGGA CGGTCCATCT AGATGTCCTC 1251 CAGGAGCAAG GACCCACTGT TCTTCATCAG TGACCCAGGA AAATGAAGCC 1301 CCCTCCTGTG GGGACAGCTC AGAATGGTGG AGTCCACAGT CCCTCCCTGA 1351 GAGACATGGT TTCCATGAGC ACAGTGGCTG CTTTGGAGAC AGTAATCATT 1401 TTCATCCCCA AAACCAAACA CACTCCTGCT CAAATGGTGT TATTGCTAAA 1451 GCAGCTTCAC TGGTTAGACT GAAGGGCCAT GGTAGCCCAA GTGATGAGCG 1501 GGGTAGAATG GAGCAGTCAG GAGAGATCTT GTTCCCCGTA GGAAACTGGG 1551 CATCTCTGTG GCCCTGAACA TCCCAGGAGG CCGATCGTAC AGAGACCTCT 1601 GGTGCCTGAC CGCAGTTCAC ATCCACATCC CTGGAATAGA CCATCACAGG 1651 CTCTTCACCC TTGGCAGGTG GACACCATTC AACCTGCCGG GGCAGGATGG 1701 ACATGGTAGA GAATGCAGAT AGTTTGCAGG CACAGGAGCG GAAGGACATA 1751 CTTATGAAGT ATGACAAGGG ACACCGAGCT GGGCTGCCAG AGGACAAGGG 1801 GCCTGAGCCC GTTGGAATCA ACAGCAGCAT TGATCGTTTT GGCATTTTGC 1851 ATGAGACGGA GCTGCCTCCT GTGACTGCAC GGGAGGCGAA GAAAATTCGG 1901 CGGGAGATGA CACGAACGAG CAAGTGGATG GAAATGCTGG GAGAATGGGA 1951 GACATATAAG CACAGTAGCA AACTCATAGA TCGAGTGTAC AAGGGAATTC 2001 CCATGAACAT CCGGGGCCCG GTGTGGTCAG TCCTCCTGAA CATTCAGGAA 2051 ATCAAGTTGA AAAACCCCGG AAGATACCAG ATCATGAAGG AGAGGGGCAA 2101 GAGGTCATCT GAACACATCC ACCACATCGA CCTGGACGTG AGGACGACTC 2151 TCCGGAACCA TGTCTTCTTT AGGGATCGAT ATGGAGCCAA GCAGAGGGAA 2201 CTATTCTACA TCCTCCTGGC CTATTCGGAG TATAACCCGG AGGTGGGCTA 2251 CTGCAGGGAC CTGAGCCACA TCACCGCCTT GTTCCTCCTT TATCTGCCTG 2301 AGGAGGACGC ATTCTGGGCA CTGGTGCAGC TGCTGGCCAG TGAGAGGCAC 2351 TCCCTGCCAG GATTCCACAG CCCAAATGGT GGGACAGTCC AGGGGCTCCA 2401 AGACCAACAG GAGCATGTGG TACCCAAGTC ACAACCCAAG ACCATGTGGC 2451 ATCAGGACAA GGAAGGTCTA TGCGGGCAGT GTGCCTCGTT AGGCTGCCTT 2501 CTCCGGAACC TGATTGACGG GATCTCTCTC GGGCTCACCC TGCGCCTGTG 2551 GGACGTGTAT TTGGTGGAAG GAGAACAGGT GTTGATGCCA ATAACCAGCA 2601 TTGCTCTTAA GGTTCAGCAG AAGCGCCTCA TGAAGACATC CAGGTGTGGC 2651 CTGTGGGCAC GTCTGCGGAA CCAATTCTTC GATACCTGGG CCATGAACGA 2701 TGACACCGTG CTCAAGCATC TTAGGGCCTC TACGAAGAAA CTAACAAGGA 2751 AGCAAGGGGA CCTGCCACCC CCAGGCCCAA CAGCCCTGGG ACGAAGGTGT 2801 GTGGCAGGAA GCCCCCAGCC AGTCTGAACC CTGGGGGCAG TCCCAGGAGC 2851 CACCCACCAT GCCCCAACGG CTTCCCCATG CCAGGCAGCA CACACCCCTC 2901 CCTCTGGGAT CAGCAGACTA CAGGCGTGTC GTCAGTGTCA GACCACAGGG 2951 GCCACACAGA GACCCCAAGG ACTCCAGAGA TGCAGCCAAA CGCGAGCAAG 3001 GGTCCTTGGC ACCCAGGCCT GTGCCGGCTT CACGTGGTGG GAAGACCCTC 3051 TGCAAGGGGT ATAGGCAGGC CCCTCCAGGC CCACCAGCCC AGTTCCAGCG 3101 GCCCATTTGC TCAGCTTCCC CGCCATGGGC ATCTCGTTTT TCCACGCCCT 3151 GTCCTGGTGG GGCTGTCCGG GAAGACACGT ACCCTGTGGG CACTCAGGGT 3201 GTGCCCAGCC TGGCCCTGGC TCAGGGAGGA CCTCAGGGTT CCTGGAGATT 3251 CCTGGAGTGG AAGTCAATGC CCCGGCTCCC AACGGACCTG GATATAGGGG 3301 GCCCTTGGTT CCCCCATTAT GATTTTGAAC GGAGCTGCTG GGTCCGTGCC 3351 ATATCCCAGG AGGACCAGCT GGCCACCTGC TGGCAGGCTG AACACTGCGG 3401 AGAGGTTCAC AACAAAGATA TGAGTTGGCC TGAGGAGATG TCTTTTACAG 3451 CAAATAGTAG TAAAATAGAT AGACAAAAGG TTCCCACAGA AAAGGGAGCC 3501 ACAGGTCTAA GCAACCTGGG AAACACATGC TTCATGAACT CAAGCATCCA 3551 GTGCGTTAGT AACACACAGC CACTGACACA GTATTTTATC TCAGGGAGAC 3601 ATCTTTATGA ACTCAACAGG ACAAATCCCA TTGGTATGAA GGGGCATATG 3651 GCTAAATGCT ATGGTGATTT AGTGCAGGAA CTCTGGAGTG GAACTCAGAA 3701 GAGTGTTGCC CCATTAAAGC TTCGGCGGAC CATAGCAAAA TATGCTCCCA 3751 AGTTTGATGG GTTTCAGCAA CAAGACTCCC AAGAACTTCT GGCTTTTCTC 3801 TTGGATGGTC TTCATGAAGA TCTCAACCGA GTCCATGAAA AGCCATATGT 3851 GGAACTGAAG GACAGTGATG GCCGACCAGA CTGGGAAGTA GCTGCAGAGG 3901 CCTGGGACAA CCATCTAAGA AGAAATAGAT CAATTATTGT GGATTTGTTC 3951 CATGGGCAGC TAAGATCTCA AGTCAAATGC AAGACATGTG GGCATATAAG 4001 TGTCCGATTT GACCCTTTCA ATTTTTTGTC TTTGCCACTA CCAATGGACA 4051 GTTACATGGA CTTAGAAATA ACAGTGATTA AGTTAGATGG TACTACCCCT 4101 GTACGGTATG GACTAAGACT GAATATGGAT GAAAAGTACA CAGGTTTAAA 4151 AAAACAGCTG AGGGATCTCT GTGGACTTAA TTCAGAACAA ATCCTACTAG 4201 CAGAAGTACA TGATTCCAAC ATAAAGAACT TTCCTCAGGA TAACCAAAAA 4251 GTACAACTCT CAGTGAGCGG ATTTTTGTGT GCATTTGAAA TTCCTGTCCC 4301 TTCATCTCCA ATTTCAGCTT CTAGTCCAAC ACAAATAGAT TTCTCCTCTT 4351 CACCATCTAC AAATGGAATG TTCACCCTAA CTACCAATGG GGACCTACCC 4401 AAACCAATAT TCATCCCCAA TGGAATGCCA AACACTGTTG TGCCATGTGG 4451 AACTGAGAAG AACTTCACAA ATGGAATGGT TAATGGTCAC ATGCCATCTC 4501 TTCCTGACAG CCCCTTTACA GGTTACATCA TTGCAGTCCA CCGAAAAATG 4551 ATGAGGACAG AACTGTATTT CCTGTCACCT CAGGAGAATC GCCCCAGCCT 4601 CTTTGGAATG CCATTGATTG TTCCATGCAC TGTGCATACC CAGAAGAAAG 4651 ACCTATATGA TGCGGTTTGG ATTCAAGTAT CCTGGTTAGC AAGACCACTC 4701 CCACCTCAGG AAGCTAGTAT TCATGCCCAG GATCGTGATA ACTGTATGGG 4751 CTATCAATAT CCATTCACTC TACGAGTTGT GCAGAAAGAT GGGATCTCCT 4801 GTGCTTGGTG CCCACAGTAT AGATTTTGCA GAGGCTGTAA AATTGATTGT 4851 GGGGAAGACA GAGCTTTCAT TGGAAATGCC TATATTGCTG TGGATTGGCA 4901 CCCCACAGCC CTTCACCTTC GCTATCAAAC ATCCCAGGAA AGGGTTGTAG 4951 ATAAGCATGA GAGTGTGGAG CAGAGTCGGC GAGCGCAAGC CGAGCCCATC 5001 AACCTGGACA GCTGTCTCCG TGCTTTCACC AGTGAGGAAG AGCTAGGGGA 5051 AAGTGAGATG TACTACTGTT CCAAGTGTAA GACCCACTGC TTAGCAACAA 5101 AGAAGCTGGA TCTCTGGAGG CTTCCACCCT TCCTGATTAT TCACCTTAAG 5151 CGATTTCAAT TTGTAAATGA TCAGTGGATA AAATCACAGA AAATTGTCAG 5201 ATTTCTTCGG GAAAGTTTTG ATCCGAGTGC TTTTTTGGTA CCACGAGACC 5251 CGGCCCTCTG CCAGCATAAA CCACTCACAC CCCAGGGGGA TGAGCTCTCC 5301 AAGCCCAGGA TTCTGGCAAG AGAGGTGAAG AAAGTGGATG CGCAGAGTTC 5351 GGCTGGAAAA GAGGACATGC TCCTAAGCAA AAGCCCATCT TCACTCAGCG 5401 CTAACATCAG CAGCAGCCCA AAAGGTTCTC CTTCTTCATC AAGAAAAAGT 5451 GGAACCAGCT GTCCCTCCAG CAAAAACAGC AGCCCTAATA GCAGCCCACG 5501 GACTTTGGGG AGGAGCAAAG GGAGGCTCCG GCTGCCCCAG ATTGGCAGCA 5551 AAAATAAGCC GTCAAGTAGT AAGAAGAACT TGGATGCCAG CAAAGAGAAT 5601 GGGGCTGGGC AGATCTGTGA GCTGGCTGAC GCCTTGAGCC GAGGGCATAT 5651 GCGGGGGGGC AGCCAACCAG AGCTGGTCAC TCCTCAGGAC CATGAGGTAG 5701 CTTTGGCCAA TGGATTCCTT TATGAGCATG AAGCATGTGG CAATGGCTGT 5751 GGCGATGGCT ACAGCAATGG TCAGCTTGGA AACCACAGTG AAGAAGACAG 5801 CACTGATGAC CAAAGAGAAG ACACTCATAT TAAGCCTATT TATAATCTAT 5851 ATGCAATTTC ATGCCATTCA GGAATTCTGA GTGGGGGCCA TTACATCACT 5901 TATGCCAAAA ACCCAAACTG CAAGTGGTAC TGTTATAATG ACAGCAGCTG 5951 TGAGGAACTT CACCCTGATG AAATTGACAC CGACTCTGCC TACATTCTTT 6001 TCTATGAGCA GCAGGGGATA GACTACGCAC AATTTCTGCC AAAGATTGAT 6051 GGCAAAAAGA TGGCAGACAC AAGCAGTACG GATGAAGACT CTGAGTCTGA 6101 TTACGAAAAG TACTCTATGT TACAGTAAAG CTACCACTCT GGCTGCTAGA 6151 CAGCTTGGTG GCGAGGGAGA TGACTCCTTG TAGCTGATAC TTGGCAAAAG 6201 TGTCACTGAA AGACAAGCTA AATGTAGTTA TTTTATCCTG TTAGAACAAA 6251 AATTCTAATT AAAATAGTTA ACTTGAAGAG TAGAAACAAT TGTATTTTGA 6301 AGTCTCATAC AAGCTGTCTG ATAGAGAACT TTCAGGCAGA TCCCACCATT 6351 AGCCTGTAAA CAAAAGGTGT GGCACCAGCC ACCTGGGACC AAATAAGAAT 6401 TGAATTGTGC TTGTCCAGAT ATGAACAAAT ATGTAGTGAG TATAGAGTTT 6451 ACCAATAATC ATAACAAATA TTAAAGATTT CCTTGGAGTC AGAGGAAAAA 6501 ACAAACAATT ATAATGTTGT CTAGGGACGA CATGATACGC TACCTCCTTT 6551 TTCCTGAAGT TTTATTCCAT TATATTGACA AGATGGAGAA AGCAAGATCA 6601 TGAAGGTGTG CAAATGATTC TTACGGCATG GACAAGGATT TTTCAATTTA 6651 TTTTTTAAAC TGTTTCCATA CCCTTTCTTT TTCTTGCTTT TTGTTTTTGC 6701 CATTGTGTTT ACGTTTGAGA CACAACCAGT CATTGGTGGC AGGGGCATAG 6751 AGTGGTCAGT CTGAAAGGGA GGCTCTCTTA AGAGCTATGT GCCTTCCAAC 6801 CAGAGGGAGA CCCAGTAGAA AGAAAAACAT CCTGGGAAAT CCAGCTACCA 6851 GGGCCCTCCC AGTGGAGGCA TCTTACATTT AGGCTACTTC AAGTATCCTC 6901 AGAAATGTAT TCTGCACCCC CGGCCCCGCC CATGCTGAGG GAAGGGGAGC 6951 AGTTGCCAAT ATTTGCACCA TCTTCACATG CACATGTTGC AACAAGAGCT 7001 TCTGGGAAGG TAAGCGGCAT CGGAGCTAGA TC