LOCUS HUMBCL2C 6030 bp mRNA PRI 27-APR-1993 DEFINITION Human bcl-2 mRNA. ACCESSION M14745 NID g179370 VERSION M14745.1 GI:179370 KEYWORDS c-myc proto-oncogene. SOURCE Human common acute lymphoblastic leukemia cell line cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6030) AUTHORS Cleary,M.L., Smith,S.D. and Sklar,J. TITLE Cloning and structural analysis of cDNAs for bcl-2 and a hybrid bcl-2/immunoglobulin transcript resulting from the t(14;18) translocation JOURNAL Cell 47, 19-28 (1986) MEDLINE 87002488 FEATURES Location/Qualifiers source 1. .6030 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 32. .751 /note="bcl-2 protein" /codon_start=1 /protein_id="AAA35591.1" /db_xref="PID:g179371" /db_xref="GI:179371" /translation="MAHAGRTGYDNREIVMKYIHYKLSQRGYEWDAGDVGAAPPGAAP APGIFSSQPGHTPHTAASRDPVARTSPLQTPAAPGAAAGPALSPVPPVVHLTLRQAGD DFSRRYRRDFAEMSRQLHLTPFTARGRFATVVEELFRDGVNWGRIVAFFEFGGVMCVE SVNREMSPLVDNIALWMTEYLNRHLHTWIQDNGGWDAFVELYGPSMRPLFDFSWLSLK TLLSLALVGACITLGAYLGHK" BASE COUNT 1668 a 1247 c 1386 g 1729 t ORIGIN 1 GTTGGCCCCC GTTACTTTTC CTCTGGGAAA TATGGCGCAC GCTGGGAGAA 51 CAGGGTACGA TAACCGGGAG ATAGTGATGA AGTACATCCA TTATAAGCTG 101 TCGCAGAGGG GCTACGAGTG GGATGCGGGA GATGTGGGCG CCGCGCCCCC 151 GGGGGCCGCC CCCGCGCCGG GCATCTTCTC CTCGCAGCCC GGGCACACGC 201 CCCATACAGC CGCATCCCGG GACCCGGTCG CCAGGACCTC GCCGCTGCAG 251 ACCCCGGCTG CCCCCGGCGC CGCCGCGGGG CCTGCGCTCA GCCCGGTGCC 301 ACCTGTGGTC CACCTGACCC TCCGCCAGGC CGGCGACGAC TTCTCCCGCC 351 GCTACCGCCG CGACTTCGCC GAGATGTCCA GGCAGCTGCA CCTGACGCCC 401 TTCACCGCGC GGGGACGCTT TGCCACGGTG GTGGAGGAGC TCTTCAGGGA 451 CGGGGTGAAC TGGGGGAGGA TTGTGGCCTT CTTTGAGTTC GGTGGGGTCA 501 TGTGTGTGGA GAGCGTCAAC CGGGAGATGT CGCCCCTGGT GGACAACATC 551 GCCCTGTGGA TGACTGAGTA CCTGAACCGG CACCTGCACA CCTGGATCCA 601 GGATAACGGA GGCTGGGATG CCTTTGTGGA ACTGTACGGC CCCAGCATGC 651 GGCCTCTGTT TGATTTCTCC TGGCTGTCTC TGAAGACTCT GCTCAGTTTG 701 GCCCTGGTGG GAGCTTGCAT CACCCTGGGT GCCTATCTGG GCCACAAGTG 751 AAGTCAACAT GCCTGCCCCA AACAAATATG CAAAAGGTTC ACTAAAGCAG 801 TAGAAATAAT ATGCATTGTC AGTGATGTTC CATGAAACAA AGCTGCAGGC 851 TGTTTAAGAA AAAATAACAC ACATATAAAC ATCACACACA CAGACAGACA 901 CACACACACA CAACAATTAA CAGTCTTCAG GCAAAACGTC GAATCAGCTA 951 TTTACTGCCA AAGGGAAATA TCATTTATTT TTTACATTAT TAAGAAAAAA 1001 AGATTTATTT ATTTAAGACA GTCCCATCAA AACTCCTGTC TTTGGAAATC 1051 CGACCACTAA TTGCCAAGCA CCGCTTCGTG TGGCTCCACC TGGATGTTCT 1101 GTGCCTGTAA ACATAGATTC GCTTTCCATG TTGTTGGCCG GATCACCATC 1151 TGAAGAGCAG ACGGATGGAA AAAGGACCTG ATCATTGGGG AAGCTGGCTT 1201 TCTGGCTGCT GGAGGCTGGG GAGAAGGTGT TCATTCACTT GCATTTCTTT 1251 GCCCTGGGGG CTGTGATATT AACAGAGGGA GGGTTCCTGT GGGGGGAAGT 1301 CCATGCCTCC CTGGCCTGAA GAAGAGACTC TTTGCATATG ACTCACATGA 1351 TGCATACCTG GTGGGAGGAA AAGAGTTGGG AACTTCAGAT GGACCTAGTA 1401 CCCACTGAGA TTTCCACGCC GAAGGACAGC GATGGGAAAA ATGCCCTTAA 1451 ATCATAGGAA AGTATTTTTT TAAGCTACCA ATTGTGCCGA GAAAAGCATT 1501 TTAGCAATTT ATACAATATC ATCCAGTACC TTAAGCCCTG ATTGTGTATA 1551 TTCATATATT TTGGATACGC ACCCCCCAAC TCCCAATACT GGCTCTGTCT 1601 GAGTAAGAAA CAGAATCCTC TGGAACTTGA GGAAGTGAAC ATTTCGGTGA 1651 CTTCCGCATC AGGAAGGCTA GAGTTACCCA GAGCATCAGG CCGCCACAAG 1701 TGCCTGCTTT TAGGAGACCG AAGTCCGCAG AACCTGCCTG TGTCCCAGCT 1751 TGGAGGCCTG GTCCTGGAAC TGAGCCGGGG CCCTCACTGG CCTCCTCCAG 1801 GGATGATCAA CAGGGCAGTG TGGTCTCCGA ATGTCTGGAA GCTGATGGAG 1851 CTCAGAATTC CACTGTCAAG AAAGAGCAGT AGAGGGGTGT GGCTGGGCCT 1901 GTCACCCTGG GGCCCTCCAG GTAGGCCCGT TTTCACGTGG AGCATGGGAG 1951 CCACGACCCT TCTTAAGACA TGTATCACTG TAGAGGGAAG GAACAGAGGC 2001 CCTGGGCCCT TCCTATCAGA AGGACATGGT GAAGGCTGGG AACGTGAGGA 2051 GAGGCAATGG CCACGGCCCA TTTTGGCTGT AGCACATGGC ACGTTGGCTG 2101 TGTGGCCTTG GCCCACCTGT GAGTTTAAAG CAAGGCTTTA AATGACTTTG 2151 GAGAGGGTCA CAAATCCTAA AAGAAGCATT GAAGTGAGGT GTCATGGATT 2201 AATTGACCCC TGTCTATGGA ATTACATGTA AAACATTATC TTGTCACTGT 2251 AGTTTGGTTT TATTTGAAAA CCTGACAAAA AAAAAGTTCC AGGTGTGGAA 2301 TATGGGGGTT ATCTGTACAT CCTGGGGCAT TAAAAAAAAA ATCAATGGTG 2351 GGGAACTATA AAGAAGTAAC AAAAGAAGTG ACATCTTCAG CAAATAAACT 2401 AGGAAATTTT TTTTTCTTCC AGTTTAGAAT CAGCCTTGAA ACATTGATGG 2451 AATAACTCTG TGGCATTATT GCATTATATA CCATTTATCT GTATTAACTT 2501 TGGAATGTAC TCTGTTCAAT GTTTAATGCT GTGGTTGATA TTTCGAAAGC 2551 TGCTTTAAAA AAATACATGC ATCTCAGCGT TTTTTTGTTT TTAATTGTAT 2601 TTAGTTATGG CCTATACACT ATTTGTGAGC AAAGGTGATC GTTTTCTGTT 2651 TGAGATTTTT ATCTCTTGAT TCTTCAAAAG CATTCTGAGA AGGTGAGATA 2701 AGCCCTGAGT CTCAGCTACC TAAGAAAAAC CTGGATGTCA CTGGCCACTG 2751 AGGAGCTTTG TTTCAACCAA GTCATGTGCA TTTCCACGTC AACAGAATTG 2801 TTTATTGTGA CAGTTATATC TGTTGTCCCT TTGACCTTGT TTCTTGAAGG 2851 TTTCCTCGTC CCTGGGCAAT TCCGCATTTA ATTCATGGTA TTCAGGATTA 2901 CATGCATGTT TGGTTAAACC CATGAGATTC ATTCAGTTAA AAATCCAGAT 2951 GGCAAATGAC CAGCAGATTC AAATCTATGG TGGTTTGACC TTTAGAGAGT 3001 TGCTTTACGT GGCCTGTTTC AACACAGACC CACCCAGAGC CCTCCTGCCC 3051 TCCTTCCGCG GGGGCTTTCT CATGGCTGTC CTTCAGGGTC TTCCTGAAAT 3101 GCAGTGGTGC TTACGCTCCA CCAAGAAAGC AGGAAACCTG TGGTATGAAG 3151 CCAGACCTCC CCGGCGGGCC TCAGGGAACA GAATGATCAG ACCTTTGAAT 3201 GATTCTAATT TTTAAGCAAA ATATTATTTT ATGAAAGGTT TACATTGTCA 3251 AAGTGATGAA TATGGAATAT CCAATCCTGT GCTGCTATCC TGCCAAAATC 3301 ATTTTAATGG AGTCAGTTTG CAGTATGCTC CACGTGGTAA GATCCTCCAA 3351 GCTGCTTTAG AAGTAACAAT GAAGAACGTG GACGCTTTTA ATATAAAGCC 3401 TGTTTTGTCT TCTGTTGTTG TTCAAACGGG ATTCACAGAG TATTTGAAAA 3451 ATGTATATAT ATTAAGAGGT CACGGGGGCT AATTGCTGGC TGGCTGCCTT 3501 TTGCTGTGGG GTTTTGTTAC CTGGTTTTAA TAACAGTAAA TGTGCCCAGC 3551 CTCTTGGCCC CAGAACTGTA CAGTATTGTG GCTGCACTTG CTCTAAGAGT 3601 AGTTGATGTT GCATTTTCCT TATTGTTAAA AACATGTTAG AAGCAATGAA 3651 TGTATATAAA AGCCTCAACT AGTCATTTTT TTCTCCTCTT CTTTTTTTTC 3701 ATTATATCTA ATTATTTTGC AGTTGGGCAA CAGAGAACCA TCCCTATTTT 3751 GTATTGAAGA GGGATTCACA TCTGCATCTT AACTGCTCTT TATGAATGAA 3801 AAAACAGTCC TCTGTATGTA CTCCTCTTTA CACTGGCCAG GGTCAGAGTT 3851 AAATAGAGTA TATGCACTTT CCAAATTGGG GACAAGGGCT CTAAAAAAAG 3901 CCCCAAAAGG AGAAGAACAT CTGAGAACCT CCTCGGCCCT CCCAGTCCCT 3951 CGCTGCACAA ATACTCCGCA AGAGAGGCCA GAATGACAGC TGACAGGGTC 4001 TATGGCCATC GGGTCGTCTC CGAAGATTTG GCAGGGGCAG AAAACTCTGG 4051 CAGGCTTAAG ATTTGGAATA AAGTCACAGA ATCAAGGAAG CACCTCAATT 4101 TAGTTCAAAC AAGACGCCAA CATTCTCTCC ACAGCTCACT TACCTCTCTG 4151 TGTTCAGATG TGGCCTTCCA TTTATATGTG ATCTTTGTTT TATTAGTAAA 4201 TGCTTATCAT CTAAAGATGT AGCTCTGGCC CAGTGGGAAA AATTAGGAAG 4251 TGATTATAAA TCGAGAGGAG TTATAATAAT CAAGATTAAA TGTAAATAAT 4301 CAGGGCAATC CCAACACATG TCTAGCTTTC ACCTCCAGGA TCTATTGAGT 4351 GAACAGAATT GCAAATAGTC TCTATTTGTA ATTGAACTTA TCCTAAAACA 4401 AATAGTTTAT AAATGTGAAC TTAAACTCTA ATTAATTCCA ACTGTACTTT 4451 TAAGGCAGTG GCTGTTTTTA GACTTTCTTA TCACTTATAG TTAGTAATGT 4501 ACACCTACTC TATCAGAGAA AAACAGGAAA GGCTCGAAAT ACAAGCCATT 4551 CTAAGGAAAT TAGGGAGTCA GTTGAAATTC TATTCTGATC TTATTCTGTG 4601 GTGTCTTTTG CAGCCCAGAC AAATGTGGTT ACACACTTTT TAAGAAATAC 4651 AATTCTACAT TGTCAAGCTT ATGAAGGTTC CAATCAGATC TTTATTGTTA 4701 TTCAATTTGG ATCTTTCAGG GATTTTTTTT TTAAATTATT ATGGGACAAA 4751 GGACATTTGT TGGAGGGGTG GGAGGGAGGA ACAATTTTTA AATATAAAAC 4801 ATTCCCAAGT TTGGATCAGG GAGTTGGAAG TTTTCAGAAT AACCAGAACT 4851 AAGGGTATGA AGGACCTGTA TTGGGGTCGA TGTGATGCCT CTGCGAAGAA 4901 CCTTGTGTGA CAAATGAGAA ACATTTTGAA GTTTGTGGTA CGACCTTTAG 4951 ATTCCAGAGA CATCAGCATG GCTCAAAGTG CAGCTCCGTT TGGCAGTGCA 5001 ATGGTATAAA TTTCAAGCTG GATATGTCTA ATGGGTATTT AAACAATAAA 5051 TGTGCAGTTT TAACTAACAG GATATTTAAT GACAACCTTC TGGTTGGTAG 5101 GGACATCTGT TTCTAAATGT TTATTATGTA CAATACAGAA AAAAATTTTA 5151 TAAAATTAAG CAATGTGAAA CTGAATTGGA GAGTGATAAT ACAAGTCCTT 5201 TAGTCTTACC CAGTGAATCA TTCTGTTCCA TGTCTTTGGA CAACCATGAC 5251 CTTGGACAAT CATGAAATAT GCATCTCACT GGATGCAAAG AAAATCAGAT 5301 GGAGCATGAA TGGTACTGTA CCGGTTCATC TGGACTGCCC CAGAAAAATA 5351 ACTTCAAGCA AACATCCTAT CAACAACAAG GTTGTTCTGC ATACCAAGCT 5401 GAGCACAGAA GATGGGAACA CTGGTGGAGG ATGGAAAGGC TCGCTCAATC 5451 AAGAAAATTC TGAGACTATT AATAAATAAG ACTGTAGTGT AGATACTGAG 5501 TAAATCCATG CACCTAAACC TTTTGGAAAA TCTGCCGTGG GCCCTCCAGA 5551 TAGCTCATTT CATTAAGTTT TTCCCTCCAA GGTAGAATTT GCAAGAGTGA 5601 CAGTGGATTG CATTTCTTTT GGGGAAGCTT TCTTTTGGTG GTTTTGTTTA 5651 TTATACCTTC TTAAGTTTTC AACCAAGGTT TGCTTTTGTT TTGAGTTACT 5701 GGGGTTATTT TTGTTTTAAA TAAAAATAAG TGTACAATAA GTGTTTTTGT 5751 ATTGAAAGCT TTTGTTATCA AGATTTTCAT ACTTTTACCT TCCATGGCTC 5801 TTTTTAAGAT TGATACTTTT AAGAGGTGGC TGATATTCTG CAACACTGTA 5851 CACATAAAAA ATACGGTAAG GATACTTTAC ATGGTTAAGG TAAAGTAAGT 5901 CTCCAGTTGG CCACCATTAG CTATAATGGC ACTTTGTTTG TGTTGTTGGA 5951 AAAAGTCACA TTGCCATTAA ACTTTCCTTG TCTGTCTAGT TAATATTGTG 6001 AAGAAAAATA AAGTACAGTG TGAGATACTG // LOCUS HUMBCL2A 5086 bp mRNA PRI 31-OCT-1994 DEFINITION Human B-cell leukemia/lymphoma 2 (bcl-2) proto-oncogene mRNA encoding bcl-2-alpha protein, complete cds. ACCESSION M13994 NID g179366 VERSION M13994.1 GI:179366 KEYWORDS alternative splicing; bcl-2-alpha protein; proto-oncogene. SOURCE Human pre-B-cell leukemia cell line 380, cDNA to mRNA, clones B[3,4,10]; and DNA, clone lambda-18-27. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5086) AUTHORS Tsujimoto,Y. and Croce,C.M. TITLE Analysis of the structure, transcripts, and protein products of bcl-2, the gene involved in human follicular lymphoma JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (14), 5214-5218 (1986) MEDLINE 86259760 COMMENT Clean copy sequence for [1] kindly provided by Y.Tsujimoto, 10-FEB-1987. The bcl-2 gene is transcribed by alternative splicing into three mRNAs of different sizes. It consists of at least two exons and encodes two proteins which only differ at their carboxy-terminal ends, and it is activated by translocation into poximity with the Ig heavy chain locus. Both the normal and rearranged bcl-2 gene products are expressed in the B-cell leukemia/lymphoma 2 cells. Genomic clone lambda-18-27 contained all the DNA sequences on the 5' of the splice site (position 2044). FEATURES Location/Qualifiers source 1. .5086 /organism="Homo sapiens" /db_xref="taxon:9606" /map="18q21.3" mRNA 1. .5086 /note="bcl2a mRNA" gene 1459. .2178 /gene="BCL2" CDS 1459. .2178 /gene="BCL2" /note="bcl2-alpha protein" /codon_start=1 /db_xref="GDB:G00-119-031" /protein_id="AAA51813.1" /db_xref="PID:g179367" /db_xref="GI:179367" /translation="MAHAGRTGYDNREIVMKYIHYKLSQRGYEWDAGDVGAAPPGAAP APGIFSSQPGHTPHPAASRDPVARTSPLQTPAAPGAAAGPALSPVPPVVHLALRQAGD DFSRRYRGDFAEMSSQLHLTPFTARGRFATVVEELFRDGVNWGRIVAFFEFGGVMCVE SVNREMSPLVDNIALWMTEYLNRHLHTWIQDNGGWDAFVELYGPSMRPLFDFSWLSLK TLLSLALVGACITLGAYLSHK" BASE COUNT 1262 a 1224 c 1287 g 1313 t ORIGIN 1 GCGCCCGCCC CTCCGCGCCG CCTGCCCGCC CGCCCGCCGC GCTCCCGCCC 51 GCCGCTCTCC GTGGCCCCGC CGCGCTGCCG CCGCCGCCGC TGCCAGCGAA 101 GGTGCCGGGG CTCCGGGCCC TCCCTGCCGG CGGCCGTCAG CGCTCGGAGC 151 GAACTGCGCG ACGGGAGGTC CGGGAGGCGA CCGTAGTCGC GCCGCCGCGC 201 AGGACCAGGA GGAGGAGAAA GGGTGCGCAG CCCGGAGGCG GGGTGCGCCG 251 GTGGGGTGCA GCGGAAGAGG GGGTCCAGGG GGGAGAACTT CGTAGCAGTC 301 ATCCTTTTTA GGAAAAGAGG GAAAAAATAA AACCCTCCCC CACCACCTCC 351 TTCTCCCCAC CCCTCGCCGC ACCACACACA GCGCGGGCTT CTAGCGCTCG 401 GCACCGGCGG GCCAGGCGCG TCCTGCCTTC ATTTATCCAG CAGCTTTTCG 451 GAAAATGCAT TTGCTGTTCG GAGTTTAATC AGAAGACGAT TCCTGCCTCC 501 GTCCCCGGCT CCTTCATCGT CCCATCTCCC CTGTCTCTCT CCTGGGGAGG 551 CGTGAAGCGG TCCCGTGGAT AGAGATTCAT GCCTGTGTCC GCGCGTGTGT 601 GCGCGCGTAT AAATTGCCGA GAAGGGGAAA ACATCACAGG ACTTCTGCGA 651 ATACCGGACT GAAAATTGTA ATTCATCTGC CGCCGCCGCT GCCAAAAAAA 701 AACTCGAGCT CTTGAGATCT CCGGTTGGGA TTCCTGCGGA TTGACATTTC 751 TGTGAAGCAG AAGTCTGGGA ATCGATCTGG AAATCCTCCT AATTTTTACT 801 CCCTCTCCCC CCGACTCCTG ATTCATTGGG AAGTTTCAAA TCAGCTATAA 851 CTGGAGAGTG CTGAAGATTG ATGGGATCGT TGCCTTATGC ATTTGTTTTG 901 GTTTTACAAA AAGGAAACTT GACAGAGGAT CATGCTGTAC TTAAAAAATA 951 CAAGTAAGTC TCGCACAGGA AATTGGTTTA ATGTAACTTT CAATGGAAAC 1001 CTTTGAGATT TTTTACTTAA AGTGCATTCG AGTAAATTTA ATTTCCAGGC 1051 AGCTTAATAC ATTGTTTTTA GCCGTGTTAC TTGTAGTGTG TATGCCCTGC 1101 TTTCACTCAG TGTGTACAGG GAAACGCACC TGATTTTTTA CTTATTAGTT 1151 TGTTTTTTCT TTAACCTTTC AGCATCACAG AGGAAGTAGA CTGATATTAA 1201 CAATACTTAC TAATAATAAC GTGCCTCATG AAATAAAGAT CCGAAAGGAA 1251 TTGGAATAAA AATTTCCTGC GTCTCATGCC AAGAGGGAAA CACCAGAATC 1301 AAGTGTTCCG CGTGATTGAA GACACCCCCT CGTCCAAGAA TGCAAAGCAC 1351 ATCCAATAAA ATAGCTGGAT TATAACTCCT CTTCTTTCTC TGGGGGCCGT 1401 GGGGTGGGAG CTGGGGCGAG AGGTGCCGTT GGCCCCCGTT GCTTTTCCTC 1451 TGGGAAGGAT GGCGCACGCT GGGAGAACGG GGTACGACAA CCGGGAGATA 1501 GTGATGAAGT ACATCCATTA TAAGCTGTCG CAGAGGGGCT ACGAGTGGGA 1551 TGCGGGAGAT GTGGGCGCCG CGCCCCCGGG GGCCGCCCCC GCACCGGGCA 1601 TCTTCTCCTC CCAGCCCGGG CACACGCCCC ATCCAGCCGC ATCCCGCGAC 1651 CCGGTCGCCA GGACCTCGCC GCTGCAGACC CCGGCTGCCC CCGGCGCCGC 1701 CGCGGGGCCT GCGCTCAGCC CGGTGCCACC TGTGGTCCAC CTGGCCCTCC 1751 GCCAAGCCGG CGACGACTTC TCCCGCCGCT ACCGCGGCGA CTTCGCCGAG 1801 ATGTCCAGCC AGCTGCACCT GACGCCCTTC ACCGCGCGGG GACGCTTTGC 1851 CACGGTGGTG GAGGAGCTCT TCAGGGACGG GGTGAACTGG GGGAGGATTG 1901 TGGCCTTCTT TGAGTTCGGT GGGGTCATGT GTGTGGAGAG CGTCAACCGG 1951 GAGATGTCGC CCCTGGTGGA CAACATCGCC CTGTGGATGA CTGAGTACCT 2001 GAACCGGCAC CTGCACACCT GGATCCAGGA TAACGGAGGC TGGGATGCCT 2051 TTGTGGAACT GTACGGCCCC AGCATGCGGC CTCTGTTTGA TTTCTCCTGG 2101 CTGTCTCTGA AGACTCTGCT CAGTTTGGCC CTGGTGGGAG CTTGCATCAC 2151 CCTGGGTGCC TATCTGAGCC ACAAGTGAAG TCAACATGCC TGCCCCAAAC 2201 AAATATGCAA AAGGTTCACT AAAGCAGTAG AAATAATATG CATTGTCAGT 2251 GATGTACCAT GAAACAAAGC TGCAGGCTGT TTAAGAAAAA ATAACACACA 2301 TATAAACATC ACACACACAG ACAGACACAC ACACACACAA CAATTAACAG 2351 TCTTCAGGCA AAACGTCGAA TCAGCTATTT ACTGCCAAAG GGAAATATCA 2401 TTTATTTTTT ACATTATTAA GAAAAAAGAT TTATTTATTT AAGACAGTCC 2451 CATCAAAACT CCGTCTTTGG AAATCCGACC ACTAATTGCC AAACACCGCT 2501 TCGTGTGGCT CCACCTGGAT GTTCTGTGCC TGTAAACATA GATTCGCTTT 2551 CCATGTTGTT GGCCGGATCA CCATCTGAAG AGCAGACGGA TGGAAAAAGG 2601 ACCTGATCAT TGGGGAAGCT GGCTTTCTGG CTGCTGGAGG CTGGGGAGAA 2651 GGTGTTCATT CACTTGCATT TCTTTGCCCT GGGGGCGTGA TATTAACAGA 2701 GGGAGGGTTC CCGTGGGGGG AAGTCCATGC CTCCCTGGCC TGAAGAAGAG 2751 ACTCTTTGCA TATGACTCAC ATGATGCATA CCTGGTGGGA GGAAAAGAGT 2801 TGGGAACTTC AGATGGACCT AGTACCCACT GAGATTTCCA CGCCGAAGGA 2851 CAGCGATGGG AAAAATGCCC TTAAATCATA GGAAAGTATT TTTTTAAGCT 2901 ACCAATTGTG CCGAGAAAAG CATTTTAGCA ATTTATACAA TATCATCCAG 2951 TACCTTAAAC CCTGATTGTG TATATTCATA TATTTTGGAT ACGCACCCCC 3001 CAACTCCCAA TACTGGCTCT GTCTGAGTAA GAAACAGAAT CCTCTGGAAC 3051 TTGAGGAAGT GAACATTTCG GTGACTTCCG ATCAGGAAGG CTAGAGTTAC 3101 CCAGAGCATC AGGCCGCCAC AAGTGCCTGC TTTTAGGAGA CCGAAGTCCG 3151 CAGAACCTAC CTGTGTCCCA GCTTGGAGGC CTGGTCCTGG AACTGAGCCG 3201 GGCCCTCACT GGCCTCCTCC AGGGATGATC AACAGGGTAG TGTGGTCTCC 3251 GAATGTCTGG AAGCTGATGG ATGGAGCTCA GAATTCCACT GTCAAGAAAG 3301 AGCAGTAGAG GGGTGTGGCT GGGCCTGTCA CCCTGGGGCC CTCCAGGTAG 3351 GCCCGTTTTC ACGTGGAGCA TAGGAGCCAC GACCCTTCTT AAGACATGTA 3401 TCACTGTAGA GGGAAGGAAC AGAGGCCCTG GGCCTTCCTA TCAGAAGGAC 3451 ATGGTGAAGG CTGGGAACGT GAGGAGAGGC AATGGCCACG GCCCATTTTG 3501 GCTGTAGCAC ATGGCACGTT GGCTGTGTGG CCTTGGCCAC CTGTGAGTTT 3551 AAAGCAAGGC TTTAAATGAC TTTGGAGAGG GTCACAAATC CTAAAAGAAG 3601 CATTGAAGTG AGGTGTCATG GATTAATTGA CCCCTGTCTA TGGAATTACA 3651 TGTAAAACAT TATCTTGTCA CTGTAGTTTG GTTTTATTTG AAAACCTGAC 3701 AAAAAAAAAG TTCCAGGTGT GGAATATGGG GGTTATCTGT ACATCCTGGG 3751 GCATTAAAAA AAAATCAATG GTGGGGAACT ATAAAGAAGT AACAAAAGAA 3801 GTGACATCTT CAGCAAATAA ACTAGGAAAT TTTTTTTTCT TCCAGTTTAG 3851 AATCAGCCTT GAAACATTGA TGGAATAACT CTGTGGCATT ATTGCATTAT 3901 ATACCATTTA TCTGTATTAA CTTTGGAATG TACTCTGTTC AATGTTTAAT 3951 GCTGTGGTTG ATATTTCGAA AGCTGCTTTA AAAAAATACA TGCATCTCAG 4001 CGTTTTTTTG TTTTTAATTG TATTTAGTTA TGGCCTATAC ACTATTTGTG 4051 AGCAAAGGTG ATCGTTTTCT GTTTGAGATT TTTATCTCTT GATTCTTCAA 4101 AAGCATTCTG AGAAGGTGAG ATAAGCCCTG AGTCTCAGCT ACCTAAGAAA 4151 AACCTGGATG TCACTGGCCA CTGAGGAGCT TTGTTTCAAC CAAGTCATGT 4201 GCATTTCCAC GTCAACAGAA TTGTTTATTG TGACAGTTAT ATCTGTTGTC 4251 CCTTTGACCT TGTTTCTTGA AGGTTTCCTC GTCCCTGGGC AATTCCGCAT 4301 TTAATTCATG GTATTCAGGA TTACATGCAT GTTTGGTTAA ACCCATGAGA 4351 TTCATTCAGT TAAAAATCCA GATGGCGAAT GACCAGCAGA TTCAAATCTA 4401 TGGTGGTTTG ACCTTTAGAG AGTTGCTTTA CGTGGCCTGT TTCAACACAG 4451 ACCCACCCAG AGCCCTCCTG CCCTCCTTCC GCGGGGGCTT TCTCATGGCT 4501 GTCCTTCAGG GTCTTCCTGA AATGCAGTGG TCGTTACGCT CCACCAAGAA 4551 AGCAGGAAAC CTGTGGTATG AAGCCAGACC TCCCCGGCGG GCCTCAGGGA 4601 ACAGAATGAT CAGACCTTTG AATGATTCTA ATTTTTAAGC AAAATATTAT 4651 TTTATGAAAG GTTTACATTG TCAAAGTGAT GAATATGGAA TATCCAATCC 4701 TGTGCTGCTA TCCTGCCAAA ATCATTTTAA TGGAGTCAGT TTGCAGTATG 4751 CTCCACGTGG TAAGATCCTC CAAGCTGCTT TAGAAGTAAC AATGAAGAAC 4801 GTGGACGTTT TTAATATAAA GCCTGTTTTG TCTTTTGTTG TTGTTCAAAC 4851 GGGATTCACA GAGTATTTGA AAAATGTATA TATATTAAGA GGTCACGGGG 4901 GCTAATTGCT AGCTGGCTGC CTTTTGCTGT GGGGTTTTGT TACCTGGTTT 4951 TAATAACAGT AAATGTGCCC AGCCTCTTGG CCCCAGAACT GTACAGTATT 5001 GTGGCTGCAC TTGCTCTAAG AGTAGTTGAT GTTGCATTTT CCTTATTGTT 5051 AAAAACATGT TAGAAGCAAT GAATGTATAT AAAAGC // LOCUS AB017915 6961 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for condoroitin 6-sulfotransferase, complete cds. ACCESSION AB017915 NID g4115403 VERSION AB017915.1 GI:4115403 KEYWORDS condoroitin 6-sulfotransferase. SOURCE Homo sapiens adult female placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6961) AUTHORS kitagawa,h., Shimakawa,H. and Sugahara,K. TITLE Direct Submission JOURNAL Submitted (25-SEP-1998) to the DDBJ/EMBL/GenBank databases. Hiroshi Kitagawa, Kobe Pharmaceutical Univ., Dept. of Biochemistry; 4-19-1 Motoyamakita-machi, Higashinada-ku, Kobe, Hyogo 658-8558, Japan (E-mail:kitagawa@kobepharma-u.ac.jp, Tel:81-78-441-7570, Fax:81-78-441-7571) REFERENCE 2 (sites) AUTHORS Tsutsumi,K., Shimakawa,H., Kitagawa,H. and Sugahara,K. TITLE Functional expression and genomic structure of human chondroitin 6-sulfotransferase JOURNAL FEBS Lett. 441 (2), 235-241 (1998) MEDLINE 99098360 COMMENT Sequence updated (09-Oct-1998). FEATURES Location/Qualifiers source 1. .6961 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="female" /tissue_type="placenta" CDS 441. .1880 /codon_start=1 /product="condoroitin 6-sulfotransferase" /protein_id="BAA36348.1" /db_xref="PID:d1037334" /db_xref="PID:g4115404" /db_xref="GI:4115404" /translation="MEKGLTLPQDCRDFVHSLKMRSKYALFLVFVVIVFVFIEKENKI ISRVSDKLKQIPQALADANSTDPALILAENASLLSLSELDSAFSQLQSRLRNLSLQLG VEPAMEAAGEEEEEQRKEEEPPRPAVAGPRRHVLLMATTRTGSSFVGEFFNQQGNIFY LFEPLWHIERTVSFEPGGANAAGSALVYRDVLKQLFLCDLYVLEHFITPLPEDHLTQF MFRRGSSRSLCEDPVCTPFVKKVFEKYHCKNRRCGPLNVTLAAEACRRKEHMALKAVR IRQLEFLQPLAEDPRLDLRVIQLVRDPRAVLASRMVAFAGKYKTWKKWLDDEGQDGLR EEEVQRLRGNCESIRLSAELGLRQPAWLRGRYMLVRYEDVARGPLQKAREMYRFAGIP LTPQVEDWIQKNTQAAHDGSGIYSTQKNSSEQFEKWRFSMPFKLAQVVQAACGPAMRL FGYKLARDAAALTNRSVSLLEERGTFWVT" polyA_site 6961 /note="31 a nucleotides" BASE COUNT 1486 a 1974 c 2002 g 1499 t ORIGIN 1 GACGCCTTGG ACGCGGTTCA GGAATCCGCC GCCGCTAGCC GGCTCGGGCC 51 TGAGCGGGGA GGGCGCCAGG CAGGACCTCT CGCAGGCTCG CTGCGCAGGA 101 CGGCGCCCGC CTGGCGCCGC TTCCCTTCCA GCGTGCCGAC CGGCCCCGCA 151 GCGCCTCCAT CCCTCCGGCC CGCCCCGGAG AAGACGCACA GCTCGGGCCG 201 CGCGGGCGCC GGGGCCGCGG AACCGCTTCT GCCGGGATCT TCCAGGAGGA 251 AAGCGAAGTT GCGAGCGGAT GCTGCCCGCG CCGGACCCCA GCCGCGGAGG 301 GTCGGGGCCG CCGGTGGAGT CTCGGCGGCC GGGGACAAGG GTGTCCCCCA 351 CCTGAAGACG GCAAGCTGGG TCCTGAGTGA TGCCCCTCAG CTGAGTGTCC 401 AAGGCTGGCC CGAGGAGCCC CCACGGCCCC ACCTTTCCCC ATGGAGAAAG 451 GACTCACTTT GCCCCAGGAC TGCCGGGACT TTGTGCACAG CCTGAAGATG 501 AGAAGCAAAT ACGCCCTTTT CTTGGTTTTT GTGGTGATAG TTTTTGTCTT 551 CATCGAAAAG GAAAATAAAA TCATATCAAG GGTCTCAGAC AAGCTGAAGC 601 AGATTCCCCA AGCTCTAGCA GATGCCAACA GCACCGACCC AGCCCTGATC 651 TTAGCTGAGA ACGCATCTCT CTTGTCCCTG AGCGAGCTCG ATTCAGCCTT 701 CTCCCAGCTT CAGAGCCGTC TCCGCAACCT CAGCTTGCAG CTGGGCGTGG 751 AGCCAGCCAT GGAGGCCGCA GGGGAGGAAG AGGAAGAGCA GAGAAAGGAG 801 GAGGAGCCGC CCAGACCGGC CGTGGCGGGG CCCCGGCGCC ACGTGCTGCT 851 CATGGCCACC ACGCGCACCG GCTCCTCGTT CGTGGGCGAG TTCTTCAACC 901 AGCAGGGCAA CATCTTCTAC CTCTTCGAGC CGCTGTGGCA CATCGAGCGC 951 ACAGTGTCCT TCGAGCCGGG GGGCGCCAAC GCCGCGGGCT CGGCCCTGGT 1001 GTACCGCGAC GTGCTCAAGC AGCTCTTCCT GTGCGACCTG TACGTGCTGG 1051 AGCACTTCAT CACGCCGCTG CCCGAGGACC ACCTGACTCA GTTCATGTTC 1101 CGCCGGGGCT CCAGCCGCTC CCTGTGCGAG GACCCCGTCT GTACGCCCTT 1151 CGTCAAGAAG GTCTTCGAGA AGTACCACTG CAAGAACCGC CGCTGCGGCC 1201 CCCTCAACGT GACGCTGGCC GCAGAGGCCT GCCGCCGCAA GGAGCACATG 1251 GCCCTCAAGG CGGTGCGCAT CCGGCAGCTG GAGTTCCTGC AGCCGCTGGC 1301 CGAGGACCCC CGCCTGGACC TGCGCGTCAT CCAGCTGGTG CGCGACCCCC 1351 GGGCCGTGCT GGCCTCGCGC ATGGTGGCCT TCGCCGGCAA GTATAAGACC 1401 TGGAAGAAGT GGCTGGACGA CGAGGGCCAG GACGGCCTGA GGGAAGAGGA 1451 GGTGCAGCGG CTGCGGGGCA ACTGCGAGAG CATCCGCCTG TCCGCGGAGC 1501 TGGGGCTGCG GCAGCCCGCC TGGCTGCGGG GCCGCTACAT GCTGGTGCGC 1551 TACGAGGACG TGGCACGCGG GCCGCTGCAG AAGGCCCGCG AGATGTACCG 1601 CTTCGCCGGC ATCCCCCTGA CCCCGCAGGT GGAAGACTGG ATCCAAAAGA 1651 ACACGCAGGC GGCCCACGAC GGCAGCGGCA TCTACTCCAC GCAGAAGAAC 1701 TCCTCGGAGC AGTTCGAGAA GTGGCGCTTC AGCATGCCCT TCAAGCTGGC 1751 CCAGGTGGTG CAGGCCGCCT GCGGCCCTGC CATGCGCCTC TTCGGCTACA 1801 AACTGGCGCG GGACGCCGCC GCCCTCACCA ACCGCTCAGT CAGCCTGCTG 1851 GAGGAGAGGG GCACCTTCTG GGTCACGTAG GGGGGCCGGG GCCCCGTATG 1901 CCCCTCCTCG TGAAAGGCCT GCCCCGTCTT TCTGCCGCAG CCCTCGCAGA 1951 GGGCGGGTGC ACAGCGCCAT GAGCGGGCAG CGCCTCCTGT AGCAGTAGGG 2001 CCCCCAGCCA GCGCTCCAGC CAAAGCGGCG GCCCCAGGGT TAATTGCGGA 2051 GAACAGGACA GTGCCCGGTC CCCTTGAGGG CCATCACACC CAGACCCAAC 2101 GGGTTGCAGC CTCCTGAGCA GGCCTAGGCA GGCCCGGGCC TGTTGGCAAG 2151 CTTCGATCTC ACACACACAG AAACATACAT TCGTGCCTGG AGACCCTGCA 2201 GGCCAGAGTC CAAATATTTA ACAATCAGAA GGGGCAAGGC TCTGACCAGT 2251 GACAGTCAGA CCTCCTGCTT TATTTGGTGT TAACGTTTCT TGTCTGGATG 2301 GTGAAGTCTG GAATCTGGGT GGGTCCTTGG AGGAGGGGCT AGGACAGCCG 2351 TGGGTGTCAA AGGTGGCATT TGAGGCTCGT TTGAGGTGAC AGTGGCTGTT 2401 TACCAACTAG TACAGAGCGA TTCAGCGTTT TCATGAATGG GTTGGCTGTG 2451 CTGGTTACTG ATGAATATGG GCCCCTTATA GAGCTGCAAA ACACACACAT 2501 GCGTATAGAC ATACATACAT GTACACCATA CATAGACAAG CATCATAAAG 2551 GCACAAGTGC ACACACATCT ATGCAGACAA GCTTCCTCGC TGCTTATGCC 2601 CACAGGGTTT TTCTGTATGA CACACCTCAG AGGAGCCTGT GCTTAACATT 2651 TGTAGGATTA TTTCGAGGGC AGGGCAGGGG AAAGAAACGC GTTAAAGGGC 2701 CATGACATGA CACAGTTCCC TGGCCGGGTT TGGACTGAAC AGTGGATTCA 2751 GAACTGCAGC GTTCAAAGCC CCTGCCCGCC TTAGATTCTC ATGGCTACCT 2801 CCTCTGACCC ATCCACATCC TTGCGGCTGG CAGGCAGGAT GTTTTGAAAT 2851 GGTGCAGCCA AAGGCCCCGT CTAGCTTGGC TGGCTCCCGA ACATGTCCAT 2901 ATTTGAAGGC TGCCTCAGGC CTGGCAGGCA GAAGCAGCTG TGGTCCCTGG 2951 AGGCAAAAGG CAGCTCACGG GGCCTTCCAT TTCCAGCCAG GCCTCATCTG 3001 GGGTAGGGCC AAGAGGAAAG TACAGAGTGC CCGTGGGGGA GAAGTCCCAC 3051 CAGGATGCCC CCCCTCCCCT GAGAAGCCCG CCTTCTTCCC TCGTGGACAG 3101 GAGTCAGCCA GTTGGTTGCT GTGGTTACAG CCAGTGCCTG AATTCCTCCT 3151 TAGGGCCCTG GGAAGAGTAT TGCTTAACGC AGGATGTGCT GGGTGTTTTG 3201 TTTCGGGCTT TTATTTATGG CTTGGTGTCT TTCTTGTTTC ATGGCTGTGT 3251 TTTTGCTTTT GTTTCTGTAG GAGCGCCTCT TCCAAGCAGT GGAGGCCTGA 3301 GGGCTTGAGG GAGGCTGGGG AAAACCAGCA GGGAGCACTT GGCTCCTGGC 3351 AGAGGGAGAG GACAGGAGGA GGAGCCTTCC CCAGGAGGAA CGAGGAGAGG 3401 TGGCCACGTG GCAGGCCACA GTGCCCTATG GGCTTTTCTG TTTGAACCCC 3451 ATGTGGGATG AATCTCAAAG CCACCAGCGT TCTTGCCTAG TGTCCAGAGC 3501 AGCAGAGATT TGCAGCCCTC GCCCCTCCCT GAGAAGAAGC TGGATTTGAA 3551 AATCCCTCAC AGGCCCTAAC ATGTTTCCTG GGTGGGCAGG GGCCTCATGA 3601 CAAGCGACAA GTGTACCCAG AGTAGAAGGC CTTCCCCACT CCCACCTCAA 3651 ACCCAGGAGC AGAACTCAGT ATCCAGAGCT TACAAGGAGC TGCAAGGGTT 3701 TGCCGAGGGC TGCCCAGCTC TGCTTCTGGT TTCCTGGACA ATTTCTCTGT 3751 CAGATACGGC CCATTGTAAA CCCAGAGGGC TGCATTTTGG GTTGAGTAAT 3801 ATGGACACCA TGGAAGTACA GGATCTGAGT CCAACATTGC CATGGGGGAG 3851 GGAAGATGTG TCCTGGCAGG AAGCCACCAT TGGAGAAGGT GAGGCCAGTA 3901 ACCATCTCAT GGGATTCGCT ATAATCAGCC TATTTGAGCT GCTGGGTCTC 3951 TGTCACTGTG GACTCCAGTC ATTCAAGGAG GTCTCCACTC TTGGGTTTTT 4001 TTTATTTTCT TTTCTTTTGC TATTTTGCCT GAACACCCCA ACACTCTTGG 4051 TTTTAGAGTC CAAGAATAAG TTGTGCAGAA ACATGGTTCT AATTTGGAGT 4101 TCAAAGGAAC ACCAGCTCTG AAAACATGAC CGTGGGGCCA GGATGTTTCT 4151 GGCAGGCCCA GAAGTGCTGG CCGTTCTTCC CCAGCCTCCC CACTGCTCCC 4201 TCCTCTGTTC CCCATGTGGA GGTCGGGGGG GCTGGGACTG GGGAGGGGGC 4251 AGCCACCTCC ATCCAAGGCT CTTCCTAGGG GCCCTGCTAA TGTGGACAGC 4301 AGACTTTATC CCTCCTTCTT ACTCTGGGCC AAGACCTCCC ACTCCGCCCT 4351 CTGAGGATGG TACCAAGAAT GGGCCTTTGG GACTTCTCAG GACATTGACA 4401 ATGTGCACAC TGCAGGTTGA CTTAATTTAT TTATTTTTTG AAAAGAAGAG 4451 ACAGAAAATA CCTTATTATC TGCAGGGACT CAAAGCTGTA CCCAAATCAA 4501 GAAAGACAAA ATAACAATAG AAACCATTCA CACACTCCGT TCATGCATGG 4551 ACACCAGAAG ACGATTCAGA ACACAAGAGG AGAATGAGCT GGAGCGCCAG 4601 CACCACGATG GCCCCAGGAC GTTGTTTAAG ATCTGAGATC CCTTGGTTTC 4651 TCTCTTGTTG ATCTGAGATG CTACATCAGG GCCCTTGCTG CTGTATTACC 4701 TCTGCCCGGG TGTTTGAGAC AGGTCACAAG CGCGTGGATG TTTCCTGGAA 4751 GTGTATTCCC CTGGGGCTCA GCGATGCTCT GGGAACCCTT GGGGTTTGGG 4801 GAAGGACAGT GTGGTGTGAC TATTGGCTTG AAACGTTATG GATATGGTTG 4851 GAGTCACCCA TGTGGATATG ATTTTCAGCT TGGAGCTGGG ATGGAGAAGA 4901 TGGAAGCACT GGTGGCCGGG CCTCTTTTTC CAGCGAGGCC AGCCCTGGCT 4951 TCTCCTCTGT TCCAGCTTGT CTGCCAGAAG CCTTCCCCTG CAAGGTGCCC 5001 ACCTGCCCGG AGACAGAGGG AGCAGCTGAG GCCTCTCTCC ACTGGCCGCC 5051 AGCCTCCCTA GCACAAGGGG ACCATTCAGC AATGAGTGTT TACTCATGTT 5101 CTTTCTTGTT GATGAGTTCT GCGGCCCAGC ATATAGAAAT GGCTCAGAAC 5151 GTTCCAGGCC TTGGTGAGAC ACAGCAGCGT CCAATGCTCA CAGGCTGTCC 5201 CCACTCTGCC TGTTCCCATC TTCCCCTCGG GACCTGTTTA TAAATTGAGG 5251 AATGGATGAG GGCCCTGGAG GGTCCTTGGG AGTAGTTAGT GGAGGGCAGG 5301 ATTCTCAGGT CCCCATAGAG AGGGAAAGAG AAGACCAGAT GTGCATAGAA 5351 GCCAATCTCT GTCACATACA CCGCAGGTGG TCACCGAGTT ATTTTCCAAA 5401 AGCTGAGGAA CATAAAGCAA ATTTAGGCTT TTGTCCTTCT GCAATACATG 5451 CACTTGAAAA TAAACAGAAA AGAGATGTTT AATAACAAGT TTACTTCCGG 5501 TCTGCTAGCA CCCTCAGCCT GGGAGCTACT GCCAGAGGCT GGCGTAGTGG 5551 GGCAGCTGCT GACCAAACCC CACAGAACAG AGGGCACCAG GCATCACACG 5601 ACATCTTTCC CTCCCATCTC TGCCATCTGT CTGTCTGGCA ATGGAAGGAG 5651 CTTCAGCAGG AGATCCTTCC CAGAAGGTTG ATTCTTGGCC TGGGTGGTAG 5701 AGGAGATATC CTGACCCTAA AGTCTTCCAG CCCCGGGTCA TCCTCTCCTA 5751 TACTAGGGCC TATTGACCGT AGAGGCCAGG TGCGGTGGCT CATACCTGTA 5801 ATCCCAGCAT TTGGGGAGGC CAAGGTGGGA GGATTGCTTG AGCCCAAGAG 5851 TTGGAGACCA GCCTGGGCAA CATGGTGAGA CCAATTGACT GTAGGGCTTG 5901 TCCCTTGCTA AGACAGGAGC AGAAGACTGG ATGGCTGTGT CCTCAAGGCA 5951 GTCCCCTCCC ATCCCCATTC CCAAAGTCAG AAAGCGAAGC CAGATCTCAA 6001 GGGCTGATAC CTGAGGCAAG GAGAGCTAAG GGGAGAGAAA TTGGGGCTGA 6051 GTGGAAGCAG ATGCCTGCAC AAGCCAAGTG TGTCTTATTG ATCTGTACAA 6101 GCTGATAGAA GGACCATCTG CTGAAATCCA GGGCTCCTGA GTTGGTGGGT 6151 TTGCTTGTGA AGCTTCTAGG ACAAGGCGCA GCCAGATGGA AGGGAGAGGG 6201 TCCAGGGCTT CCTAAATGCA GCCGCATCAC CTGTCAACTC GCTGCAGTTT 6251 CAGAGACAGT GGAAGCTGCT CTTTACCTCA AAAGCACAAA GCAGAATTGG 6301 CAACTTCACA TGTCTCGAGA GCTCCAAGAT CCTTTGGTCT CGTGTCCTTC 6351 GGCACCCCGT AACTGGACTG GGGACAAATT TGTTACGTGT TTCCAAGGCT 6401 ACAGACATGG TGCCATCCTC ACAGGCCTAG CTCATGTCAT GGATCAGGTG 6451 CTTTGCTGGG GGGATACAAT GACCCATGTG GTCTGCGTCT TCTGCCATTG 6501 GCTTGAACTG GGCACCCCCT GAAGTCTCCA TCTCAGCTGT ATATTCTAGT 6551 CCAAATTTTC CTGGCTCCCC TGCTCCCTCC CAGGTCCTGC ACACTGTCTT 6601 TTTGCCAACA CCTCAGCCCT GTTTGCTATT TCATGTCTGC ATGGTACGAG 6651 ACACCCCTTC ACAGCATACA CTGCCATGGT ATGTACATAT GCATCCACGT 6701 GTGTGTATGT GTAGCATGTA GCTCATTCTG CTCAGCCTCT GGCGCCGTTC 6751 CACATTGCTC CCAGCCCCAT TGCTGTCACT GTCGTCCCAG ATGTCCTTGC 6801 CATGGCCACA GACAATCTGC CGTCTCCTGG AAACGCTGGG GCTGCCCTTC 6851 AGAGAGCTGG CAGCCCCAGC AGTCAGGGCC TGCTTTGCAG AATAGAATGT 6901 GAACCCAACT CCTGATGGCC TACTTGACTT ATTTAATTAA AGATGAAATC 6951 ATGCAATGAG C // LOCUS AF047033 7785 bp mRNA PRI 12-JUN-1999 DEFINITION Homo sapiens sodium bicarbonate cotransporter 3 (SLC4A7) mRNA, complete cds. ACCESSION AF047033 NID g5051627 VERSION AF047033.1 GI:5051627 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7785) AUTHORS Pushkin,A., Abuladze,N., Lee,I., Newman,D., Hwang,J. and Kurtz,I. TITLE Cloning, tissue distribution, genomic organization, and functional characterization of NBC3, a new member of the sodium bicarbonate cotransporter family JOURNAL J. Biol. Chem. 274 (23), 16569-16575 (1999) MEDLINE 99278433 REFERENCE 2 (bases 1 to 7785) AUTHORS Pushkin,A., Abuladze,N., Newman,D., Hwang,J. and Kurtz,I. TITLE Direct Submission JOURNAL Submitted (06-FEB-1998) Medicine, UCLA, 10833 Le Conte Ave, Los Angeles, CA 90095, USA FEATURES Location/Qualifiers source 1. .7785 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p22" /tissue_type="skeletal muscle" gene 1. .7785 /gene="SLC4A7" CDS 72. .3716 /gene="SLC4A7" /note="NBC3; solute carrier family 4, sodium bicarbonate cotransporter, member 7" /codon_start=1 /product="sodium bicarbonate cotransporter 3" /protein_id="AAD38322.1" /db_xref="PID:g5051628" /db_xref="GI:5051628" /translation="MERFRLEKKLPGPDEEAVVDLGKTSSTVNTKFEKEELESHRAVY IGVHVPFSKESRRRHRHRGHKHHHRRRKDKESDKEDGRESPSYDTPSQRVQFILGTED DDEEHIPHDLFTEMDELCYRDGEEYEWKETARWLKFEEDVEDGGDRWSKPYVATLSLH SLFELRSCILNGTVMLDMRASTLDEIADMVLDNMIASGQLDESIRENVREALLKRHHH QNEKRFTSRIPLVRSFADIGKKHSDPHLLERNGEGLSASRHSLRTGLSASNLSLRGES PLSLLLGHLLPSSRAGTPAGSRCTTPVPTPQNSPPSSPSISRLTSRSSQKSQRQAPEL LVSPASDDIPTVVIHPPEEDLEAALKGEEQKNEENVDLTPGILASPQSAPGNLDNSKS GEIKGNGSGGSRENSTVDFSKVDMNFMRKIPTGAEASNVLVGEVDFLERPIIAFVRLA PAVLLTGLTEVPVPTRFLFLLLGPAGKAPQYHEIGRSIATLMTDEIFHDVAYKAKDRN DLLSGIDEFLDQVTVLPPGEWDPSIRIEPPKSVPSQEKRKIPVFHNGSTPTLGETPKE AAHHAGPELQRTGRLFGGLILDIKRKAPFFLSDFKDALSLQCLASILFLYCACMSPVI TFGGLLGEATEGRISAIESLFGASLTGIAYSLFAGQPLTILGSTGPVLVFEKILYKFC RDYQLSYLSLRTSIGLWTSFLCIVLVATDASSLVCYITRFTEEAFAALICIIFIYEAL EKLFDLGETYAFNMHNNLDKLTSYSCVCTEPPNPSNETLAQWKKDNITAHNISWRNLT VSECKKLRGVFLGSACGHHGPYIPDVLFWCVILFFTTFFLSSFLKQFKTKRYFPTKVR STISDFAVFLTIVIMVTIDYLVGVPSPKLHVPEKFEPTHPERGWIISPLGDNPWWTLL IAAIPALLCTILIFMDQQITAVIINRKEHKLKKGAGYHLDLLMVGVMLGVCSVMGLPW FVAATVLSISHVNSLKVESECSAPGEQPKFLGIREQRVTGLMIFILMGLSVFMTSVLK FIPMPVLYGVFLYMGVSSLKGIQLFDRIKLFGMPAKHQPDLIYLRYVPLWKVHIFTVI QLTCLVLLWVIKVSAAAVVFPMMVLALVFVRKLMDLCFTKRELSWLDDLMPESKKKKE DDKKKKEKEEAERMLQDDDDTVHLPFEGGSLLQIPVKALKYSPDKPVSVKISFEDEPR KKYVDAETSL" BASE COUNT 2368 a 1309 c 1450 g 2658 t ORIGIN 1 CTCACTGTCC CTGGAATCTT CAAGGGAGTT ACTGCATTAC ATCATCAGAA 51 ACAAGGCATT TCTATATTAC TATGGAAAGA TTTCGTCTGG AGAAGAAGTT 101 ACCTGGTCCT GATGAAGAAG CTGTTGTGGA TCTTGGCAAA ACTAGCTCAA 151 CTGTGAACAC CAAGTTTGAA AAAGAAGAAC TAGAAAGTCA TAGAGCTGTA 201 TATATTGGTG TTCACGTCCC GTTTAGTAAA GAGAGTCGTC GGCGTCATAG 251 GCATCGCGGA CACAAACATC ACCACCGGAG GAGAAAAGAT AAAGAATCAG 301 ATAAAGAAGA TGGACGGGAA TCTCCTTCTT ATGATACACC ATCCCAGAGA 351 GTTCAGTTTA TCCTTGGTAC TGAAGATGAT GATGAAGAAC ATATTCCCCA 401 TGATCTCTTC ACGGAAATGG ATGAACTGTG TTACAGAGAT GGAGAAGAAT 451 ATGAATGGAA AGAAACTGCT AGATGGCTGA AATTTGAAGA GGATGTTGAA 501 GATGGCGGTG ACCGATGGAG TAAACCTTAT GTGGCAACTC TCTCTTTGCA 551 CAGTCTTTTT GAACTAAGGA GTTGCATCCT CAATGGAACA GTCATGCTGG 601 ATATGAGAGC AAGCACTCTA GATGAAATAG CAGATATGGT ATTAGACAAC 651 ATGATAGCTT CTGGCCAATT AGACGAGTCC ATACGAGAGA ATGTCAGAGA 701 AGCTCTTCTG AAGAGACATC ATCATCAGAA TGAGAAAAGA TTCACCAGTC 751 GGATTCCTCT TGTTCGATCT TTTGCAGATA TAGGCAAGAA ACATTCTGAC 801 CCTCACTTGC TTGAAAGGAA TGGGGAAGGC CTTTCAGCCT CCCGCCACTC 851 TTTGCGAACA GGTCTGTCTG CCTCAAACCT TTCCTTGAGA GGAGAATCAC 901 CTTTATCTCT TCTTCTCGGT CATCTTCTTC CTTCTTCAAG AGCTGGAACC 951 CCTGCAGGCT CAAGGTGTAC AACCCCAGTA CCCACCCCTC AAAACAGTCC 1001 TCCTTCTAGC CCTAGCATCA GCCGCCTGAC CTCCAGAAGT TCCCAAAAGA 1051 GTCAGCGTCA GGCCCCAGAA CTACTGGTTT CACCTGCCAG TGATGATATT 1101 CCCACAGTAG TAATTCATCC GCCTGAGGAA GACTTAGAAG CAGCGCTGAA 1151 AGGCGAGGAG CAGAAGAATG AGGAAAATGT TGACTTAACT CCAGGTATTT 1201 TGGCCTCTCC CCAGTCTGCT CCTGGAAACT TGGACAATAG TAAAAGTGGA 1251 GAAATTAAAG GTAATGGAAG TGGTGGAAGC AGAGAAAATA GTACTGTTGA 1301 CTTCAGCAAG GTTGATATGA ATTTCATGAG AAAAATTCCT ACGGGTGCTG 1351 AGGCATCCAA CGTCCTGGTG GGCGAAGTAG ACTTTTTGGA AAGGCCAATA 1401 ATTGCATTTG TGAGACTGGC TCCTGCTGTC CTCCTTACAG GGTTGACTGA 1451 GGTCCCTGTT CCAACCAGGT TTTTGTTTTT GTTATTGGGT CCAGCGGGCA 1501 AGGCACCACA GTACCATGAA ATTGGACGAT CAATAGCCAC TCTCATGACA 1551 GATGAGATTT TCCATGATGT AGCTTATAAA GCAAAAGACA GAAATGACCT 1601 CTTATCTGGA ATTGATGAAT TTTTAGATCA AGTAACTGTC CTACCTCCAG 1651 GAGAGTGGGA TCCTTCTATA CGCATAGAAC CACCAAAAAG TGTCCCTTCT 1701 CAGGAAAAGA GAAAGATTCC TGTGTTTCAC AATGGATCTA CCCCCACACT 1751 GGGTGAGACT CCTAAAGAGG CCGCTCATCA TGCTGGGCCT GAGCTACAGA 1801 GGACTGGACG GCTTTTTGGT GGTTTGATAC TTGACATCAA AAGGAAAGCA 1851 CCTTTTTTCT TGAGTGACTT CAAGGATGCA TTAAGCCTGC AGTGCCTGGC 1901 CTCGATTCTT TTCCTATACT GTGCCTGTAT GTCTCCTGTA ATCACTTTTG 1951 GAGGGCTGCT TGGAGAAGCT ACAGAAGGCA GAATAAGTGC AATAGAGTCT 2001 CTTTTTGGAG CATCATTAAC TGGGATTGCC TATTCATTGT TTGCTGGGCA 2051 ACCTCTAACA ATATTGGGGA GCACAGGTCC AGTTCTAGTG TTTGAAAAAA 2101 TTTTATATAA ATTCTGCAGA GATTATCAAC TTTCTTATCT GTCTTTAAGA 2151 ACCAGTATTG GTCTGTGGAC TTCTTTTTTG TGCATTGTTT TGGTTGCAAC 2201 AGATGCAAGC AGCCTTGTGT GTTATATTAC TCGATTTACA GAAGAGGCTT 2251 TTGCAGCCCT TATTTGCATC ATATTCATCT ACGAGGCTTT GGAGAAGCTC 2301 TTTGATTTAG GAGAAACATA TGCATTTAAT ATGCACAACA ACTTAGATAA 2351 ACTGACCAGC TACTCATGTG TATGTACTGA ACCTCCAAAT CCCAGCAATG 2401 AAACTCTAGC ACAATGGAAG AAAGATAATA TAACAGCACA CAATATTTCC 2451 TGGAGAAATC TTACTGTTTC TGAATGTAAA AAACTTCGTG GTGTATTCTT 2501 GGGGTCAGCT TGTGGTCATC ATGGACCTTA TATTCCAGAT GTGCTCTTTT 2551 GGTGTGTCAT CTTGTTTTTC ACAACATTTT TTCTGTCTTC ATTCCTCAAG 2601 CAATTTAAGA CCAAGCGTTA CTTTCCTACC AAGGTGCGAT CGACAATCAG 2651 TGATTTTGCT GTATTTCTCA CAATAGTAAT AATGGTTACA ATTGACTACC 2701 TTGTAGGAGT TCCATCTCCT AAACTTCATG TTCCTGAAAA ATTTGAGCCT 2751 ACTCATCCAG AGAGAGGGTG GATCATAAGC CCACTGGGAG ATAATCCTTG 2801 GTGGACCTTA TTAATAGCTG CTATTCCTGC TTTGCTTTGT ACCATTCTCA 2851 TCTTTATGGA TCAACAAATC ACAGCTGTAA TTATAAACAG AAAGGAACAC 2901 AAATTGAAGA AAGGAGCTGG CTATCACCTT GATTTGCTCA TGGTTGGCGT 2951 TATGTTGGGA GTTTGCTCTG TCATGGGACT TCCATGGTTT GTGGCTGCAA 3001 CAGTGTTGTC AATAAGTCAT GTCAACAGCT TAAAAGTTGA ATCTGAATGT 3051 TCTGCTCCAG GGGAACAACC CAAGTTTTTG GGAATTCGTG AACAGCGGGT 3101 TACAGGGCTA ATGATTTTTA TTCTAATGGG CCTCTCTGTG TTCATGACTT 3151 CAGTCCTAAA GTTTATTCCA ATGCCTGTTC TGTATGGTGT TTTCCTTTAT 3201 ATGGGAGTTT CCTCATTAAA AGGAATCCAG TTATTTGACC GTATAAAATT 3251 ATTTGGAATG CCTGCTAAGC ATCAGCCTGA TTTGATATAC CTCCGTTATG 3301 TGCCGCTCTG GAAGGTCCAT ATTTTCACAG TCATTCAGCT TACTTGTTTG 3351 GTCCTTTTAT GGGTGATAAA AGTTTCAGCT GCTGCAGTGG TTTTTCCCAT 3401 GATGGTTCTT GCATTAGTGT TTGTGCGCAA ACTCATGGAC CTGTGTTTCA 3451 CGAAGAGAGA ACTTAGTTGG CTTGATGATC TTATGCCAGA AAGTAAGAAA 3501 AAGAAAGAAG ATGACAAAAA GAAAAAAGAG AAAGAGGAAG CTGAACGGAT 3551 GCTTCAAGAC GATGATGATA CTGTGCACCT TCCATTTGAA GGGGGAAGTC 3601 TCTTGCAAAT TCCAGTCAAG GCCCTAAAAT ATAGTCCTGA TAAACCTGTG 3651 AGTGTGAAAA TAAGTTTTGA AGATGAACCA AGAAAGAAAT ACGTGGATGC 3701 TGAAACTTCA TTATAGAATT GAACCAAGAG GCATTATACA TATAGATATA 3751 TACATATGTA ATGTGTGCGT ATCATGTCAC TATATATAAG AATATTGTAT 3801 GTCATGCTGT TTATGTGTGA CTACCGGGTT TTTAAAAGTA GTGTCTGGAG 3851 TTTGTAATGA GCACCGTGGA GACTATGTAT TTAATGAAAT GCTCTCTTTG 3901 AAGTGAGGTA CATGGTTCTT AACTATTCAA ATATTTATTC TGTTAGAAAA 3951 AAAAAATTTT CTGTTTTGCA ATAGAAGGAT GTGGAGAAAT GCTTTCAGTC 4001 TACTTTTCTT AAATCTCTGT TCATCAGTGG CAATTCGTAA AAACCTTAAG 4051 TGATACTTTG TTTATATGTT TATAATTTTT AGGTGTTTCC TGAAATTTTC 4101 ACATATTATT TCACTTTTGT TAGTGCTTTA TGGGAAGAAT AGGGAGTCTA 4151 TACCAGTGCT GTGGGAAAAA TGGTAACATT TCAGGGCTTC TCTATTTGTG 4201 TTTCATTTCT GTAGATGTCC ATCGTGTTTC ACTAACTGGC GTTTTCTTAG 4251 CCATAGAGAT GACTGTAGAA CAATAGAAAC TTTAATAATG ATAGTTTTTA 4301 ACTTTTATGT TTAAATTTTT TTTAAATCTT AAAACCTTCA TATCTAGGTA 4351 TCCATTGTGA CAGACAAGTA AAATTGCAGG TGATTTGATA ATTAAGCACC 4401 CATACCATTT ATAACTTCTG AATTTAAAAA GTTATACAAT GCCAGTTTGC 4451 AATAGTTGAT TTTGATGCCT TTGTAGAATA TTTTTTCTGA ATCCTTATGC 4501 TCTTTTAAAG CCAATGATTC CCACTCTGTT CTCTGCCTTG TCTCTCTTTG 4551 TCTTAAAATG CTTTAGTTTC CATCAGGTTC AAGTTCTTGA CTATTATTCT 4601 CTTATAAGTA GTAGGCGTAA ATAATCAGGA GTTAGAATTC TCTCAGAAGG 4651 GTCTATGATC AGTATTACTT TATTAAGAAT TACCTTTCAT TTTCTCTTTA 4701 TGTTATTCTT TCACTTTTGT AGATTACATT TAATAGCTTG TTACCTGTGA 4751 TTTTATTTTA AAATATTTAT TTTGATATGA TGCTTAAATA TTATATAAAC 4801 ATTTGGAAAA GTAACAAATA GAGTAAATTG TTAATGTAAA TAGTTGGTGC 4851 TTACTTTCAT TTATGTTTAT TATCTTAAAG CAATTGATAG ATTTTACATC 4901 TTTGATATAA AGCACTGCCA TATTTATATT TTAAAAGGAA ATTAGACATT 4951 TTATATGTAG CTCAGATTAA TGACATTTTT ATTTTGTGTA TTAGTTTTTG 5001 CTTTTCTGAG CTTTTTAAAG TCTAACAAAT CTTTTTAGTC ATCTTTTATA 5051 TACTTTTAGT TCCACATGAA ATAAATGTTG TTAAGCCTGT AGGACTGGAT 5101 GAATGGGGTT GTGAAACTGA TTTGACAAGA GAATAATTTA CAAAAATCAA 5151 ATGAGTATTG AGAAGCCACA GAAATATCAA AAATGGTGAC ATCACTATTA 5201 GGATGAAAAT TTTATGAAAT TCCAATGCTC CCTTTAACCA TATAATTAAA 5251 TTACAAAGGT ATACATAATT AATCTAATAA AGGATCATGA GGAATGCCAA 5301 AAGCTGAAAT TTTAGCAAGT CTGGTGTTTT AAATTCATTA TAAACTATTT 5351 ATTAACTAAT GTAATGTCCT CCTCTAAGAC CATATTGTAA GAGTCTTGTT 5401 AAAATGATGG ATTCAACTTC TGCCTCGGAT GAGAGGTTGG AAATGTAGCT 5451 GCTTTTCTTT AGAAAATATG TAGCTGTCAT CATTTGGTAC TGCCTAAAAA 5501 GAGTTAGACT CTATTGGCAA TTGATAGAGT ATTACCCACG GTGCTTATTA 5551 TTGTTTATGA GCTTCCATTG TAATGATTCC TTTTTATTTG TAGCAGCATA 5601 ATTATTTCCA AAGACCCCAG TATTTGCTGC TATTTTTAAA ACTCCCCTGA 5651 TGTACCTGAA CAAGAGGATT TCCTCACATC ATTTCCTTGT GTCTGGACAT 5701 CAGGGGTAAC AACTGTACTT ACTTTACAGG AAGAAATTTT AAAACTGAAA 5751 ACTACCTGGG ATCATAGTGT TTCTGTGATT TTATTTAATT GTGTATAGTA 5801 ATTATCCAGT GCCAGAAAAA CCGTCACTTG CAAAATACTT TGCACTCAAA 5851 ATGTTTTTAC AATGCTTCTA AATGTTACTG GTTTCTGCTT TCTTTTGACT 5901 ACTTGACTGA CAAAATGATC TGACTACTCC ATTTAAGAGC AAAGGTAACT 5951 CATGTTTAAG TAATTAACTG CTTGTTTTAA GTGATTATAT TTCTTCCACT 6001 GTTTTTGAAA AATAATCAAA GATAGCATTC ATTGAGAGAC AGTGACAGAT 6051 ATAATTTACT ACATATATTG ATTTTTAAAT AAAGTTGCCT TAAATAAGTG 6101 TATGTAAGCA GTAGTAGTTG CTATGTACTG ATTTACCTCA AGGTGCAAAA 6151 TAATTAAACC TGTACATATT CCATTTACAA AATAAATTCA GCCCTGCACT 6201 TTCTTTAGAT GCCTTGATTT CCAGAATGGA GCTTAGTGCT ACTGAATACC 6251 CTGGCCACAG AGCCACCTCA GGATATTCTT TTCTCCACCC TAGTTTATTT 6301 ATTTATAGAT ATCTGTTTAC AAAGTCTGTA GTAAATCCTG ATGCTGACCA 6351 TCTGAAATGT ACTTTTTTTC TGAATGCTGT TTCAATCTAA AATAGCAGCT 6401 TTTGAGAAAA CAATGATGTA AATTCCTTAT GATAAAAGGA TGATTCTATA 6451 TATTCTTTAA TGATATTAAA TATGCCGAAG CCAAGCACAC AGTCTTTCTA 6501 AAGTGTGTGT ATGTTTGTGT GAATGTGAAT GATACTGATC TTATATCTGT 6551 TAAAAGTTGT TTTAAAAAGC TGTGGCATCC CATTGTTCAT ATTTGCCAAG 6601 TCTTCTGTAA AGATGTCTAG GACGAAATAT TTTATGTGCT AATGCATGTA 6651 TTTGTAAACC AGATTTGTTT ACCACTCAAA ATTAACTTGT TTTCTTCATC 6701 CAAAAAAGTT TATGTCTTCC ACGTACTTAA ATTTTCTGTG TGGGTATAAT 6751 ATAGCTTTCT AATTTTTTTC TTTCACAAAG GCAGGTTCAA AATTCTGTTG 6801 AAAGAAAAAT GCTTTCTGAA ACTGAGGTAT AACACCAGAG CTTGCTGTTT 6851 AAAGGATTAT ATGATGTACA TCAGTTCTAT AAATGTGCTC AGCAGTTTAA 6901 CATGTGAATC CTGTTTTAAA GTGCTCAGAT TTCAACTGTG TAAGCCATTG 6951 ATATAACGCT GTAATTAAAA ATGTTTATAT GAAATAACTT AATGTTTTAA 7001 ATTTATTTAT GTAGATCACA TCATTTTTAT CAGTATGCAG TGCAAATATG 7051 TGAAATGTCT TTTGGTTTAT TCCAACAATT ATTTATTTTA GAAAGTAAGT 7101 TTAAAGACTT TAAGGACATT CAAAGTTTAA AATAGTGTTC AAATTGCAAA 7151 ATTTGGCAAT CTTCATATAA ATTGGTTTCT TTTCTAACTT TTCAAAAACT 7201 AACATTAAAT GTCAATTATA GGAAAACATA GTTGGAAATG TAATCATCCC 7251 AAAGATCATT TTTAAAATGA AATTTAATTA GCACATATTG AACATTTGAC 7301 TTAATTGTTA AACCCCAGTT TTGTTTTGTT TTTTTAATCA GATTTTTGCA 7351 CACTGATTAG TTTTTGTGTT GTGGCTTTTG TTGCTTTATT ATTCAAGGTT 7401 TTTTTTTTTC TTTCCCCATG GGGGAGATTG TCTTCCAATG TTTAACTACG 7451 TTTAAATAAA TAAAAATTGA ATTTTATTGT TCATTTATAT AAAATCTGAT 7501 ACCTTGATGT AATTTCACAA TACAGTTCCA ATTTTTATGG CTTTTATAAT 7551 TACAATGATA TTTTCTTCTA TAATAAAATC CAAAGTAAAC ATTTAAATTG 7601 TAGAACTGAT ATTTTTCATT TATATGAAGT ATAAGCCTCT ACTGGGTCTA 7651 TATTGTGAAT CATCCTGCCT TTCAAATTTG TTTCATAATT GTTAGATGAA 7701 AACTATTTTT TTGGAGATGT TACTGAAGTT GATTGAGCAA TAAAAGTCTA 7751 CTTAATTAAA AAAAAAAAAA AAAAAAAAAA AAAAA // LOCUS HUMCYCLOX 3387 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens cyclooxygenase-2 (Cox-2) mRNA, complete cds. ACCESSION M90100 NID g181253 VERSION M90100.1 GI:181253 KEYWORDS cyclooxygenase-2; prostaglandin synthase. SOURCE Homo sapiens umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3387) AUTHORS Hla,T. and Neilson,K. TITLE Human cyclooxygenase-2 cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (16), 7384-7388 (1992) MEDLINE 92366465 FEATURES Location/Qualifiers source 1. .3387 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /tissue_type="umbilical vein" 5'UTR 1. .97 /gene="Cox-2" gene 1. .3387 /gene="Cox-2" CDS 98. .1912 /gene="Cox-2" /EC_number="1.14.99.1" /codon_start=1 /product="cyclooxygenase-2" /protein_id="AAA58433.1" /db_xref="PID:g181254" /db_xref="GI:181254" /translation="MLARALLLCAVLALSHTANPCCSHPCQNRGVCMSVGFDQYKCDC TRTGFYGENCSTPEFLTRIKLFLKPTPNTVHYILTHFKGFWNVVNNIPFLRNAIMSYV LTSRSHLIDSPPTYNADYGYKSWEAFSNLSYYTRALPPVPDDCPTPLGVKGKKQLPDS NEIVGKLLLRRKFIPDPQGSNMMFAFFAQHFTHQFFKTDHKRGPAFTNGLGHGVDLNH IYGETLARQRKLRLFKDGKMKYQIIDGEMYPPTVKDTQAEMIYPPQVPEHLRFAVGQE VFGLVPGLMMYATIWLREHNRVCDVLKQEHPEWGDEQLFQTSRLILIGETIKIVIEDY VQHLSGYHFKLKFDPELLFNKQFQYQNRIAAEFNTLYHWHPLLPDTFQIHDQKYNYQQ FIYNNSILLEHGITQFVESFTRQIAGRVAGGRNVPPAVQKVSQASIDQSRQMKYQSFN EYRKRFMLKPYESFEELTGEKEMSAELEALYGDIDAVELYPALLVEKPRPDAIFGETM VEVGAPFSLKGLMGNVICSPAYWKPSTFGGEVGFQIINTASIQSLICNNVKGCPFTSF SVPDPELIKTVTINASSSRSGLDDINPTVLLKERSTEL" sig_peptide 98. .148 /gene="Cox-2" mat_peptide 149. .1909 /gene="Cox-2" /EC_number="1.14.99.1" /product="cyclooxygenase-2" 3'UTR 1913. .3387 /gene="Cox-2" polyA_signal 3369. .3374 /gene="Cox-2" BASE COUNT 1010 a 712 c 633 g 1032 t ORIGIN 1 GTCCAGGAAC TCCTCAGCAG CGCCTCCTTC AGCTCCACAG CCAGACGCCC 51 TCAGACAGCA AAGCCTACCC CCGCGCCGCG CCCTGCCCGC CGCTGCGATG 101 CTCGCCCGCG CCCTGCTGCT GTGCGCGGTC CTGGCGCTCA GCCATACAGC 151 AAATCCTTGC TGTTCCCACC CATGTCAAAA CCGAGGTGTA TGTATGAGTG 201 TGGGATTTGA CCAGTATAAG TGCGATTGTA CCCGGACAGG ATTCTATGGA 251 GAAAACTGCT CAACACCGGA ATTTTTGACA AGAATAAAAT TATTTCTGAA 301 ACCCACTCCA AACACAGTGC ACTACATACT TACCCACTTC AAGGGATTTT 351 GGAACGTTGT GAATAACATT CCCTTCCTTC GAAATGCAAT TATGAGTTAT 401 GTGTTGACAT CCAGATCACA TTTGATTGAC AGTCCACCAA CTTACAATGC 451 TGACTATGGC TACAAAAGCT GGGAAGCCTT CTCTAACCTC TCCTATTATA 501 CTAGAGCCCT TCCTCCTGTG CCTGATGATT GCCCGACTCC CTTGGGTGTC 551 AAAGGTAAAA AGCAGCTTCC TGATTCAAAT GAGATTGTGG GAAAATTGCT 601 TCTAAGAAGA AAGTTCATCC CTGATCCCCA GGGCTCAAAC ATGATGTTTG 651 CATTCTTTGC CCAGCACTTC ACGCATCAGT TTTTCAAGAC AGATCATAAG 701 CGAGGGCCAG CTTTCACCAA CGGGCTGGGC CATGGGGTGG ACTTAAATCA 751 TATTTACGGT GAAACTCTGG CTAGACAGCG TAAACTGCGC CTTTTCAAGG 801 ATGGAAAAAT GAAATATCAG ATAATTGATG GAGAGATGTA TCCTCCCACA 851 GTCAAAGATA CTCAGGCAGA GATGATCTAC CCTCCTCAAG TCCCTGAGCA 901 TCTACGGTTT GCTGTGGGGC AGGAGGTCTT TGGTCTGGTG CCTGGTCTGA 951 TGATGTATGC CACAATCTGG CTGAGGGAAC ACAACAGAGT ATGCGATGTG 1001 CTTAAACAGG AGCATCCTGA ATGGGGTGAT GAGCAGTTGT TCCAGACAAG 1051 CAGGCTAATA CTGATAGGAG AGACTATTAA GATTGTGATT GAAGATTATG 1101 TGCAACACTT GAGTGGCTAT CACTTCAAAC TGAAATTTGA CCCAGAACTA 1151 CTTTTCAACA AACAATTCCA GTACCAAAAT CGTATTGCTG CTGAATTTAA 1201 CACCCTCTAT CACTGGCATC CCCTTCTGCC TGACACCTTT CAAATTCATG 1251 ACCAGAAATA CAACTATCAA CAGTTTATCT ACAACAACTC TATATTGCTG 1301 GAACATGGAA TTACCCAGTT TGTTGAATCA TTCACCAGGC AAATTGCTGG 1351 CAGGGTTGCT GGTGGTAGGA ATGTTCCACC CGCAGTACAG AAAGTATCAC 1401 AGGCTTCCAT TGACCAGAGC AGGCAGATGA AATACCAGTC TTTTAATGAG 1451 TACCGCAAAC GCTTTATGCT GAAGCCCTAT GAATCATTTG AAGAACTTAC 1501 AGGAGAAAAG GAAATGTCTG CAGAGTTGGA AGCACTCTAT GGTGACATCG 1551 ATGCTGTGGA GCTGTATCCT GCCCTTCTGG TAGAAAAGCC TCGGCCAGAT 1601 GCCATCTTTG GTGAAACCAT GGTAGAAGTT GGAGCACCAT TCTCCTTGAA 1651 AGGACTTATG GGTAATGTTA TATGTTCTCC TGCCTACTGG AAGCCAAGCA 1701 CTTTTGGTGG AGAAGTGGGT TTTCAAATCA TCAACACTGC CTCAATTCAG 1751 TCTCTCATCT GCAATAACGT GAAGGGCTGT CCCTTTACTT CATTCAGTGT 1801 TCCAGATCCA GAGCTCATTA AAACAGTCAC CATCAATGCA AGTTCTTCCC 1851 GCTCCGGACT AGATGATATC AATCCCACAG TACTACTAAA AGAACGTTCG 1901 ACTGAACTGT AGAAGTCTAA TGATCATATT TATTTATTTA TATGAACCAT 1951 GTCTATTAAT TTAATTATTT AATAATATTT ATATTAAACT CCTTATGTTA 2001 CTTAACATCT TCTGTAACAG AAGTCAGTAC TCCTGTTGCG GAGAAAGGAG 2051 TCATACTTGT GAAGACTTTT ATGTCACTAC TCTAAAGATT TTGCTGTTGC 2101 TGTTAAGTTT GGAAAACAGT TTTTATTCTG TTTTATAAAC CAGAGAGAAA 2151 TGAGTTTTGA CGTCTTTTTA CTTGAATTTC AACTTATATT ATAAGGACGA 2201 AAGTAAAGAT GTTTGAATAC TTAAACACTA TCACAAGATG CCAAAATGCT 2251 GAAAGTTTTT ACACTGTCGA TGTTTCCAAT GCATCTTCCA TGATGCATTA 2301 GAAGTAACTA ATGTTTGAAA TTTTAAAGTA CTTTTGGGTA TTTTTCTGTC 2351 ATCAAACAAA ACAGGTATCA GTGCATTATT AAATGAATAT TTAAATTAGA 2401 CATTACCAGT AATTTCATGT CTACTTTTTA AAATCAGCAA TGAAACAATA 2451 ATTTGAAATT TCTAAATTCA TAGGGTAGAA TCACCTGTAA AAGCTTGTTT 2501 GATTTCTTAA AGTTATTAAA CTTGTACATA TACCAAAAAG AAGCTGTCTT 2551 GGATTTAAAT CTGTAAAATC AGATGAAATT TTACTACAAT TGCTTGTTAA 2601 AATATTTTAT AAGTGATGTT CCTTTTTCAC CAAGAGTATA AACCTTTTTA 2651 GTGTGACTGT TAAAACTTCC TTTTAAATCA AAATGCCAAA TTTATTAAGG 2701 TGGTGGAGCC ACTGCAGTGT TATCTCAAAA TAAGAATATC CTGTTGAGAT 2751 ATTCCAGAAT CTGTTTATAT GGCTGGTAAC ATGTAAAAAC CCCATAACCC 2801 CGCCAAAAGG GGTCCTACCC TTGAACATAA AGCAATAACC AAAGGAGAAA 2851 AGCCCAAATT ATTGGTTCCA AATTTAGGGT TTAAACTTTT TGAAGCAAAC 2901 TTTTTTTTAG CCTTGTGCAC TGCAGACCTG GTACTCAGAT TTTGCTATGA 2951 GGTTAATGAA GTACCAAGCT GTGCTTGAAT AACGATATGT TTTCTCAGAT 3001 TTTCTGTTGT ACAGTTTAAT TTAGCAGTCC ATATCACATT GCAAAAGTAG 3051 CAATGACCTC ATAAAATACC TCTTCAAAAT GCTTAAATTC ATTTCACACA 3101 TTAATTTTAT CTCAGTCTTG AAGCCAATTC AGTAGGTGCA TTGGAATCAA 3151 GCCTGGCTAC CTGCATGCTG TTCCTTTTCT TTTCTTCTTT TAGCCATTTT 3201 GCTAAGAGAC ACAGTCTTCT CAAACACTTC GTTTCTCCTA TTTTGTTTTA 3251 CTAGTTTTAA GATCAGAGTT CACTTTCTTT GGACTCTGCC TATATTTTCT 3301 TACCTGAACT TTTGCAAGTT TTCAGGTAAA CCTCAGCTCA GGACTGCTAT 3351 TTAGCTCCTC TTAAGAAGAT TAAAAAAAAA AAAAAAG // LOCUS HUMENDOSYN 3362 bp mRNA PRI 12-JUN-1993 DEFINITION Human endoperoxide synthase type II mRNA, complete cds. ACCESSION L15326 NID g291987 VERSION L15326.1 GI:291987 KEYWORDS endoperoxide synthase type II. SOURCE Homo sapiens (library: lambda gt11) prostaglandin cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3362) AUTHORS Jones,D.A., Carlton,D.P., McIntyre,T.M., Zimmerman,G.A. and Prescott,S.M. TITLE Molecular cloning of human prostaglandin endoperoxide synthase type II and demonstration of expression in response to cytokines JOURNAL J. Biol. Chem. 268, 9049-9054 (1993) MEDLINE 93232069 FEATURES Location/Qualifiers source 1. .3362 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary" /cell_type="endothelial" /tissue_type="prostaglandin" /tissue_lib="lambda gt11" /map="Chromosome 1" CDS 93. .1907 /codon_start=1 /product="endoperoxide synthase type II" /protein_id="AAA35803.1" /db_xref="PID:g291988" /db_xref="GI:291988" /translation="MLARALLLCAVLALSHTANPCCSHPCQNRGVCMSVGFDQYKCDC TRTGFYGENCSTPEFLTRIKLFLKPTPNTVHYILTHFKGFWNVVNNIPFLRNAIMSYV LTSRSHLIDSPPTYNADYGYKSWEAFSNLSYYTRALPPVPDDCPTPLGVKGKKQLPDS NEIVEKLLLRRKFIPDPQGSNMMFAFFAQHFTHQFFKTDHKRGPAFTNGLGHGVDLNH IYGETLARQRKLRLFKDGKMKYQIIDGEMYPPTVKDTQAEMIYPPQVPEHLRFAVGQE VFGLVPGLMMYATIWLREHNRVCDVLKQEHPEWGDEQLFQTSRLILIGETIKIVIEDY VQHLSGYHFKLKFDPELLFNKQFQYQNRIAAEFNTLYHWHPLLPDTFQIHDQKYNYQQ FIYNNSILLEHGITQFVESFTRQIAGRVAGGRNVPPAVQKVSQASTDQSRQMKYQSFN EYRKRFMLKPYESFEELTGEKEMSAELEALYGDIDAVELYPALLVEKPRPDAIFGETM VEVGAPFSLKGLMGNVICSPAYWKPSTFGGEVGFQIINTASIQSLICNNVKGCPFTSF SVPDPELIKTVTINASSSRSGLDDINPTVLLKERSTEL" BASE COUNT 1000 a 697 c 626 g 1039 t ORIGIN 1 GGAACTCCTC AGCAGCGCCT CCTTCAGCTC CACAGCCAGA CGCCCTCAGA 51 CAGCAAAGCC TACCCCCGCG CCGCGCCCTG CCCGCCGCTG CGATGCTCGC 101 CCGCGCCCTG CTGCTGTGCG CGGTCCTGGC GCTCAGCCAT ACAGCAAATC 151 CTTGCTGTTC CCACCCATGT CAAAACCGAG GTGTATGTAT GAGTGTGGGA 201 TTTGACCAGT ATAAGTGCGA TTGTACCCGG ACAGGATTCT ATGGAGAAAA 251 CTGCTCAACA CCGGAATTTT TGACAAGAAT AAAATTATTT CTGAAACCCA 301 CTCCAAACAC AGTGCACTAC ATACTTACCC ACTTCAAGGG ATTTTGGAAC 351 GTTGTGAATA ACATTCCCTT CCTTCGAAAT GCAATTATGA GTTATGTGTT 401 GACATCCAGA TCACATTTGA TTGACAGTCC ACCAACTTAC AATGCTGACT 451 ATGGCTACAA AAGCTGGGAA GCCTTCTCTA ACCTCTCCTA TTATACTAGA 501 GCCCTTCCTC CTGTGCCTGA TGATTGCCCG ACTCCCTTGG GTGTCAAAGG 551 TAAAAAGCAG CTTCCTGATT CAAATGAGAT TGTGGAAAAA TTGCTTCTAA 601 GAAGAAAGTT CATCCCTGAT CCCCAGGGCT CAAACATGAT GTTTGCATTC 651 TTTGCCCAGC ACTTCACGCA TCAGTTTTTC AAGACAGATC ATAAGCGAGG 701 GCCAGCTTTC ACCAACGGGC TGGGCCATGG GGTGGACTTA AATCATATTT 751 ACGGTGAAAC TCTGGCTAGA CAGCGTAAAC TGCGCCTTTT CAAGGATGGA 801 AAAATGAAAT ATCAGATAAT TGATGGAGAG ATGTATCCTC CCACAGTCAA 851 AGATACTCAG GCAGAGATGA TCTACCCTCC TCAAGTCCCT GAGCATCTAC 901 GGTTTGCTGT GGGGCAGGAG GTCTTTGGTC TGGTGCCTGG TCTGATGATG 951 TATGCCACAA TCTGGCTGCG GGAACACAAC AGAGTATGCG ATGTGCTTAA 1001 ACAGGAGCAT CCTGAATGGG GTGATGAGCA GTTGTTCCAG ACAAGCAGGC 1051 TAATACTGAT AGGAGAGACT ATTAAGATTG TGATTGAAGA TTATGTGCAA 1101 CACTTGAGTG GCTATCACTT CAAACTGAAA TTTGACCCAG AACTACTTTT 1151 CAACAAACAA TTCCAGTACC AAAATCGTAT TGCTGCTGAA TTTAACACCC 1201 TCTATCACTG GCATCCCCTT CTGCCTGACA CCTTTCAAAT TCATGACCAG 1251 AAATACAACT ATCAACAGTT TATCTACAAC AACTCTATAT TGCTGGAACA 1301 TGGAATTACC CAGTTTGTTG AATCATTCAC CAGGCAAATT GCTGGCAGGG 1351 TTGCTGGTGG TAGGAATGTT CCACCCGCAG TACAGAAAGT ATCACAGGCT 1401 TCCACTGACC AGAGCAGGCA GATGAAATAC CAGTCTTTTA ATGAGTACCG 1451 CAAACGCTTT ATGCTGAAGC CCTATGAATC ATTTGAAGAA CTTACAGGAG 1501 AAAAGGAAAT GTCTGCAGAG TTGGAAGCAC TCTATGGTGA CATCGATGCT 1551 GTGGAGCTGT ATCCTGCCCT TCTGGTAGAA AAGCCTCGGC CAGATGCCAT 1601 CTTTGGTGAA ACCATGGTAG AAGTTGGAGC ACCATTCTCC TTGAAAGGAC 1651 TTATGGGTAA TGTTATATGT TCTCCTGCCT ACTGGAAGCC AAGCACTTTT 1701 GGTGGAGAAG TGGGTTTTCA AATCATCAAC ACTGCCTCAA TTCAGTCTCT 1751 CATCTGCAAT AACGTGAAGG GCTGTCCCTT TACTTCATTC AGTGTTCCAG 1801 ATCCAGAGCT CATTAAAACA GTCACCATCA ATGCAAGTTC TTCCCGCTCC 1851 GGACTAGATG ATATCAATCC CACAGTACTA CTAAAAGAAC GTTCGACTGA 1901 ACTGTAGAAG TCTAATGATC ATATTTATTT ATTTATATGA ACCATGTCTA 1951 TTAATTTAAT TATTTAATAA TATTTATATT AAACTCCTTA TGTTACTTAA 2001 CATCTTCTGT AACAGAAGTC AGTACTCCTG TTGCGGAGAA AGGAGTCATA 2051 CTTGTGAAGA CTTTATGTCA CCTACCTCTA AAGATTTTGC TGTTGCTGTT 2101 AAGTTTGGAA AACAGTTTTT ATTCTGTTTT ATAAACCAGA GAGAAATGAG 2151 TTTTGACGTC TTTTTACTTG AATTTCAACT TATATTATAA GAACGAAAGT 2201 AAAGATGTTT GAATACTTAA ACACTGTCAC AAGATGGCAA AATGCTGAAA 2251 GTTTTTACAC TGTCGATGTT TCCAATGCAT CTTCCATGAT GCATTAGAAG 2301 TAACTAATGT TTGAAATTTT AAAGTACTTT TGGTTATTTT TCTGTCATCA 2351 AACAAAAACA GGTATCAGTG CATTATTAAA TGAATATTTA AATTAGACAT 2401 TACCAGTAAT TTCATGTCTA CTTTTTAAAA TCAGCAATGA AACAATAATT 2451 TGAAATTTCT AAATTCATAG GGTAGAATCA CCTGTAAAAG CTTGATTTGA 2501 TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA GCTGTCTTGG 2551 ATTTAAATCT GTAAAATCAG TAGAAATTTT ACTACAATTG CTTGTTAAAA 2601 TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT 2651 GTGACTGTTA AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG 2701 GTGGAGCCAC TGCAGTGTTA TCTTAAAATA AGAATATTTT GTTGAGATAT 2751 TCCAGAATTT GTTTATATGG CTGGTAACAT GTAAAATCTA TATCAGCAAA 2801 AGGGTCTACC TTTAAAATAA GCAATAACAA AGAGGAAAAC CAAATTATTG 2851 TTCAAATTTA GGTTTAAACT TTTGAGGCAA ACTTTTTTTT ATCCTTGTGC 2901 ACTGCAGGCC TGGTACTCAG ATTTTGCCTA TGAGGTTAAT GAAGTACCAA 2951 GCTGTGCTTG AATAACGATA TGTTTTCTCA GATTTTCTGT TGTACAGTTT 3001 AATTTAGCAG TCCATATCAC ATTGCAAAAG TAGCAATGAC CTCATAAAAT 3051 ACCTCTTCAA AATGCTTAAA TTCATTTCAC ACATTAATTT TATCTCAGTC 3101 TTGAAGCCAA TTCAGTAGGT GCATTGGAAT CAAGCCTGGC TACCTGCATG 3151 CTGTTCCTTT TCTTTTCTTC TTTTAGCCAT TTTGCTAAGA GACACAGTCT 3201 TCTCATCACT TCGTTTCTCC TATTTTGTTT TACTAGTTTT AAGATCAGAG 3251 TTCACTTTCT TTGGACTCTG CCTATATTTT CTTACCTGAA CTTTTGCAAG 3301 TTTTCAGGTA AACCTCAGCT CAGGACTGCT ATTTAGCTCC TCTTAAGAAG 3351 ATTAAAAAAA AA // LOCUS AB002350 6170 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0352 gene, complete cds. ACCESSION AB002350 NID g2224644 VERSION AB002350.1 GI:2224644 KEYWORDS KIAA0352. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1642. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6170) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .6170 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1642" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 87. .2225 /gene="KIAA0352" CDS 87. .2225 /gene="KIAA0352" /codon_start=1 /protein_id="BAA20809.1" /db_xref="PID:d1021648" /db_xref="PID:g2224645" /db_xref="GI:2224645" /translation="MGMRIKLQSTNHPNNLLKELNKCRLSETMCDVTIVVGSRSFPAH KAVLACAAGYFQNLFLNTGLDAARTYVVDFITPANFEKVLSFVYTSELFTDLINVGVI YEVAERLGMEDLLQACHSTFPDLESTARAKPLTSTSESHSGTLSCPSAEPAHPLGELR GGGDYLGADRNYVLPSDAGGSYKEEEKNVASDANHSLHLPQPPPPPPKTEDHDTPAPF TSIPSMMTQPLLGTVSTGIQTSTSSCQPYKVQSNGDFSKNSFLTPDNAVDITTGTNSC LSNSEHSKDPGFGQMDELQLEDLGDDDLQFEDPAEDIGTTEEVIELSDDSEDELAFGE NDNRENKAMPCQVCKKVLEPNIQLIRQHARDHVDLLTGNCKVCETHFQDRNSRVTHVL SHIGIFLFSCDMCETKFFTQWQLTLHRRDGIFENNIIVHPNDPLPGKLGLFSGAASPE LKCAACGKVLAKDFHVVRGHILDHLNLKGQACSVCDQRHLNLCSLMWHTLSHLGISVF SCSVCANSFVDWHLLEKHMAVHQSLEDALFHCRLCSQSFKSEAAYRYHVSQHKCNSGL DARPGFGLQHPALQKRKLPAEEFLGEELALQGQPGNSKYSCKVCGKRFAHTSEFNYHR RIHTGEKPYQCKVCHKFFRGRSTIKCHLKTHSGALMYRCTVCGHYSSTLNLMSKHVGV HKGSLPPDFTIEQTFMYIIHSKEADKNPDS" BASE COUNT 1489 a 1540 c 1505 g 1636 t ORIGIN 1 GAAGCGGCGG CGGCGATGGT GCTCGGGGCG CCGCAGAGCC GGATTAACTG 51 TGCTGATAAG GAGGTAATTT CATAGGAGCT GCTAAGATGG GCATGAGGAT 101 CAAACTGCAA AGCACCAACC ACCCCAACAA CCTGCTGAAG GAACTCAACA 151 AGTGCCGGCT CTCAGAGACC ATGTGCGACG TCACCATTGT GGTGGGGAGC 201 CGCTCCTTCC CGGCCCACAA GGCTGTGCTG GCCTGTGCAG CTGGCTACTT 251 CCAGAACCTC TTCCTGAATA CTGGGCTTGA TGCTGCCAGG ACCTATGTGG 301 TGGACTTCAT CACCCCTGCC AACTTTGAGA AGGTTCTGAG CTTTGTCTAC 351 ACTTCAGAAC TCTTCACAGA CCTGATCAAT GTTGGGGTCA TCTACGAGGT 401 AGCTGAGCGT TTGGGTATGG AGGACCTCCT CCAGGCCTGT CACTCTACCT 451 TTCCTGATCT GGAGAGCACT GCCAGGGCCA AGCCCCTGAC CAGCACCAGT 501 GAGAGCCACT CTGGTACCCT GAGTTGTCCT TCGGCAGAAC CTGCCCATCC 551 CCTTGGAGAA CTCCGAGGTG GTGGGGACTA CCTTGGTGCT GATAGAAACT 601 ATGTGTTGCC CAGTGATGCT GGAGGGAGCT ATAAAGAGGA AGAGAAGAAT 651 GTTGCCAGTG ACGCTAACCA TAGCCTGCAT CTGCCGCAAC CGCCCCCACC 701 ACCGCCAAAG ACAGAAGACC ATGACACCCC TGCTCCCTTC ACGTCCATTC 751 CTAGCATGAT GACCCAGCCA CTCCTAGGCA CTGTCAGCAC GGGCATCCAG 801 ACCAGCACGA GCTCCTGCCA GCCATACAAA GTTCAAAGCA ATGGAGACTT 851 CAGTAAAAAC AGCTTCCTCA CCCCTGACAA TGCAGTAGAC ATTACCACTG 901 GGACCAACTC CTGTCTGAGC AATAGTGAGC ACTCCAAAGA TCCTGGCTTT 951 GGGCAGATGG ATGAGCTCCA GCTCGAGGAC CTGGGGGATG ATGACTTGCA 1001 GTTTGAAGAC CCTGCTGAGG ATATAGGCAC AACTGAGGAG GTGATTGAGC 1051 TGAGTGATGA CAGTGAGGAT GAGTTGGCTT TTGGAGAGAA TGACAATCGG 1101 GAGAATAAGG CCATGCCCTG CCAGGTGTGC AAGAAAGTTC TAGAGCCCAA 1151 CATTCAACTG ATCCGGCAGC ATGCTCGGGA CCATGTGGAC CTGCTGACGG 1201 GCAACTGCAA GGTCTGCGAG ACCCACTTCC AGGACCGAAA CTCCCGGGTA 1251 ACTCATGTCC TGTCCCACAT TGGTATTTTC CTTTTCTCCT GCGACATGTG 1301 TGAAACTAAG TTCTTTACCC AGTGGCAGCT GACCCTTCAC CGACGGGATG 1351 GAATATTTGA GAACAACATC ATTGTCCACC CCAACGATCC CCTGCCAGGG 1401 AAGCTGGGTC TCTTTTCAGG GGCAGCCTCC CCAGAGCTGA AATGCGCTGC 1451 CTGTGGGAAA GTATTGGCCA AAGATTTCCA TGTGGTCCGG GGCCACATCC 1501 TTGACCATCT AAACTTGAAG GGCCAGGCCT GCAGTGTCTG CGACCAGCGT 1551 CACCTTAACC TCTGCAGCCT CATGTGGCAC ACGCTGTCCC ATCTCGGCAT 1601 CTCAGTCTTC TCCTGTTCTG TCTGTGCGAA CAGCTTTGTG GACTGGCATC 1651 TTCTAGAGAA GCACATGGCT GTGCACCAAA GTCTGGAAGA CGCCCTCTTC 1701 CACTGCCGCT TGTGCAGCCA GAGCTTCAAG TCAGAGGCTG CCTATCGCTA 1751 CCACGTCAGC CAGCACAAAT GCAACAGTGG CCTTGATGCA CGGCCTGGTT 1801 TTGGGCTGCA GCACCCAGCT CTCCAGAAGC GGAAGCTGCC AGCAGAGGAG 1851 TTTCTGGGTG AAGAGCTGGC GCTGCAGGGC CAACCTGGGA ACAGCAAGTA 1901 TAGCTGCAAG GTCTGTGGCA AAAGATTTGC CCACACAAGC GAATTCAACT 1951 ACCACCGGCG GATCCACACG GGGGAGAAGC CATACCAATG TAAGGTGTGC 2001 CACAAGTTCT TTCGAGGCCG CTCGACCATC AAGTGCCACC TAAAGACACA 2051 CTCGGGGGCC CTCATGTACC GCTGCACAGT CTGTGGGCAC TACAGTTCCA 2101 CCCTTAACCT CATGAGCAAA CATGTTGGTG TGCACAAAGG CAGCCTCCCC 2151 CCTGACTTCA CCATCGAGCA GACCTTCATG TACATCATCC ATTCCAAAGA 2201 GGCGGATAAG AACCCGGACA GTTGACTGGG TCCCGGCAGA GCCACGGGGA 2251 GCTCCCAAGC AGCAGCCAGG ATGCTGATAT CTAAGAGGTG TTGGTCCCTC 2301 CCCAGCTGAA GTTATAATTT TGCCTTGGTA GGAATTCTGT TCTGTGTTGT 2351 GTTTAAAGAA GAAAAGAAGA AGAAATAGCA CATAAGCTGT TACTGTTGTT 2401 GAGAAGCAAC AGCCCTATCA CATTTACCTC CATACCTGTT CTTGCCCATG 2451 CAGGGCTATG TTTTTCATTC TTTTGAGGCT GGTTTTGGGA TCTAGTCAAG 2501 CAGTTGGTGT CCACTAGACC CCCTTCCCCA GCCTCTCTAG TTTTAGTTTA 2551 CTGATAGGTT TTATGCTGCT AAGAATCCAA CCAACAGCCT CACTTAACAG 2601 AGGAGGTAAA GGGAGGTTTT CACTGTGGGT GTTACTGCAG GCCTCCAACT 2651 GGGATGACCA GCAATGAGAA AGATTTTGGG AATGTGATCA TTCAGAAAAG 2701 ACAGGTCAGC AGGGCAGTCC CCTCAGGTTC CAGCCCTCAG CAGGGACAAG 2751 ACATCAGGGA TGTTGGTGTC TGCTTATCTA CAGCCCTAAT CTGCTGATTG 2801 AACAGTGAAA ATCTTTTGGC AGCTAGATCC ATACTAGGCA CAGAGCTTTC 2851 TATTTAGGTC AGAAAGCTTT GGGATGAACC CCTGCCAGCC AGAAGGGGTG 2901 TCCTGCAGTG CCACCAGAAG TGGCAGCCTG GATGGACAGG AGAGGTTTCT 2951 CTTTTCTCCT CATTTCCAAA GAAACAGGAT TTTATGGAGT GATGGCCCGG 3001 GACGTCCGGC CTTCTGTGGG GCAGAGAGGA TAGGGAACCA CTTTGATATA 3051 GTCATCTGTT TTGGCCACTT CTGTTGGCCA TGAGTGTCTT GGTGGAGAGG 3101 TGGGGATGTA TCTGACAGCA GCAGCCTTGC CTTAATTTAT ATCTGGTCTC 3151 CCGTCCAGAA GTGTTTGGCC CGTGGTGTAG ATGAGCTGAC TCCATGAAGT 3201 GGTGGAGTGG CAGACCGCGA GCCCTTCAGG ATTAAAGGGA CCTGAGTAAC 3251 TGGTGGTGTG TAGCAGGGTG TGCGTTCTGA CTGTCTGTCC TAGTCGGGTA 3301 ACCTGTTTAC TTTGTGCTAA CAGTGCCGGA GCTTTGTCAG CTCACCTTTG 3351 ACCTGCTGGA ATTTATCCTG ATCTGTCGTT GCCATCACCT CCAGGGGGCG 3401 CTATTGGATG GCAGCTGGTT CAGGCCCTCC GTGGGTGGCT GCAGAGTGTG 3451 CCGGCACAGC CCACGCAGCT GGTTAGCTCC ACTCTTACTT GGTTTTCTAA 3501 GGGGCTTTGC CCAAAGAAGT CTTGAGGGAT AGGGCCCTCG ATCTTGCATA 3551 CTTGTGAGGT GCCACCTCAG TAGCTCATAC TACCTCACCC TGCTCAGGTG 3601 AGCTCTGGGA GTCCCTGGCC TCAGCCCTGG CACTGCCCCT GGTGGGATTC 3651 AGCAACACCC CGGGCTGTTT CACAAGCAAG TGGTTTTCAT TAACTCACAA 3701 AGCCTTTTTG GACATTAATA TTTATTTATT TTTGTTTTTG TATACACATT 3751 ACCCAGCATC TCTTTTGTAT AAGAGACTTT AGGAAAATGA GTTTCTCCCC 3801 AGCAAGTAGC CATATTCCAG AGAACAGCCT TGACCAGAGA TGTGGAGAAC 3851 CAGGGGTATA ACTAAGGGAA GACATGTCAA GCCCTTAAGC AAATTCTTCT 3901 TCTCCAGATT GTCTCTAGCA TAATAAACCC AGGGAACACT TTAGGCCATG 3951 GGTGTATGTT CTATAAAGTT CGGGACAGTT AAATTCCAGG CCTTTCATCC 4001 CCCTTCCTTC CTGTGATGGA GTAGATTGGG GACAGGGTTG AGGGGAACAA 4051 GTGACATGAA TGACCTATTT GCACAGTTTG GAAGCCTCCT GTCTTTATTT 4101 ATATTGAGAT GTCAGACAAA CCAAAGCTCC ATCCTTGTTG GACCTGCTGC 4151 TTGTCCCCAG CCCTGAACTG ATAAAGCCTC AGAGTTGGAG TGCCTGGCTC 4201 TCTGGTGGGG TGACCATTAG ATGAAGGGAC TTGTACAGTG GCCAGTTTAA 4251 AGGTCCACCT TTGACCATCT AAACCCACCT TGTTCAGTGT CCTCTGAGGA 4301 CATCCTCATC AGGAAAGCAG TGTTGAGACT CTTCATTGCT GACTGGCTTC 4351 TCCCTTTCTT ACTCACACTG ACCATTAGAA TTTAAGAAGG AAATGTGTAA 4401 CAGACTACAG TCAAGTGTCT GCTACATTTT CAAGCATGAG CAATCCCTCC 4451 CAGACTGTTG GTGAGGACTG ATTTTTGAAA TGCTGGTGTG AGAGAGGTGG 4501 TAATTACAGG AACCAGCCGA GTGGCTGAGA GCAGATAAAT GTGCTGGAGA 4551 AACCTTTTTC CTTAACAGAG GGCATCATGG ATGCTGGGTG GTCTGTGTAC 4601 TTAACCTGAA ACTGTGAAGT TTTCCCTTTT TGCCAGTAAA CCAAAAAGCA 4651 GATCCTTGAA ACTTGGCCCT TGAAACACGA AACAGAAAAC TGCATCCCCT 4701 GATCCCCCGG GGGCTGATCA GATTGATCAG GGTGGCTCAG TTGGTCCCAG 4751 TCAGATACGT CATAGGATCA GTAGCTCATG AGATTTGTTG ACCAAACCCT 4801 CTCCCTGTTG GTGACCCCTG TATTCACACC TGAACTTCCC GTTTCCCCCC 4851 ACCCCACCAA AGTCATGTCT GCTTCCTCTG CCTCCAGCTC ACCCTCTTCT 4901 GAATGGTTTT GCTTGAAACA CTATATTGTG GCAGAGGGCA CCCTGGGATA 4951 CCTGGTAGAA GGTATTCATT TTATTTTGCA TTTTTAATTG TTTTGACTTT 5001 CTGTTCATTT GATTTTGTTG CTCCCTCTCT CCTGGAACCT AGTTTACTAC 5051 TCTCTTCTGT GTTACTCTGA AGTTCTGTTT AAGCCTTGAA CATCCTCTTC 5101 TCCCATTTTC TTGGTATGTA CTCAGGACCA GCTACATCAT TTGTGGAGCC 5151 CTCTTATTCA TAAATTATTA AAAATTTCAA GGTGGTGGTC ATAGAGCATT 5201 AAACCAAATA TGAGGCCATT CCCAACTTGT TTTCCGAGGG GAAAATGGTA 5251 ATACTTGTGT GGCACCCGGG GTTAAACAGC AGAGGCTCCA TGTGGCCAGA 5301 GGCAGAGATT AGTATCCTGG CACTCCAGTG ACCCACTGGG TGACTCACTG 5351 ATGCCACAGC ACCCGCTAGG AAGCTCTGCT GAACCTTAGT ATTTGGTCCT 5401 AAATTTTATG ACTCCATGGA GTTCCCGTAG TCCATGGCTA GTTAGGAAGA 5451 AAGGAGGTGG GATAAGGGTC AGGCCCAGGT GACCCCTAAG AACCAGGAGA 5501 TGGGTAAAAG TTTTTTTTTA TATTCTGCTT TTCTGATCTG TGAGTACCTG 5551 TTTGTCTCCA GGCCAAACCT TTGGGCTTAA ATATCTTTTT CCTAGACAGG 5601 TTTTTGCTAG TGTTGAATTT TCTTCTTCCT CTGGCCTCCT TCTGTGCCCC 5651 TTTCCCCAAG CCCAAGACTG CTTAACTTCC AAAGCAAATT CTAGATAGAC 5701 ACTGTATTTA TTGGTATGGG AGTGGGCTCT ATGGGGTGGT CTGCACCCAT 5751 CTGGGACTCT TTTCCCTAAA TCCTGCACCA AATGAGTCAG GAGGCAGGGT 5801 GCACAGCATT AGTTTCAATG TGGTTATGCA TCATAAGCTT AACATCAGAA 5851 TGAAAATGAA ACTCGATTTT GATGTTTCTT TAAAACCCTT CCCCTGTCCA 5901 ATCCACTCGC CGCCCCCACC TTGAATAGCT AAAGTCTCTT ATGAAACAGA 5951 GAAGAGTTGT TGACGTCTAA CTCCTTCCAT TAAATTAATA AGTACTGACC 6001 TCCTAATATT TAAGTGTTTA CTATCTATTG CTGTAAAGTT TTGTATATTT 6051 TGTAAACTTT TTTCCCCAAA TAGTAGATGT CTAAAATCAT TGTACATCTG 6101 ATTCTTTTAT ATTCCATTGT TCAGCACAAA GTGTGGTTTT TATTTAGAAT 6151 AAAAAAAGAA ATTTGAAATG // LOCUS AF029729 3634 bp mRNA PRI 05-JAN-1999 DEFINITION Homo sapiens neuralized mRNA, complete cds. ACCESSION AF029729 NID g4103927 VERSION AF029729.1 GI:4103927 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3634) AUTHORS Prinos,P., Kilpatrick,M.W. and Tsipouras,P. TITLE Direct Submission JOURNAL Submitted (10-OCT-1997) Pediatrics, UCONN Health Center, 263 Farmington Ave., Farmington, CT 06030, USA FEATURES Location/Qualifiers source 1. .3634 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q24-q25" CDS 62. .1786 /codon_start=1 /product="neuralized" /protein_id="AAD01887.1" /db_xref="PID:g4103928" /db_xref="GI:4103928" /translation="MGNNFSSIPSLPRGNPSRAPRGHPQNLKDSIGGPFPVTSHRCHH KQKHCPAVLPSGGLPATPLLFHPHTKGSQILMDLSHKAVKRQASFCNAITFSNRPVLI YEQVRLKITKKQCCWSGALRLGFTSKDPSRIHPDSLPKYACPDLVSQSGFWAKALPEE FANEGNIIAFWVDKKGRVFHRINDSAVMLFFSGVRTADPLWALVDVYGLTRGVQLLDS ELVLPDCLRPRSFTALRRPSLRREADDARLSVSLCDLNVPGADGDEAAPAAGCPIPQN SLNSQHSRALPAQLDGDLRFHALRAGAHVRILDEQTVARVEHGRDERALVFTSRPVRV AETIFVKVTRSGGARPGALSFGVTTCDPGTLRPADLPFSPEALVDRKEFWAVCRVPGP LHSGDILGLVVNADGELHLSHNGAAAGMQLCVDASQPLWMLFGLHGTITQIRILGSTI LAERGIPSLPCSPASTPTSPSALGSRLSDPLLSTCSSGPLGSSAGGTAPNSPVSLPES PVTPGLGQWSDECTICYEHAVDTVIYTCGHMCLCYACGLRLKKALHACCPICRRPIKD IIKTYRSS" BASE COUNT 614 a 1192 c 1076 g 752 t ORIGIN 1 GGCAGCTCCT GCCCGGCCTC GCCCCCACCC GCGAGCGCCG AACCTCCTGG 51 GGCCGGATGC CATGGGTAAC AACTTCTCCA GTATCCCCTC GCTGCCCCGA 101 GGAAACCCGA GCCGCGCGCC GCGGGGTCAC CCCCAGAACC TCAAAGACTC 151 TATCGGGGGC CCCTTCCCCG TCACTTCTCA CCGATGCCAC CACAAGCAGA 201 AGCACTGTCC GGCAGTGCTG CCCAGCGGGG GGCTCCCAGC CACGCCGCTG 251 CTCTTCCACC CGCACACCAA GGGCTCCCAG ATCCTCATGG ACCTCAGCCA 301 CAAGGCTGTC AAGAGGCAGG CCAGCTTCTG CAACGCCATC ACCTTCAGCA 351 ACCGCCCGGT CCTCATCTAC GAGCAAGTCA GGCTGAAGAT CACCAAGAAG 401 CAGTGCTGCT GGAGCGGGGC CCTGCGGCTG GGCTTCACCA GCAAGGACCC 451 GTCCCGCATC CACCCTGACT CGCTGCCCAA GTACGCCTGC CCCGACCTGG 501 TGTCCCAGAG TGGCTTCTGG GCCAAGGCGC TGCCTGAGGA GTTTGCCAAT 551 GAGGGCAACA TCATCGCATT CTGGGTGGAC AAGAAGGGCC GTGTCTTCCA 601 CCGCATCAAC GACTCGGCTG TTATGCTGTT CTTCAGCGGG GTCCGCACGG 651 CCGACCCGCT CTGGGCCCTG GTGGACGTCT ACGGCCTCAC GCGGGGCGTC 701 CAGCTGCTTG ATAGCGAGCT GGTGCTCCCG GACTGTCTGC GGCCGCGCTC 751 CTTCACCGCC CTGCGGCGGC CGTCGCTGCG GCGCGAGGCG GACGACGCGC 801 GCCTCTCGGT GAGCCTATGC GACCTCAACG TGCCGGGCGC GGACGGCGAC 851 GAGGCCGCGC CGGCCGCCGG CTGCCCCATC CCGCAGAACT CACTCAACTC 901 GCAGCACAGC CGCGCGCTGC CGGCGCAGCT CGACGGCGAC CTGCGTTTCC 951 ACGCCCTGCG CGCCGGCGCG CACGTCCGCA TCCTCGACGA GCAGACGGTG 1001 GCGCGCGTGG AGCACGGGCG CGACGAGCGC GCGCTCGTCT TCACCAGCCG 1051 GCCCGTGCGC GTGGCCGAGA CCATCTTCGT CAAGGTCACG CGCTCGGGTG 1101 GCGCGCGGCC CGGCGCGCTG TCGTTCGGCG TCACCACGTG CGACCCCGGC 1151 ACGCTGCGGC CGGCCGACCT GCCCTTCAGC CCTGAGGCCC TGGTGGACCG 1201 CAAGGAATTC TGGGCCGTGT GCCGCGTGCC CGGGCCCCTG CACAGCGGCG 1251 ACATCCTGGG CCTGGTGGTC AACGCCGACG GCGAGCTGCA CCTCAGCCAC 1301 AATGGCGCGG CCGCCGGCAT GCAGCTGTGC GTGGACGCCT CGCAGCCGCT 1351 TTGGATGCTC TTCGGCCTGC ACGGGACCAT CACGCAGATC CGCATCCTCG 1401 GCTCCACTAT CCTGGCCGAG CGGGGTATCC CATCACTCCC CTGCTCCCCT 1451 GCCTCCACGC CAACCTCGCC CAGTGCCCTG GGCAGCCGCC TGTCTGACCC 1501 CTTGCTCAGC ACGTGCAGCT CTGGCCCTCT GGGTAGCTCT GCTGGTGGGA 1551 CAGCCCCCAA TTCGCCAGTG AGCCTGCCCG AGTCGCCAGT GACCCCAGGT 1601 CTGGGCCAGT GGAGCGATGA GTGCACCATT TGCTATGAAC ACGCGGTGGA 1651 CACGGTCATC TACACATGTG GCCACATGTG CCTCTGCTAC GCCTGTGGCC 1701 TGCGCCTCAA GAAGGCTCTG CACGCCTGCT GCCCCATCTG CCGCCGCCCC 1751 ATCAAGGACA TCATCAAGAC CTACCGCAGC TCCTAGCCCG TTGCGGTGGC 1801 CCATCCCGCA TACCCATCTT CTCGGGCTTC AGCCCAGTCC CAGCTGAGGA 1851 ACAAGCCAGT GGGGCCCCTT CTCTTCCTCA TTTTGGAAAC TTTTCCTCCT 1901 CTATTAAACA TGGGAAACTG AAGCCCTTGA AGGTTTGGGG AGAGGGGGGT 1951 ATCAGGCAGG GAGGGGGCAG AGGCAAATCA CCGGGCAGAG GGAGGGGAGG 2001 AGAGGAGGCC GCACTCTCCC TGTCTCTCCC GTCTCTGCAC CCAGCTCCTC 2051 TCTGCATGCT GAGGGCTAAA TTGGGATCTC AGCCTGCCCT AATCTTTCCC 2101 CATCTGAGGC AGGTTTCTAG GAGGTGTCTG TAGTCCATGT GGCACCTTTG 2151 TGAGAATTAG AAAACATGTA CCTTCCTCTG GGCAGCTGCA GCCCTGAGCC 2201 AGGACATGTG GCCTGGCTAG TGCAGCATGG AGTGGATTTT AGGCTCATTT 2251 GCCCTCTCAG CCAACTGCCC TTGAGCAGTG TACAGCCTGC TCAACAACCG 2301 CCCTGGGCAG GCCGCTCCTG AGCTGCTGTC TCCCTCTCTG ACCTATGCCT 2351 ACCTTCTCTT TCTTCGTTCC CTGCCCACTG GGCATGGCAG CTGGGGGTGA 2401 GAGGCTGGTG GTTGCCTCCC GTCCTGGGCA GGCTGCAGCC TCATGCCATG 2451 TCTCTCTCCC ACTACCTGAT GGGCACATGG ACAGGCTGCA GAGGGCTCCA 2501 GGTTCTGGGC CCCTGGCTGA GAAGGGGAGG ATCCTGTCCT GGCTGTAGCC 2551 TTCCTGCCCC AATCATCTCG CTGGATGCCA TTGCCCAGGG GTAGCCCTTC 2601 GGTATGGCTG GGGAGAAGGG TTGGTCTTCT GCCAGGGTCC CTAAGAACAG 2651 GTTTGCAGCT ACCCTCTCTG ACCTTTTCCC CATATCTTGT GCTGTCCAGG 2701 GCCTAGGCTG ATAGTCAGAG CCTGTGGCAC ACCTGGGATG GGCCAGGGCC 2751 CTGGGTGGGG AGGTGTGGGT TCCCTGGGCC AAGACCAGCC TTTTTCCTCA 2801 GTTTAATTAA TTTATTTATT TGGTTTGTGG TTTTGGGTTT TTTTTGTGTG 2851 TGTGTGTGTA GGTGAGTTCC CATCCTCTGG GCCCCTTAGA AATGCTGCCT 2901 TTGTGTGTGG GGTACCTGGG AAAGGGTGTG GCTGAGGCAG TCATAATGCA 2951 GTATCTTCCC CTGACTCCAG CCACAAAGCT CATACCCAGC CCTACCAATC 3001 ATGAATTGGA ACTCCATAAG GAGGGGCCAA ATTGGGAGAC CTTTTGGCAA 3051 ACAGTGTCTA TTTTACAGAG AGGCAAACCA AGCCTCCTTA GTGCCCTGGG 3101 GCTGATGGTC ACCCAGAGCC ATAGAGCACC CTCTCCTGGG CCAGAGCTGG 3151 GTTATCATGG TTGGCTTAGG TACCTCAGGT GGCCCAGGAG GTCCCCCCTC 3201 CATAAGGGCG TCAAGCTTCC AGGAGGGCCA GTGTTCCTAG TGTGGGCCCA 3251 GCGCACAGAC AGGGAGACCC TGTGATGGCG GGGCAGGGAG CCCGTTCGTA 3301 GGGAAGATGA GGAGGCAAGG GCTGCCTCTC ACACCTCCAG GTTTTTTAAG 3351 TTCTGGGGAG GGAGCGCCCC ACCTGCTGAG GTCGGTCCTT TTAGCCCTGC 3401 CTGCCCTGGC CTGGGCTTTC CCAGCCTCCC AGCCCTTTGC CCCCTTAGAA 3451 GGGTTTTTGT TTTTGTTTTT TTTTGCATCC ATCAGAGAAT GCACCTTTGT 3501 GTGGCAGGCA GGGCATGGGT TTTAGTCCTG GCCAATGACC AGCTGTGTGG 3551 CCCTGGCCGA GTTGTAGCCC CCTGGAGCCC AGGTTCCATT TTTATAAAAT 3601 GGTTGTTTTG GGAGAGAAAA AAAAAAAAAA AAAA // LOCUS HUMGRO 1050 bp mRNA PRI 11-JUN-1993 DEFINITION Human gro (growth regulated) gene. ACCESSION J03561 NID g183622 VERSION J03561.1 GI:183622 KEYWORDS gro gene; tumor cell. SOURCE Human bladder tumor cell (T24) cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1050) AUTHORS Anisowicz,A., Bardwell,L. and Sager,R. TITLE Constitutive overexpression of a growth-regulated gene in transformed Chinese hamster and human cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7188-7192 (1987) MEDLINE 88041072 COMMENT Draft entry and computer-readable sequence kindly submitted by R.Sager (20-NOV-1987). FEATURES Location/Qualifiers source 1. .1050 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 54. .140 /note="signal peptide (put.); putative" CDS 54. .377 /note="gro protein" /codon_start=1 /protein_id="AAA35933.1" /db_xref="PID:g306806" /db_xref="GI:306806" /translation="MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQ CLQTLQGIHPKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLN SDKSN" mat_peptide 141. .374 /note="gro mature protein (put.); putative" BASE COUNT 270 a 246 c 239 g 295 t ORIGIN 1 CTCGCCAGCT CTTCCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC 51 CCCATGGCCC GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT 101 GCGAGTGGCA CTGCTGCTCC TGCTCCTGGT AGCCGCTGGC CGGCGCGCAG 151 CAGGAGCGTC CGTGGCCACT GAACTGCGCT GCCAGTGCTT GCAGACCCTG 201 CAGGGAATTC ACCCCAAGAA CATCCAAAGT GTGAACGTGA AGTCCCCCGG 251 ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 301 AAGCTTGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG 351 ATGCTGAACA GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC 401 TGGTGGCTGT TCCTGAAGGA GGCCCTGCCC TTATAGGAAC AGAAGAGGAA 451 AGAGAGACAC AGCTGCAGAG GCCACCTGGA TTGTGCCTAA TGTGTTTGAG 501 CATCGCTTAG GAGAAGTCTT CTATTTATTT ATTTATTCAT TAGTTTTGAA 551 GATTCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 601 CTACCTGCAC ACTGTCCTAT TATATTCATT CTTTTTGAAA TGTCAACCCC 651 AAGTTAGTTC AATCTGGATT CATATTTAAT TTGAAGGTAG AATGTTTTCA 701 AATGTTCTCC AGTCATTATG TTAATATTTC TGAGGAGCCT GCAACATGCC 751 AGCCACTGTG ATAGAGGCTG GCGGATCCAA GCAAATGGCC AATGAGATCA 801 TTGTGAAGGC AGGGGAATGT ATGTGCACAT CTGTTTTGTA ACTGTTTAGA 851 TGAATGTCAG TTGTTATTTA TTGAAATGAT TTCACAGTGT GTGGTCAACA 901 TTTCTCATGT TGAAACTTTA AGAACTAAAA TGTTCTAAAT ATCCCTTGGA 951 CATTTTATGT CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT 1001 ACAGTGTTTC TGGCTTAGAA CAAAGGGGCT TAATTATTGA TGTTTTCGGA // LOCUS D86967 6072 bp mRNA PRI 07-FEB-1999 DEFINITION Human mRNA for KIAA0212 gene, complete cds. ACCESSION D86967 NID g1504007 VERSION D86967.1 GI:1504007 KEYWORDS KIAA0212. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2602. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6072) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6072) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1. .6072 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 3" /clone="HA2602" /sex="male" /tissue_type="bone marrow" 5'UTR 1. .58 gene 59. .2032 /gene="KIAA0212" CDS 59. .2032 /gene="KIAA0212" /note="Containing ATP/GTP-binding site motif A(P-loop): Similar to C.elegans protein(P1:CEC47E128);Similar to Mouse alpha-mannosidase(P1:B54407)" /citation=[2] /codon_start=1 /protein_id="BAA13203.1" /db_xref="PID:d1013892" /db_xref="PID:g1504008" /db_xref="GI:1504008" /translation="MQWRALVLGLVLLRLGLHGVLWLVFGLGPSMGFYQRFPLSFGFQ RLRSPDGPASPTSGPVGRPGGVSGPSWLQPPGTGAAQSPRKAPRRPGPGMCGPANWGY VLGGRGRGPDEYEKRYSGAFPPQLRAQMRDLARGMFVFGYDNYMAHAFPQDELNPIHC RGRGPDRGDPSNLNINDVLGNYSLTLVDALDTLAIMGNSSEFQKAVKLVINTVSFDKD STVQVFEATIRVLGSLLSAHRIITDSKQPFGDMTIKDYDNELLYMAHDLAVRLLPAFE NTKTGIPYPRVNLKTGVPPDTNNETCTAGAGSLLVEFGILSRLLGDSTFEWVARRAVK ALWNLRSNDTGLLGNVVNIQTGHWVGKQSGLGAGLDSFYEYLLKSYILFGEKEDLEMF NAAYQSIQNYLRRGREACNEGEGDPPLYVNVNMFSGQLMNTWIDSLQAFFPGLQVLIG DVEDAICLHAFYYAIWKRYGALPERYNWQLQAPDVLFYPLRPELVESTYLLYQATKNP FYLHVGMDILQSLEKYTKVKCGYATLHHVIDKSTEDRMESFFLSETCKYLYLLFDEDN PVHKSGTRYMFTTEGHIVSVDEHLRELPWKEFFSEEGGQDQGGKSVHRPKPHELKVIN SSSNCNRVPDERRYSLPLKSIYMRQIDQMVGLI" 3'UTR 2033. .6072 BASE COUNT 1518 a 1317 c 1410 g 1827 t ORIGIN 1 GGTGGTCGGC GGGGAGGCCC CCGCGCTTTA AAATAATGCC CGCGGCGCCC 51 GCGCGACCAT GCAATGGCGA GCGCTCGTCC TGGGGCTGGT GCTCCTCCGG 101 CTTGGCCTCC ATGGAGTATT GTGGCTCGTC TTCGGGCTGG GGCCCAGCAT 151 GGGCTTCTAC CAGCGCTTTC CGCTCAGCTT CGGCTTCCAG CGTCTGAGGA 201 GCCCCGACGG CCCCGCGTCG CCCACCTCGG GGCCCGTGGG CCGGCCTGGG 251 GGGGTATCCG GGCCGTCGTG GCTGCAGCCG CCGGGGACCG GGGCAGCGCA 301 GAGCCCGCGC AAGGCTCCGC GGCGTCCTGG GCCGGGGATG TGCGGCCCAG 351 CCAACTGGGG CTACGTGCTG GGCGGCCGGG GCCGCGGCCC GGACGAGTAC 401 GAGAAGCGCT ACAGCGGCGC CTTCCCTCCG CAGCTGCGTG CCCAGATGCG 451 CGACCTGGCA CGGGGCATGT TCGTCTTTGG CTACGACAAC TACATGGCTC 501 ACGCCTTCCC CCAGGACGAG CTCAACCCCA TCCACTGCCG CGGCCGTGGG 551 CCCGACCGCG GGGACCCTTC AAATCTGAAC ATCAATGATG TACTAGGGAA 601 CTACTCATTG ACTCTTGTTG ATGCATTGGA TACACTTGCA ATAATGGGAA 651 ATTCATCCGA GTTCCAGAAA GCAGTCAAGT TAGTGATCAA CACAGTTTCA 701 TTTGACAAAG ATTCCACCGT CCAAGTCTTT GAGGCCACGA TAAGGGTCCT 751 GGGAAGCCTC CTTTCTGCTC ACAGAATAAT AACTGACTCC AAGCAGCCCT 801 TTGGTGACAT GACAATTAAG GACTATGATA ATGAGTTGTT ATACATGGCC 851 CATGACCTGG CGGTGCGGCT CCTCCCTGCT TTTGAAAACA CCAAGACAGG 901 GATTCCATAT CCTCGGGTGA ATCTAAAGAC AGGAGTTCCT CCTGACACCA 951 ATAATGAGAC ATGCACAGCG GGAGCCGGTT CCCTCCTGGT GGAATTTGGG 1001 ATTCTGAGTC GACTCCTGGG GGACTCCACA TTTGAGTGGG TGGCCAGACG 1051 AGCAGTGAAA GCCCTTTGGA ACCTCCGGAG CAATGATACA GGATTACTAG 1101 GCAATGTCGT GAACATTCAG ACGGGCCACT GGGTTGGAAA GCAGAGTGGC 1151 CTGGGTGCCG GGCTGGACTC CTTCTATGAA TACCTCTTGA AATCTTACAT 1201 TCTCTTTGGA GAAAAAGAAG ACCTAGAAAT GTTTAATGCT GCATATCAGA 1251 GTATTCAGAA CTACTTAAGA AGAGGGCGGG AAGCCTGCAA TGAAGGAGAA 1301 GGAGACCCTC CACTCTATGT CAACGTGAAC ATGTTCAGTG GGCAGCTGAT 1351 GAACACCTGG ATTGACTCTC TGCAGGCCTT TTTCCCTGGA CTGCAGGTGC 1401 TGATAGGAGA TGTGGAAGAT GCCATCTGCC TTCATGCCTT CTACTATGCC 1451 ATATGGAAAC GATATGGTGC CCTCCCTGAG AGATATAACT GGCAGCTGCA 1501 GGCCCCTGAC GTTCTCTTCT ACCCACTGAG ACCAGAGTTA GTGGAATCCA 1551 CATATCTCCT CTACCAGGCA ACCAAGAATC CCTTCTACCT CCATGTAGGA 1601 ATGGATATTC TGCAGAGTCT GGAAAAGTAC ACAAAAGTCA AGTGTGGGTA 1651 CGCCACGCTG CATCACGTCA TTGACAAGTC CACAGAAGAC CGGATGGAGA 1701 GCTTCTTTCT CAGTGAGACC TGTAAATATT TGTATCTGCT GTTTGATGAA 1751 GACAATCCAG TACACAAGTC TGGAACCAGA TACATGTTCA CAACAGAGGG 1801 ACACATTGTA TCTGTGGATG AGCATCTTCG GGAATTGCCA TGGAAGGAAT 1851 TCTTCTCTGA AGAGGGAGGG CAGGACCAAG GGGGAAAGTC TGTGCACAGG 1901 CCGAAACCTC ATGAGTTAAA AGTCATCAAC TCCAGCTCCA ACTGCAATCG 1951 TGTACCTGAT GAGAGGAGGT ACTCCCTGCC CTTAAAGAGC ATCTACATGC 2001 GACAGATTGA CCAGATGGTT GGTTTGATTT GATCTGCTCT CTGTGAGGCC 2051 TCATCTTGAA CCAGACCTTA ACGACCAAAC CCAGACCATG CCAAAGTCCA 2101 GTCTGAAATG AAAGGGGACA GAAGTCTTGC TGTCCATGGT GGTGTAGGAA 2151 TTTCTGTGCA ACACCTCACC ACGTCTGGTT AATCCTTGCA CACTTCAGTG 2201 TTTCTCTCCT GTTCAATAAA ATGCCCTGTT AAGGATATAA TTTGAAGTGA 2251 GAAGATACAT GGAAATTGCC CTCTTATGAC ATGTTGATGT TATAAGCACA 2301 ATAGATGGGG CATCTTTGGA TTGATGTTCA CAGCTTTATA CTTCAGAACC 2351 TAAGTCTCTT CACTTTGCTG GCACCTGCTA TACTGGAGTA TTGCTATGTC 2401 TTTAAAAAAT TTTTTTTTAT TATATTTTAT TTTTTTGAGA CAGGGTCTTG 2451 ATATTTTTTT GGGACAGGGT TACCTGGGCT CAAGTGATCC TTCTGCCTCA 2501 GCCTCCCGAG TAGCTGGGAT TACAGGTGAG CACCACTGTA CCTGGCTAGC 2551 TACTTCTTTG TTAGAGGATT GAGAATGAAA TTTCTGCAAA AGGGCCCATG 2601 GTTCATTTGG TATCCCTATT TAATTGCATT GAAAATGTCA TCCTTTCTGT 2651 TGTTAGATAA TTGGGGTCTT CCCCTGATAT CCAACCGTGA TTTTGGATCA 2701 CATGGGAGAA AAAGTCATCC AGTTTTTCAT GTTTGCCTCA AGTAATCTTT 2751 ACAGTGTTAC AAATTATTTG CTTAAGAAGA ATGGTCTTAA CCAGAATTCT 2801 TAACAGATAG TCTCTTAGGT TATTATGTTA TGGTCTAAGA GGTTAACTGA 2851 CATCTTTTGG ATGGTATTTT GCATTTTGAA TATGAACTTA CCTGAGGAAC 2901 TCCCATAGTT CCAGAATCAG GTGCCTTTTA GGGAGAGAAC AATACCTAAG 2951 ATTGTCTGAG CTTCCATCTT TCTCATATTT CCTAAGCAAG GATTCTCACT 3001 TATGACCATA TTTGGGTTAG AGTTCTGTTT TGTTTCTGTT TTCTGTGTCT 3051 AGTGCCAATT AGCTAAATCA GGGAGAAAGA AATGATCACA TGACTTTTAG 3101 CATCCTTGAG CCATTTCTCT GTGTAATACA GGCTTTAGAT TAGTGCCTTA 3151 TATTGGTTTT GGTTTGGGGC ACTGGATGTC GCAGCTACTG CTATGGTTTC 3201 AGGAGGCCTG TTTAGCCACA TGGTGAGACC GTGGTGAAAG GGGGATGGAA 3251 ATTGCTTGGC CAGTCTTTGC CTTTCATCCT GTAAAAGTAA GCATGTAGAA 3301 GGAGGAAGTT GTGCTAAAAT GCCTTTGTTT TTTTGTTATT ATTTTCTTAG 3351 CCAGAACATC TCTCTTTGAA CTCACACTGA TACACACCTG CTACTCTTAC 3401 ACAGTGCAGC AGGGCTGACT CTTAGTCTGG CTTCCATGAA GCGTCATGGG 3451 TGGAAACGCA TTCTAGTAAA AAAGGTAGGA AATCCCTAAA ACTTCCAGCC 3501 TCACATAGCA CGGTTCTCAC CTGTCACTGT TTTCCCACCT CTAAGGATTT 3551 CATGTACATC TTTTCAAAGC TAGAAATAAG CACTGTCTAA GTTTATGTTG 3601 CATTTTTAGT CAAAAGGGAG AAATCTTATT CCTTCTTGAA AATTTTAAGT 3651 GTTATGGTTT TATATAGTTC AGTTCTTTGA GATTTTTGAA AAGAGTATTT 3701 TCAGTAATAA ACGTGCCATC TCTATCTCTT AAACATTTAT TACAACAATT 3751 GTTTTAAAAT AGAAAAAATA AAATGCTTCT ATTTTACCTT TTTTCATTTC 3801 AGAAGCATTA TTCTGTTTAT TAACAGTGTC CCATCTACTG AATAGAAAAC 3851 TTTGAGAATA ATATATATAT ATATTTTAAA TGTTTTCACT GACTCATTGA 3901 AAATGTTAAT TACACACACA TGCATGCATG CACACACGAG CATACTTGTA 3951 CCTTTGTCTC TGGGCAAACA GGTGGGACTG TTAGTGACCC ATTTGGGAAA 4001 ATAGAGCATC TCAGAGAAGG AGGTGAGTTC TTCCTGCCTG TGATTTCTCT 4051 TGGCGCTCCC CTCCTCTCCC GCTCTGGCTT CTGTGGCGGC AGTGGTGGGT 4101 AAGCACTCCA GTGTTCTCTT AATGAGGCAC TTTGCCTGTC ACTCGAGCAA 4151 GCCTGGGTGT TCCTTCCTCC TCATGCTCCT GGAATAGGGA ATAGGGATCT 4201 CATGCTTGCA AACTACACAA TGCTGCAGGT GCTTCCCAGG GGCCACAGGC 4251 TGTCAGGAAA CGTGTTTTAT GTTAAGTCAC AAACCCACTT GACTTCTGGG 4301 TACTGGAATT AATACCAGTG GGTGAGACTG AGGGTGAGTG AGTTAGTACA 4351 TATTAATCCT GGTTGTTGAG CTTCCAGACT ACCCCGTCCA AAGTTTGATG 4401 CTATGTAGTC AGTGGTTTGT GGGGCTGGAT GCCAGAAGGT TCTTTGAGCC 4451 AGTTTCAAAG GTTACTTGTT TTTTTTTTTT TTTTTTTAAG TCAGAATGTT 4501 AACAGCTGTG ATATATCCTG CAGGGCTTTT GCAGTTTCTT CTGTTCTGTG 4551 TTCTGAAATC CTGGGTAGAG AATGGCTGAG GAGGAGATTA CCAGAGAAGT 4601 TGCTTTGCTC AGTGCTTTGC CCCAGGATTG CCTCAAATCT GAGTGGACTT 4651 CATCCTTTGC GGCGGCTCTG AGCCTGGCCC ATCTTCCTAT TCCCACGTGT 4701 AGCTAGTGTC TAGTGTCAGC TTTGCTCAAT GTGGTGGAAA CATTTTGCAG 4751 AACTGTTGTA GAAAGCTGCC TTATAGTTGG CTTGACAAAG CATAATTCTC 4801 TCATAACAAA CTTTCAAATC ATTACAGTAG CTTAGCTACT TTAGTTGATG 4851 TGACCGAGGA ATCCCTTCTA GAATCATAGG TGGCAAGGGA GGGTTTGCTA 4901 GCTCTCCATT TGCACTGGCC ATTGTGAAAA ACCAGCTTCT GTATTCAAAT 4951 CTTTCCTTCA TTTTTTTAAA TTTTTTTTTT GGCAGCGCTT GTGCTGGAAC 5001 TTACTCATTG TAACTGAATC CTCAGGGCTT TTCTTGTTTT AGATCATGGA 5051 CTGTGCACGT GACACTTAAA TAATTTTCTA TGTATTTAAA GAAAAATGCA 5101 CCAGGATGGT GTCTGTGCAC GTGACTATTA GAGGAGCGTC TGTAGAAGTA 5151 CCTGGTTTGG TCAGTGCAGT TGTGCAATCT GAGGGCCTTG TTTCCTCCTC 5201 CCCTTTCCCC TTCTCCCCAC CAAAGGAAAA TATCCCTCTT AATGATTTCG 5251 TAGTTCAGTT TACTGAATGA TTACCACCTG TAATTCCTCT TTGGATTGTG 5301 TAGACTCAAC ATGAGACATT CCTTTCTGCT TTCTGGAGGG CACCAGGGGC 5351 CTTTCTCTTT GATAAATTTT TTTTGTCTGT TGACAAAAAC AAAAATCTTT 5401 TTTCAAATGT AGTGCTGGTG AAAAGGTAGG GCTGAGTGAT TACCTTAGCC 5451 ACAGGGTGGC TGAGCAGGAA CTTTAGAAGA AAATCCTGAG CTTTCCTGTC 5501 CATTCCCAGC ATCCAGCTCC TATTCTAGTG CCTCTTCCCT GCAGGGCAGG 5551 GACCCCTTGG GAAATCGAGG AGGTGGGACG GGCTGGGCCC TGTGTCCCAG 5601 GTTTCACAGG GCTCAGGGTT ATGCTCCCGC TTGAATCTGG ACGTGAATCT 5651 GGTAAAAATA TCAAGTACCT GTGGAACTCC CTGATTCTAT ACCCTCTTCC 5701 TTCTTTCTGC AAGGCAGAGG AATAATATTT TTAAAGGTTA TTTTGTTTTA 5751 GTTTTAAATA GCAAAACACA AGCTGCATTT TTATTTATTT TGCATAAGAA 5801 AGGTAAATCT TTTTACAAAA AAAAGTATAG AGTTGGAAAC TCTGGGAAAA 5851 CTTACGGAAA TACACAAATG CTTCTCTGTA ATGTGCAATA TGCTTTGCAA 5901 CTGTAGATGA TATTTTATGT TTAATCTGTA AATAAGAAAT GTATTTAAAT 5951 TAAAAGGGAT CTTTTTGTAA AAGGACCAAA TGTTCTTTTA TAAATGTAAT 6001 AAGGAATATC TTGCTCTTTA AAATTTATTA GGATTTTTAT GAGTAATTTT 6051 TATTAAAAGA TTTCTTTTTT TG // LOCUS D30783 4627 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens mRNA for epiregulin, complete cds. ACCESSION D30783 NID g2381480 VERSION D30783.1 GI:2381480 KEYWORDS growth regulator; epiregulin. SOURCE Homo sapiens colorectal adenocarcinoma epithelial cell_line:HCT-15 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4627) AUTHORS Toyoda,H. TITLE Direct Submission JOURNAL Submitted (24-MAY-1994) to the DDBJ/EMBL/GenBank databases. Hitoshi Toyoda, Research Center, Taisho Pharmaceutical Co., Ltd.; No.403 Yoshino-cho, 1-chome, Ohmiya, Saitama 330, Japan (Tel:048-663-1111(ex.3611), Fax:048-652-7254) REFERENCE 2 (sites) AUTHORS Toyoda,H., Komurasaki,T., Uchida,D. and Morimoto,S. TITLE Distribution of mRNA for human epiregulin, a differentially expressed member of the epidermal growth factor family JOURNAL Biochem. J. 326 (Pt 1), 69-75 (1997) MEDLINE 97479200 FEATURES Location/Qualifiers source 1. .4627 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HCT-15" /cell_type="epithelial" /tissue_type="colorectal adenocarcinoma" CDS 167. .676 /note="EGF-related peptide" /codon_start=1 /product="epiregulin" /protein_id="BAA22146.1" /db_xref="PID:d1023005" /db_xref="PID:g2381481" /db_xref="GI:2381481" /translation="MTAGRRMEMLCAGRVPALLLCLGFHLLQAVLSTTVIPSCIPGES SDNCTALVQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVR CEHFFLTVHQPLSKEYVALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVT SGDPELPQV" mat_peptide 353. .490 /product="epiregulin" misc_feature 521. .574 /note="transmembrane domain" polyA_signal 4580. .4585 /note="early mRNA polyadenylation signal" polyA_signal 4584. .4589 /note="early mRNA polyadenylation signal" BASE COUNT 1426 a 836 c 836 g 1529 t ORIGIN 1 TCACTTGCCT GATATTTCCA GTGTCAGAGG GACACAGCCA ACGTGGGGTC 51 CCTTCTAGGC TGACAGCCGC TCTCCAGCCA CTGCCGCGAG CCCGTCTGCT 101 CCCGCCCTGC CCGTGCACTC TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG 151 CCCGCTCCCA TCGCCGATGA CCGCGGGGAG GAGGATGGAG ATGCTCTGTG 201 CCGGCAGGGT CCCTGCGCTG CTGCTCTGCC TGGGTTTCCA TCTTCTACAG 251 GCAGTCCTCA GTACAACTGT GATTCCATCA TGTATCCCAG GAGAGTCCAG 301 TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA CGTGTGGCTC 351 AAGTGTCAAT AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT 401 GGACAGTGCA TCTATCTGGT GGACATGAGT CAAAACTACT GCAGGTGTGA 451 AGTGGGTTAT ACTGGTGTCC GATGTGAACA CTTCTTTTTA ACCGTCCACC 501 AACCTTTAAG CAAAGAGTAT GTGGCTTTGA CCGTGATTCT TATTATTTTG 551 TTTCTTATCA CAGTCGTCGG TTCCACATAT TATTTCTGCA GATGGTACAG 601 AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA GTTACCTCAG 651 GGGATCCAGA GTTGCCGCAA GTCTGAATGG CGCCATCAAA CTTATGGGCA 701 GGGATAACAG TGTGCCTGGT TAATATTAAT ATTCCATTTT ATTAATAATA 751 TTTATGTTGG GTCAAGTGTT AGGTCAATAA CACTGTATTT TAATGTACTT 801 GAAAAATGTT TTTATTTTTG TTTTATTTTT GACAGACTAT TTGCTAATGT 851 ATAATGTGCA GAAAATATTT AATATCAAAA GAAAATTGAT ATTTTTATAC 901 AAGTAATTTC CTGAGCTAAA TGCTTCATTG AAAGCTTCAA AGTTTATATG 951 CCTGGTGCAC AGTGCTTAGA AGTAAGCAAT TCCCAGGTCA TAGCTCAAGA 1001 ATTGTTAGCA AATGACAGAT TTCTGTAAGC CTATATATAT AGTCAAATCG 1051 ATTTAGTAAG TATGTTTTTT ATGTTCCTCA AATCAGTGAT AATTGGTTTG 1101 ACTGTACCAT GGTTTGATAT GTAGTTGGCA CCATGGTATC ATATATTAAA 1151 ACAATAATGC AATTAGAATT TGGGAGAAGC AAATATAGGT CCTGTGTTAA 1201 ACACTACACA TTTGAAACAA GCTAACCCTG GGGAGTCTAT GGTCTCTTCA 1251 CTCAGGTCTC AGCTATAATT CTGTTATATG AGGGGCAGTG GACAGTTCCC 1301 TATGCCAACT CACGACTCCT ACAGGTACTA GTCACTCATC TACCAGATTC 1351 TGCCTATGTA AAATGAATTG AAAAACAATT TTCTGTAATC TTTTATTTAA 1401 GTAGTGGGCA TTTCATAGCT TCACAATGTT CCTTTTTTGT ATATTACAAC 1451 ATTTATGTGA GGTAATTATT GCTCAACAGA CAATTAGAAA AAAGTCCACA 1501 CTTGAAGCCT AAATTTGTGC TTTTTAAGAA TATTTTTAGA CTATTTCTTT 1551 TTATAGGGGC TTTGCTGAAT TCTAACATTA AATCACAGCC CAAAATTTGA 1601 TGGACTAATT ATTATTTTAA AATATATGAA GACAATAATT CTACATGTTG 1651 TCTTAAGATG GAAATACAGT TATTTCATCT TTTATTCAAG GAAGTTTTAA 1701 CTTTAATACA GCTCAGTAAA TGGCTTCTTC TAGAATGTAA AGTTATGTAT 1751 TTAAAGTTGT ATCTTGACAC AGGAAATGGG AAAAAACTTA AAAATTAATA 1801 TGGTGTATTT TTCCAAATGA AAAATCTCAA TTGAAAGCTT TTAAAATGTA 1851 GAAACTTAAA CACACCTTCC TGTGGAGGCT GAGATGAAAA CTAGGGCTCA 1901 TTTTCCTGAC ATTTGTTTAT TTTTTGGAAG AGACAAAGAT TTCTTCTGCA 1951 CTCTGAGCCC ATAGGTCTCA GAGAGTTAAT AGGAGTATTT TTGGGCTATT 2001 GCATAAGGAG CCACTGCTGC CACCACTTTT GGATTTTATG GGAGGCTCCT 2051 TCATCGAATG CTAAACCTTT GAGTAGAGTC TCCCTGGATC ACATACCAGG 2101 TCAGGGAGGA TCTGTTCTTC CTCTACGTTT ATCCTGGCAT GTGCTAGGGT 2151 AAACGAAGGC ATAATAAGCC ATGGCTGACC TCTGGAGCAC CAGGTGCCAG 2201 GACTTGTCTC CATGTGTATC CATGCATTAT ATACCCTGGT GCAATCACAC 2251 GACTGTCATC TAAAGTCCTG GCCCTGGCCC TTACTATTAG GAAAATAAAC 2301 AGACAAAAAC AAGTAAATAT ATATGGTCCT ATACATATTG TATATATATT 2351 CATATACAAA CATGTATGTA TACATGACCT TAATGGATCA TAGAATTGCA 2401 GTCATTTGGT GCTCTGCTAA CCATTTATAT AAAACTTAAA AACAAGAGAA 2451 AAGAAAAATC AATTAGATCT AAACAGTTAT TTCTGTTTCC TATTTAATAT 2501 AGCTGAAGTC AAAATATGTA AGAACACATT TTAAATACTC TACTTACAGT 2551 TGGCCCTCTG TGGTTAGTTC CACATCTGTG GATTCAACCA ACCAAGGACG 2601 GAAAATGCTT AAAAAATAAT ACAACAACAA CAAAAAATAC ATTATAACAA 2651 CTATTTACTT TTTTTTTTTT CTTTTTGAGA TGGAGTCTCG CTCTGTTGCC 2701 CAGGTTGGAG TGCAGTGGCA CGATCTCGGC TCACTGCAAC CTCACCTCCC 2751 GGGTTCAAGA GATCCTCCTG CCTCAGCCTC CTGAGCAGCT GGGACTACAG 2801 GCGCATGCCA CCATGCCCAG CTAATTTTTG TATTTTTAGT AGAGGCGGGG 2851 TTTCACCATG TTGGCCAGGA TGGTCTCAAT CTCCTAACCT TGAGATCCAC 2901 CCTCCACAGC CTCCCAAACT GCTGGGATTA CAGGCGTGAG CCACCGCACG 2951 TAGCATTTAC ATTAGGTATT ACAAGTAATG TAAAGATGAT TTAAGTATAC 3001 AGGAGGATGT GAATAGGTTA TATGCAAGCA CTATGCCCTT TTATATAAGT 3051 GACTTGAACA TCTGTGCCCG ATTTTAGTAT GTGCAGGGGG GCGATCTGGG 3101 AATCAGTCCC CTGTGGATAC CAAGGTACAA CTGTATTTAT TAACGCTTAC 3151 TAGATGTGAG GAGAGTCTGA ATATTTTCAG TGATCTTGGC TGTTTCAAAA 3201 AAATCTATTG ACTTTTCAAT AAATCAGCTG CAATCCATTT ATTTCATTTA 3251 CAAAAGATTT ATTGTAAGCC TCTCAATCTT GGTTTTTCAG TTGATCTTAA 3301 GCATGTCAAT TCATAAAAAC AAGTCATTTT TGTATTTTTC ATCTTTAAGA 3351 ATGCTTAAAA AAGCTAATCC CTAAAATAGT TAGATCTTTG TAAATGCATA 3401 TTAAATAATA AAGTATGACC CACATTACTT TTTATGGGTG AAAATAAGAC 3451 AAAAATAATA GTTTTAGTGA GGATGGTGCT GAGTAAACAT AAAAACTGAT 3501 TTGCTCTCAG CTGATGTGTC CTGTACACAG TGGGAAGATT TTAGTTCACA 3551 CTTAGTCTAA CTCCCCCATT TTACAGATTT CTCACTATAT ATATTTCTAG 3601 AAGGGGCTAT GCATATTCAA TGTATTGAGA ACCAAAGCAA CCACAAATGC 3651 ATAAATGCAT AATTTATGGT CTTCAACCAA GGCCACATAA TAACCCAGTT 3701 AACTTACTCT TTAACCAGGA ATATTAAGTT CTATAACTAG TACTCAAGGT 3751 TTAACCTTAA AATTAAGATT TCCTTAACCT TAACCTTAAA ATTGATATTA 3801 TATTAAACAT ACATAATACA ATGTAACTCC ACTGTTCTCC TGAATATTTT 3851 TTGCTCTAAT CTCTCTGCCG AAAGTCAAAG TGATGGGAGA ATTGGTATAC 3901 TGGTATGACT ACGTCTTAAG TCAGATTTTT ATTTATGAGT CTTTGAGACT 3951 AAATTCAATC ACCACCAGGT ATCAAATCAA CTTTTATGCA GCAAATATAT 4001 GATTCTAGTG TCTGACTTTT GTTAAATTCA GTAATGCAGT TTTTAAAAAC 4051 CTGTATCTGA CCCACTTTGT AATTTTTGCT CCAATATCCA TTCTGTAGAC 4101 TTTTGAAAAA AAAGTTTTTA ATTTGATGCC CAATATATTC TGACCGTTAA 4151 AAAATTCTTG TTCATATGGG AGAAGGGGGA GTAATGACTT GTACAAACAG 4201 TATTTCTGGT GTATATTTTA ATGTTTTTAA AAAGAGTAAT TTCATTTAAA 4251 TATCTGTTAT TCAAATTTGA TGATGTTAAA TGTAATATAA TGTATTTTCT 4301 TTTTATTTTG CACTCTGTAA TTGCACTTTT TAAGTTTGAA GAGCCATTTT 4351 GGTAAACGGT TTTTATTAAA GATGCTATGG AACATAAAGT TGTATTGCAT 4401 GCAATTTAAA GTAACTTATT TGACTATGAA TATTATCGGA TTACTGAATT 4451 GTATCAATTT GTTTGTGTTC AATATCAGCT TTGATAATTG TGTACCTTAA 4501 GATATTGAAG GAGAAAATAG ATAATTTACA AGATATTATT AATTTTTATT 4551 TATTTTTCTT GGGAATTGAA AAAAATTGAA ATAAATAAAA ATGCATTGAA 4601 CATCTTGCAT TCAAAATCTT CACTGAC // LOCUS AF073519 1912 bp mRNA PRI 12-JUL-1999 DEFINITION Homo sapiens small EDRK-rich factor 1, long isoform (SERF1) mRNA, complete cds. ACCESSION AF073519 NID g3641543 VERSION AF073519.1 GI:3641543 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1912) AUTHORS Scharf,J.M., Endrizzi,M.G., Wetter,A., Huang,S., Thompson,T.G., Zerres,K., Dietrich,W.F., Wirth,B. and Kunkel,L.M. TITLE Identification of a candidate modifying gene for spinal muscular atrophy by comparative genomics JOURNAL Nat. Genet. 20 (1), 83-86 (1998) MEDLINE 98400264 REFERENCE 2 (bases 1 to 1912) AUTHORS Scharf,J.M., Endrizzi,M.G., Wetter,A., Huang,S., Thompson,T.G., Wirth,B., Dietrich,W.F. and Kunkel,L.M. TITLE Direct Submission JOURNAL Submitted (23-JUN-1998) HHMI/Genetics, Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1. .1912 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q13" /note="located 5' to SMN" gene 1. .1912 /gene="SERF1" /note="4F5L; H4F5L" CDS 184. .516 /gene="SERF1" /note="includes exon 3b; similar to small EDRK-rich factor 2" /codon_start=1 /product="small EDRK-rich factor 1, long isoform" /protein_id="AAC63518.1" /db_xref="PID:g3641544" /db_xref="GI:3641544" /translation="MARGNQRELARQKNMKKTQEISKGKRKEDSLTASQRKQSSGGQK SESKMSAGPHLPLKAPRENPCFPLPAAGGSRYYLAYGSITPISAFVFVVFFSVFFPSF YEDFCCWI" BASE COUNT 481 a 448 c 478 g 505 t ORIGIN 1 CCGGCGGCTG TTGTCGGGCC TCCAGCGGGC GGGGCCGTTG GCGGAGCAGA 51 GCGGAGGCGC AGCCGGGCGG AGGGCCCACG AGGGCTCAGC CTTCCCGGTC 101 AGCGGTGGTG ACGGTATCCC AGAGTGCCAG AGAACCGTTG CTTTTCCGAG 151 TTGCTCTTCT TCCAGGCTCC GTTGGTGGTC GGCATGGCCC GTGGAAATCA 201 ACGAGAACTT GCCCGCCAGA AAAACATGAA GAAAACCCAG GAAATTAGCA 251 AGGGAAAGAG GAAAGAGGAT AGCTTGACTG CCTCTCAGAG AAAGCAGAGT 301 TCTGGAGGCC AGAAATCTGA GAGCAAGATG TCAGCTGGGC CACACCTCCC 351 TCTGAAGGCT CCAAGGGAGA ATCCTTGCTT TCCTCTTCCA GCTGCTGGTG 401 GCTCCAGGTA TTACTTGGCT TATGGCAGCA TAACTCCTAT CTCTGCCTTT 451 GTCTTTGTGG TCTTCTTTTC TGTCTTCTTC CCTTCTTTTT ATGAGGACTT 501 TTGCTGTTGG ATTTAGGTTC CATTCTAACC TAGGATGATC TCATTTGGAA 551 ATCCTTAATT TCATCTACAA AAACTGTTTT CCCAAATAGG TCACATTCAC 601 GCATATCAGA TGGACAGATG TATCATTTTG GGGTCCACCA TTCAACCCAC 651 TACAAGGAGT TTTTTAAACA AAAATAGGAA ACTTAGATGT AACTTAGCAC 701 TTTTTTTTTT TTTTTTTGAG ATGGAGTCTC ACTCTGTCAC CAGACTGGAG 751 TGCAGTGGCG CCATCTCAGC TCCATGCAAC CTCTGCCTCC TGGGTTCAAG 801 CAGTTCTCTT GCCTCAGCCT CCTGGGTAGC TGGGATTACA GGCACGCGCT 851 GCCACACCCA GGTAATTTAT TTATTTTTTT TTTGAGACAG AGTCTCGCAC 901 TGTTGCCCAG GCTGGACTGC AGTGGCGTGA TCTCTGCTCA CTGCAACCTC 951 CGCCTCCCGG GTTCAAGCGA TTCTCCAGCC TCAGCTTCCT GAGTAGATGG 1001 GATTACAGGC GCCTGCCACC ACGCCCAGCT AATTTTTTTG TATTCTTAGT 1051 AGAGATGGGG TTTCACCATG TTGGCCAGGC TGGTCTCCAT CTCCTGACCT 1101 CGTGATTCAC CCGCCTCGGC CTCCCAAAGT GCTGGGATTA CAGGCGTGAG 1151 TCACAGCCCC CGGCCATAAT TTAGCACTTT AAAAAATAAT AGCCATGTTG 1201 GGCCAGGCGT GGTGGCTCAT GCCTGTAATC TGAGCACTTT GGGAGACCAA 1251 GGCGGGTAGA TCCCTTGTGC CCAGGAGTTC AAGACCAGCC TGGGCAACAT 1301 GGCGAAACCC CATTTCTACT AAAAATACAA AAATTAGCTG GGGCGAGGGG 1351 ATAGGCCGAG TTCCGGGTGT AAGGGGGCCA TTAGGGAGAG CAGAGCGAGG 1401 CAGCTGATCT TCCGGATTGG GGGCCTTGCC CGGAAGCTGG ACCTCACGGA 1451 GATGAAACGG AAGATGCACG AGGATATGAT CTCCATACAG AACTTTCTCA 1501 TCTACGTGGC CCTGCTGCGA GTCACTCCAT TTATCTTAAA GAAATTGGAC 1551 AGCATATGAA GATTGGACAT CACATGTGAA TGCATGATAT GAAGAGCCTG 1601 GTTACAGTTT CTACTGTTCT CTGCAAGTAA ATAGGCCCAG AAAGGTATAA 1651 GAGACTCTTT GAATGGACAT AAAAATTCTG CTTGTTAAGA ACAAGTTGAG 1701 CTCTGGTAAC TGATCTTAAT AGCTAAAATA TAAAAATATT TGGGAAGTCT 1751 GAAATGAGGT CTCCTGGCCC TGGTGTGCCC TTAATGCCTG TGACAGTTGG 1801 CCTCTGTGAA TATTGGTATA ATTGTAAATA ATGTCAAACT CCATTTTCTA 1851 GCAAGTATTA ATAATTAAGG GAAGTATGTC TGAAATGGCA AAAAAAAAAA 1901 AAAAAAAAAA AA // LOCUS AB002389 5677 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0391 gene, complete cds. ACCESSION AB002389 NID g2224722 VERSION AB002389.1 GI:2224722 KEYWORDS KIAA0391. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0118. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5677) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .5677 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0118" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 360. .2063 /gene="KIAA0391" CDS 360. .2063 /gene="KIAA0391" /codon_start=1 /protein_id="BAA20845.1" /db_xref="PID:d1021687" /db_xref="PID:g2224723" /db_xref="GI:2224723" /translation="MTFYLFGIRSFPKLWKSPYLGLGPGHSYVSLFLADRCGIRNQQR LFSLKTMSPQNTKATNLIAKARYLRKDEGSNKQVYSVPHFFLAGAAKERSQMNSQTED HALAPVRNTIQLPTQPLNSEEWDKLKEDLKENTGKTSFESWIISQMAGCHSSIDVAKS LLAWVAAKNNGIVSYDLLVKYLYLCVFHMQTSEVIDVFEIMKARYKTLEPRGYSLLIR GLIHSDRWREALLLLEDIKKVITPSKKNYNDCIQGALLHQDVNTAWNLYQELLGHDIV PMLETLKAFFDFGKDIKDDNYSNKLLDILSYLRNNQLYPGESFAHSIKTWFESGQCSG CGKTIESIQLSPEEYECLKGKIMRDVIDGGDQYRKTTPQELKRFENFIKSRPPFDVVI DGLNVAKMFPKVRESQLLLNVVSQLAKRNLRLLVLGRKHMLRRSSQWSRDEMEEVQKQ ASCFFADDISEDDPFLLYATLHSGNHCRFITRDLMRDHKACLPDAKTQRLFFKWQQGH QLAIVNRFPGSKLTFQRILSYDTVVQTTGDSWHIPYDEDLVERCSCEVPTKWLCLHQK T" BASE COUNT 1691 a 1137 c 1200 g 1649 t ORIGIN 1 GCCGCGTTAA GTCTGAGTGC CGCTTTGAGT TGTTGAATGA AGTGAACTTC 51 ATTTGTCAGC GTTCGGTTCA TGAACTGGAA TGTAAGAGGC ACCAGAGGAT 101 TCCTGCTCTG TCCCCTGGTT TGCGGCTTGC GACGTTGGAC ATCCCCGGAT 151 TGTTGTTTAA TAGAGAAAAC TCACCTGCCT TCTTGCTTTT AAGTAGCCCC 201 AAAAGCAGAA CCTTGATTTG TCTGTAAGGA AGAAACACAA ACCTTTTAAA 251 AAGTTCCACT TCGACTCTGC ACCGCCGACC CCCAATCTCT TTAATTTTGC 301 CATAGAAGAG GGGTTTTTTC AACATCTCTC TCACTATCTG GTGCTGATCT 351 CACTGCATAA TGACTTTCTA TTTGTTTGGT ATTCGAAGCT TTCCGAAGCT 401 TTGGAAGAGC CCATACCTTG GGCTAGGCCC AGGGCACTCT TATGTCTCGC 451 TGTTTCTGGC AGACCGCTGT GGCATCAGGA ACCAGCAGAG GTTGTTTTCT 501 CTTAAAACAA TGTCTCCACA GAATACCAAA GCAACGAATC TGATTGCCAA 551 GGCCAGATAT CTCAGGAAAG ATGAGGGCAG TAATAAGCAA GTTTATTCTG 601 TTCCTCATTT TTTTTTAGCT GGAGCAGCTA AGGAGAGATC ACAGATGAAT 651 TCTCAAACTG AAGATCATGC CTTGGCACCT GTGAGGAACA CTATTCAACT 701 CCCAACACAA CCTTTGAATT CAGAGGAGTG GGATAAACTT AAGGAAGATT 751 TAAAAGAAAA CACCGGAAAG ACCAGTTTCG AAAGTTGGAT CATTTCACAG 801 ATGGCTGGCT GTCATAGCTC TATAGATGTG GCTAAATCTC TGCTGGCATG 851 GGTAGCAGCC AAAAATAATG GTATTGTAAG TTACGATTTA CTGGTCAAGT 901 ATTTGTATCT CTGTGTCTTT CATATGCAGA CATCTGAAGT TATTGATGTC 951 TTTGAAATTA TGAAAGCCAG ATATAAGACT TTAGAACCTA GAGGTTACAG 1001 TCTTCTCATC CGGGGATTGA TCCATTCAGA CAGATGGAGA GAAGCATTGT 1051 TGCTGTTAGA GGACATCAAA AAAGTTATAA CTCCTTCAAA AAAGAACTAT 1101 AATGACTGTA TCCAGGGAGC TCTCCTTCAT CAAGATGTAA ACACAGCTTG 1151 GAATTTATAT CAGGAATTGC TAGGTCATGA TATTGTTCCT ATGTTGGAAA 1201 CTTTAAAAGC TTTCTTTGAT TTTGGAAAAG ACATAAAGGA TGATAACTAT 1251 TCAAATAAAC TACTAGATAT TCTTTCATAT CTAAGAAATA ATCAGCTGTA 1301 TCCAGGGGAG TCATTTGCAC ACAGTATAAA AACATGGTTT GAGAGTGGCC 1351 AGTGTTCGGG CTGTGGAAAA ACCATAGAGT CTATTCAGCT GAGTCCAGAA 1401 GAATATGAAT GTCTTAAGGG AAAAATCATG AGGGATGTGA TAGATGGAGG 1451 TGACCAGTAC AGAAAGACAA CACCTCAGGA ACTTAAGAGA TTTGAGAACT 1501 TCATAAAATC TCGTCCTCCT TTTGATGTTG TCATTGATGG TCTCAATGTT 1551 GCCAAAATGT TTCCTAAAGT TCGTGAATCT CAACTTCTCT TGAATGTCGT 1601 CTCTCAACTA GCCAAACGGA ATCTGCGACT GCTGGTCCTA GGCCGGAAGC 1651 ACATGCTAAG ACGGAGTTCC CAGTGGAGTC GGGATGAGAT GGAAGAGGTG 1701 CAAAAGCAAG CCAGCTGTTT TTTTGCTGAT GACATCTCGG AGGATGATCC 1751 ATTCCTTCTG TATGCCACAC TGCACTCCGG GAATCACTGC AGGTTTATCA 1801 CAAGAGACCT GATGCGGGAC CACAAGGCCT GTCTGCCTGA TGCCAAGACC 1851 CAACGCCTGT TTTTTAAGTG GCAGCAGGGA CATCAGCTGG CAATTGTAAA 1901 TAGGTTTCCA GGATCAAAAC TAACCTTTCA GCGTATTCTC AGCTATGACA 1951 CAGTGGTGCA AACAACTGGA GACTCGTGGC ACATACCATA TGATGAAGAC 2001 TTGGTAGAAA GATGTTCCTG TGAAGTACCA ACCAAATGGC TTTGCCTCCA 2051 CCAAAAGACA TAGAGATTCT TACCTCTATG CTAAGTTTGT GTTTGGGTAC 2101 CCTCTAGGTT GGCATCAGAG GCTCTTGAGC TGGTGTTTGT TTAGGGCATT 2151 GCCTCTGTCC TGAAGATAAA AGGATTCTAT TAACAGCATT GACATTGATT 2201 TTTTAATGAA ATGAGATATA TCTTTTCATA ACCAGCTGCG TTTTTTTCCC 2251 CTAACATTTG TTTTTGGAGG CTTATCAAGA GTTGGAGAAC TTAGTGTAGA 2301 GCAAAACCTG CATTTCTCCT ACTGGGCCAG CTATTCCACT TAGCTTGGGT 2351 GACTAATAGT GCTTTTGGTA TCCATTTTTT GCTACTTCTG ACCTTGCCTT 2401 CCAGGCCTAC CAATAGCAGA ATCAATCCAT CTGTCCCTGA GATACTCATG 2451 TTGTTTCAAA TGCCTCCTCC CATTTCTGGC ATAGTCTCAT TCTCTGTATG 2501 TTATGCCCTA TCCACATGGA ATCATTTATC GTCCTCTGTA ATAAACTGGC 2551 CAAGATACTA AAGGCTTACT ATTCATAGCA GTTTTTAATT ACTTATCATC 2601 CAATTATTTG GATTGGAGAA GAGGGGGCAT TCACTCCTCT TTTTCTTATT 2651 TTTTTTGGAA ATAGAGTCTC AACTCACTCT AGCCTGGGTG ACAGAGCGAG 2701 ACCTTGTCTC AAAACAAAAC AAAGTGCTGG AATTGCAGAC TTGAGCCACA 2751 GTGCCCAGCC TCACTTCTCT AGACTATGAT GGTTTTTTCT TCATTCTATA 2801 ATCTCTTTTC CAAATTGGTT CAACATTTTG TGAACACTAT TAATTTCATC 2851 ATTCAGTATA TGTGGGCTTT CTAAAATATG CCAATTTTTT TCCACTTAAT 2901 CAAGTTTGAC TTAATTTAAC AAAGTGATTA TATTTTAATA GTTACATTTC 2951 TGTTTTTTCC ACTCACTAGC CAGCTTACAG TTTATTAGCC CTTGATTTCA 3001 GCTGAAAATA TTCATGTCTG CACCCCTTCA TGATAGTTCT TTCTTTACGT 3051 ATACATACTG TATTCAATAT GCAAGAACAG GCAAAAACTA CTCTATTGTG 3101 ATAAAAATCA GAATAGTAAT TGCCTGAGGA AAGGGATATG AGAGAACTTG 3151 AGAGAACTTT CTCGGGGTGA TGGAAAGTTT CTTATATTGA TTTGGGTAAT 3201 AGTAACATAG CTATATGTAT ATATTAGTTA AAATTCCTCA CACTAAACAT 3251 TTTAAATTGA TATATTTACA TTTATGACAA TATACCTCAA AGTAAGTTAG 3301 GGTAAGAAAA GATAATTACT TACATGAAAT AAACAATGAC CTCTTTTATA 3351 TCAAACCATA TACATGTGTA TAATTAGGGT ATGTATTATG CACAGAGAAA 3401 CAATTTAGGA AAATCCAAGG AGGGGACGTT TTATCTTCTT ACCTATTTAC 3451 TGAATGCAAC ATTACTGCAC ACCAAGACAA AAGAGCTCTC CAGGAAAACA 3501 TTGGATATAT TGAGAGCATT AAAAGATACT GCAAAAGCTC TAATAAATTC 3551 AGTCTGCTTA TTTTCCAAAT TTCATAAACT ACATACTTAG GAAACTGTGC 3601 TTTCAGTGAG CTAAACTTCT TTTTTTAAGT AACTATCATA GTTTTAAGAA 3651 AAACATTTTA AGAAGACAAA AAGTATTTAT TAAGCCCATC TAAAAGGCTA 3701 ATGCAAATTC CCAAAAAAGG AGCACATAGA GATAGAGGAG GAGGCCGAAG 3751 TGGTGGCTCA TACCTGTTAA TTCCAGCACT TTGGGAGGCC AAGACAGGAG 3801 GATCACTTTA GGCCCAGAGT TGGAGACCAA CCTGGGCAAC ATAGCAAGAC 3851 CCTGTCTCTT AAAAAAAAAA AAAAAGACGG GAGAAGCTAC AAGAAGAAAA 3901 CTAGAACTTT AGAGCAGGAG TAAACCTTAG AGCATGTAAA GTCCATTTTG 3951 GAGATGAGGA ACAGACCCAG GAAGATGACC TGGCTTCCCT GAATCCCACG 4001 GCTAGTTAGT GCAGACATTT CAGCCATAAC CCAGCTCTTC TAATTCCCAA 4051 ATACTCTTTC TTCTACTGGC ACATAGAGAT GGGGGAGGAG TCAGGGCATG 4101 GTGGCCCACA CCTACAGTTC CAGCACTTTG GGAGGCCAAA TGGGAGAATT 4151 GCTTGAAGCC AGGAGTTGGA GACCAGCCTA GGCAACACAG GGAGACCCGT 4201 GTCGACAAAA AATTTAAAAA TTAGCTGGGC ATGGTAGCAC ATGCCTGTGG 4251 TCCCAGCTAC TCAGAGGGCT GAGGTGGGAG GATCACTTGA GCCCCAGAGG 4301 TCAAGGCTGC AGTGAGCTGT GATCATGCTA CTGCACTCCA GCCTAGGTGA 4351 CAGAGTGAGA CCCTGTCTCA AAGGGAGGGA GGTAAGAATG AGAAGAAGGA 4401 ACAGGGGTGT ACCTCTTTTA AGGGCCCAAG TATCCTGAAT GGCTCAGCAG 4451 TATAGAACAT TGTGGTAGAG AAATTACATT TTAAAATAAC TCTAATACCG 4501 TTTAGAAACA AAACCCTAAC TTCTGCTTGA GATAAACTGA AGTGCATCTG 4551 TCCCTTGTCC AGGAGTGGGG AACCATTGTA GGGTTGCTCA GCATAAGTCA 4601 TACTGCCACG GTGACCTTGA GGAGTGCAGG GATTCCCTGA AGGAAGCAGC 4651 TGGTACCAGA CACTTAGGCT GCCCATTTGT GTTCTGATCA TTTGAGTGAA 4701 AAAAAGGTAC CTGTCAAGCA AGCTCCTGGA CACCACAAGA AGGAGGAATT 4751 ATTTTAAAAG CTGTACTCTT AAATTGTTAG TATCTTTAAA ATCAGTTGTG 4801 AACAATGAAG GATTTGAAAG AGCATTGACT TTGCCACTTA AAAGTATTTT 4851 TAAAATACTT TGTGCTTCCC CCTTGCATTC TGAATTTATA CACTTTTCCT 4901 CCTGCTGTTC TCAGACCCAG TGGAAAGAAA ATCTCAAGGA AGAAGGCTGA 4951 GTTTATTCTC TCAGGGCTCT GTTGGGTCTA CCTCATCTGA GGTGGCTTAT 5001 TCTTCATAGG AAATTAATTT TTCTTCTCAA GTATGCACTT AAATATAATT 5051 ACTGCTTCCT TGGTCCTCTA GCAGATTTCT CACTTTTATT TATTTTTTTT 5101 TTTGAGACAG AGTCTTGATC TTTTTTCATC TAGGCTGGAG TGCAATGGTT 5151 TGATCTCAGT TCACTGCAAC CTCTGCCTCC TGGGTTCAAG CAATTCTCAT 5201 GCCTCTGCCT CTCGGGCAGC TGGAATTACA GGCATGCGCC ATGACGCCTG 5251 GCTAATTTTT GCATTTTTAG TAGAGACGGG TTTTCACCAT GTTGCCCCGG 5301 TTGCTCTCAA ACTCCTGACC TCAGGTGATC CACCCGCCTC AGCCTCCCAA 5351 AGTGCTGGGA TTACAGGTGT GAGTCACCCC GCACAGCCTG AAATGAGGCA 5401 TCTCTATCTA TAGTCCAGCA GCCCTACAGG AGGCAGGAGG GGAGCAAGAA 5451 TAAGAAAGGA AATTTGTAAA AGGCACTTAG GAGTGAGCAG AAAGGAAATA 5501 GGACCAGCTT TTACCTGCCC AGTCCTGGCC AGTGACAAGC AGTCTGCTTG 5551 AGTCTGTGCT AAATAAACAA AGGAAGTTCC ATTTAGAGCT CTACAGAGGG 5601 GAAGCCATAG AAATTAACAG GATGAAAATA CAAGAGACAG GAACACAGAT 5651 GAATAAATGT AATAAAATTT GAGAAAT // LOCUS AB002372 5530 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0374 gene, complete cds. ACCESSION AB002372 NID g2224688 VERSION AB002372.1 GI:2224688 KEYWORDS KIAA0374. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0327. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5530) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .5530 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0327" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 643. .2259 /gene="KIAA0374" CDS 643. .2259 /gene="KIAA0374" /codon_start=1 /protein_id="BAA20829.1" /db_xref="PID:d1021670" /db_xref="PID:g2224689" /db_xref="GI:2224689" /translation="MPGSGPSERMTWPGPALSAGPPTRPLSSAPGIPPIPPLTRTHSL MAMSLPGSRRTSAGSRRRTSPPVSVRDAYGTSSLSSSSNSGSYKGSDSSPTPRRSMKY TLCSDNHGIKPPTPEQYLTPLQQKEVCIRHLKARLKDTQDRLQDRDTEIDDLKTQLSR MQEDWIEEECHRVEAQLALKEARKEIKQLKQVIDTVKNNLIDKDKGLQKYFVDINIQN KKLETLLHSMEVAQNGMAKEDGTGESAGGSPARSLTRSSTYTKLSDPAVCGDRQPGDP SSGSAEDGADSGFAAADDTLSRTDALEASSLLSSGVDCGTEETSLHSSFGLGPRFPAS NTYEKLLCGMEAGVQASCMQERAIQTDFVQYQPDLDTILEKVTQAQVCGTDPESGDRC PELDAHPSGPRDPNSAVVVTVGDELEAPEPITRGPTPQRPGANPNPGQSVSVVCPMEE EEEAAVAEKEPKSYWSRHYIVDLLAVVVPAVPTVAWLCRSQRRQGQPIYNISSLLRGC CTVALHSIRRISCRSLSQPSPSPAGGGSQL" BASE COUNT 1069 a 1693 c 1699 g 1069 t ORIGIN 1 GGCGGCCCCT GCAGGGCAGC TGAAGCCATG GAAGCCTCCG CAGGTCGCTG 51 ATCAGGGCCA GGCGGCTGCA GCAGCGACTG CAGAGGCGCT GCGCCAAGCC 101 GGGCCGGAGT GGTGCGAGCC GGCGGGGCTG CGGAGGGCCA GTGGACTCAG 151 GGTTGTTGAG AGGAGTCAAT GGCATAATAC GGGAAGCCCC GAACCACGGG 201 CCAACTGGGA AGTTGATGGT CGGCGAGACC AGCCCATCCT AATTTGGGGT 251 TCCTGGTCCT GCTCCAGGAG TCCTACAGCC TGCAGCCCCT ACCAGAGAGG 301 GAGGACTGAA CAGCAAGGGG GTGTGTGGGT CTGAGTATCA GGGTCCTGGG 351 GAAGAAGCAG GCCTGGCTCG TAGAGTAGAA GACTCGGTGC CGGCAGTCAG 401 GAGGCCCTGC TCCTGAAGCC TCTGTGACCT TGGGCACGGC CTTCACTCTG 451 TCTGGGGCAC ATTCACTCAT CTGGAGAACA CAGGGTTGGA CTGATTACTT 501 TTAGCCACTC CTGACAGTTG TGGTTTTAAG TTGGGTGGTG GGAACCTGTG 551 CACCAGCCCT ATTCAATTCA CTGGTGGAGG CAGCCGTGGT CTGCCAGGCC 601 CTCGCTCCTG GGGTGTCCTG CCGAAGGGTG AGAAGAGAAG CCATGCCGGG 651 CAGCGGCCCC AGCGAGAGGA TGACGTGGCC TGGCCCGGCC CTTTCTGCGG 701 GCCCCCCAAC CCGCCCTCTC TCCTCAGCCC CCGGGATACC GCCCATCCCA 751 CCCCTTACTC GGACCCACAG CCTCATGGCC ATGTCCCTGC CAGGAAGTAG 801 ACGGACCTCT GCTGGATCAC GCAGGCGCAC CTCTCCACCT GTGAGCGTGC 851 GGGATGCCTA CGGCACCTCT TCGCTCAGCA GCAGCAGCAA TTCTGGCTCC 901 TACAAGGGCA GTGACAGCAG TCCCACGCCA AGGCGCTCCA TGAAATACAC 951 GCTGTGCAGT GACAACCATG GCATCAAGCC CCCGACCCCG GAGCAGTACC 1001 TGACCCCCCT GCAGCAGAAG GAGGTGTGCA TCCGGCACCT GAAAGCCCGG 1051 CTGAAGGACA CACAGGACCG GCTCCAGGAC CGGGACACAG AGATTGATGA 1101 CCTGAAGACG CAGCTGTCAC GCATGCAGGA GGACTGGATT GAGGAGGAGT 1151 GCCACCGCGT GGAGGCCCAG CTGGCCCTGA AGGAGGCCCG AAAGGAGATC 1201 AAGCAGCTCA AGCAGGTCAT CGACACTGTC AAGAACAACC TGATTGACAA 1251 GGACAAGGGG CTGCAGAAGT ACTTCGTGGA CATCAACATC CAGAACAAGA 1301 AGCTGGAGAC GCTGCTGCAC AGCATGGAGG TGGCCCAGAA TGGCATGGCC 1351 AAGGAGGATG GCACTGGGGA GTCAGCCGGT GGGTCCCCTG CCCGCTCCCT 1401 CACCCGCAGC TCCACCTACA CCAAGCTGAG TGACCCGGCT GTCTGTGGTG 1451 ACCGCCAGCC GGGTGATCCC TCCAGCGGCT CTGCTGAGGA TGGGGCAGAC 1501 AGTGGCTTTG CAGCAGCCGA TGACACACTG AGCCGGACGG ACGCGCTGGA 1551 AGCCAGCAGC CTGCTGTCGT CGGGGGTGGA CTGTGGCACC GAGGAGACCT 1601 CGCTGCACAG CTCCTTCGGC CTGGGCCCCC GCTTCCCTGC CAGCAACACC 1651 TATGAGAAGC TGCTGTGTGG CATGGAGGCT GGTGTGCAGG CCAGCTGCAT 1701 GCAGGAGCGT GCCATCCAGA CAGACTTCGT GCAGTACCAG CCTGACCTTG 1751 ACACCATCCT GGAGAAAGTG ACCCAGGCCC AGGTCTGTGG GACAGACCCT 1801 GAGTCAGGGG ACAGGTGCCC AGAGCTGGAT GCCCACCCTT CAGGGCCCAG 1851 AGACCCCAAC TCAGCAGTGG TGGTGACAGT GGGTGATGAG CTAGAGGCCC 1901 CAGAGCCCAT CACCCGTGGA CCCACCCCAC AGCGGCCTGG TGCCAACCCC 1951 AACCCTGGCC AGTCGGTGAG CGTGGTGTGC CCCATGGAAG AGGAGGAGGA 2001 GGCTGCCGTG GCTGAGAAGG AGCCCAAGAG CTACTGGAGC CGCCACTACA 2051 TCGTGGATCT GCTGGCTGTG GTGGTGCCGG CCGTGCCCAC GGTGGCCTGG 2101 CTTTGCCGCT CCCAGCGGCG CCAGGGCCAG CCCATCTACA ACATCAGCTC 2151 CCTGCTGCGG GGCTGCTGCA CTGTGGCCTT GCACTCCATC CGCAGGATCA 2201 GCTGCCGCTC GCTGAGCCAG CCGAGTCCCA GCCCAGCGGG CGGCGGCTCC 2251 CAGCTCTGAG GGGGCCCATT CCGGCAGCGG CGCCTGCGGC CTGACCACTG 2301 ATTGTAGGGA TGCCGTTCCC CCCTCCCTTC TCCCATGGGC ATCATCTTAT 2351 TTATTTAGTT TTGGGTGTGG AACTGTTTCT TTTTTTCAAG ATGTTAAAAC 2401 AGTCCCGTGG AAGGAGCAGG GGTTGGAGAA AGGCATCCCA AAGCTTCGAT 2451 GGAGAGCAGG GAAGGGGGAC CCAAGGCAGG AGGTACACCA GCTGGACAAA 2501 TTGCAGGGAG GGGAGGGAGC GAGGGCCAAC CCGGCCCCTC TGTCCCCTTG 2551 GCTCTTCAGA CAGGGCCAGC CCTGCTCAGG AAGTCTCTGG CTGTCTTCAT 2601 GTGGGGAAGC CGGGCTTGAG TTGCCCATAG GCCCCTGCCC TGCACCATCC 2651 TGTCCAGTGC CCTGCGCACT CCATGCCGTC TCTTCCAAGC CACCTTGCCC 2701 GCAGCCCAGG CTCCTGGGCC AGTGCTCTCT CCTCAAATGG AGGCAGCCAT 2751 GGCCTGAAGT GCAGATCACT GACCCAGGGC TCAGAGCAGA GGCCAGAACC 2801 ACTGGGCCGG CCGGCATTCC AGCCTCCCCA GACTGCTGCC CACCTTGGGA 2851 CTCAGGAGCT CAGTCAAGGC CACAGGCTGG AGGAGAGACG GGGCTGGGCG 2901 CAAGGTGGCG GAGGGCAGTG TGGGTTCTGT GTCTGTCTGT TCATCCCAGG 2951 CTTTCCCGTC ATCCCTTTCC TCTTGGCACT TCTGGGTGTG TCAGTCATTA 3001 TTCCTGTGAG GTAGCTAAGC CCGGCAAGCT CAGTGCTGGG GTAGGAGGGC 3051 CTGCCTGAGT CCCAGCTCCC AGCTGGAGAA TCCACCAGCA CAAGAAGAGG 3101 AGGCAGGGGC AGAAACCCAA GGGGGCTCCC CCAGCCTTCC AAGGTGAGGC 3151 CATCTCATCT GCAGGCTGGG AGGCAGGGCT GGACTCAGGA ACCCACAGCT 3201 TACTGAAAAA GCCAGAGGCC ATGACTGCCC CCAGAAACTT GCCCCGAGTT 3251 TCTCTGGGGC CCTCGGGCCC AACTTCTGCT TTGCACTATG TTCACTTTGG 3301 GGTTGGTTCT CAGCCATCCA AGGGTCTCCA GTGAGGTGGC TGCTTGCTGT 3351 CTGAGATGAG GGTTCCTAAA CCTTAAACCT CTCTGCCTCT GGAGGAGGGT 3401 GGGGTATTCT GGCAGGATGA ATCGCAGGAT GGCGCGTACT GAAGCCACGA 3451 TGTTCATCCA GGCCAAAGCA GGGTGTCCTG GGATAGGTTT CCTAGGCAGG 3501 GGCGTGGCAG AGCAGGTAGG CGCCCTGCAC CCTCCCTGCT CCCAGCCCAA 3551 GGCCAGTCGG CCTGGGAAGT TCACTGCCCC AACTCTTTTC CCGGGACCAT 3601 ACTGAGTCCC CCAGCCACGC TGCTGACATT ACCATTATTA TTTTACCAAG 3651 TAAACTCACT CCTTTTCCCC CACAGGGGTT ACACCCATCT GGTTGTCCCA 3701 CCCACTCTTC AGAGGCTAGG CCCCACCTCT GGGGGTTGGA GGGACCCTGT 3751 GTCTTACTCG CCCCTCTGGC CTAAGGGCCA CTCTGGTTAT CTGCCAAGGT 3801 TGCTTGCCCT CACCCCAATG CTCCAACAGC CATTGCCTAA CTCATGGGCT 3851 CTGCCCTTCT GCTCGGTGCC CTCCACGTGA GGCGGGGCAC CTGCATGCAC 3901 TGGGAGGGGG CGGCTGGCCC AGCCCTCGGG GCAGGAGCCC CCTCTGCCAC 3951 ACGCTTTGTG CCTCCAAAGC TCCCCCCGCC TTGGTCAGGG CCTCAGACCA 4001 GCCAACCTTT GTGGAATAAG CCCCAGCCCA GCCAAACCAA ACCCAGATGC 4051 CTGAGGCCTG GCTGGGGCTG CCCCCGCAGG ACACTGTGGC CATGCCACGG 4101 AGGGGGCAGT GGACAAAACC AATCCAAAGC CAAGCCGGGA CTGGCTGCGG 4151 ACCCAGCCTC CTGTGCCGCG CACTCACGGA GCTGCGTAGT CTCCTCAGAC 4201 ATAGTCAAAG CTTTGCCGAG AAAAGAAATG TATGAACTAT ATTTGACAAC 4251 ATAAAATCTC TCTATTTTTC ACCACTGGAA TTTAGTCAAG CTTCAGGCCC 4301 CTCTGCTCCT GTCTGTGTCT TGCGTCTGTG GCCTTCCTAT TGTGTCTTGT 4351 GTTTTGGTGG ATGTGACAGG GCTGGGGCCA CAGTCTACTC TGTCTCTGCT 4401 GCTAGAGAAG CCACCTGTGG AGGACTGGGC TGTGGTTGGG CCTGAGGCCT 4451 GTGGAGGAGG TGAGTGTAGC CAGCAGCGGC CGTCTACTCC TGTTCTGGCC 4501 TGAGACCACC GGTGTGGGTC ACGAGGACCC TGGCCCACAG TGTTGAGGGT 4551 CCTCTCTTTC GGGGGCTCTC CTGGGGCCTT CGATGGGCTT TTCTTCTTGT 4601 CAGTGGAGGG AGCAGCTCCC CACTCAGCCC TGGGACAGGC CCTGGGACTG 4651 TGGTGGCCGG TGGCCCTGGC CCAGCTCTGG AGTGCATGTG TGTGTGCTCA 4701 GATCCCGCAT CTATGCAAAG GTGCAGGCTG CCTGTGAGGC TCCAGGCGGT 4751 GGAGGTGCGG CAGCTGCTGC TCAGGTGCCA TGCCCTGAAG AGGCAGGGTA 4801 CACATGGGGC TGGGAGGGCA AAGATGGGGC GGGGCCTCCC CCTGAGAGAG 4851 CTCACCCTCC ACAGTGACCC TTTTCCTTCC TGCCTAATAC CTCCTCCCGT 4901 TGGGCTGTGA CTTTTCCTCC TGCCCTCAGC CTCTGAAACA GAAATCTTTG 4951 GGGCCTCCCC TTTCCCCAGC GTCTGGCAAG AGCTGATGAG GTCCTGAGGA 5001 ACAGTGTCCC CAGGAACCAC CTTCTGGAGC CTAGCGGCCA GACGTGGCTT 5051 CTCCTGAGTT CTAGGGCTCG GGCCAGAGTG GCACTACTGC CCGGCCAGTC 5101 CAGGCTCAGG CAGGAATCGG ACGCTGGGGC CCGGGCTCTG CACAGAGAGC 5151 TGGGCTTGAG TGGAGTTTAT TGTAGATTTC TCCCAAGGGG ATCACTTAGC 5201 TTCTCACTGA CACACCCTTC CCAATTGCCA CAAGCGCAGG GGTATCTTAC 5251 AGCACTTTGG GGAGGGGTGG GACAAACTGC AAGGTTCTGG GGCCAAGGCC 5301 TACCCAGGGG CCTCTGCCCA GAGAACTCAC AACCCCCTCT CTGACCTAGG 5351 GGACAATGCA AACAGGTCAC AGTGCATTCC CATATTAGGC CATCCCCTTC 5401 ACGAAGAATT CAGGGGATGT GGGAAGTGGG GAGGCGGGGA GGGATCTTGA 5451 CTCTTGTCTC CTTTGTCCTT TTGTTCAGAC AGAGTTGTAC CTGCAGCAGA 5501 CAACTCTGAA TTAAAGCATG AAAACACAGC // LOCUS U17195 10343 bp mRNA PRI 02-JUN-1999 DEFINITION Homo sapiens A-kinase anchor protein (AKAP100) mRNA, complete cds. ACCESSION U17195 NID g5360203 VERSION U17195.2 GI:5360203 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 5122 to 7288) AUTHORS McCartney,S., Little,B.M., Langeberg,L.K. and Scott,J.D. TITLE Cloning and characterization of A-kinase anchor protein 100 (AKAP100). A protein that targets A-kinase to the sarcoplasmic reticulum JOURNAL J. Biol. Chem. 270 (16), 9327-9333 (1995) MEDLINE 95238446 REFERENCE 2 (bases 1 to 10343) AUTHORS Kapiloff,M.S., Shillace,R.V., Westphal,A.M. and Scott,J.D. TITLE mAKAP: An A-Kinase Anchoring Protein Targeted to the Nuclear Membrane of Differentiated Myocytes JOURNAL Unpublished REFERENCE 3 (bases 5122 to 7288) AUTHORS McCartney,S. TITLE Direct Submission JOURNAL Submitted (16-NOV-1994) Vollum Institute, 3181 S.W. Sam Jackson Park Road, Portland, OR 97201-3098, USA REFERENCE 4 (bases 1 to 10343) AUTHORS Kapiloff,M.S., Shillace,R.V., Westphal,A.M. and Scott,J.D. TITLE Direct Submission JOURNAL Submitted (29-MAR-1999) Vollum Institute, Oregon Health Sciences University, 3181 S.W. Sam Jackson Park Road L-474, Portland, OR 97201, USA REMARK Sequence update by submitter COMMENT On Jul 6, 1999 this sequence version replaced gi:687595. FEATURES Location/Qualifiers source 1. .10343 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q" gene 1. .10343 /gene="AKAP100" CDS 127. .7086 /gene="AKAP100" /function="involved in anchoring protein kinase A (PKA) to muscle nuclear membrane" /note="mAKAP" /codon_start=1 /product="A-kinase anchor protein" /protein_id="AAA92354.2" /db_xref="PID:g5360204" /db_xref="GI:5360204" /translation="MLTMSVTLSPLRSQDLDPMATDASPMAINMTPTVEQGEGEEAMK DMDSDQQYEKPPPLHTGADWKIVLHLPEIETWLRMTSERVRDLTYSVQQDSDSKHVDV HLVQLKDICEDISDHVEQIHALLETEFSLKLLSYSVNVIVDIHAVQLLWHQLRVSVLV LRERILQGLQDANGNYTRQTDILQAFSEETKEGRLDSLTEVDDSGQLTIKCSQNYLSL DCGITAFELSDYSPSEDLLSGLGDMTSSQVKTKPFDSWSYSEMEKEFPELIRSVGLLT VAADSISTNGSEAVTEEVSQVSLSVDDKGGCEEDNASAVEEQPGLTLGVSSSSGEALT NAAQPSSETVQQESSSSSHHDAKNQQPVPCENATPKRTIRDCFNYNEDSPTQPTLPKR GLFLKEETFKNDLKGNGGKRQMVDLKPEMSRSTPSLVDPPDRSKLCLVLQSSYPNSPS AASQSYECLHKVGNGNLENTVKFHIKEISSSLGRLNDCYKEKSRLKKPHKTSEEVPPC RTPKRGTGSGKQAKNTKSSAVPNGELSYTSKAIEGPQTNSASTSSLEPCNQRSWNAKL QLQSETSSSPAFTQSSESSVGSDNIMSPVPLLSKHKSKKGQASSPSHVTRNGEVVEAW YGSDEYLALPSHLKQTEVLALKLENLTKLLPQKPRGETIQNIDDWELSEMNSDSEIYP TYHVKKKHTRLGRVSPSSSSDIASSLGESIESGPLSDILSDEESSMPLAGMKKYADEK SERASSSEKNESHSATKSALIQKLMQDIQHQDNYEAIWEKIEGFVNKLDEFIQWLNEA METTENWTPPKAEMDDLKLYLETHLSFKLNVDSHCALKEAVEEEGHQLLELIASHKAG LKDMLRMIASQWKELQRQIKRQHSWILRALDTIKAEILATDVSVEDEEGTGSPKAEVQ LCYLEAQRDAVEQMSLKLYSEQYTSSSKRKEEFADMSKVHSVGSNGLLDFDSEYQELW DCLIDMESLVMDSHDLMMSEEQQQHLYKRYSVEMSIRHLKKTELLSKVEALKKGGVLL PNDLLEKVDSINEKWELLGKTLGEKIQDTMAGHSGSSPRDLLSPESGSLVRQLEVRIK ELKGWLRDTELFIFNSCLRQEKEGTMNTEKQLQYFKSLCREIKQRRRGVASILRLCQH LLDDRETCNLNADHQPMQLIIVNLERRWEAIVMQAVQWQTRLQKKMGKESETLNVIDP GLMDLNGMSEDALEWDEMDISNKLISLNEESNDLDQELQPVIPSLKLGETSNEDPGYD EEADNHGGSQYASNITAPSSPHIYQVYSLHNVELYEDNHMPFLKNNPKVTGMTQPNVL TKSLSKDSSFSSTKSLPDLLGGSNLVKPCACHGGDMSQNSGSESGIVSEGDTETTTNS EMCLLNAVDGSPSNLETEHLDPQMGDAVNVLKQKFTDEGESIKLPNSSQSSISPVGCV NGKVGDLNSITKHTPDCLGEELQGKHDVFTFYDYSYLQGSKLKLPMIMKQSQSEKVHV EDPLLRGFYFDKKSCKSKHQTTELQPDVPPHERILASASHEMDRISYKSGNIEKTFTG MQNAKQLSLLSHSSSIESLSPGGDLFGLGIFKNGSDSLQRSTSLESWLTSYKSNEDLF SCHSSGDISVSSGSVGELSKRTLDLLNRLENIQSPSEQKIKRSVSDITLQSSSQKMSF TGQMSLDIASSINEDSAASLTELSSSDELSLCSEDIVLHKNKIPESNASFRKRLTRSV ADESDVNVSMIVNVSCTSACTDDEDDSDLLSSSTLTLTEEELCIKDEDDDSSIATDDE IYEDCTLMSGLDYIKNELQTWIRPKLSLTRDKKRCNVSDEMKGSKDISSSEMTNPSDT LNIETLLNGSVKRVSENNGNGKNSSHTHELGTKRENKKTIFKVNKDPYVADMENGNIE GIPERQKGKPNVTSKVSENLGSHGKEISESEHCKCKALMDSLDDSNTAGKEFVSQDVR HLPKKCPNHHHFENQSTASTPTEKSFSELALETRFNNRQDSDALKSSDDAPSMAGKSA GCCLALEQNGTEENASISNISCCNCEPDVFHQKDAEDCSVHNFVKEIIDMASTALKSK SQPENEVAAPTSLTQIKEKVLEHSHRPIQLRKGDFYSYLSLSSHDSDCGEVTNYIEEK SSTPLPLDTTDSGLDDKEDIECFFEACVEGDSDGEEPCFSSAPPNESAVPSEAAMPLQ ATACSSEFSDSSLSADDADTVALSSPSSQERAEVGKEVNGLPQTSSGCAENLEFTPSK LDSEKESSGKPGESGMPEEHNAASAKSKVQDLSLKANQPTDKAALHPSPKTLTCEENL LNLHEKRHRNMHR" BASE COUNT 3250 a 2108 c 2204 g 2781 t ORIGIN 1 CATCATGCAG CAGGTCAAAC AAGGCATCTC CTAGTATTGC ATCCTACAGA 51 TGTGCTGTAA ACATCAAAAG AAGACGGTGG GATCAGGAGA TGCTGTTTTG 101 GAAAGAAGTG AGGTTTAGAC TTCTCCATGT TAACCATGAG CGTGACACTT 151 TCCCCCCTGA GGTCACAGGA CCTGGATCCC ATGGCTACTG ATGCTTCACC 201 CATGGCCATC AACATGACAC CCACTGTGGA GCAGGGTGAG GGAGAAGAGG 251 CAATGAAGGA CATGGACTCT GACCAGCAGT ATGAAAAGCC ACCCCCACTA 301 CACACAGGGG CTGACTGGAA GATTGTCCTC CACTTACCTG AAATTGAGAC 351 CTGGCTCCGG ATGACCTCAG AGAGGGTCCG AGACCTAACC TATTCAGTCC 401 AGCAGGATTC GGACAGCAAG CATGTGGATG TACATCTAGT TCAACTAAAG 451 GACATTTGTG AAGATATTTC TGATCATGTT GAGCAAATCC ATGCCCTCCT 501 TGAAACAGAG TTCTCCCTAA AGCTGCTGTC TTACTCTGTC AACGTGATAG 551 TGGACATCCA CGCAGTGCAG CTCCTCTGGC ACCAGCTTCG AGTCTCAGTG 601 CTGGTTCTGC GGGAGCGCAT TCTGCAAGGT CTGCAGGACG CCAATGGCAA 651 CTACACTAGG CAGACGGACA TTCTGCAAGC TTTCTCTGAA GAGACAAAAG 701 AGGGCCGGCT TGATTCTCTA ACAGAAGTGG ATGACTCAGG ACAATTAACC 751 ATCAAATGTT CTCAAAATTA CTTGTCTCTG GATTGTGGCA TTACTGCATT 801 CGAACTGTCT GACTACAGTC CAAGTGAGGA TTTGCTCAGT GGGCTAGGTG 851 ACATGACCTC TAGCCAAGTC AAAACCAAAC CCTTTGACTC TTGGAGCTAC 901 AGTGAGATGG AAAAGGAGTT TCCTGAGCTT ATCCGAAGTG TTGGTTTACT 951 TACGGTAGCT GCTGACTCTA TCTCTACCAA TGGCAGTGAA GCAGTTACTG 1001 AGGAGGTATC TCAAGTATCT CTCTCAGTAG ACGACAAAGG TGGATGTGAG 1051 GAAGACAATG CTTCTGCAGT CGAAGAGCAA CCAGGCTTAA CACTGGGGGT 1101 GTCATCATCT TCAGGAGAAG CTCTGACAAA TGCTGCTCAA CCCTCCTCTG 1151 AGACTGTGCA GCAAGAATCC AGTTCCTCCT CCCATCATGA TGCAAAGAAT 1201 CAGCAGCCTG TTCCTTGTGA AAATGCAACC CCCAAACGAA CCATCAGAGA 1251 TTGCTTTAAT TATAACGAGG ACTCTCCCAC GCAGCCTACA TTGCCAAAAA 1301 GAGGACTTTT TCTTAAAGAG GAAACTTTTA AGAATGATCT GAAAGGCAAT 1351 GGTGGAAAGA GGCAAATGGT TGATCTAAAG CCTGAGATGA GCAGAAGCAC 1401 CCCTTCGCTA GTAGATCCTC CTGACAGATC CAAACTTTGC CTGGTATTGC 1451 AGTCTTCTTA CCCCAACAGC CCTTCTGCTG CCAGCCAGTC TTATGAGTGT 1501 TTACACAAGG TGGGGAATGG GAACCTTGAA AACACAGTCA AATTTCACAT 1551 TAAAGAAATT TCTTCCAGCC TGGGAAGGCT TAACGACTGC TATAAAGAGA 1601 AATCTCGACT TAAAAAGCCA CACAAGACCT CAGAAGAGGT GCCTCCATGC 1651 CGAACACCTA AACGGGGGAC TGGTTCAGGC AAACAAGCTA AAAATACAAA 1701 GAGCTCAGCA GTGCCAAATG GAGAGCTTTC TTATACTTCC AAGGCCATAG 1751 AGGGGCCACA AACAAATTCT GCTTCCACAT CCTCACTTGA GCCTTGTAAT 1801 CAGAGAAGTT GGAATGCCAA ATTGCAATTG CAGTCAGAAA CATCCAGTTC 1851 ACCAGCTTTT ACTCAGAGCA GTGAATCCTC TGTTGGCTCA GACAACATCA 1901 TGTCTCCGGT GCCACTTCTT TCAAAACACA AAAGCAAAAA AGGTCAAGCC 1951 TCCTCTCCAA GTCACGTCAC TAGGAATGGT GAGGTTGTGG AGGCCTGGTA 2001 TGGCTCTGAT GAATACCTAG CACTGCCCTC TCACCTTAAG CAGACAGAAG 2051 TATTGGCTTT GAAGTTGGAA AACCTAACAA AGCTTCTGCC TCAGAAACCC 2101 AGAGGAGAAA CCATCCAGAA TATTGATGAC TGGGAACTGT CTGAAATGAA 2151 TTCAGATTCT GAAATCTATC CAACCTATCA TGTCAAAAAG AAGCATACAA 2201 GGCTAGGCAG GGTGTCTCCA AGCTCATCTA GTGACATAGC CTCTTCACTA 2251 GGGGAGAGCA TTGAATCTGG GCCCCTGAGT GACATTCTTT CTGATGAGGA 2301 GTCCAGTATG CCTCTCGCTG GCATGAAAAA GTATGCTGAT GAGAAGTCAG 2351 AAAGAGCTTC ATCCTCTGAG AAAAATGAGA GCCATTCTGC CACTAAATCA 2401 GCTTTAATTC AGAAACTGAT GCAAGATATT CAGCACCAAG ACAACTATGA 2451 AGCCATATGG GAAAAAATAG AGGGGTTTGT AAACAAACTG GATGAATTCA 2501 TTCAATGGTT AAATGAAGCC ATGGAAACTA CAGAGAATTG GACTCCCCCT 2551 AAAGCAGAGA TGGATGACCT TAAACTGTAT CTGGAGACAC ACTTGAGTTT 2601 TAAGTTGAAT GTAGACAGTC ATTGTGCTCT CAAGGAAGCT GTGGAGGAGG 2651 AAGGACACCA ACTTCTTGAG CTTATTGCAT CTCACAAAGC AGGACTGAAG 2701 GACATGCTGC GGATGATTGC AAGTCAATGG AAGGAGCTGC AGAGGCAAAT 2751 CAAACGGCAG CACAGCTGGA TTCTCAGGGC TCTGGATACC ATCAAAGCCG 2801 AGATACTGGC TACTGATGTG TCTGTGGAGG ATGAGGAAGG GACTGGAAGC 2851 CCCAAGGCTG AGGTTCAACT ATGCTACCTG GAAGCACAAA GAGATGCTGT 2901 TGAGCAGATG TCCCTCAAGC TGTACAGCGA GCAGTATACC AGCAGCAGCA 2951 AGCGAAAGGA AGAGTTTGCT GATATGTCAA AAGTTCATTC AGTGGGAAGC 3001 AATGGGCTTC TGGACTTTGA TTCAGAATAT CAGGAGCTCT GGGATTGCTT 3051 GATTGACATG GAGTCCCTTG TGATGGACAG CCACGACCTG ATGATGTCAG 3101 AGGAGCAGCA GCAGCATCTT TACAAGCGAT ACAGTGTGGA AATGTCCATC 3151 AGACACCTGA AAAAGACGGA GCTGCTTAGT AAGGTTGAAG CTTTGAAGAA 3201 AGGTGGCGTT TTACTACCAA ATGATCTCCT TGAAAAAGTG GATTCAATTA 3251 ATGAAAAATG GGAACTGCTT GGGAAAACCC TAGGAGAGAA GATCCAGGAC 3301 ACAATGGCAG GGCACAGTGG GTCGAGTCCA CGTGACCTGC TCTCTCCTGA 3351 AAGTGGAAGC CTGGTAAGGC AGCTGGAGGT CAGGATCAAA GAACTGAAAG 3401 GATGGCTAAG AGATACAGAG CTTTTCATCT TCAATTCCTG TCTGAGACAA 3451 GAAAAGGAAG GAACAATGAA TACTGAGAAA CAACTGCAAT ACTTTAAGTC 3501 CCTCTGTCGT GAAATCAAGC AACGACGTCG AGGAGTTGCC TCCATTCTGC 3551 GACTATGCCA GCATCTTTTG GATGACCGGG AGACTTGCAA TCTGAATGCA 3601 GACCACCAGC CCATGCAGCT GATCATTGTA AATCTTGAAA GAAGGTGGGA 3651 AGCCATTGTC ATGCAAGCCG TCCAGTGGCA AACACGTCTA CAAAAGAAGA 3701 TGGGAAAGGA ATCTGAGACT TTGAATGTGA TTGATCCTGG CTTGATGGAC 3751 CTAAATGGGA TGAGTGAGGA TGCCCTGGAA TGGGATGAAA TGGACATAAG 3801 TAACAAGTTA ATTAGTTTGA ATGAGGAATC AAATGACCTT GATCAAGAAC 3851 TCCAACCTGT TATCCCTTCC TTGAAGCTTG GAGAGACAAG TAATGAGGAC 3901 CCTGGTTATG ACGAGGAGGC TGATAACCAT GGGGGATCTC AGTATGCCTC 3951 AAATATTACT GCCCCCTCTA GTCCACACAT TTACCAGGTG TACAGCCTCC 4001 ACAATGTTGA ACTCTATGAG GACAACCACA TGCCATTTCT GAAAAACAAT 4051 CCAAAGGTCA CTGGCATGAC ACAGCCTAAT GTTTTAACTA AGAGTCTCAG 4101 TAAAGACTCT TCATTTTCAT CTACCAAATC TTTGCCAGAT CTTCTAGGTG 4151 GTTCCAATTT GGTAAAGCCC TGCGCATGTC ATGGAGGAGA CATGAGCCAG 4201 AATTCAGGCA GTGAGAGTGG AATTGTCAGT GAAGGAGACA CAGAAACCAC 4251 TACCAACTCT GAAATGTGCT TGCTCAATGC AGTGGATGGG TCCCCAAGTA 4301 ACCTTGAAAC TGAACATCTG GACCCACAAA TGGGAGATGC AGTTAACGTG 4351 TTAAAGCAAA AATTTACAGA TGAGGGGGAA AGCATTAAGC TTCCAAATAG 4401 CTCTCAGTCG TCCATTTCAC CAGTGGGTTG TGTAAATGGA AAAGTTGGAG 4451 ATTTAAACAG TATTACCAAA CATACCCCTG ACTGTTTGGG AGAAGAATTA 4501 CAAGGAAAAC ATGATGTGTT TACATTTTAT GATTACTCAT ACCTCCAAGG 4551 CTCAAAACTC AAATTACCAA TGATAATGAA ACAGTCACAA AGCGAAAAAG 4601 TGCATGTGGA GGATCCCCTG CTTCGTGGTT TTTATTTTGA TAAAAAATCA 4651 TGCAAATCTA AACATCAGAC TACAGAGTTA CAACCAGATG TACCTCCCCA 4701 TGAAAGGATT TTGGCAAGTG CATCTCATGA AATGGATCGC ATTTCATATA 4751 AAAGTGGCAA TATAGAAAAG ACATTCACTG GCATGCAGAA TGCCAAACAG 4801 CTCTCCCTTT TATCTCATAG TTCATCTATT GAGTCCCTTT CTCCAGGGGG 4851 TGATTTATTT GGATTGGGCA TCTTTAAAAA TGGCAGTGAC AGCCTCCAGC 4901 GAAGCACTTC TTTAGAAAGT TGGTTGACTT CCTATAAAAG CAATGAAGAT 4951 CTCTTTAGCT GTCACAGCTC TGGGGATATA AGCGTGAGCA GTGGCTCAGT 5001 TGGTGAACTA AGTAAAAGAA CATTAGATCT CCTGAATCGT TTGGAGAATA 5051 TCCAGAGCCC CTCAGAGCAA AAGATAAAAC GAAGTGTTTC TGATATCACT 5101 CTTCAAAGCA GTTCCCAAAA GATGTCCTTT ACTGGCCAGA TGTCATTGGA 5151 CATAGCATCT TCTATCAATG AAGACTCAGC GGCATCTCTA ACAGAACTTA 5201 GCAGCAGTGA CGAGCTCTCT CTTTGCTCAG AGGATATTGT GTTACACAAG 5251 AACAAGATCC CGGAATCGAA TGCATCGTTC AGGAAGCGTC TGACTCGTTC 5301 AGTGGCTGAT GAAAGCGATG TCAATGTCAG CATGATTGTT AATGTCTCTT 5351 GCACCTCTGC TTGCACTGAT GATGAAGATG ACAGCGACCT GCTCTCCAGC 5401 TCTACCCTTA CCTTGACTGA AGAAGAGCTG TGCATCAAAG ATGAGGATGA 5451 CGACTCCAGT ATTGCAACAG ATGATGAAAT TTATGAAGAC TGCACCTTGA 5501 TGTCAGGGCT AGACTACATA AAGAATGAAT TACAGACCTG GATTAGGCCA 5551 AAATTGTCTT TGACAAGAGA TAAGAAAAGG TGCAATGTCA GTGATGAGAT 5601 GAAGGGCAGT AAAGATATAA GTAGCAGTGA GATGACCAAT CCCTCTGATA 5651 CTCTGAATAT TGAGACCCTT CTAAATGGCT CTGTAAAACG TGTCTCTGAA 5701 AATAATGGAA ATGGTAAGAA TTCATCTCAT ACCCATGAGT TAGGGACAAA 5751 GCGTGAAAAT AAGAAAACTA TTTTCAAAGT TAATAAAGAT CCATATGTGG 5801 CTGACATGGA AAATGGCAAT ATTGAAGGTA TTCCAGAAAG GCAAAAGGGC 5851 AAACCGAATG TGACTTCAAA GGTATCAGAA AATCTTGGTT CACATGGGAA 5901 AGAGATTTCA GAGAGTGAGC ATTGTAAGTG TAAAGCACTT ATGGATAGTT 5951 TAGATGATTC AAATACTGCT GGCAAGGAAT TTGTTTCCCA AGATGTTAGA 6001 CATCTTCCAA AGAAATGTCC AAATCACCAC CATTTTGAAA ATCAAAGCAC 6051 TGCCTCTACT CCCACTGAGA AGTCTTTCTC AGAACTGGCT TTAGAAACCA 6101 GGTTTAACAA CAGACAAGAC TCTGATGCAC TGAAATCATC TGATGATGCA 6151 CCGAGTATGG CTGGAAAATC TGCTGGTTGT TGCCTAGCAC TTGAACAAAA 6201 CGGAACAGAG GAAAATGCTT CTATCAGCAA CATTTCCTGT TGCAACTGTG 6251 AGCCAGATGT TTTCCATCAA AAAGATGCCG AAGATTGTTC AGTACACAAC 6301 TTTGTTAAGG AAATCATTGA CATGGCTTCG ACAGCCCTAA AAAGTAAATC 6351 TCAACCTGAA AACGAGGTGG CTGCTCCTAC TTCATTAACT CAAATCAAGG 6401 AGAAAGTGTT GGAGCATTCT CACCGGCCCA TCCAGCTGAG AAAAGGGGAC 6451 TTTTATTCGT ACTTATCTCT CTCATCTCAT GACAGTGATT GTGGGGAGGT 6501 CACCAATTAC ATAGAAGAGA AAAGCAGCAC TCCATTGCCA CTAGACACCA 6551 CTGACTCGGG CTTAGATGAC AAGGAAGATA TTGAATGCTT TTTTGAGGCC 6601 TGTGTTGAGG GTGACTCTGA TGGAGAGGAG CCTTGTTTCT CTAGTGCTCC 6651 TCCAAATGAA TCTGCAGTTC CCAGCGAAGC TGCAATGCCA CTACAAGCAA 6701 CAGCATGTTC TTCTGAGTTC AGTGATAGTT CTCTTTCAGC TGATGATGCA 6751 GATACAGTGG CTCTTTCAAG TCCTTCCTCT CAGGAAAGAG CTGAGGTTGG 6801 AAAGGAAGTG AATGGTTTGC CCCAAACTTC CAGTGGCTGT GCAGAAAACT 6851 TAGAGTTTAC TCCTTCAAAG CTTGACAGTG AAAAGGAAAG TTCCGGAAAA 6901 CCAGGTGAAT CTGGAATGCC AGAAGAACAT AATGCTGCTT CAGCCAAATC 6951 TAAAGTTCAA GACCTCTCCT TGAAGGCAAA TCAGCCAACA GACAAGGCCG 7001 CATTGCATCC CAGCCCCAAA ACTTTAACCT GTGAAGAAAA TCTTCTAAAC 7051 CTTCATGAAA AACGACATAG AAATATGCAT AGGTAGAATG TACCCCCTCC 7101 CCAAGCATGA AAATCATCTC ACTGAAAGAT ACGCCTGGCT GCAACTCAGG 7151 GGTGGCCTCA TCCTCCCGCC CTGGGCTGGC CTCTGGTTCC ATCACGTTTG 7201 TCACTGCCGT TTATTACATT GACTTCTCCC AAGATGAATC TTCCTTCCAA 7251 ATGTGTTTTC TCCACACAAG CCTTGTGATC TGAATGTGTG CGCTGGTTCT 7301 CTTTAGGTGA TCGTCTTTGA AGTTCAGCAA AGCTGCTTGT TCTCCCATGG 7351 ATTCCTGTCC CAAGCTACCT CTACCAACCC TCTCTCTCCA GCTAGACTTT 7401 TCTCTTTGCC TCCTCCCTTC CCTTCCACTC TTTAAAGTTC TGCAGTTCAC 7451 CAACTGGTAG TCCATTAAAT TCTCCTGTCT AGAATGACCC CCCCACCAGT 7501 ACTTGACCAA TTTCATGTAT CAATCTGGAT TTTTTTTTTA ACGGTATAAT 7551 GACTGTGCTT ATTGAAAGAG TTTTACCTAA AAAGCCAACA TTTGAATTGG 7601 TTGCAGCATA GAGAAGAAAC ACTGGTCCTT CTTTCAAAAT TAAGCAACTA 7651 TTAAAAGCGC CATTTTATTT ATTTCATTTA AAAAATAATC TATGCAGCAT 7701 TTCAAGAAAC AACCATATGG TGTTGTATAT TATAAACTGG TGACATTCTA 7751 CTATTGAATT ATGTACAACA TTTTCATTTT TTATGCTTCT TGAGGTGGTA 7801 ATGAGAAAAA AGTTTTTTAA AAAAGTGTGC CTTGCTGTAT TTCTTATACC 7851 ATTTATTAAA AAGCTGCTTT CACGGTAAAA TTATGTTGGT TTGAAAGGAG 7901 GAAATAGCAA GGTTAAGATG TGTGAATAAT TTCTGTATAT ATGTATAACC 7951 AAGTACAAAC ATTGATGTAT AATGACAGTA TAAAATGCTT TCATGTTTGT 8001 GATGTCTAGT GATGTGGAAA ATATAAGCCT TAAATCCATT AGATTGCATG 8051 GTAATTAAAA TTGGCATAAT AAACACAGAT TATTGGGGGA AAAGGAAAAT 8101 TAGTGATCTC TTCTACTATG TTCTTTACCA AATTGTTGCA TCTGGTTCTG 8151 AAAAAGTATA GCATGTAGCA GCTTCCAAAC ATATTCATAT TGCTTAAGAG 8201 GCTTAACATT ACCTAAACTA GAGACTAGAC GTAAAGCCTT CAGTTTTCAA 8251 AATCTTTCTG GTCACTATAA AGATCTTGGA ACAGCAAATG ATTAAATGTC 8301 AGTTCCCCTA AACCAATAAA CATTTATACT AGATTTTTTA TTTCCACTTA 8351 TCATTAATGA TTTAATGTTG GATTTCAGGT ACCTTGTATG TCTTAATTTA 8401 TTTTAAATAT TTATTTTGAA TGAGTTTGAT AGAAAGCTAG TAGAAAAGTA 8451 CAGAAAATTT GACTATTATT TATAGATTTC AGGTATATTT ATATGTGTAA 8501 AAGAAATTGA CAAAGAAATA TTTCATCTGG CCTTTACTGA CTCCTGTTAA 8551 ATGCAGTTTT AAATTTATAT CGTAACACCT ACTTAAGTGC CTGACACAGT 8601 AGGTATTCAA TAAAAATTTA CTGAATTAAA GGATTAAATT AGGTGACATG 8651 GTGACATCTA TCCCTTTATT TTGACACTAA AACATGGACA CAACTAGAAA 8701 GAGGTACAAT GCAATATAAA GTCACAATAG ATAATATATA TCAAATTTCT 8751 AAAAGGTAAA GAATGTTGTG GGTTCATGCA GTCACAGGAA TGACAATCAT 8801 TCAACAGATA GTTCAGAAAC ACTTTTTATC TGCAAGGCAC TATTCTAGAT 8851 CCAGAAGATG CAATGTTGAA CAAACAGACA AAGCCCTGCC CTCAGAAGGC 8901 TGTCCTGCAT TAGGAACAAG TGAACACGCA AATGACATGA AGTATTTGTT 8951 GCAGAGCTGA GGAACAGAGC AAATGTAGTG ATAGAAGCGC AATGAGAGAA 9001 GCAGCAGTGG GTACAAGGAG GAAGAAAAAG GGCTTGCAGA GAGTGGAAAG 9051 TTAGTGGAAT ATTCATGAAA CTTCATTGCA GGGGTAATAG AAGAAAAAGT 9101 AAATTGGGAG GACTTAATGG AAGGTCTTTT AAAAAGTTAA CTTGGAGCTT 9151 CTGTATGTAA AATGCTAGGT AATAAGGACA CTTTGTACAG GCTGTTTTGC 9201 ACCTGATTTT ATTTATCATT AGTGCCACGC CAAGATCATT TAGACGATGC 9251 TTATCTGTAA TTCTACCACT TTAATAACTA TTTGTATTTT TATGCCCCTT 9301 CTGATCTTTT CCATATGTAT TTCTAAATGG ATAAATTATT CTAGGCTTCT 9351 TAATAGGTAG TAATTTGTTC AAAAGCGGTT TTAGCCAGAC ATCTAGTTGC 9401 AGTGTTCAAG AGGATTATGG GGGAAAGAGA TTAGAGATAA TTGTCTAGTT 9451 AGGGGGCAGC TGGAGAAAAT AAGCTAAGTT TGCAATAACA GAGTACACAA 9501 GTATAGTGGC CCAGGATGTA GTGAAAGAAC AAATCCTAGA GTCTTTGAAA 9551 TTTCTAAGGG CATTCTAGAC CTCTGTTGGG ATATGGTATT ATTTTACATA 9601 CTGACACAAC CTAAATTTTC TTTGGGTAGT AACTAATGTC AAGTCTACAT 9651 CGACTGGTAA AACATTCAAA GAACAAACTG ACAATGATGT TCTACCTACT 9701 TGTTACATGC TCATGGAAGA CCGTGCAGTA TTGAAAGTAT TTGTTAATTA 9751 TCTGCTTAGT ATTAACACTA AATTTGTAGA ATGACTTTCA GGTTTGTTGA 9801 ACAATGCCTT TTCAGGTTGG AAGAAGAAAA ATAGCCTCAA TCTCCCACCC 9851 CATGTAGGCA CTACCTCCCC AATTACCCTT AGAAAATGAT CACACCAACT 9901 CTGCCTACAC ACTTCCAGTG ATAGTGGCTC ATTGTCTGTT AAGGCAAACT 9951 GTTCCACTGT TGGGCATATC TCTTTGTTAG AAAGTTCTTT CTTAGGTTGC 10001 TAAAATCTGC CTAGTACCCC GCTACCCTGT TCTGTCTTAT GGAGCAGCCC 10051 AGATTATCTT TACTCCCTCT TTCTCATGGC AACCCTGAAG ATAATCAAGG 10101 CCAGTTACTC ATCATCTCCC AACCACTGTT TCCTCAACTG CCCTTCATAT 10151 GTCATGGTTT TCAGATCCAT TCCAACCTGA CTGAATGTTA ACAGACAGAA 10201 TTCTTCACAT TAAGGAACTG TCTTCATCAT CATACATGTA GAAAAGAATC 10251 TGAACATTTA AGTGCGAAGT TTTCTCTAGA AATATATTCA AGATATGTTT 10301 ATTCTATTAT TGTAAATTTC AAACAATAAA TAAATAAGAA TCC // LOCUS AB002343 6387 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0345 gene, complete cds. ACCESSION AB002343 NID g2224630 VERSION AB002343.1 GI:2224630 KEYWORDS KIAA0345. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1491. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6387) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .6387 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1491" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 725. .3253 /gene="KIAA0345" CDS 725. .3253 /gene="KIAA0345" /codon_start=1 /protein_id="BAA20803.1" /db_xref="PID:d1021641" /db_xref="PID:g2224631" /db_xref="GI:2224631" /translation="MLYSSRGDPEGQPLLLSLLILAMWVVGSGQLHYSVPEEAEHGTF VGRIAQDLGLELAELVPRLFQLDSKGRGDLLEVNLQNGILFVNSRIDREELCGRSAEC SIHLEVIVDRPLQVFHVDVEVKDINDNPPVFPATQKNLFIAESRPLDSRFPLEGASDA DIGENALLTYRLSPNEYFFLDVPTSNQQVKPLGLVLRKLLDREETPELHLLLTATDGG KPELTGTVQLLITVLDNNDNAPVFDRTLYTVKLPENVSIGTLVIHPNASDLDEGLNGD IIYSFSSDVSPDIKSKFHMDPLSGAITVIGHMDFEESRAHKIPVEAVDKGFPPLAGHC TLLVEVVDVNDNAPQLTIKTLSVPVKEDAQLGTVIALISVIDLDADANGQVTCSLTPH VPFKLVSTYKNYYSLVLDRALDRESVSAYELVVTARDGGSPSLWATARVSVEVADVND NAPAFAQSEYTVFVKENNPPGCHIFTVSARDADAQENALVSYSLVERRLGERSLSSYV SVHAESGKVYALQPLDHEELELLQFQVSARDAGVPPLGSNVTLQVFVLDENDNAPALL TPRMRGTDGAVSEMVLRSVGAGVVVGKVRAVDADSGYNAWLSYELQPETASASIPFRV GLYTGEISTTRALDETDAPRQRLLVLVKDHGEPALTATATVLVSLVESGQAPKSSSRA SVGATGPEVTLVDVNVYLIIAICAVSSLLVLTLLLYTVLRCSAMPTEGECAPGKPTLV CSSAVGSWSYSQQRRQRVCSGEGKQKTDLMAFSPGLSPCAGSTERTGEPSASSDSTGK VGFSSILFIYIIFFLERYYRLLPGAVQIVLFIFLEIQQIFFLIK" BASE COUNT 1612 a 1435 c 1494 g 1846 t ORIGIN 1 CTCGCTTTTC TTGCAATATT TTATACCTTT TCAATTCATA GAATTACTCA 51 AGAAAACTAC CTCAGTTGGT TGCTACTTTT TGTTGATTCC TTTTACCAGA 101 CATGACTAAG TTTCTTTTTC ATCAGTAGAT TTCTGGGCTC CTATATTCAC 151 TAGAGATTGC AACTCCTGGA TTTCTCTTAC ACTAGAATCC TATTTCGAGC 201 CATATGGGAG ATTCTGAATT CCAGAACAAA AGAATTTTGT AATTTAAAAT 251 TCGTGATTGC TCAATGGAAT CATTTTAATT GTTACTTCAT TTCTGTCGTT 301 ATTTAAAACT TAAGTGGAGA GTTTTCTCAG GGATAAGAAA ACCACAATCA 351 AGGTCATACA AAACTTTTAG AGGCAGTCAG TCTGCTAAGA AGGCTCCAGC 401 AAGAGAAACG GGATCTTCTG TTTCAACAAT CATTACTTAA GAAAAAATTA 451 AGAAAATGAA ATAAGTTTTG CAGAATAACT GTGAAATTTT TATTCATGAA 501 ATATGTACTT ACACTTTGGG CCACGTGATG TCACTCTTTG CCGCGATGTT 551 CTCTCTGAAT CCAGACAAAT ACAGCCCTTT TCCCATGGGA AAGAGGCTCA 601 ATTCTTTTTC ACTCTCTCTG TGCTGAACGA TGGCGAACAC AGCAGAATGG 651 GACTGACGAA ATCAGATGAT TTCTTCTAAT TTGGAGGCAA TTTTCACTAA 701 TTAGAAGAAG ACTGAGTATT TGAAATGTTA TACTCAAGTC GAGGAGATCC 751 AGAGGGTCAG CCTCTACTGC TCTCGCTTCT GATCCTCGCA ATGTGGGTGG 801 TGGGGAGCGG CCAGCTCCAC TACTCCGTCC CGGAGGAAGC CGAACACGGC 851 ACCTTCGTGG GCCGCATCGC GCAGGACCTG GGGCTGGAGC TGGCGGAGCT 901 GGTGCCGCGC CTGTTCCAGT TGGATTCCAA AGGCCGCGGG GACCTTCTGG 951 AGGTAAATCT GCAGAATGGC ATTTTGTTTG TGAATTCTCG GATCGACCGC 1001 GAGGAGCTGT GCGGGCGGAG CGCGGAGTGC AGCATCCACC TGGAGGTGAT 1051 CGTAGACAGG CCGCTGCAGG TTTTCCATGT GGACGTGGAG GTGAAGGACA 1101 TTAACGACAA CCCTCCAGTG TTCCCAGCGA CACAAAAGAA TCTGTTCATC 1151 GCGGAATCCA GGCCGCTTGA CTCTCGGTTT CCACTAGAGG GCGCGTCCGA 1201 TGCAGATATC GGGGAGAACG CCCTGCTCAC TTACAGACTG AGCCCCAATG 1251 AGTATTTCTT CCTGGACGTG CCAACCAGCA ACCAGCAGGT AAAACCTCTT 1301 GGACTTGTAT TACGGAAACT TTTAGACAGA GAAGAAACTC CGGAGCTTCA 1351 TTTATTGCTC ACGGCCACCG ATGGAGGCAA ACCCGAGCTG ACTGGCACCG 1401 TTCAATTACT CATCACGGTA CTGGACAACA ATGACAATGC CCCAGTGTTC 1451 GACAGAACCC TGTATACGGT GAAATTACCA GAAAACGTTT CTATCGGAAC 1501 GCTGGTGATT CACCCCAATG CCTCAGATTT AGACGAAGGC TTGAATGGGG 1551 ATATTATTTA CTCCTTCTCC AGTGATGTTT CTCCAGATAT AAAATCCAAG 1601 TTCCACATGG ACCCCTTAAG TGGGGCAATC ACAGTGATAG GACATATGGA 1651 TTTTGAAGAA AGTAGAGCAC ACAAGATCCC AGTCGAGGCT GTCGATAAAG 1701 GCTTCCCACC CCTGGCTGGT CATTGTACAC TTCTTGTGGA AGTTGTGGAT 1751 GTAAATGACA ATGCTCCACA GTTGACTATC AAAACGCTCT CGGTTCCTGT 1801 AAAAGAGGAC GCACAACTGG GGACAGTTAT TGCCCTGATT AGTGTGATCG 1851 ACCTAGACGC AGATGCCAAC GGGCAGGTGA CCTGCTCCCT GACGCCCCAC 1901 GTCCCCTTCA AGCTGGTGTC CACCTACAAG AATTACTACT CGTTGGTGCT 1951 GGACAGAGCT CTGGACCGCG AGAGTGTGTC CGCCTACGAG CTGGTGGTTA 2001 CCGCGCGGGA CGGGGGCTCG CCTTCACTGT GGGCCACGGC CAGGGTGTCT 2051 GTGGAGGTGG CCGACGTGAA CGACAACGCA CCAGCGTTCG CGCAGTCCGA 2101 GTACACGGTG TTCGTGAAGG AGAACAACCC GCCGGGCTGC CACATCTTCA 2151 CGGTGTCTGC GCGGGACGCT GACGCGCAGG AGAACGCCCT GGTGTCCTAC 2201 TCGCTGGTGG AGCGGCGGTT GGGCGAGCGC TCGCTGTCGA GCTACGTGTC 2251 AGTGCACGCG GAGAGCGGCA AGGTGTACGC GCTGCAGCCG TTGGACCACG 2301 AGGAGCTGGA GCTGCTACAG TTCCAGGTGA GCGCGCGCGA CGCGGGCGTG 2351 CCGCCTCTGG GCAGCAACGT GACGCTGCAG GTGTTCGTGC TGGACGAGAA 2401 CGACAATGCG CCGGCGCTGC TGACACCTCG GATGAGGGGC ACTGACGGCG 2451 CAGTGAGCGA GATGGTGCTG CGGTCGGTGG GCGCCGGCGT AGTGGTGGGG 2501 AAGGTGCGCG CAGTGGACGC CGACTCGGGC TACAACGCGT GGCTTTCATA 2551 CGAGCTGCAG CCAGAAACGG CCAGCGCGAG CATCCCGTTC CGCGTGGGGC 2601 TGTACACGGG CGAGATCAGC ACAACGCGTG CCCTGGACGA AACGGACGCA 2651 CCGCGCCAGC GCCTACTGGT GCTGGTGAAA GACCACGGGG AGCCAGCGCT 2701 GACGGCCACG GCCACTGTGC TGGTGTCGCT GGTGGAGAGC GGCCAGGCGC 2751 CAAAGTCATC GTCGCGGGCG TCAGTGGGTG CCACGGGCCC CGAGGTGACG 2801 CTGGTGGATG TCAACGTGTA CCTGATCATC GCCATCTGCG CGGTGTCTAG 2851 CCTGTTGGTT CTCACGCTGC TGCTGTACAC TGTGCTGCGG TGCTCGGCGA 2901 TGCCCACCGA GGGCGAGTGC GCGCCTGGCA AGCCGACGCT GGTGTGTTCT 2951 AGCGCGGTGG GGAGTTGGTC GTACTCGCAG CAGAGGAGGC AGAGGGTGTG 3001 CTCTGGCGAG GGTAAGCAGA AGACCGACCT CATGGCCTTC AGCCCGGGCC 3051 TTTCTCCTTG TGCTGGATCT ACAGAGCGAA CGGGAGAACC CTCTGCTTCC 3101 TCAGATTCAA CTGGGAAGGT GGGTTTTTCT AGCATTTTAT TTATTTATAT 3151 AATTTTTTTT CTTGAAAGAT ATTATCGATT ACTCCCAGGG GCCGTTCAAA 3201 TAGTTTTATT CATTTTTCTA GAAATCCAGC AGATTTTTTT TCTGATAAAG 3251 TAAACCCCTT AACATTGGAG CCGACTTTGT CTTGACTTCT AGTGAGAATT 3301 ATAAACTGTA TATTAAATAG ATATTTTTTG GGTGCTGAAT CAATTTTATT 3351 TAAATTTGTG ATTAAAGTGA CATTGAATTT CTGATGCTAT GCTGCCATAA 3401 CACTTGAAAA CCAATTTAGT TGTTAGTCAT TCATTAAACA TTAACATCAC 3451 TATCATTTAT TTATTGCTAA ATGATGCATA GTATTTTAGT CTACTTGTAT 3501 TGTTTATAAG AAACCCAAGC AAAAATATAT AGCAATTGTT ACCTTGTTAA 3551 GTTTGTAGTT CTCTACATTT CTCTGGATGG AGACTGTGAA CATCTGATTG 3601 TTCAGCAACC TTCAGTATCT ATTATTTTAA TAAGAAAGAA ACTTCCCCTA 3651 AACTTTAGAA AACAGTTGCT CCACTTTAGG AATCAAATTA TGTCAATAAA 3701 TGTTATAAAC ACAGCCTTCA TTTCAACTTA TATAAAATAT GTTTTAAAAT 3751 GCCTGACAAT GTAGATAATT CAAGAAATGT TGACTGAAAT TTTGTCTACA 3801 CTTAGAACAT TTTTTGAAAT TCAGTTTACA GAAATTGGAG AAAATGCTTT 3851 TTAAACAAGT GTTTCCTTTC TTCAAGAAGA CATTCTCCTT TTAATTGAAA 3901 TTTTCTCCAT TCAGTGATAA AATGATCAGC CATGTGAAGA TTCGAAACTT 3951 CGAGTTCTTT TGAAATTCAG AGTCTGTAAC TTAAAACATT ACCCTTATGA 4001 ATTTAGATGA GAATTCACTT GTTCTGTCAG TAATCCATAA GACAGAAATC 4051 TGTTTTTTTA AAAATATCTT TTTCTCCTCT CAGCTCATAC ATAACACAAG 4101 GCAGAAATCT GGATATGAGA TTTGCCTCTT TAATGTCACT ACATGTTATG 4151 TTTCCTGAAT TGTAGTTTGT GACTTTCAAA ATGGTGGTTT TCCACACTCT 4201 ACCTTTAGTG CAAGCTATTT GTTTGTTTTC TAATTTATAG TTTTAAAAAC 4251 TTCGCTTATT GAGTTTTTGT TATGTGGTTT ATATTTTTCT TTCTCTTTCA 4301 GCTATTTTAT TTAATATTGT GTCAGATATT TTACAAGGTA TGACCTAATT 4351 AAAAACTCAG TAGAGAAAGA TCAGAATGGC CTTGAGAATA GAGCCACAAA 4401 AATAACTATG AAAATGCCAG TAACGTTTAT TTAAAACAAA ATATTTTAAT 4451 TTTTAAATTT TCCCTTAAAA CACACTTTTG GAATATGCTA CAATATTACA 4501 TGTTTTTTGT CTTTTTATTT TTCTGAGACG GAGTCGTTTT CTGCCACCCA 4551 GGCTGGAGTA CAGTGGCATG ATCTTGGCTC ACTGCAGCGT CTGCCTCCTG 4601 GGTTCGAGCA ATTCTCCTGC CTCAGCCTCC TGAGTAGCTG GGATTATAGG 4651 CACATGCCAC CGCGCCCAGC TAATTTTTGT ATTTTTAGTA GAGATGGGGT 4701 TTCATCATGT TGGCCAGGTT GGTCTCGAAC TCCTGACCTT GTGATGCTCC 4751 CACCTCGGCC TCCCAAAGTG CTGGGATTAA AGCTGTGAGC CACTGTGCCA 4801 AGGCTTTTTT ATTTTTTTTT TTTTGTCATT TTCTTTCAAA ACTTGAGTGG 4851 TCTCTGAGCT CCTGTCATTA AACCTATCTA TATCTGTCTA TCAGCACAAC 4901 TCACCTTGAA TATAGTCTTA TACTTTCAAG TATCTTTGTC TTTGCACGTT 4951 TTTCAAGTTT CATGTGCCAT TTAAACTTGG ACCCAGGTAT CTGATTATTT 5001 GATGTGAATA GAGGGATGCT ACAGATGTCA TTTGTCTCCC GCCCTAAGTC 5051 CTCCAGTCTC CTTAGAGCTA GTACTTACTA AGCATTTACT ATGTCATCAA 5101 TAATCATAAA ACGTATTTTT TTTTTTGAGT CAGAGTCTCG CTCTGTTGCC 5151 CGGGCTGGAG TGCAGTGGTG CCATCTTGGC TCACTCCAGG CTCCCCCTCC 5201 CGTGTTCACG CCATTCTCCT GCCTCAGCCT CCCGAGTGGC TGGGACTGCA 5251 GGCGCCTGCC ACCGTGCCCG CCTAGTTTTT TTGTATTTTT GGTAGAGATG 5301 GGGTTTCACC GTGTTAGTCA GGATGGTCTC GATCTCCTGA CCTCATGATC 5351 CTCCCGCCTC GGCCTCCCAA AATGCTGGGA TTGCAGGCGT GAGCCACCGC 5401 GCCTGGCCTA AAATGTGTTC TTTATTATTG ACGGCTGTAT TGATGGGATT 5451 GGTAATTTAG TCCTTCATAT TAATCTCTAT TCTCTCTCAG AGTACAAGCT 5501 CTCATCATAT GCAAATTCTC AGAAGGGCTG TGAACACCTT AGTAATAAAT 5551 TTATCTTTTG AGGTCATTAG CAAACATGAA CTCACAGGGA TCCAGAGATG 5601 GTAAAATTCA AAACAGCCTG TCAAGTTCAA AACAGAGAGG TGAAAGCAGA 5651 AGAGACACTT TCCTATTTTG CCTAATAGGT CTCCTTATAT GCATCTGTAG 5701 TTAACATTCC TCAATTCAAG TTAGAATCAT GAAACAATAA TGAAGCTCCT 5751 CCTATGTCTC TTTTCAAGTT GTAATTACTA TATAGGAAAA ACTAAGTTGT 5801 CACCCAATAT CTTAGACACT TTGAGAGCAA AGGGGGTGCT GTAAATAAGT 5851 ATACAAGATC ACAGACCTAA ATTGAGCCTG TTCCAGACAA ATTGGGGCCT 5901 ATGGTCAACC TATCCTTAGA CCTGCTAACG CATTAGCATT AGCAGCACCT 5951 AAGTCCTCAT TGAATGTTCT GGTTCAAGGC TCCACCTCAG AAATTCTGAA 6001 ATGGGTAGTA AGAGCAAATT TTCATTTTAA AGCACACCTG AGATGATTCT 6051 CATACAACCG AAATTTTAGA TCCATAGCCC TATTTGATAC TTGACAGTGC 6101 AAGTTTCTGT AATTTAAAAA GATGTGGTGG CCTGACGCCT GCGGTCCCCG 6151 TTTTGGGAGG CCGAGGTGGG AGGGTCCCTT CCTTGAGCCC AGCAGTTTGA 6201 GACCAATGTA GTGAGACTCA TCTCTGCCAG AAAAAAAAGA TTGGCCGGGC 6251 GTGGTGGCAC ACATCTCTGG TCCCAATTAC TCGGGAGGCT GAGGCGAGAG 6301 AATCGCTTGA GCCTGGGACA TTGAGGCTGC AGTGAGCTGT GATGGCACAG 6351 CTGCATTTCA GCCCGGGTGA CAGCGAGATT CTGTCTC // LOCUS HSU47741 8694 bp mRNA PRI 29-SEP-1997 DEFINITION Human CREB-binding protein (CBP) mRNA, complete cds. ACCESSION U47741 NID g2443858 VERSION U47741.1 GI:2443858 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8694) AUTHORS Borrow,J., Stanton,V.P., Andresen,J.M., Becher,R., Behm,F.G., Chaganti,R.S.K., Civin,C.I., Disteche,C., Dube,I., Frischauf,A.M., Horsman,D., Mitelman,F., Volinia,S., Watmore,A.E. and Housman,D.E. TITLE The translocation t(8;16)(p11;p13) of acute myeloid leukaemia fuses a putative acetyltransferase to the CREB-binding protein JOURNAL Nature Genet. 14 (1), 33-41 (1996) MEDLINE 96376968 REFERENCE 2 (bases 1 to 8694) AUTHORS Sobulo,O.M., Borrow,J., Tomek,R., Reshmi,S., Harden,A., Schlegelberger,B., Housman,D., Doggett,N.A., Rowley,J.D. and Zeleznik-Le,N.J. TITLE MLL is fused to CBP, a histone acetyltransferase, in therapy-related acute myeloid leukemia with a t(11;16)(q23;p13.3) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (16), 8732-8737 (1997) MEDLINE 97385172 REFERENCE 3 (bases 1 to 8694) AUTHORS Borrow,J., Stanton,V.P., Andresen,J.M., Becher,R., Behm,F.G., Chaganti,R.S.K., Civin,C.I., Disteche,C., Dube,I., Frischauf,A.M., Horsman,D., Mitelman,F., Volinia,S., Watmore,A.E. and Housman,D.E. TITLE Direct Submission JOURNAL Submitted (30-JAN-1996) Julian Borrow, Center for Cancer Research, E17-540, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA COMMENT On Sep 29, 1997 this sequence version replaced gi:1517911. FEATURES Location/Qualifiers source 1. .8694 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13" /cell_line="U937" gene 1. .8694 /gene="CBP" CDS 199. .7527 /gene="CBP" /note="transcriptional adaptor" /codon_start=1 /product="CREB-binding protein" /protein_id="AAC51770.1" /db_xref="PID:g2443859" /db_xref="GI:2443859" /translation="MAENLLDGPPNPKRAKLSSPGFSANDSTDFGSLFDLENDLPDEL IPNGGELGLLNSGNLVPDAASKHKQLSELLRGGSGSSINPGIGNVSASSPVQQGLGGQ AQGQPNSANMASLSAMGKSPLSQGDSSAPSLPKQAASTSGPTPAASQALNPQAQKQVG LATSSPATSQTGPGICMNANFNQTHPGLLNSNSGHSLINQASQGQAQVMNGSLGAAGR GRGAGMPYPTPAMQGASSSVLAETLTQVSPQMTGHAGLNTAQAGGMAKMGITGNTSPF GQPFSQAGGQPMGATGVNPQLASKQSMVNSLPTFPTDIKNTSVTNVPNMSQMQTSVGI VPTQAIATGPTADPEKRKLIQQQLVLLLHAHKCQRREQANGEVRACSLPHCRTMKNVL NHMTHCQAGKACQVAHCASSRQIISHWKNCTRHDCPVCLPLKNASDKRNQQTILGSPA SGIQNTIGSVGTGQQNATSLSNPNPIDPSSMQRAYAALGLPYMNQPQTQLQPQVPGQQ PAQPQTHQQMRTLNPLGNNPMNIPAGGITTDQQPPNLISESALPTSLGATNPLMNDGS NSGNIGTLSTIPTAAPPSSTGVRKGWHEHVTQDLRSHLVHKLVQAIFPTPDPAALKDR RMENLVAYAKKVEGDMYESANSRDEYYHLLAEKIYKIQKELEEKRRSRLHKQGILGNQ PALPAPGAQPPVIPQAQPVRPPNGPLSLPVNRMQVSQGMNSFNPMSLGNVQLPQAPMG PRAASPMNHSVQMNSMGSVPGMAISPSRMPQPPNMMGAHTNNMMAQAPAQSQFLPQNQ FPSSSGAMSVGMGQPPAQTGVSQGQVPGAALPNPLNMLGPQASQLPCPPVTQSPLHPT PPPASTAAGMPSLQHTTPPGMTPPQPAAPTQPSTPVSSSGQTPTPTPGSVPSATQTQS TPTVQAAAQAQVTPQPQTPVQPPSVATPQSSQQQPTPVHAQPPGTPLSQAAASIDNRV PTPSSVASAETNSQQPGPDVPVLEMKTETQAEDTEPDPGESKGEPRSEMMEEDLQGAS QVKEETDIAEQKSEPMEVDEKKPEVKVEVKEEEESSSNGTASQSTSPSQPRKKIFKPE ELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPDYFDIVKNPMDLSTIKRKLDTGQ YQEPWQYVDDVWLMFNNAWLYNRKTSRVYKFCSKLAEVFEQEIDPVMQSLGYCCGRKY EFSPQTLCCYGKQLCTIPRDAAYYSYQNRYHFCEKCFTEIQGENVTLGDDPSQPQTTI SKDQFEKKKNDTLDPEPFVDCKECGRKMHQICVLHYDIIWPSGFVCDNCLKKTGRPRK ENKFSAKRLQTTRLGNHLEDRVNKFLRRQNHPEAGEVFVRVVASSDKTVEVKPGMKSR FVDSGEMSESFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSDCPPPNTRRVYISYLDS IHFFRPRCLRTAVYHEILIGYLEYVKKLGYVTGHIWACPPSEGDDYIFHCHPPDQKIP KPKRLQEWYKKMLDKAFAERIIHDYKDIFKQATEDRLTSAKELPYFEGDFWPNVLEES IKELEQEEEERKKEESTAASETTEGSQGDSKNAKKKNNKKTNKNKSSISRANKKKPSM PNVSNDLSQKLYATMEKHKEVFFVIHLHAGPVINTLPPIVDPDPLLSCDLMDGRDAFL TLARDKHWEFSSLRRSKWSTLCMLVELHTQGQDRFVYTCNECKHHVETRWHCTVCEDY DLCINCYNTKSHAHKMVKWGLGLDDEGSSQGEPQSKSPQESRRVSIQRCIQSLVHACQ CRNANCSLPSCQKMKRVVQHTKGCKRKTNGGCPVCKQLIALCCYHAKHCQENKCPVPF CLNIKHKLRQQQIQHRLQQAQLMRRRMATMNTRNVPQQSLPSPTSAPPGTPTQQPSTP QTPQPPAQPQPSPVSMSPAGFPSVARTQPPTTVSTGKPTSQVPAPPPPAQPPPAAVEA ARQIEREAQQQQHLYRVNINNSMPPGRTGMGTPGSQMAPVSLNVPRPNQVSGPVMPSM PPGQWQQAPLPQQQPMPGLPRPVISMQAQAAVAGPRMPSVQPPRSISPSALQDLLRTL KSPSSPQQQQQVLNILKSNPQLMAAFIKQRTAKYVANQPGMQPQPGLQSQPGMQPQPG MHQQPSLQNLNAMQAGVPRPGVPPQQQAMGGLNPQGQALNIMNPGHNPNMASMNPQYR EMLRRQLLQQQQQQQQQQQQQQQQQQGSAGMAGGMAGHGQFQQPQGPGGYPPAMQQQQ RMQQHLPLQGSSMGQMAAQMGQLGQMGQPGLGADSTPNIQQALQQRILQQQQMKQQIG SPGQPNPMSPQQHMLSGQPQASHLPGQQIATSLSNQVRSPAPVQSPRPQSQPPHSSPS PRIQPQPSPHHVSPQTGSPHPGLAVTMASSIDQGHLGNPEQSAMLPQLNTPSRSALSS ELSLVGDTTGDTLEKFVEGL" misc_feature 994. .995 /gene="CBP" /note="t(8;16) breakpoint position in MOZ-CBP fusion" BASE COUNT 2252 a 2569 c 2144 g 1729 t ORIGIN 1 TGAGGAATCA ACAGCCGCCA TCTTGTCGCG GACCCGACCG GGGCTTCGAG 51 CGCGATCTAC TCGGCCCCGC CGGTCCCGGG CCCCACAACC GCCCGCGCTC 101 GCTCCTCTCC CTCGCAGCCG GCAGGGCCCC CGACCCCCGT CCGGGCCCTC 151 GCCGGCCCGG CCGCCCGTGC CCGGGGCTGT TTTCGCGAGC AGGTGAAAAT 201 GGCTGAGAAC TTGCTGGACG GACCGCCCAA CCCCAAAAGA GCCAAACTCA 251 GCTCGCCCGG TTTCTCGGCG AATGACAGCA CAGATTTTGG ATCATTGTTT 301 GACTTGGAAA ATGATCTTCC TGATGAGCTG ATACCCAATG GAGGAGAATT 351 AGGCCTTTTA AACAGTGGGA ACCTTGTTCC AGATGCTGCT TCCAAACATA 401 AACAACTGTC GGAGCTTCTA CGAGGAGGCA GCGGCTCTAG TATCAACCCA 451 GGAATAGGAA ATGTGAGCGC CAGCAGCCCC GTGCAGCAGG GCCTGGGTGG 501 CCAGGCTCAA GGGCAGCCGA ACAGTGCTAA CATGGCCAGC CTCAGTGCCA 551 TGGGCAAGAG CCCTCTGAGC CAGGGAGATT CTTCAGCCCC CAGCCTGCCT 601 AAACAGGCAG CCAGCACCTC TGGGCCCACC CCCGCTGCCT CCCAAGCACT 651 GAATCCGCAA GCACAAAAGC AAGTGGGGCT GGCGACTAGC AGCCCTGCCA 701 CGTCACAGAC TGGACCTGGT ATCTGCATGA ATGCTAACTT TAACCAGACC 751 CACCCAGGCC TCCTCAATAG TAACTCTGGC CATAGCTTAA TTAATCAGGC 801 TTCACAAGGG CAGGCGCAAG TCATGAATGG ATCTCTTGGG GCTGCTGGCA 851 GAGGAAGGGG AGCTGGAATG CCGTACCCTA CTCCAGCCAT GCAGGGCGCC 901 TCGAGCAGCG TGCTGGCTGA GACCCTAACG CAGGTTTCCC CGCAAATGAC 951 TGGTCACGCG GGACTGAACA CCGCACAGGC AGGAGGCATG GCCAAGATGG 1001 GAATAACTGG GAACACAAGT CCATTTGGAC AGCCCTTTAG TCAAGCTGGA 1051 GGGCAGCCAA TGGGAGCCAC TGGAGTGAAC CCCCAGTTAG CCAGCAAACA 1101 GAGCATGGTC AACAGTTTGC CCACCTTCCC TACAGATATC AAGAATACTT 1151 CAGTCACCAA CGTGCCAAAT ATGTCTCAGA TGCAAACATC AGTGGGAATT 1201 GTACCCACAC AAGCAATTGC AACAGGCCCC ACTGCAGATC CTGAAAAACG 1251 CAAACTGATA CAGCAGCAGC TGGTTCTACT GCTTCATGCT CATAAGTGTC 1301 AGAGACGAGA GCAAGCAAAC GGAGAGGTTC GGGCCTGCTC GCTCCCGCAT 1351 TGTCGAACCA TGAAAAACGT TTTGAATCAC ATGACGCATT GTCAGGCTGG 1401 GAAAGCCTGC CAAGTTGCCC ATTGTGCATC TTCACGACAA ATCATCTCTC 1451 ATTGGAAGAA CTGCACACGA CATGACTGTC CTGTTTGCCT CCCTTTGAAA 1501 AATGCCAGTG ACAAGCGAAA CCAACAAACC ATCCTGGGGT CTCCAGCTAG 1551 TGGAATTCAA AACACAATTG GTTCTGTTGG CACAGGGCAA CAGAATGCCA 1601 CTTCTTTAAG TAACCCAAAT CCCATAGACC CCAGCTCCAT GCAGCGAGCC 1651 TATGCTGCTC TCGGACTCCC CTACATGAAC CAGCCCCAGA CGCAGCTGCA 1701 GCCTCAGGTT CCTGGCCAGC AACCAGCACA GCCTCAAACC CACCAGCAGA 1751 TGAGGACTCT CAACCCCCTG GGAAATAATC CAATGAACAT TCCAGCAGGA 1801 GGAATAACAA CAGATCAGCA GCCCCCAAAC TTGATTTCAG AATCAGCTCT 1851 TCCGACTTCC CTGGGGGCCA CAAACCCACT GATGAACGAT GGCTCCAACT 1901 CTGGTAACAT TGGAACCCTC AGCACTATAC CAACAGCAGC TCCTCCTTCT 1951 AGCACCGGTG TAAGGAAAGG CTGGCACGAA CATGTCACTC AGGACCTGCG 2001 GAGCCATCTA GTGCATAAAC TCGTCCAAGC CATCTTCCCA ACACCTGATC 2051 CCGCAGCTCT AAAGGATCGC CGCATGGAAA ACCTGGTAGC CTATGCTAAG 2101 AAAGTGGAAG GGGACATGTA CGAGTCTGCC AACAGCAGGG ATGAATATTA 2151 TCACTTATTA GCAGAGAAAA TCTACAAGAT ACAAAAAGAA CTAGAAGAAA 2201 AACGGAGGTC GCGTTTACAT AAACAAGGCA TCTTGGGGAA CCAGCCAGCC 2251 TTACCAGCCC CGGGGGCTCA GCCCCCTGTG ATTCCACAGG CACAACCTGT 2301 GAGACCTCCA AATGGACCCC TGTCCCTGCC AGTGAATCGC ATGCAAGTTT 2351 CTCAAGGGAT GAATTCATTT AACCCCATGT CCTTGGGGAA CGTCCAGTTG 2401 CCACAAGCAC CCATGGGACC TCGTGCAGCC TCCCCAATGA ACCACTCTGT 2451 CCAGATGAAC AGCATGGGCT CAGTGCCAGG GATGGCCATT TCTCCTTCCC 2501 GAATGCCTCA GCCTCCGAAC ATGATGGGTG CACACACCAA CAACATGATG 2551 GCCCAGGCGC CCGCTCAGAG CCAGTTTCTG CCACAGAACC AGTTCCCGTC 2601 ATCCAGCGGG GCGATGAGTG TGGGCATGGG GCAGCCGCCA GCCCAAACAG 2651 GCGTGTCACA GGGACAGGTG CCTGGTGCTG CTCTTCCTAA CCCTCTCAAC 2701 ATGCTGGGGC CTCAGGCCAG CCAGCTACCT TGCCCTCCAG TGACACAGTC 2751 ACCACTGCAC CCAACACCGC CTCCTGCTTC CACGGCTGCT GGCATGCCAT 2801 CTCTCCAGCA CACGACACCA CCTGGGATGA CTCCTCCCCA GCCAGCAGCT 2851 CCCACTCAGC CATCAACTCC TGTGTCGTCT TCCGGGCAGA CTCCCACCCC 2901 GACTCCTGGC TCAGTGCCCA GTGCTACCCA AACCCAGAGC ACCCCTACAG 2951 TCCAGGCAGC AGCCCAGGCC CAGGTGACCC CGCAGCCTCA AACCCCAGTT 3001 CAGCCCCCGT CTGTGGCTAC CCCTCAGTCA TCGCAGCAAC AGCCGACGCC 3051 TGTGCACGCC CAGCCTCCTG GCACACCGCT TTCCCAGGCA GCAGCCAGCA 3101 TTGATAACAG AGTCCCTACC CCCTCCTCGG TGGCCAGCGC AGAAACCAAT 3151 TCCCAGCAGC CAGGACCTGA CGTACCTGTG CTGGAAATGA AGACGGAGAC 3201 CCAAGCAGAG GACACTGAGC CCGATCCTGG TGAATCCAAA GGGGAGCCCA 3251 GGTCTGAGAT GATGGAGGAG GATTTGCAAG GAGCTTCCCA AGTTAAAGAA 3301 GAAACAGACA TAGCAGAGCA GAAATCAGAA CCAATGGAAG TGGATGAAAA 3351 GAAACCTGAA GTGAAAGTAG AAGTTAAAGA GGAAGAAGAG AGTAGCAGTA 3401 ACGGCACAGC CTCTCAGTCA ACATCTCCTT CGCAGCCGCG CAAAAAAATC 3451 TTTAAACCAG AGGAGTTACG CCAGGCCCTC ATGCCAACCC TAGAAGCACT 3501 GTATCGACAG GACCCAGAGT CATTACCTTT CCGGCAGCCT GTAGATCCCC 3551 AGCTCCTCGG AATTCCAGAC TATTTTGACA TCGTAAAGAA TCCCATGGAC 3601 CTCTCCACCA TCAAGCGGAA GCTGGACACA GGGCAATACC AAGAGCCCTG 3651 GCAGTACGTG GACGACGTCT GGCTCATGTT CAACAATGCC TGGCTCTATA 3701 ATCGCAAGAC ATCCCGAGTC TATAAGTTTT GCAGTAAGCT TGCAGAGGTC 3751 TTTGAGCAGG AAATTGACCC TGTCATGCAG TCCCTTGGAT ATTGCTGTGG 3801 ACGCAAGTAT GAGTTTTCCC CACAGACTTT GTGCTGCTAT GGGAAGCAGC 3851 TGTGTACCAT TCCTCGCGAT GCTGCCTACT ACAGCTATCA GAATAGGTAT 3901 CATTTCTGTG AGAAGTGTTT CACAGAGATC CAGGGCGAGA ATGTGACCCT 3951 GGGTGACGAC CCTTCACAGC CCCAGACGAC AATTTCAAAG GATCAGTTTG 4001 AAAAGAAGAA AAATGATACC TTAGACCCCG AACCTTTCGT TGATTGCAAG 4051 GAGTGTGGCC GGAAGATGCA TCAGATTTGC GTTCTGCACT ATGACATCAT 4101 TTGGCCTTCA GGTTTTGTGT GCGACAACTG CTTGAAGAAA ACTGGCAGAC 4151 CTCGAAAAGA AAACAAATTC AGTGCTAAGA GGCTGCAGAC CACAAGACTG 4201 GGAAACCACT TGGAAGACCG AGTGAACAAA TTTTTGCGGC GCCAGAATCA 4251 CCCTGAAGCC GGGGAGGTTT TTGTCCGAGT GGTGGCCAGC TCAGACAAGA 4301 CGGTGGAGGT CAAGCCCGGG ATGAAGTCAC GGTTTGTGGA TTCTGGGGAA 4351 ATGTCTGAAT CTTTCCCATA TCGAACCAAA GCTCTGTTTG CTTTTGAGGA 4401 AATTGACGGC GTGGATGTCT GCTTTTTTGG AATGCACGTC CAAGAATACG 4451 GCTCTGATTG CCCCCCTCCA AACACGAGGC GTGTGTACAT TTCTTATCTG 4501 GATAGTATTC ATTTCTTCCG GCCACGTTGC CTCCGCACAG CCGTTTACCA 4551 TGAGATCCTT ATTGGATATT TAGAGTATGT GAAGAAATTA GGGTATGTGA 4601 CAGGGCACAT CTGGGCCTGT CCTCCAAGTG AAGGAGATGA TTACATCTTC 4651 CATTGCCACC CACCTGATCA AAAAATACCC AAGCCAAAAC GACTGCAGGA 4701 GTGGTACAAA AAGATGCTGG ACAAGGCGTT TGCAGAGCGG ATCATCCATG 4751 ACTACAAGGA TATTTTCAAA CAAGCAACTG AAGACAGGCT CACCAGTGCC 4801 AAGGAACTGC CCTATTTTGA AGGTGATTTC TGGCCCAATG TGTTAGAAGA 4851 GAGCATTAAG GAACTAGAAC AAGAAGAAGA GGAGAGGAAA AAGGAAGAGA 4901 GCACTGCAGC CAGTGAAACC ACTGAGGGCA GTCAGGGCGA CAGCAAGAAT 4951 GCCAAGAAGA AGAACAACAA GAAAACCAAC AAGAACAAAA GCAGCATCAG 5001 CCGCGCCAAC AAGAAGAAGC CCAGCATGCC CAACGTGTCC AATGACCTGT 5051 CCCAGAAGCT GTATGCCACC ATGGAGAAGC ACAAGGAGGT CTTCTTCGTG 5101 ATCCACCTGC ACGCTGGGCC TGTCATCAAC ACCCTGCCCC CCATCGTCGA 5151 CCCCGACCCC CTGCTCAGCT GTGACCTCAT GGATGGGCGC GACGCCTTCC 5201 TCACCCTCGC CAGAGACAAG CACTGGGAGT TCTCCTCCTT GCGCCGCTCC 5251 AAGTGGTCCA CGCTCTGCAT GCTGGTGGAG CTGCACACCC AGGGCCAGGA 5301 CCGCTTTGTC TACACCTGCA ACGAGTGCAA GCACCACGTG GAGACGCGCT 5351 GGCACTGCAC TGTGTGCGAG GACTACGACC TCTGCATCAA CTGCTATAAC 5401 ACGAAGAGCC ATGCCCATAA GATGGTGAAG TGGGGGCTGG GCCTGGATGA 5451 CGAGGGCAGC AGCCAGGGCG AGCCACAGTC AAAGAGCCCC CAGGAGTCAC 5501 GCCGGGTGAG CATCCAGCGC TGCATCCAGT CGCTGGTGCA CGCGTGCCAG 5551 TGCCGCAACG CCAACTGCTC GCTGCCATCC TGCCAGAAGA TGAAGCGGGT 5601 GGTGCAGCAC ACCAAGGGCT GCAAACGCAA GACCAACGGG GGCTGCCCGG 5651 TGTGCAAGCA GCTCATCGCC CTCTGCTGCT ACCACGCCAA GCACTGCCAA 5701 GAAAACAAAT GCCCCGTGCC CTTCTGCCTC AACATCAAAC ACAAGCTCCG 5751 CCAGCAGCAG ATCCAGCACC GCCTGCAGCA GGCCCAGCTC ATGCGCCGGC 5801 GGATGGCCAC CATGAACACC CGCAACGTGC CTCAGCAGAG TCTGCCTTCT 5851 CCTACCTCAG CACCGCCCGG GACCCCCACA CAGCAGCCCA GCACACCCCA 5901 GACGCCGCAG CCCCCTGCCC AGCCCCAACC CTCACCCGTG AGCATGTCAC 5951 CAGCTGGCTT CCCCAGCGTG GCCCGGACTC AGCCCCCCAC CACGGTGTCC 6001 ACAGGGAAGC CTACCAGCCA GGTGCCGGCC CCCCCACCCC CGGCCCAGCC 6051 CCCTCCTGCA GCGGTGGAAG CGGCTCGGCA GATCGAGCGT GAGGCCCAGC 6101 AGCAGCAGCA CCTGTACCGG GTGAACATCA ACAACAGCAT GCCCCCAGGA 6151 CGCACGGGCA TGGGGACCCC GGGGAGCCAG ATGGCCCCCG TGAGCCTGAA 6201 TGTGCCCCGA CCCAACCAGG TGAGCGGGCC CGTCATGCCC AGCATGCCTC 6251 CCGGGCAGTG GCAGCAGGCG CCCCTTCCCC AGCAGCAGCC CATGCCAGGC 6301 TTGCCCAGGC CTGTGATATC CATGCAGGCC CAGGCGGCCG TGGCTGGGCC 6351 CCGGATGCCC AGCGTGCAGC CACCCAGGAG CATCTCACCC AGCGCTCTGC 6401 AAGACCTGCT GCGGACCCTG AAGTCGCCCA GCTCCCCTCA GCAGCAACAG 6451 CAGGTGCTGA ACATTCTCAA ATCAAACCCG CAGCTAATGG CAGCTTTCAT 6501 CAAACAGCGC ACAGCCAAGT ACGTGGCCAA TCAGCCCGGC ATGCAGCCCC 6551 AGCCTGGCCT CCAGTCCCAG CCCGGCATGC AACCCCAGCC TGGCATGCAC 6601 CAGCAGCCCA GCCTGCAGAA CCTGAATGCC ATGCAGGCTG GCGTGCCGCG 6651 GCCCGGTGTG CCTCCACAGC AGCAGGCGAT GGGAGGCCTG AACCCCCAGG 6701 GCCAGGCCTT GAACATCATG AACCCAGGAC ACAACCCCAA CATGGCGAGT 6751 ATGAATCCAC AGTACCGAGA AATGTTACGG AGGCAGCTGC TGCAGCAGCA 6801 GCAGCAACAG CAGCAGCAAC AACAGCAGCA ACAGCAGCAG CAGCAAGGGA 6851 GTGCCGGCAT GGCTGGGGGC ATGGCGGGGC ACGGCCAGTT CCAGCAGCCT 6901 CAAGGACCCG GAGGCTACCC ACCGGCCATG CAGCAGCAGC AGCGCATGCA 6951 GCAGCATCTC CCCCTCCAGG GCAGCTCCAT GGGCCAGATG GCGGCTCAGA 7001 TGGGACAGCT TGGCCAGATG GGGCAGCCGG GGCTGGGGGC AGACAGCACC 7051 CCCAACATCC AGCAAGCCCT GCAGCAGCGG ATTCTGCAGC AACAGCAGAT 7101 GAAGCAGCAG ATTGGGTCCC CAGGCCAGCC GAACCCCATG AGCCCCCAGC 7151 AACACATGCT CTCAGGACAG CCACAGGCCT CGCATCTCCC TGGCCAGCAG 7201 ATCGCCACGT CCCTTAGTAA CCAGGTGCGG TCTCCAGCCC CTGTCCAGTC 7251 TCCACGGCCC CAGTCCCAGC CTCCACATTC CAGCCCGTCA CCACGGATAC 7301 AGCCCCAGCC TTCGCCACAC CACGTCTCAC CCCAGACTGG TTCCCCCCAC 7351 CCCGGACTCG CAGTCACCAT GGCCAGCTCC ATAGATCAGG GACACTTGGG 7401 GAACCCCGAA CAGAGTGCAA TGCTCCCCCA GCTGAACACC CCCAGCAGGA 7451 GTGCGCTGTC CAGCGAACTG TCCCTGGTCG GGGACACCAC GGGGGACACG 7501 CTAGAGAAGT TTGTGGAGGG CTTGTAGCAT TGTGAGAGCA TCACCTTTTC 7551 CCTTTCATGT TCTTGGACCT TTTGTACTGA AAATCCAGGC ATCTAGGTTC 7601 TTTTTATTCC TAGATGGAAC TGCGACTTCC GAGCCATGGA AGGGTGGATT 7651 GATGTTTAAA GAAACAATAC AAAGAATATA TTTTTTTGTT AAAAACCAGT 7701 TGATTTAAAT ATCTGGTCTC TCTCTTTGGT TTTTTTTTGG CGGGGGGGTG 7751 GGGGGGGTTC TTTTTTTTCC GTTTTGTTTT TGTTTGGGGG GAGGGGGGTT 7801 TTGTTTGGAT TCTTTTTGTC GTCATTGCTG GTGACTCATG CCTTTTTTTA 7851 ACGGGAAAAA CAAGTTCATT ATATTCATAT TTTTTATTTG TATTTTCAAG 7901 ACTTTAAACA TTTATGTTTA AAAGTAAGAA GAAAAATAAT ATTCAGAACT 7951 GATTCCTGAA ATAATGCAAG CTTATAATGT ATCCCGATAA CTTTGTGATG 8001 TTTCGGGAAG ATTTTTTTCT ATAGTGAACT CTGTGGGCGT CTCCCAGTAT 8051 TACCCTGGAT GATAGGAATT GACTCCGGCG TGCACACACG TACACACCCA 8101 CACACATCTA TCTATACATA ATGGCTGAAG CCAAACTTGT CTTGCAGATG 8151 TAGAAATTGT TGCTTTGTTT CTCTGATAAA ACTGGTTTTA GACAAAAAAT 8201 AGGGATGATC ACTCTTAGAC CATGCTAATG TTACTAGAGA AGAAGCCTTC 8251 TTTTCTTTCT TCTATGTGAA ACTTGAAATG AGGAAAAGCA ATTCTAGTGT 8301 AAATCATGCA AGCGCTCTAA TTCCTATAAA TACGAAACTC GAGAAGATTC 8351 AATCACTGTA TAGAATGGTA AAATACCAAC TCATTTCTTA TATCATATTG 8401 TTAAATAAAC TGTGTGCAAC AGACAAAAAG GGTGGTCCTT CTTGAATTCA 8451 TGTACATGGT ATTAACACTT AGTGTTCGGG GTTTTTTGTT ATGAAAATGC 8501 TGTTTTCAAC ATTGTATTTG GACTATGCAT GTGTTTTTTC CCCATTGTAT 8551 ATAAAGTACC GCTTAAAATT GATATAAATT ACTGAGGTTT TTAACATGTA 8601 TTCTGTTCTT TAAGATCCCC TGTAAGAATG TTTAAGGTTT TTATTTATTT 8651 ATATATATTT TTTGGTCTGT TCTTTGTAAA AAAAAAAAAA AAAA // LOCUS AF104032 4670 bp mRNA PRI 17-MAR-1999 DEFINITION Homo sapiens L-type amino acid transporter subunit LAT1 mRNA, complete cds. ACCESSION AF104032 NID g4426639 VERSION AF104032.1 GI:4426639 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4670) AUTHORS Prasad,P.D., Wang,H., Huang,W., Kekuda,R., Rajan,D.P., Leibach,F.H. and Ganapathy,V. TITLE Human LAT1, a subunit of system L amino acid transporter: molecular cloning and transport function JOURNAL Biochem. Biophys. Res. Commun. 255 (2), 283-288 (1999) MEDLINE 99160855 REFERENCE 2 (bases 1 to 4670) AUTHORS Prasad,P.D., Wang,H., Huang,W., Kekuda,R., Rajan,D.P., Leibach,F.H. and Ganapathy,V. TITLE Direct Submission JOURNAL Submitted (02-NOV-1998) Obstetrics and Gynecology, Medical College of Georgia, 1120 15th Street, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1. .4670 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 67. .1590 /function="transports large neutral amino acids in association with 4F2HC antigen" /note="LAT1" /codon_start=1 /product="L-type amino acid transporter subunit LAT1" /protein_id="AAD20464.1" /db_xref="PID:g4426640" /db_xref="GI:4426640" /translation="MAGAGPKRRALAAPAAEEKEEAREKMLAAKSADGSAPAGEGEGV TLQRNITLLNGVAIIVGTIIGSGIFVTPTGVLKEAGSPGLALVVWAACGVFSIVGALC YAELGTTISKSGGDYAYMLEVYGSLPAFLKLWIELLIIRPSSQYIVALVFATYLLKPL FPTCPVPEEAAKLVACLCVLLLTAVNCYSVKAATRVQDAFAAAKLLALALIILLGFVQ IGKGDVSNLDPNFSFEGTKLDVGNIVLALYSGLFAYGGWNYLNFVTEEMINPYRNLPL AIIISLPIVTLVYVLTNLAYFTTLSTEQMLSSEAVAVDFGNYHLGVMSWIIPVFVGLS CFGSVNGSLFTSSRLFFVGSREGHLPSILSMIHPQLLTPVPSLVFTCVMTLLYAFSKD IFSVINFFSFFNWLCVALAIIGMIWLRHRKPELERPIKVNLALPVFFILACLFLIAVS FWKTPVECGIGFTIILSGLPVYFFGVWWKNKPKWLLQGIFSTTVLCQKLMQVVPQET" BASE COUNT 896 a 1471 c 1301 g 1002 t ORIGIN 1 GCGGCGCGCA CACTGCTCGC TGGGCCGCGG CTCCCGGGTG TCCCAGGCCC 51 GGCCGGTGCG CAGAGCATGG CGGGTGCGGG CCCGAAGCGG CGCGCGCTAG 101 CGGCGCCGGC GGCCGAGGAG AAGGAAGAGG CGCGGGAGAA GATGCTGGCC 151 GCCAAGAGCG CGGACGGCTC GGCGCCGGCA GGCGAGGGCG AGGGCGTGAC 201 CCTGCAGCGG AACATCACGC TGCTCAACGG CGTGGCCATC ATCGTGGGGA 251 CCATTATCGG CTCGGGCATC TTCGTGACGC CCACGGGCGT GCTCAAGGAG 301 GCAGGCTCGC CGGGGCTGGC GCTGGTGGTG TGGGCCGCGT GCGGCGTCTT 351 CTCCATCGTG GGCGCGCTCT GCTACGCGGA GCTCGGCACC ACCATCTCCA 401 AATCGGGCGG CGACTACGCC TACATGCTGG AGGTCTACGG CTCGCTGCCC 451 GCCTTCCTCA AGCTCTGGAT CGAGCTGCTC ATCATCCGGC CTTCATCGCA 501 GTACATCGTG GCCCTGGTCT TCGCCACCTA CCTGCTCAAG CCGCTCTTCC 551 CCACCTGCCC GGTGCCCGAG GAGGCAGCCA AGCTCGTGGC CTGCCTCTGC 601 GTGCTGCTGC TCACGGCCGT GAACTGCTAC AGCGTGAAGG CCGCCACCCG 651 GGTCCAGGAT GCCTTTGCCG CCGCCAAGCT CCTGGCCCTG GCCCTGATCA 701 TCCTGCTGGG CTTCGTCCAG ATCGGGAAGG GTGATGTGTC CAATCTAGAT 751 CCCAACTTCT CATTTGAAGG CACCAAACTG GATGTGGGGA ACATTGTGCT 801 GGCATTATAC AGCGGCCTCT TTGCCTATGG AGGATGGAAT TACTTGAATT 851 TCGTCACAGA GGAAATGATC AACCCCTACA GAAACCTGCC CCTGGCCATC 901 ATCATCTCCC TGCCCATCGT GACGCTGGTG TACGTGCTGA CCAACCTGGC 951 CTACTTCACC ACCCTGTCCA CCGAGCAGAT GCTGTCGTCC GAGGCCGTGG 1001 CCGTGGACTT CGGGAACTAT CACCTGGGCG TCATGTCCTG GATCATCCCC 1051 GTCTTCGTGG GCCTGTCCTG CTTCGGCTCC GTCAATGGGT CCCTGTTCAC 1101 ATCCTCCAGG CTCTTCTTCG TGGGGTCCCG GGAAGGCCAC CTGCCCTCCA 1151 TCCTCTCCAT GATCCACCCA CAGCTCCTCA CCCCCGTGCC GTCCCTCGTG 1201 TTCACGTGTG TGATGACGCT GCTCTACGCC TTCTCCAAGG ACATCTTCTC 1251 CGTCATCAAC TTCTTCAGCT TCTTCAACTG GCTCTGCGTG GCCCTGGCCA 1301 TCATCGGCAT GATCTGGCTG CGCCACAGAA AGCCTGAGCT TGAGCGGCCC 1351 ATCAAGGTGA ACCTGGCCCT GCCTGTGTTC TTCATCCTGG CCTGCCTCTT 1401 CCTGATCGCC GTCTCCTTCT GGAAGACACC CGTGGAGTGT GGCATCGGCT 1451 TCACCATCAT CCTCAGCGGG CTGCCCGTCT ACTTCTTCGG GGTCTGGTGG 1501 AAAAACAAGC CCAAGTGGCT CCTCCAGGGC ATCTTCTCCA CGACCGTCCT 1551 GTGTCAGAAG CTCATGCAGG TGGTCCCCCA GGAGACATAG CCAGGAGGCC 1601 GAGTGGCTGC CGGAGGAGCA TGCGCAGAGG CCAGTTAAAG TAGATCACCT 1651 CCTCGAACCC ACTCCGGTTC CCCGCAACCC ACAGCTCAGC TGCCCATCCC 1701 AGTCCCTCGC CGTCCCTCCC AGGTCGGGCA GTGGAGGCTG CTGTGAAAAC 1751 TCTGGTACGA ATCTCATCCC TCAACTGAGG GCCAGGGACC CAGGTGTGCC 1801 TGTGCTCCTG CCCAGGAGCA GCTTTTGGTC TCCTTGGGCC CTTTTTCCCT 1851 TCCCTCCTTT GTTTACTTAT ATATATATTT TTTTTAAACT TAAATTTTGG 1901 GTCAACTTGA CACCACTAAG ATGATTTTTT AAGGAGCTGG GGGAAGGCAG 1951 GAGCCTTCCT TTCTCCTGCC CCAAGGGCCC AGACCCTGGG CAAACAGAGC 2001 TACTGAGACT TGGAACCTCA TTGCTACGAC AGACTTGCAC TGAAGCCGGA 2051 CAGCTGCCCA GACACATGGG CTTGTGACAT TCGTGAAAAC CAACCCTGTG 2101 GGCTTATGTC TCTGCCTTAG GGTTTGCAGA GTGGAAACTC AGCCGTAGGG 2151 TGGCACTGGG AGGGGGTGGG GGATCTGGGC AAGGTGGGTG ATTCCTCTCA 2201 GGAGGTGCTT GAGGCCCCGA TGGACTCCTG ACCATAATCC TAGCCCTGAG 2251 ACACCATCCT GAGCCAGGGA ACAGCCCCAG GGTTGGGGGG TGCCGGCATC 2301 TCCCCTAGCT CACCAGGCCT GGCCTCTGGG CAGTGTGGCC TCTTGGCTAT 2351 TTCTGTGTCC AGTTTTGGAG GCTGAGTTCT GGTTCATGCA GACAAAGCCC 2401 TGTCCTTCAG TCTTCTAGAA ACAGAGACAA GAAAGGCAGA CACACCGCGG 2451 CCAGGCACCC ATGTGGGCGC CCACCCTGGG CTCCACACAG CAGTGTCCCC 2501 TGCCCCAGAG GTCGCAGCTA CCCTCAGCCT CCAATGCATT GGCCTCTGTA 2551 CCGCCCGGCA GCCCCTTCTG GCCGGTGCTG GGTTCCCACT CCCGGCCTAG 2601 GCACCTCCCC GCTCTCCCTG TCACGCTCAT GTCCTGTCCT GGTCCTGATG 2651 CCCGTTGTCT AGGAGACAGA GCCAAGCACT GCTCACGTCT CTGCCGCCTG 2701 CGTTTGGAGG CCCCTGGGCT CTCACCCAGT CCCCACCCGC CTGCAGAGAG 2751 GGAACTAGGG CACCCCTTGT TTCTGTTGTT CCCGTGAATT TTTTTCGCTA 2801 TGGGAGGCAG CCGAGGCCTG GCCAATGCGG CCCACTTTCC TGAGCTGTCG 2851 CTGCCTCCAT GGCAGCAGCC AAGGACCCCC AGAACAAGAA GACCCCCCCG 2901 CAGGATCCCT CCTGAGCTCG GGGGGCTCTG CCTTCTCAGG CCCCGGGCTT 2951 CCCTTCTCCC CAGCCAGAGG TGGAGCCAAG TGGTCCAGCG TCACTCCAGT 3001 GCTCAGCTGT GGCTGGAGGA GCTGGCCTGT GGCACAGCCC TGAGTGTCCC 3051 AAGCCGGGAG CCAACGAAGC CGGACACGGC TTCACTGACC AGCGGCTGCT 3101 CAAGCCGCAA GCTCTCAGCA AGTGCCCAGC GGAGCCTGCC GCCCCCACCT 3151 GGGCACCGGG ACCCCCTCAC CATCCAGTGG GCCCGGAGAA ACCTGATGAA 3201 CAGTTTGGGG ACTCAGGACC AGATGTCCGT CTCTCTTGCT TGAGGAATGA 3251 AGACCTTTAT TCACCCCTGC CCCGTTGCTT CCCGCTGCAC ATGGACAGAC 3301 TTCACAGCGT CTGCTCATAG GACCTGCATC CTTCCTGGGG ACGAATTCCA 3351 CTCGTCCAAG GGACAGCCCA CGGTCTGGAG GCCGAGGACC ACCAGCAGGC 3401 AGGTGGACTG ACTGTGTTGG GCAAGACCTC TTCCCTCTGG GCCTGTTCTC 3451 TTGGCTGCAA ATAAGGACAG CAGCTGGTGC CCCACCTGCC TGGTGCATTG 3501 CTGTGTGAAT CCAGGAGGCA GTGGACATCG TAGGCAGCCA CGGCCCCGGG 3551 TCCAGGAGAA GTGCTCCCTG GAGGCACGCA CCACTGCTTC CCACTGGGGC 3601 CGGCGGGGCC CACGCACGAC GTCAGCCTCT TACCTTCCCG CCTCGGCTAG 3651 GGGTCCTCGG GATGCCGTTC TGTTCCAACC TCCTGCTCTG GGACGTGGAC 3701 ATGCCTCAAG GATACAGGGA GCCGGCGGCC TCTCGACGGC ACGCACTTGC 3751 CTGTTGGCTG CTGCGGCTGT GGGCGAGCAT GGGGGCTGCC AGCGTCTGTT 3801 GTGGAAAGTA GCTGCTAGTG AAATGGCTGG GGCCGCTGGG GTCCGTCTTC 3851 ACACTGCGCA GGTCTCTTCT GGGCGTCTGA GCTGGGGTGG GAGCTCCTCC 3901 GCAGAAGGTT GGTGGGGGGT CCAGTCTGTG ATCCTTGGTG CTGTGTGCCC 3951 CACTCCAGCC TGGGGACCCC ACTTCAGAAG GTAGGGGCCG TGTCCCGCGG 4001 TGCTGACTGA GGCCTGCTTC CCCCTCCCCC TCCTGCTGTG CTGGAATTCC 4051 ACAGGGACCA GGGCCACCGC AGGGGACTGT CTCAGAAGAC TTGATTTTTC 4101 CGTCCCTTTT TCTCCACACT CCACTGACAA ACGTCCCCAG CGGTTTCCAC 4151 TTGTGGGCTT CAGGTGTTTT CAAGCACAAC CCACCACAAC AAGCAAGTGC 4201 ATTTTCAGTC GTTGTGCTTT TTTGTTTTGT GCTAACGTCT TACTAATTTA 4251 AAGATGCTGT CGGCACCATG TTTATTTATT TCCAGTGGTC ATGCTCAGCC 4301 TTGCTGCTCT GCGTGGCGCA GGTGCCATGC CTGCTCCCTG TCTGTGTCCC 4351 AGCCACGCAG GGCCATCCAC TGTGACGTCG GCCGACCAGG CTGGACACCC 4401 TCTGCCGAGT AATGACGTGT GTGGCTGGGA CCTTCTTTAT TCTGTGTTAA 4451 TGGCTAACCT GTTACACTGG GCTGGGTTGG GTAGGGTGTT CTGGCTTTTT 4501 TGTGGGGTTT TTATTTTTAA AGAAACACTC AATCATCCTA AAAAAAAAAA 4551 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 4601 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 4651 AAAAAAAAAA AAAAAAAAAA // LOCUS HUMIA1X 2838 bp mRNA PRI 31-DEC-1994 DEFINITION Human zinc-finger DNA-binding motifs (IA-1) mRNA, complete cds. ACCESSION M93119 NID g184510 VERSION M93119.1 GI:184510 KEYWORDS . SOURCE Homo sapiens insulinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2838) AUTHORS Goto,Y., De Silva,M.G., Toscani,A., Prabhakar,B.S., Notkins,A.L. and Lan,M.S. TITLE A novel human insulinoma-associated cDNA, IA-1, encodes a protein with 'zinc-finger' DNA-binding motifs JOURNAL J. Biol. Chem. 267 (21), 15252-15257 (1992) MEDLINE 92340582 FEATURES Location/Qualifiers source 1. .2838 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="beta islet tumor" /tissue_type="insulinoma" gene 148. .1680 /gene="IA-1" CDS 148. .1680 /gene="IA-1" /codon_start=1 /protein_id="AAA58680.1" /db_xref="PID:g184511" /db_xref="GI:184511" /translation="MPRGFLVKRSKKSTPVSYRVRGGEDGDRALLLSPSCGGARAEPP APSPVPGPLPPPPPAERAHAALAAALACAPGPQPPPQGPRAAHFGNPEAAHPAPLYSP TRPVSREHEKHKYFERSFNLGSPVSAESFPTPAALLGGGGGGGASGAGGGGTCGGDPL LFAPAELKMGTAFSAGAEAARGPGPGPPLPPAAALRPPGKRPPPPTAAEPPAKAVKAP GAKKPKAIRKLHFEDEVTTSPVLGLKIKEGPVEAPRGRAGGAARPLGEFICQLCKEEY ADPFALAQHKCSRIVRVEYRCPECAKVFSCPANLASHRRWHKPRPAPAAARAPEPEAA ARAEAREAPGGGSDRDTPSPGGVSESGSEDGLYECHHCAKKFRRQAYLRKHLLAHHQA LQAKGAPLAPPAEDLLALYPGPDEKAPQEAAGDGEGAGVLGLSASAECHLCPVCGESF ASKGAQERHLRLLHAAQVFPCKYCPATFYSSPGLTRHINKCHPSENRQVILLQVPVRP AC" BASE COUNT 492 a 941 c 847 g 558 t ORIGIN 1 GGGCGCAGAG CTGGGCCGAG CCGTCGCCGG CGCCACGCGA GTCCCGCAGC 51 CGCCGCGCCC GGGCAATGGG CCGGGGGCAC TGAGGGCCGC CGGGGCCGAG 101 CGCGGAGGGG GGACCGAGCC AGTGCCGTGC CCTCGGGCCG CGCCAACATG 151 CCCCGCGGCT TCCTGGTGAA GCGCAGCAAG AAGTCCACGC CCGTTTCCTA 201 CCGGGTCCGC GGCGGCGAGG ACGGCGACCG CGCACTGCTG CTCTCGCCCA 251 GCTGCGGGGG CGCCCGCGCC GAGCCCCCGG CGCCGAGCCC GGTCCCCGGG 301 CCGCTGCCGC CGCCGCCGCC CGCGGAGCGC GCCCATGCAG CGCTCGCCGC 351 CGCGCTTGCC TGCGCGCCTG GGCCGCAGCC ACCCCCGCAG GGCCCGCGGG 401 CCGCGCACTT CGGCAACCCC GAGGCTGCGC ACCCCGCGCC GCTCTACAGT 451 CCCACGCGGC CCGTGAGCCG CGAGCACGAG AAGCACAAGT ACTTCGAACG 501 CAGCTTCAAC CTGGGCTCGC CGGTCTCGGC CGAGTCCTTC CCCACGCCCG 551 CCGCGCTGCT CGGAGGGGGC GGCGGCGGCG GCGCGAGCGG AGCTGGCGGA 601 GGCGGCACCT GCGGCGGCGA CCCGCTGCTC TTCGCGCCCG CCGAGCTCAA 651 GATGGGCACG GCGTTCTCGG CTGGCGCCGA GGCGGCCCGC GGCCCGGGCC 701 CCGGCCCCCC ACTGCCCCCT GCCGCCGCCC TGCGGCCCCC GGGAAAGCGG 751 CCCCCGCCCC CTACCGCCGC GGAGCCGCCC GCCAAGGCAG TCAAGGCCCC 801 GGGCGCCAAG AAGCCCAAGG CCATCCGCAA GCTGCACTTC GAGGACGAGG 851 TGACCACGTC GCCCGTGCTG GGGCTCAAGA TCAAGGAGGG CCCGGTGGAG 901 GCGCCGCGGG GCCGCGCGGG GGGCGCGGCG CGGCCGCTGG GCGAGTTCAT 951 CTGCCAGCTG TGCAAGGAGG AGTACGCCGA CCCGTTCGCG CTGGCGCAGC 1001 ACAAATGCTC GCGCATCGTG CGTGTGGAGT ACCGCTGTCC CGAGTGCGCC 1051 AAGGTCTTCA GCTGCCCGGC CAACCTGGCC TCGCACCGCC GCTGGCACAA 1101 ACCGCGGCCC GCGCCCGCCG CCGCCCGCGC GCCGGAGCCA GAAGCAGCAG 1151 CCAGGGCTGA GGCGCGGGAG GCACCCGGCG GCGGCAGCGA CCGGGACACG 1201 CCGAGCCCCG GCGGCGTGTC CGAGTCGGGC TCCGAGGACG GGCTCTACGA 1251 GTGCCATCAC TGCGCCAAGA AGTTCCGCCG CCAGGCCTAC CTACGCAAGC 1301 ACCTGCTGGC GCACCACCAG GCGCTGCAGG CCAAGGGCGC GCCGCTAGCG 1351 CCCCCGGCCG AGGACCTACT GGCCTTGTAC CCCGGGCCCG ACGAGAAGGC 1401 GCCCCAGGAG GCGGCCGGCG ACGGCGAGGG GGCCGGCGTG CTGGGCCTGA 1451 GTGCGTCCGC CGAGTGCCAC CTGTGCCCAG TGTGCGGAGA GTCGTTCGCC 1501 AGCAAGGGCG CTCAGGAGCG CCACCTGCGC CTGCTGCACG CCGCCCAGGT 1551 GTTCCCCTGC AAGTACTGCC CGGCCACCTT CTACAGCTCG CCCGGCCTTA 1601 CGCGGCACAT CAACAAGTGC CACCCATCCG AAAACAGACA GGTGATCCTC 1651 CTGCAGGTGC CCGTGCGCCC GGCCTGCTAG AGCGCGCCCT CCACCCCGGC 1701 CCCCGAACTG TGCCTTCGCT TGGAGACCCA CAAAGAGAGT GCGCCCTGCA 1751 CGCCCCGAAC CCGAGTCCGC GCTGGGGGAG CCTCGCCCCC GCCCCCACCG 1801 GGTGAGAGTG TCGTCTCCGC TTCTCTCGGT GTGGCGTGAC GGTAACCCCA 1851 TACTCTCCTT TTGACTCCTT TTGGAACCCC CACTTTTACG TTGTGTCCCT 1901 CCGCCTCCCC CATGGCGCAA CAGGAGTCAG TCTCTTTCTG TACAAGGGAG 1951 AAAAGCTGTA CGCGTTTGTC TCGTGGTTGG AAGCCTCCCC TTGGCGGGGA 2001 GAAGCTTTTT TTCTTGCTAG TATTCGCTGT GTTCATGGTC TAGAAATGCG 2051 GTCTGGTCTC GCCTCGCCTA CCAATCTCTG CTCTCTATGT ATGTAGCGTA 2101 CGGGTTGTTT TGGGTGAATC TTGAGGAATA AATGCCTTTA TATTTCACAG 2151 GCTGTAAATT GAACTTCCCA CACGATTAGC TTTATTATGG CTTGTGAACT 2201 GCTGGAGTCT GGCTTTACCT TTTTGTATGT GAACAAATCA AATTGCTTAA 2251 AAAAGAGTTT TCTTTAGTAT AGCCACAAAT GCCTTGAACT GTTGTCTGGG 2301 ATTGTTTTGT GGGGGGAGGG AAGGGAGTGT TCCGAAGATG CTGTAGTAAC 2351 TGCCTCAGTG TTTCACGTAA GACTTTTTGG TTTGATCATC TTTGTTGAGG 2401 TAGGACTATC AGTTCCCTCT AAATGTATAT GTTGATTTAT GAGTAATTGT 2451 TATTTATTCT TTATTTATTT ATATTAATTA TGAAGATTAT GATATTATTT 2501 GATTGCAGAT TTTTTTGGCG CGCTGCCCCC TCCCCACCCT GCCACTCTTG 2551 ACATTCCACT GTGCGTTTTA GAAGAGAGCC TTTTTCTAAA GGGATCTGCT 2601 TAAAGTTTTA ACTTTTATAC CTATCTGAGT GAATTACAGA CAACCTATCA 2651 TTTATTCTGC TTCGAGGGTC CCCAGGGCCC TTGTACAACC GACAGCTCTT 2701 ACTTTTAAAT GCAATCTCTT TTCTACATAC ATTATTTTCT TAATTGTTAG 2751 CTATTTATAG AAAGCTTCAA TAGAACTGTT TCAACTGTAT AACTATTTAC 2801 TATTCAAATA AAATATTTTC AAAGTCAAAA AAAAAAAA // LOCUS HUMIL3A 923 bp mRNA PRI 06-JAN-1995 DEFINITION Human interleukin 3 (IL-3) mRNA, complete cds, clone pcD-SR-alpha. ACCESSION M20137 NID g186328 VERSION M20137.1 GI:186328 KEYWORDS interleukin; interleukin 3. SOURCE Human thymocyte cell, cDNA to mRNA, clone pcD-SR-alpha. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 923) AUTHORS Otsuka,T., Miyajima,A., Brown,N., Otsu,K., Abrams,J.S., Saeland,S., Caux,C., Malefijt,R.D.W., DeVries,J., Meyerson,P., Yokota,K., Gemmel,L., Rennick,D., Lee,F., Arai,N., Arai,K.-I. and Yokota,T. TITLE Isolation and characterization of an expressible cDNA encoding human IL-3. Induction of IL-3 mRNA in human T cell clones JOURNAL J. Immunol. 140 (7), 2288-2295 (1988) MEDLINE 88170808 FEATURES Location/Qualifiers source 1. .923 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q23-q31" mRNA <1. .923 /note="IL3 mRNA" gene 54. .512 /gene="IL3" CDS 54. .512 /gene="IL3" /note="interleukin 3" /codon_start=1 /db_xref="GDB:G00-120-095" /protein_id="AAA59147.1" /db_xref="PID:g307060" /db_xref="GI:307060" /translation="MSRLPVLLLLQLLVRPGLQAPMTQTTPLKTSWVNCSNMIDEIIT HLKQPPLPLLDFNNLNGEDQDILMENNLRRPNLEAFNRAVKSLQNASAIESILKNLLP CLPLATAAPTRHPIHIKDGDWNEFRRKLTFYLKTLENAQAQQTTLSLAIF" BASE COUNT 245 a 244 c 196 g 238 t ORIGIN 1 CAGAGCCCCA CGAAGGACCA GAACAAGACA GAGTGCCTCC TGCCGATCCA 51 AACATGAGCC GCCTGCCCGT CCTGCTCCTG CTCCAACTCC TGGTCCGCCC 101 CGGACTCCAA GCTCCCATGA CCCAGACAAC GCCCTTGAAG ACAAGCTGGG 151 TTAACTGCTC TAACATGATC GATGAAATTA TAACACACTT AAAGCAGCCA 201 CCTTTGCCTT TGCTGGACTT CAACAACCTC AATGGGGAAG ACCAAGACAT 251 TCTGATGGAA AATAACCTTC GAAGGCCAAA CCTGGAGGCA TTCAACAGGG 301 CTGTCAAGAG TTTACAGAAC GCATCAGCAA TTGAGAGCAT TCTTAAAAAT 351 CTCCTGCCAT GTCTGCCCCT GGCCACGGCC GCACCCACGC GACATCCAAT 401 CCATATCAAG GACGGTGACT GGAATGAATT CCGGAGGAAA CTGACGTTCT 451 ATCTGAAAAC CCTTGAGAAT GCGCAGGCTC AACAGACGAC TTTGAGCCTC 501 GCGATCTTTT AGTCCAACGT CCAGCTCGTT CTCTGGGCCT TCTCACCACA 551 GAGCCTCGGG ACATCAAAAA CAGCAGAACT TCTGAAACCT CTGGGTCATC 601 TCTCACACAT TCCAGGACCA GAAGCATTTC ACCTTTTCCT GCGGCATCAG 651 ATGAATTGTT AATTATCTAA TTTCTGAAAT GTGCAGCTCC CATTTGGCCT 701 TGTGCGGTTG TGTTCTCATT TTTATCCCAT TGAGACTATT TATTTATGTA 751 TGTATGTATT TATTTATTTA TTGCCTGGAG TGTGAACTGT ATTTATTTTA 801 GCAGAGGAGC CATGTCCTGC TGCTTCTGCA AAAAACTCAG AGTGGGGTGG 851 GGAGCATGTT CATTTGTACC TCGAGTTTTA AACTGGTTCC TAGGGATGTG 901 TGAGAATAAA CTAGACTCTG AAC // LOCUS HUME16GEN 3984 bp mRNA PRI 27-APR-1993 DEFINITION Human E16 mRNA, complete cds. ACCESSION M80244 NID g181907 VERSION M80244.1 GI:181907 KEYWORDS E16 protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3984) AUTHORS Gaugitsch,H.W., Prieschl,E.E., Kalthoff,F., Huber,N.E. and Baumruker,T. TITLE A novel transiently expressed, integral membrane protein linked to cell activation: Molecular cloning via the rapid degradation signal AUUUA JOURNAL J. Biol. Chem. 267, 11267-11273 (1992) MEDLINE 92283834 FEATURES Location/Qualifiers source 1. .3984 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cell" gene 311. .1036 /gene="E16" CDS 311. .1036 /gene="E16" /codon_start=1 /protein_id="AAA35780.1" /db_xref="PID:g181908" /db_xref="GI:181908" /translation="MINPYRNLPLAIIISLPIVTLVYVLTNLAYFTTLSTEQMLSSEA VAVDFGNYHLGVMSWIIPVFVGLSCFGSVNGSLFTSSRLFFVGSREGHLPSILSMIHP QLLTPVPSLVFTCVMTLLYAFSKDIFSVINFFSFFNWLCVALAIIGMIWLRHRKPELE RPIKVNLALPVFFILACLFLIAVSFWKTPVECGIGFTIILSGLPVYFFGVWWKNKPKW LLQGIFSTTVLCQKLMQVVPQET" BASE COUNT 692 a 1268 c 1095 g 929 t ORIGIN 1 GTCCTTTCAC GCGTGTCTTC GTGTTGGTGC GCTTTTCACT GGTCATAAAG 51 TGCTGCTCAC GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC 101 CAGGATGCTT TTGCCGCCGC CAAGCTCCTG GCCCTGGCCC TGATCATCCT 151 GCTGGGCTTC GTCCAGATCG GGAAGGGTGA TGTGTCCAAT CTAGATCCCA 201 AGTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT TGTGCTGGCA 251 TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 301 CACAGAGGAA ATGATCAACC CCTACAGAAA CCTGCCCCTG GCCATCATCA 351 TCTCCCTGCC CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC 401 TTCACCACCC TGTCCACCGA GCAGATGCTG TCGTCCGAGG CCGTGGCCGT 451 GGACTTCGGG AACTATCACC TGGGCGTCAT GTCCTGGATC ATCCCCGTCT 501 TCGTGGGCCT GTCCTGCTTT GGCTCCGTCA ATGGGTCCCT GTTCACATCC 551 TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 601 CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA 651 CGTGTGTGAT GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC 701 ATCAACTTCT TCAGCTTCTT CAACTGGCTC TGCGTGGCCC TGGCCATCAT 751 CGGCATGATC TGGCTGCGCC ACAGAAAGCC TGAGCTTGAG CGGCCCATCA 801 AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG CCTCTTCCTG 851 ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 901 CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA 951 ACAAGCCCAA GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT 1001 CAGAAGCTCA TGCAGGTGGT CCCCCAGGAG ACATAGCCAG GAGGCCGAGT 1051 GGCTGCCGGA GGAGCATGCG CAGAGGCCAG TTAAAGTAGA TCACCTCCTC 1101 GAACCCACTC CGGTTCCCCG CAACCCACAG CTCAGCTGCC CATCCCAGTC 1151 CTCGCCGTCC CTCCCAGGTC GGGCAGTGGA GGCTGCTGTG AAAACTCTGG 1201 TACGAATCTC ATCCCTCAAC TGAGGGCCAG GGACCCAGGT GTGCCTGTGC 1251 TCCTGCCCAG GAGCAGCTTT TGGTCTCCTT GGGCCCTTTT TCCCTTCCCT 1301 CCTTTGTTTA CTTATATATA TATTTTTTTT AAACTTAAAT TTTGGGTCAA 1351 CTTGACACCA CTAAGATGAT TTTTTAAGGA GCTGGGGGAA GGCAGGAGCC 1401 TTCCTTTCTC CTGCCCCAAG GGCCCAGACC CTGGGCAAAC AGAGCTACTG 1451 AGACTTGGAA CCTCATTGCT ACCACAGACT TGCACTGAAG CCAGACAGCT 1501 GCCCAGACAC ATGGGCTTGT GACATTCGTG AAAACCAACC CTGTGGGCTT 1551 ATGTCTCTGC CTTAGGGTTT GCAGAGTGGA AACTCAGCCG TAGGGTGGCA 1601 CTGGGAGGGG GTGGGGGATC TGGGCAAGGT GGGTGATTCC TCCCAGGAGG 1651 TGCTTGAGGC CCCGATGGAC TCCTGACCAT AATCCTAGCC CCGAGACACC 1701 ATCCTGAGCC AGGGAACAGC CCCAGGGTTG GGGGGTGCCG GCATCTCCCC 1751 TAGCTCACCA GGCCTGGCCT CTGGGCAGTG TGGCCTCTTG GCTATTTCTG 1801 TTCCAGTTTT GGAGGCTGAG TTCTGGTTCA TGCAGACAAA GCCCTGTCCT 1851 TCAGTCTTCT AGAAACAGAG ACAAGAAAGG CAGACACACC GCGGCCAGGC 1901 ACCCATGTGG GCGCCCACCC TGGGCTCCAC ACAGCAGTGT CCCCTGCCCC 1951 AGAGGTCGCA GCTACCCTCA GCCTCCAATG CATTGGCCTC TGTACCGCCC 2001 GGCAGCCCCT TCTGGCCGGT GCTGGGTTCC CACTCCCGGC CTAGGCACCT 2051 CCCCGCTCTC CCTGTCACGC TCATGTCCTG TCCTGGTCCT GATGCCCGTT 2101 GTCTAGGAGA CAGAGCCAAG CACTGCTCAC GTCTCTGCCG CCTGCGTTTG 2151 GAGGCCCCTG GGCTCTCACC CAGTCCCCAC CCGCCTGCAG AGAGGGAACT 2201 AGGGCACCCC TTGTTTCTGT TGTTCCCGTG AATTTTTTTC GCTATGGGAG 2251 GCAGCCGAGG CCTGGCCAAT GCGGCCCACT TTCCTGAGCT GTCGCTGCCT 2301 CCATGGCAGC AGCCAAGGAC CCCCAGAACA AGAAGACCCC CCCGCAGGAT 2351 CCCTCCTGAG CTCGGGGGGC TCTGCCTTCT CAGGCCCCGG GCTTCCCTTC 2401 TCCCCAGCCA GAGGTGGAGC CAAGTGGTCC AGCGTCACTC CAGTGCTCAG 2451 CTGTGGCTGG AGGAGCTGGC CTGTGGCACA GCCCTGAGTG TCCCAAGCCG 2501 GGAGCCAACG AAGCCGGACA CGGCTTCACT GACCAGCGGC TGCTCAAGCC 2551 GCAAGCTCTC AGCAAGTGCC CAGTGGAGCC TGCCGCCCCC ACCTGGGCAC 2601 CGGGACCCCC TCACCATCCA GTGGGCCCGG AGAAACCTGA TGAACAGTTT 2651 GGGGACTCAG GACCAGATGT CCGTCTCTCT TGCTTGAGGA ATGAAGACCT 2701 TTATTCACCC CTGCCCCGTT GCTTCCCGCT GCACATGGAC AGACTTCACA 2751 GCGTCTGCTC ATAGGACCTG CATCCTTCCT GGGGACGAAT TCCACTCGTC 2801 CAAGGGACAG CCCACGGTCT GGAGGCCGAG GACCACCAGC AGGCAGGTGG 2851 ACTGACTGTG TTGGGCAAGA CCTCTTCCCT CTGGGCCTGT TCTCTTGGCT 2901 GCAAATAAGG ACAGCAGCTG GTGCCCCACC TGCCTGGTGC ATTGCTGTGT 2951 GAATCCAGGA GGCAGTGGAC ATCGTAGGCA GCCACGGCCC CAGGTCCAGG 3001 AGAAGTGCTC CCTGGAGGCA CGGACCACTG CTTCCCACTG GGGCCGGCGG 3051 GGCCCACGCA CGACGTCAGC CTCTTACCTT CCCGCCTCGG CTAGGGGTCC 3101 TCGGGATGCC GTTCTGTTCC AACCTCCTGT TCTGGGAGGT GGACATGCCT 3151 CAAGGATACA GGGAGCCGGC GGCCTCTCGA CGGCACGCAC TTCCTGTTGG 3201 CTGCTGCGGC TGTGGGCGAG CATGGGGGCT GCCAGCGTCT GTTGTGGAAA 3251 GTAGCTGCTA GTGAAATGGC TGGGGCCGCT GGGGTCCGTC TTCACACTGC 3301 GCAGGTCTCT TCTGGGCGTC TGAGCTGGGG TGGGAGCTCC TCCGCAGAAG 3351 GTTGGTGGGG GGTCCAGTCT GTGATCCTTG GTGCTGTGTG CCCCACTCCA 3401 GCCTGGGGAC CCCACTTCAG AAGGTAGGGG CCGTGTCCCG CGGTGCTGAC 3451 TGAGGCCTGC TTCCCCCTCC CCCTCCTGCT GTGCTGGAAT TCCACAGGGA 3501 CCAGGGCCAC CGCAGGGGAC TGTCTCAGAA GACTTGATTT TTCCGTCCCT 3551 TTTTCTCCAC ACTCCACTGA CAAACGTCCC CAGCGGTTTC CACTTGTGGG 3601 CTTCAGGTGT TTTCAAGCAC AACCCACCAC AACAAGCAAG TGCATTTTCA 3651 GTCGTTGTGC TTTTTTGTTT TGTGCTAACG TCTTACTAAT TTAAAGATGC 3701 TGTCGGCACC ATGTTTATTT ATTTCCAGTG GTCATGCTCA GCCTTGCTGC 3751 TCTGCGTGGC GCAGGTGCCA TGCCTGCTCC CTGTCTGTGT CCCAGCCACG 3801 CAGGGCCATC CACTGTGACG TCGGCCGACC AGGCTGGACA CCCTCTGCCG 3851 AGTAATGACG TGTGTGGCTG GGACCTTCTT TATTCTGTGT TAATGGCTAA 3901 CCTGTTACAC TGGGCTGGGT TGGGTAGGGT GTTCTGGCTT TTTTGTGGGG 3951 TTTTTATTTT TAAAGAAACA CTCAATCATC CTAG // LOCUS HUM2OGDH 4122 bp mRNA PRI 02-FEB-1999 DEFINITION Human mRNA for 2-oxoglutarate dehydrogenase, complete cds. ACCESSION D10523 D90499 NID g531240 VERSION D10523.1 GI:531240 KEYWORDS 2-oxoglutarate dehydrogenase. SOURCE Homo sapiens cDNA to mRNA, clone_lib:lamda gt11. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4122) AUTHORS Koike,K. TITLE Direct Submission JOURNAL Submitted (11-SEP-1991) to the DDBJ/EMBL/GenBank databases. Kichiko Koike, Atomic Disease Institute, Nagasaki Univ. School of Medicine, Department of Pathological Biochemistry; Sakamoto 1-12-4, Nagasaki, Nagasaki 852, Japan (Tel:0958-47-2111(ex.2347), Fax:0958-45-9790) REFERENCE 2 (sites) AUTHORS Koike,K., Urata,Y. and Goto,S. TITLE Cloning and nucleotide sequence of the cDNA encoding human 2-oxoglutarate dehydrogenase (lipoamide) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (5), 1963-1967 (1992) MEDLINE 92179301 REFERENCE 3 (bases 1 to 4122) AUTHORS Koike,K. JOURNAL Unpublished (1994) REFERENCE 4 (sites) AUTHORS Koike,K. TITLE The gene encoding human 2-oxoglutarate dehydrogenase: structural organization and mapping to chromosome 7p13-p14 JOURNAL Gene 159 (2), 261-266 (1995) MEDLINE 95347609 COMMENT On Aug 20, 1994 this sequence version replaced gi:219394. Submitted (11-Sep-1991) to DDBJ by: Kichiko Koike Department of Pathological Biochemistry Atomic Disease Institute Nagasaki University School of Medicine 12-4 Sakamoto-machi Nagasaki-shi 852 Japan Phone: 0958-47-2111 Fax: 0958-47-8514. FEATURES Location/Qualifiers source 1. .4122 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lamda gt11" 3'UTR 1. .57 old_sequence 39 /citation=[2] /replace="g" sig_peptide 58. .177 CDS 58. .3066 /codon_start=1 /product="2-oxoglutarate dehydrogenase precursor" /protein_id="BAA01393.1" /db_xref="PID:d1001866" /db_xref="PID:g531241" /db_xref="GI:531241" /translation="MFHLRTCAAKLRPLTASQTVKTFSQNRPAAARTFQQIRCYSAPV AAEPFLSGTSSNYVEEMYCAWLENPKSVHKSWDIFFRNTNAGAPPGTAYQSPLPLSRG SLAAVAHAQSLVEAQPNVDKLVEDHLAVQSLIRAYQIRGHHVAQLDPLGILDADLDSS VPADIISSTDKLGFYGLDESDLDKVFHLPTTTFIGGQESALPLREIIRRLEMAYCQHI GVEFMFINDLEQCQWIRQKFETPGIMQFTNEEKRTLLARLVRSTRFEEFLQRKWSSEK RFGLEGCEVLIPALKTIIDKSSENGVDYVIMGMPHRGRLNVLANVIRKELEQIFCQFD SKLEAADEGSGDVKYHLGMYHRRINRVTDRNITLSLVANPSHLEAADPVVMGKTKAEQ FYCGDTEGKKVMSILLHGDAAFAGQGIVYETFHLSDLPSYTTHGTVHVVVNNQIGFTT DPRMARSSPYPTDVARVVNAPIFHVNSDDPEAVMYVCKVAAEWRSTFHKDVVVDLVCY RRNGHNEMDEPMFTQPLMYKQIRKQKPVLQKYAELLVSQGVVNQPEYEEEISKYDKIC EEAFARSKDEKILHIKHWLDSPWPGFFTLDGQPRSMSCPSTGLTEDILTHIGNVASSV PVENFTIHGGLSRILKTRGEMVKNRTVDWALAEYMAFGSLLKEGIHIRLSGQDVERGT FSHRHHVLHDQNVDKRTCIPMNHLWPNQAPYTVCNSSLSEYGVLGFEAGLRMASPNAL VLWEAQFGDFHNTAQCIIDQFICPGQAKWVRQNGIVLLLPHGMEGMGPEHSSARPERF LQMCNDDPDVLPDLKEANFDINQLYDCNWVVVNCSTPGNFFHVLRRQILLPFRKPLII FTPKSLLRHPEARSSFDEMLPGTHFQRVIPEDGPAAQNPENVKRLLFCTGKVYYDLTR ERKARDMVGQVAITRIEQLSPFPFDLLLKEVQKYPNAELAWCQEEHKNQGYYDYVKPR LRTTISRAKPVWYAGRNPAAAPATGNKKTH" mat_peptide 178. .3063 /EC_number="1.2.4.2" /product="2-oxoglutarate dehydrogenase" old_sequence 322 /citation=[2] /replace="" old_sequence 325 /citation=[2] /replace="" old_sequence 332 /citation=[2] /replace="" old_sequence 350 /citation=[2] /replace="" old_sequence 538. .540 /citation=[2] /replace="tgc" old_sequence 627 /citation=[2] /replace="c" old_sequence 1087 /citation=[2] /replace="g" old_sequence 1149 /citation=[2] /replace="c" old_sequence 2247 /citation=[2] /replace="a" old_sequence 2403. .2404 /citation=[2] /replace="ta" old_sequence 2440 /citation=[2] /replace="g" old_sequence 2452 /citation=[2] /replace="g" old_sequence 2779 /citation=[2] /replace="g" old_sequence 2857. .2858 /citation=[2] /replace="cg" old_sequence 3026 /citation=[2] /replace="g" old_sequence 3044. .3045 /citation=[2] /replace="cg" 5'UTR 3067. .4122 old_sequence 3152. .3153 /citation=[2] /replace="" old_sequence 3209 /citation=[2] /replace="t" old_sequence 3231. .3232 /citation=[2] /replace="cc" old_sequence 3425 /citation=[2] /replace="c" old_sequence 3435 /citation=[2] /replace="a" old_sequence 3590. .3595 /citation=[2] /replace="ggagtc" old_sequence 3614 /citation=[2] /replace="g" old_sequence 3620. .3623 /citation=[2] /replace="ggag" old_sequence 3663 /citation=[2] /replace="a" old_sequence 3671 /citation=[2] /replace="g" old_sequence 3674 /citation=[2] /replace="g" old_sequence 3684 /citation=[2] /replace="c" old_sequence 3688. .3689 /citation=[2] /replace="tg" old_sequence 3710 /citation=[2] /replace="a" old_sequence 4066 /citation=[2] /replace="" old_sequence 4078. .4089 /citation=[2] /replace="" polyA_site 4122 BASE COUNT 895 a 1196 c 1142 g 889 t ORIGIN 1 CGGGTTCGGG TGGAGCTGAG CCGGAGACAG GCAATTGTGA AAAACTTCAG 51 GACAAAAATG TTTCATTTAA GGACTTGTGC TGCTAAGTTG AGGCCATTGA 101 CGGCTTCCCA GACTGTTAAG ACATTTTCAC AAAACAGACC AGCAGCAGCT 151 AGGACATTTC AACAGATTCG GTGCTATTCT GCACCTGTTG CTGCTGAGCC 201 CTTTCTCAGT GGGACTAGTT CGAACTATGT GGAGGAGATG TACTGTGCTT 251 GGCTGGAAAA CCCCAAAAGT GTACATAAGT CATGGGACAT TTTTTTTCGC 301 AACACGAATG CCGGAGCCCC ACCGGGCACT GCCTACCAGA GTCCCCTTCC 351 CCTGAGCCGA GGCTCCCTGG CTGCTGTGGC CCATGCACAG TCCCTGGTAG 401 AAGCACAGCC CAACGTGGAC AAGCTCGTGG AGGACCACCT GGCAGTGCAG 451 TCACTCATCA GGGCATATCA GATACGAGGG CACCATGTAG CACAGCTGGA 501 CCCCCTGGGG ATTTTGGATG CTGATCTGGA CTCCTCCGTG CCCGCTGACA 551 TTATCTCATC CACAGACAAA CTTGGGTTCT ATGGCCTGGA TGAGTCTGAC 601 CTCGACAAGG TCTTCCACTT GCCCACCACC ACTTTCATCG GGGGACAGGA 651 ATCAGCACTT CCTCTGCGGG AGATCATCCG TCGGCTGGAG ATGGCCTACT 701 GCCAGCATAT TGGGGTGGAG TTCATGTTCA TCAATGACCT GGAGCAGTGC 751 CAGTGGATCC GGCAGAAGTT TGAGACCCCT GGGATCATGC AGTTCACAAA 801 TGAGGAGAAA CGGACCCTGC TGGCCAGGCT TGTGCGGTCC ACCAGGTTTG 851 AGGAGTTCCT ACAGCGGAAG TGGTCCTCTG AGAAGCGCTT TGGTCTAGAA 901 GGCTGCGAGG TACTGATCCC TGCCCTCAAG ACCATCATTG ACAAGTCTAG 951 TGAGAATGGC GTGGACTACG TGATCATGGG CATGCCACAC AGAGGGCGGC 1001 TGAACGTGCT TGCAAATGTC ATCAGGAAGG AGCTGGAACA GATCTTCTGT 1051 CAATTCGATT CAAAGCTGGA GGCAGCTGAT GAGGGCTCCG GAGATGTGAA 1101 GTACCACCTG GGCATGTATC ACCGCAGGAT CAATCGTGTC ACCGACAGGA 1151 ACATTACCTT GTCCTTGGTG GCCAACCCTT CCCACCTTGA GGCCGCTGAC 1201 CCCGTGGTGA TGGGCAAGAC CAAAGCCGAA CAGTTTTACT GTGGCGACAC 1251 TGAAGGGAAA AAGGTCATGT CCATCCTGTT GCATGGGGAT GCTGCATTTG 1301 CTGGCCAGGG CATTGTGTAC GAGACCTTCC ACCTCAGCGA CCTGCCATCC 1351 TACACAACTC ATGGCACCGT GCACGTGGTC GTCAACAACC AGATCGGCTT 1401 CACCACCGAC CCTCGGATGG CCCGCTCCTC CCCCTACCCC ACTGACGTGG 1451 CCCGAGTGGT GAATGCCCCC ATTTTCCACG TGAACTCAGA TGACCCCGAG 1501 GCTGTCATGT ACGTGTGCAA AGTGGCGGCC GAGTGGAGGA GCACCTTCCA 1551 CAAGGACGTG GTTGTCGATT TGGTGTGTTA CCGGCGCAAC GGCCACAACG 1601 AGATGGATGA GCCCATGTTC ACGCAGCCGC TCATGTACAA GCAGATCCGC 1651 AAGCAGAAGC CTGTGTTACA GAAGTACGCT GAGCTGCTGG TGTCGCAGGG 1701 TGTGGTCAAC CAGCCTGAGT ATGAGGAGGA AATTTCCAAG TATGATAAGA 1751 TCTGTGAGGA AGCTTTTGCC AGATCTAAAG ATGAGAAGAT CTTGCACATT 1801 AAGCACTGGC TGGACTCTCC CTGGCCTGGC TTCTTCACCC TGGACGGGCA 1851 GCCCAGGAGC ATGTCCTGCC CCTCCACGGG TCTGACGGAG GATATTCTGA 1901 CACACATCGG GAATGTGGCT AGTTCTGTGC CTGTGGAAAA CTTTACTATT 1951 CATGGAGGGC TGAGCCGGAT CTTGAAGACT CGTGGGGAAA TGGTGAAGAA 2001 CCGGACTGTG GACTGGGCTC TAGCGGAGTA CATGGCGTTT GGCTCGCTCC 2051 TGAAGGAGGG CATCCACATT CGGCTGAGCG GCCAGGACGT GGAGCGGGGC 2101 ACATTCAGCC ACCGCCACCA TGTGCTCCAT GACCAGAATG TGGACAAGAG 2151 AACCTGCATC CCCATGAACC ATCTCTGGCC CAATCAGGCC CCCTATACTG 2201 TGTGCAACAG CTCACTGTCT GAGTACGGCG TGCTGGGCTT TGAAGCTGGG 2251 CTTCGCATGG CCAGTCCTAA TGCCCTGGTC CTCTGGGAAG CCCAATTTGG 2301 TGACTTCCAC AACACGGCCC AGTGTATCAT CGACCAGTTC ATCTGCCCGG 2351 GACAAGCCAA GTGGGTGCGG CAGAATGGCA TCGTGTTGCT GCTGCCCCAT 2401 GGCATGGAGG GCATGGGTCC AGAACATTCC TCCGCCCGCC CAGAGCGGTT 2451 CTTGCAGATG TGCAACGATG ACCCAGATGT CCTGCCAGAC CTTAAAGAAG 2501 CCAACTTCGA CATCAATCAG CTATATGACT GCAATTGGGT TGTTGTCAAC 2551 TGCTCCACTC CTGGCAACTT CTTCCACGTG CTACGACGCC AGATCCTGCT 2601 GCCATTCCGG AAGCCGTTAA TTATCTTCAC CCCCAAATCC CTGTTGCGCC 2651 ACCCCGAGGC CAGATCCAGC TTTGATGAGA TGCTTCCAGG AACCCACTTC 2701 CAGCGGGTGA TCCCAGAAGA TGGCCCTGCA GCTCAGAACC CAGAAAATGT 2751 CAAAAGGCTT CTCTTCTGCA CCGGCAAAGT GTATTATGAC CTCACCCGGG 2801 AGCGCAAAGC ACGCGACATG GTGGGGCAGG TGGCCATCAC AAGGATTGAG 2851 CAGCTGTCGC CATTCCCCTT TGACCTCCTG CTGAAGGAGG TGCAGAAGTA 2901 CCCCAATGCT GAGCTGGCCT GGTGCCAGGA GGAGCACAAG AACCAAGGCT 2951 ACTATGACTA CGTGAAGCCA AGACTTCGGA CCACCATCAG CCGCGCCAAG 3001 CCCGTCTGGT ATGCCGGCCG GAACCCAGCG GCTGCTCCAG CCACCGGCAA 3051 CAAGAAGACC CACTGACGGA GCTGCAGCGC CTCCTGGACA CGGCCTTCGA 3101 CCTGGACGTC TTCAAGAACT TCTCGTAGAT GCTGCCTAGG GTTGCTTGGG 3151 CCACTGCCCT CTCCACACCC ATGACTGCCC CTTGCTTCTC AACTAAAGAA 3201 TAGTGCCTCA GCGCTGCCCA CACCACCGTC CTCCTCGCTG TGCCACCACC 3251 CCTCCCTCTG CTCTCATAGG AGTTAGGCTG TCGTCCCCCT CCAGTGCTTG 3301 GCTGCCCCAC AGGCCACACG CTGCCCAGGC TCTGCTGACT TCTGAGCAGT 3351 TTTCCAGGAG GCCGGGGGGA GCAGGAGGAG GAAAGGTAGC CCCCGAGGGA 3401 TGTCCTTGGG GAGGGGTCAG CTCTGGCCCC AATCCTCCCC ACCAGTCTCA 3451 CCCACTAGGA TAGGAACTGG GCCTTGTGTG CTGGCTTCCG CTGTCACCCA 3501 GCAAGGCACA GGCTCCTGTA TTTGAGACTA GGATAGCTTC ATCTTGAGCC 3551 TGAGCCTTAG AATCTGTAGA GGAGCCTGGA GTCGGATCTA GCCATGGCTG 3601 GCAGAGGTTT CTAGGGTGGG CCCCAGCCGT GGCGTGAACT GAGGATGACC 3651 CGGGGCAGCT GGCAGGAGAG AGCCTTGGCC TGACCTGGCA CAGAAAGGGC 3701 AGCTTCAGTC TCTGCAGTGT CCATTATCTG CTGTTCCTTC GAGGGTTCCA 3751 GGCTGTGTGT GGGGCCCAAG CATGCCCCAC CCACCCTCCT GGGCCCAGGC 3801 AGCACCTGGA GCCCACAGAG TCTGTGTGTA GCCAGGAAGC CCCGCTCAGG 3851 TAGCCACCGC CGGGGCACTG GCTGCTCTGT CTTGGTCCTG TTAACCCTCC 3901 ACCTCCTCTC TTGGACTCCC TCCCCACCCC AACCACTCTT TCTTTCTCCT 3951 TTAACCCAAT GGAGACTTTC TGATGCATCG TTTTCTTTGC TGTGCCAAAG 4001 CAGGTCAGAA GAGGGAGAGG AGGGGCTGGG GGTGAGGGGC CAGGCCATGG 4051 CCAAGGGGCC AGCTGCCCCT CATTTATCAC TCTGACCTTC ACAGGGACAG 4101 ATCTGATTTA TTTATTTTGG TT // LOCUS D13641 3259 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0016 gene, complete cds. ACCESSION D13641 NID g285986 VERSION D13641.1 GI:285986 KEYWORDS KIAA0019; KIAA0016. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3259) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3259) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REMARK Erratum:[[published erratum appears in DNA Res 1995 Aug 31;2(4):211]] REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 REFERENCE 5 (bases 1 to 3259) AUTHORS Seki,N., Moczko,M., Nagase,T., Zufall,N., Ehmann,B., Dietmeier,K., Schafer,E., Nomura,N. and Pfanner,N. TITLE A human homolog of the mitochondrial protein import receptor Mom19 can assemble with the yeast mitochondrial receptor complex JOURNAL FEBS Lett. 375 (3), 307-310 (1995) MEDLINE 96085231 FEATURES Location/Qualifiers source 1. .3259 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /chromosome="1" /map="1q42" /sex="male" 5'UTR <1. .101 gene 102. .539 /gene="KIAA0016" CDS 102. .539 /gene="KIAA0016" /note="similar to fungal mitochondrial import receptor Mom19" /codon_start=1 /product="mitochondrial outer membrane protein 19" /protein_id="BAA02804.1" /db_xref="PID:d1003309" /db_xref="PID:g285987" /db_xref="GI:285987" /translation="MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKK QKLAKERAGLSKLPDLKDAEAVQKFFLEEIQLGEELLAQGEYEKGVDHLTNAIAVCGQ PQQLLQVLQQTLPPPVFQMLLTKLPTISQRIVSAQSLAEDDVE" 3'UTR 540. .>3259 BASE COUNT 912 a 588 c 729 g 1030 t ORIGIN 1 GGCCGTCGGG TGTGAGCTGC GCCGACCGCT CTGAGGGTTC GTGGCCCACC 51 GCTCCTTCGC GGTCCCTGCC GCCACCGTCC ACGCTCAGCG TTGTAGAGAA 101 GATGGTGGGT CGGAACAGCG CCATCGCCGC CGGTGTATGC GGGGCCCTTT 151 TCATTGGGTA CTGCATCTAC TTCGACCGCA AAAGACGAAG TGACCCCAAC 201 TTCAAGAACA GGCTTCGAGA ACGAAGAAAG AAACAGAAGC TTGCCAAGGA 251 GAGAGCTGGG CTTTCCAAGT TACCTGACCT TAAAGATGCT GAAGCTGTTC 301 AGAAGTTCTT CCTTGAAGAA ATACAGCTTG GTGAAGAGTT ACTAGCTCAA 351 GGTGAATATG AGAAGGGCGT AGACCATCTG ACAAATGCAA TTGCTGTGTG 401 TGGACAGCCA CAGCAGTTAC TGCAGGTCTT ACAGCAAACT CTTCCACCAC 451 CAGTGTTCCA GATGCTTCTG ACTAAGCTCC CAACAATTAG TCAGAGAATT 501 GTAAGTGCTC AGAGCTTGGC TGAAGATGAT GTGGAATGAG AAACAAATGT 551 CAACATAATA AAATCTCAGT TAAAAATATT TTAAAAATTC TTGGTAGTTG 601 AGCAGCTCTG GGGGAATAAG GGCAAATATG CTTGTTATGA ACTACACTGA 651 AATCTACCAA AGTTAATGTT TACTTTGTGT AGATCCATTT GTCTATTTTA 701 TTTATTTTTC CCAGTGAAAA GTGTATTTTG ATAGAGAACT TTTCATTCTA 751 TAAATACACT ATGAGTTACT AAAATATCAT GGATTTTGTT TATTCCTGAA 801 ACATAGTTAC ATAGTTAAAC TGTACATATG ACATGGCTTA TGTTAAAAAT 851 ACCCAGTGCT CAGTTTTGAA AGATAGGCAA AAAAAAAAAA AGTATAGGAG 901 AAACTGAAGA ATGTACACTT TTTTAGAGGG CACATTTTGC TGTAAATCTG 951 GAAATTTGAT AGACTTGACT GTGTTTGTGA AAACTGAGCA TTAAAGGTTT 1001 TGATTGATCC TTTCTTTCCA TTTAATCTCT GAGACGTAAA TATGTGAGGT 1051 GTGCTGCTGT GCTGGGTTAA CAGCTTCCTT CCCTTTCTGT GTAGCAGTCT 1101 TGAAATGTTC TGTTTAAATC AGTAGGCTTA ATGTGTTCTG GGTATTTATC 1151 TCCTTGTATT TTAAATATAT GTAGTTGCAA ATAGCACCAG GAATTAGATT 1201 TCTGTACACC CCTAATCTAG CCTTGTGAGC TTCGCTAGTT AATGTGTGCT 1251 CACTTTCCCT CCATTTGTTA CGTGAGAGAA TGCGTCTGCT GATCACTGAA 1301 GTGTCCCTTT TAGCTTCTGA TTCATTGGGT TCTGTTGGGC ATCTTTAAAT 1351 CCACCTTAAC CTGAGGAATG TATGTGGGCA ACCAGGCCCT GCATTTTTTT 1401 ATATTCTGAA TTTTGCATGC TTGCCTGACT TAGTATTTCT GAATTGATGT 1451 TTTTTTTAAT GGTATAACTA TCTTGATTTT CACTGAAATT ATATGGTTCT 1501 GTCACTACTC TGTAAATTAA TCCGAAACTT TTAAGGTAAC TGGGATGATC 1551 TGCTTGTAAA AATGCTTGTT GCCTTTTGCT TTATCTTCAG TGTACCTCCT 1601 TAATCCTGCT TCAACTTGAT TATCTTGTGA AACGATGAGA GTAAGTTGCA 1651 ACCTTGTGAC TGAAAACTTG AAAAGAGTGG AGCAGGTGGG ACCTCTTATT 1701 CTCAAATAGT GACATATTCT CCGTAGTCAC AGTTTCAGAA CTGAGTAAGG 1751 ATCCTTGGTA CTTGGTGGCA TCTGTTGAAC TGAGGAGCAT TTCTCATTGT 1801 AAAGATTGCC TTTGTTCTGT CTAAAAGTCT GGAGAAATCC CAAAGACTTT 1851 TCCTATGTAC TAGGCATTTT ATTTTGATTG ACTTACAAAC TCTTCTTAAT 1901 CATTATCAAT CTCGGTTTTT TTGTGGTGCA GTGGAAGGAG AAATAGGTCT 1951 AGTTTCTGCC TCTGATTAGC CGCACAGCCT TGAACAAATC ACATTTCATC 2001 TTTGAACTTA CCTCTACTGT TAGACTAGGC GACTCACATT TGAGGACTTT 2051 TCTCGGGTAT CTTGAGGGTT TGTGATCCTG AACCCTTAAA CAGTGCTTTT 2101 TTGTTACACA GGAGGGCTTT TTTGGGGGGA TGACCAGTAC AGACATGCCA 2151 GTTAGTTTTA CTAGTGGGAT CCCAAATCCA AAGCAGTGTA GTGGTGATTG 2201 GTCAGTGACT AACCAGGCAG CTAAGAAGTC TTAGGCAGCA GCCCAGACAT 2251 GTATAGAGGG GCAGTTAGAG GGAGAACAGG GGTGGGAAAG GGAGCAAGGG 2301 GCAGATAGCT CAGCAAGGAA AGAATGGGCT CAGAAAAAGG AGGGCTGGCT 2351 GGAGGAGTGA GGGGCAGCTT AAGTTTGGGG AGGGTAGAAA CGCCGTTTCC 2401 TTGGGAACTG GAGTGCAGTA TGAGCTGGGT GTCACTTGGC TCTGAACATA 2451 CTGGCTTTGC TGTAATGCTT GAAAAGGCGT TGGTATCTTC ATTTTACAGT 2501 TCATTAACCC AAGTACGTTT TCTTATTTAA ATGACAACTT TGGTGCTTTA 2551 AAATGAGGTA CCACTTTTTA AAGCTAGCTG TGTCGAGTTA AAGAAAAAAT 2601 CAGCAGTTTT TTCTCCCAGA AATGTAATTG CCAAACACTT TTCATCCCCA 2651 TCTTAAGTTT TACAAGGTGA TGTAATCAGC TTGTTGTAGT GATGCTGGCC 2701 AAATGGTGCT CAGCAGGTGA GAACAAAAAA ACCCCAGATT TCAGTGAACT 2751 AATACACAGC TTGAGCGTTT CCATGTGCTA ATGTTGCACA CTTACTAAAA 2801 AACTTTGGAA ATGGAAAATA ATGTATTAGT GCAACAGTTG ATGTGCTTCT 2851 TTGGGCAAAG ATATAGTTTT GTTCCACAAT TTGTACTTAA AAGCGAAAGA 2901 ACATTGAAAA CATAGACTTA CTGGCTGTAG CAATGCTGGC CTGTTAACTG 2951 ATAACTAGAA CTTAGGTTCA CGTTTATGTA AAGTGTGTAA AACCTAGTAG 3001 AGCTTGCATA GTCGGCACTC AGTAAATGTT TGGTTCCTTT TGCCCCTTGG 3051 TAAGTTTATT TTACCATCCT CCCACCTGCC ATTCTGACTT TATTAAATCA 3101 ACATGTGGAC CAGAGTGTTA ATGAGATGTT ATTGCAGAAG AGATTGAGAA 3151 AATTGGTATA TCATGCAGAT AACATACAAA ATCTTTTTGT AACGTAAAAA 3201 ATGCAGTTTT ATTATTGCTT GTGCCTCAAC TGTTTAAGTG AATATTAAAG 3251 GGCTTGGAG // LOCUS HSY17394 1597 bp mRNA PRI 08-JUN-1998 DEFINITION Homo sapiens mRNA for prefoldin subunit 3. ACCESSION Y17394 NID g3212111 VERSION Y17394.1 GI:3212111 KEYWORDS pfd3 gene; prefoldin. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1597) AUTHORS Vainberg,I.E., Lewis,S.A., Rommelaere,H., Ampe,C., Vandekerckhove,J., Klein,H.L. and Cowan,N.J. TITLE Prefoldin, a chaperone that delivers unfolded proteins to cytosolic chaperonin JOURNAL Cell 93 (5), 863-873 (1998) MEDLINE 98292183 REFERENCE 2 (bases 1 to 1597) AUTHORS Lewis,S. TITLE Direct Submission JOURNAL Submitted (02-JUN-1998) S. Lewis, NYU Medical Center, 550 First Avenue, New York NY 10016, USA COMMENT Overlapping sequence: U56833. FEATURES Location/Qualifiers source 1. .1597 /organism="Homo sapiens" /db_xref="taxon:9606" gene 13. .606 /gene="pfd3hu" CDS 13. .606 /gene="pfd3hu" /codon_start=1 /product="prefoldin subunit 3" /protein_id="CAA76761.1" /db_xref="PID:e1297425" /db_xref="PID:g3212112" /db_xref="GI:3212112" /translation="MAAVKDSCGKGEMATGNGRRLHLGIPEAVFVEDVDSFMKQPGNE TADTVLKKLDEQYQKYKFMELNLAQKKRRLKGQIPEIKQTLEILKYMQKKKESTNSME TRFLLADNLYCKASVPPTDKVCLWLGANVMLEYDIDEAQALLEKNLSTATKNLDSLEE DLDFLRDQFTTTEVNMARVYNWDVKRRNKDDSTKNKA" BASE COUNT 520 a 266 c 319 g 492 t ORIGIN 1 CGCATCCCCA AGATGGCGGC CGTTAAGGAC AGTTGTGGCA AAGGAGAAAT 51 GGCCACAGGG AATGGGCGGC GGCTCCACCT GGGGATTCCT GAGGCCGTGT 101 TTGTGGAAGA TGTAGATTCC TTCATGAAAC AGCCTGGGAA TGAGACTGCA 151 GATACAGTAT TAAAGAAGCT GGATGAACAG TACCAGAAGT ATAAGTTTAT 201 GGAACTCAAC CTTGCTCAAA AGAAAAGAAG GCTAAAAGGT CAGATTCCTG 251 AAATTAAACA GACTTTGGAA ATTCTAAAAT ACATGCAGAA GAAAAAAGAG 301 TCCACCAACT CAATGGAGAC CAGATTCTTG CTGGCAGATA ACCTGTATTG 351 CAAAGCTTCA GTTCCTCCTA CCGATAAAGT GTGTCTGTGG TTGGGGGCTA 401 ATGTAATGCT TGAATATGAT ATTGATGAAG CTCAGGCATT GTTGGAAAAG 451 AATTTATCGA CTGCCACAAA GAATCTTGAT TCCCTGGAGG AAGACCTTGA 501 CTTTCTTCGA GATCAATTTA CTACCACAGA AGTCAATATG GCCAGGGTTT 551 ATAATTGGGA TGTAAAAAGA AGAAACAAGG ATGACTCTAC CAAGAACAAA 601 GCATAATGCT GGCAATTAAA AATGTGGTTT AGTTTTCCAA ACATGTTATC 651 TTAAATACCC CTTTATCCTT ACAGGTTGAC ATAACTTTGA ATGTTTTAAC 701 AGCAAGAATT TTAAGAAAAG ATAAACACCA TTTTATTTAT TTATAAAAAC 751 AAAATTAGTT TCAAATATTT TTGACATTGT GATTTTTTTT TCCACATTTC 801 TCAGCAAAGC TAATGGTATT TTAATATCAT TATTTTTTGC CTGTCATAAG 851 AAAACTCTTA GCTGAAATGG CCGAAAACTG TGAGACATGC TATGGAAGCT 901 GAATGCCGGA CGCTAGCACA GTTTACTTTT TCCCTTTCTA ATTGGCTGAT 951 GTTACTCTCA CTTGATGTGG TTAAACCATT TTAGAGGTAG AGAAGACAGA 1001 CAGTTTGAAT ATTTGTAAAC TTGTTTTTCT TTGGTATATT TAGGACTTAG 1051 TGGTCCTCTG TTGCTATTGT CTTCTATAAG TGGAGTTTCA TGACTTACTG 1101 CTTAACGAAT AACTAACTAC TATGATATTC TGGACATTTT AGGAAATGGT 1151 AATTTGCCTT GCTACACATT AAGAGGGCTA TTAAGACTAC ATTTTTTCTA 1201 ACCTCAGATA AGTGCAGTGT CTTTGCAATG CCAACATAAG GGAGATCTTG 1251 GCCAACGTGA AATAAAATTA CTCATTCAAA ACTCTGCCTA AGGTGATTTT 1301 GTAGTTCTTA ACAGTTCTCC AGAGCATCTT GAACAGGAAT ATTAAGATAA 1351 ATGTGAATCT GCAATGGCTG AAAAGAGTTG TGAGCTTTTT TATTCATGAT 1401 AAAACCTTAT AGGAATAGTA TAAAAAATCC CTGTGGAAAG CTACTAGTAC 1451 ATTGACCAGC GCTGGGTGAT ACAGATTCTG ATAAAAACAT AAATGTATTA 1501 GTTCATCTCC ATGTAGTAAA AAGTATACTT ATACAATGTT TTGTACTTGT 1551 ATTTCATGAA ATTAAAACAG TGATGCTAAA ACGGCCTTCG TGGCCTC // LOCUS HSSIRPBET 3804 bp mRNA PRI 14-MAY-1997 DEFINITION H.sapiens mRNA for SIRP-beta1. ACCESSION Y10376 NID g2052057 VERSION Y10376.1 GI:2052057 KEYWORDS SIRP-beta1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3804) AUTHORS Kharitonenkov,A., Chen,Z., Sures,I., Wang,H., Schilling,J. and Ullrich,A. TITLE A family of proteins that inhibit signalling through tyrosine kinase receptors JOURNAL Nature 386 (6621), 181-186 (1997) MEDLINE 97215901 REFERENCE 2 (bases 1 to 3804) AUTHORS Chen,Z. TITLE Direct Submission JOURNAL Submitted (07-JAN-1997) Z. Chen, Max-Planck-Institut fuer Biochemie, Department of Molecular Biology, Am Klopferspitz 18A, D- 82152 Martinsried, FRG FEATURES Location/Qualifiers source 1. .3804 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" sig_peptide 41. .103 /gene="SIRP-beta1" gene 41. .1237 /gene="SIRP-beta1" CDS 41. .1237 /gene="SIRP-beta1" /codon_start=1 /protein_id="CAA71404.1" /db_xref="PID:e293717" /db_xref="PID:g2052058" /db_xref="GI:2052058" /db_xref="SPTREMBL:O00241" /translation="MPVPASWPHLPSPFLLMTLLLGRLTGVAGEDELQVIQPEKSVSV AAGESATLRCAMTSLIPVGPIMWFRGAGAGRELIYNQKEGHFPRVTTVSELTKRNNLN FSISISNITPADAGTYYCVKFRKGSPDDVEFKSGAGTELSVRAKPSAPVVSGPAVRAT PEHTVSFTCESHGFSPRDITLKWFKNGNELSDFQTNVDPAGDSVSYSIHSTARVVLTR GDVHSQVICEMAHITLQGDPLRGTANLSEAIRVPPTLEVTQQPMRAENQANVTCQVSN FYPRGLQLTWLENGNVSRTETASTLIENKDGTYNWMSWLLVNTCAHRDDVVLTCQVEH DGQQAVSKSYALEISAHQKEHGSDITHEPALAPTAPLLVALLLGPKLLLVVGVSAIYI CWKQKA" BASE COUNT 914 a 1055 c 797 g 1038 t ORIGIN 1 CACAGACGTT TGGACAGAGC AGGCTCCTAA GGTCTCCAGA ATGCCCGTGC 51 CAGCCTCCTG GCCCCACCTT CCTAGTCCTT TCCTGCTGAT GACGCTACTG 101 CTGGGGAGAC TCACAGGAGT GGCAGGTGAG GACGAGCTAC AGGTGATTCA 151 GCCTGAAAAG TCCGTATCAG TTGCAGCTGG AGAGTCGGCC ACTCTGCGCT 201 GTGCTATGAC GTCCCTGATC CCTGTGGGGC CCATCATGTG GTTTAGAGGA 251 GCTGGAGCAG GCCGGGAATT AATCTACAAT CAGAAAGAAG GCCACTTCCC 301 ACGGGTAACA ACTGTTTCAG AACTCACAAA GAGAAACAAC CTGAACTTTT 351 CCATCAGCAT CAGTAACATC ACCCCAGCAG ACGCCGGCAC CTACTACTGT 401 GTGAAGTTCC GGAAAGGGAG CCCTGACGAC GTGGAGTTTA AGTCTGGAGC 451 AGGCACTGAG CTGTCTGTGC GCGCCAAACC CTCTGCCCCC GTGGTATCGG 501 GCCCTGCGGT GAGGGCCACA CCTGAGCACA CAGTGAGCTT CACCTGCGAG 551 TCCCATGGCT TCTCTCCCAG AGACATCACC CTGAAATGGT TCAAAAATGG 601 GAATGAGCTC TCAGACTTCC AGACCAACGT GGACCCCGCA GGAGACAGTG 651 TGTCCTACAG CATCCACAGC ACAGCCAGGG TGGTGCTGAC CCGTGGGGAC 701 GTTCACTCTC AAGTCATCTG CGAGATGGCC CACATCACCT TGCAGGGGGA 751 CCCTCTTCGT GGGACTGCCA ACTTGTCTGA GGCCATCCGA GTTCCACCCA 801 CCTTGGAGGT TACTCAACAG CCCATGAGGG CAGAGAACCA GGCAAACGTC 851 ACCTGCCAGG TGAGCAATTT CTACCCCCGG GGACTACAGC TGACCTGGTT 901 GGAGAATGGA AATGTGTCCC GGACAGAAAC AGCTTCGACC CTCATAGAGA 951 ACAAGGATGG CACCTACAAC TGGATGAGCT GGCTCCTGGT GAACACCTGT 1001 GCCCACAGGG ACGATGTGGT GCTCACCTGT CAGGTGGAGC ATGATGGGCA 1051 GCAAGCAGTC AGCAAAAGCT ATGCCCTGGA GATCTCAGCA CACCAGAAGG 1101 AGCACGGCTC AGATATCACC CATGAACCAG CGCTGGCTCC TACTGCTCCA 1151 CTCCTCGTAG CTCTCCTCCT GGGCCCCAAG CTGCTACTGG TGGTTGGTGT 1201 CTCTGCCATC TACATCTGCT GGAAACAGAA GGCCTGACTG ACCCTCAGTC 1251 TCTGCTGCCT CCTCCTTTCT TGAGAAGCTC AGCCTGAGAG AAGGAGCTGG 1301 CGAGAACCTT CCCCACACTC AGCTCCAAAC GCCTCCTCTC CCAGGTCATC 1351 TGCCTGCCCA CACGCTCCTG TTCCACCTTC ACAAGACCAT GATGCCCCAA 1401 AGCAGTGTCT CTATTCACGG TCCTGAGCAG GGGCCATGGG ATTGGGCTCT 1451 GGGCACTGAC TCATGGCACC TCCCTAGAAG GTGAGAAACA CTCCAAATCT 1501 AAACACACCA GGACTTCTCC CATCCGTCGC CTTGGGACTG GCCATAAACC 1551 ACAGACTCTC TCCAGGCTCT CAAGAGTTAT CCTGTCTTCT GGATTCCTGC 1601 CTACCCCAAC TCCCCCAGCC TTGTTGAGGT TCTCTACTGC CTCCTGAATA 1651 CACATGAACC CCTATACCAA TTTTAAGAAA AAAATGATTC TCTTTCCTCT 1701 TTGTCCAAGC ATCCTATCCC TCAAACCCAA AAAGAAAGAA GCTCTCCCTT 1751 CTCTCTCTGT GATGGAGACA GTATTTCTTC TAGTATCCTG CAGCCTTCCC 1801 AGTCCTGCTG CTTGTGGTAG AAATTGCTGC CACAGCCCAA CATTGAGGAG 1851 CCCTCGATGA CTGCCCTTTA CAACTCATAT TCAGTTCTGC CTCCAAAATG 1901 CATGTGTCCA CTTACATGAG ATGGTAAATG TTTAACAATG GACTTTCTGA 1951 AAGGGAAAAA CCAAAAGCTG TTTTGCAGTG CTTGCCAATT TCTCTAGTGT 2001 AATAACTCCC AACCTGACCA ATTTCAGCAC TGCCAACAGT TAAACAACCA 2051 GATTCGAAGA TTCCTGAAAT TTAACAATTG GTTTTCAGGG CCCAGTCCAA 2101 GCCTGCTGCT GGAAACCTCA GAGTTAAATC CCTATTCTCC ACACCTCTCA 2151 CCTCCACCAC CCCTCCCTGT CCCAGCCAGC ATCATCTCTT TGGGGACCAC 2201 TCCTCTGGCT TTCATTTTTC AGCCACAGTG ATTCTTTGGA AAAGTCAAAT 2251 CATATCACTT CTCTGCTTCT TCCCCAACAC AGCTGCATGG TCCCGCTCTC 2301 CCTCCTTCAA GTCTCTGCTC AATGTCACTT CATTAAAGGC GGCCTTCTAT 2351 AAACTACCTT GTATAAAATA TTATTTATTT TCTCTATCCC GGCATTCTAA 2401 TTTCTCTTAT CCTAATTAAT TTTTCTTTAG CCCTTATTTT GATGAGTATT 2451 ATGCCGAATA CAGGCAGCCC TCACTTTTCA TGGCCAGTGC AAGATTGCAA 2501 AAAGACTGTG CAACCTGAAA CCCAGGAAAG CAGTCTCCAT AGTCAATCAG 2551 AAAAACAATG ATCATTCTGT GACCTTTACC ATTTTTTGTC AAAATATTAG 2601 AAACTCTCAC ACTCTCAGTT ACAAATGTAG AGGACAATGA AAATATAATG 2651 AAATAAATAT TTATTTGTGC ACTACAATTC AAAGCATTAG AAACATTGAA 2701 GTCAATGGCG TTTCTTGTAA ATGTATCCAG ATGAGGTTGG AAGAGTGCTT 2751 GACCTTTTTG TATATTTCTA ATATGGAGTG ATATAGTTTG GCTCTGTGTC 2801 TCCATCCAAA TCTCATCTTA AATTGTAATC TGCATGTGTT GTGGGAATGG 2851 GACCTAGGTA GGAGGTGACT GAATACATGG GGGCGGACTT CCCCCTTGCT 2901 GTTCTTGTGA TAGTGAGTTC TCATAAGATC TCAGTGAGTT CTCATGAGAT 2951 CTGGTTTTTT GAAAGTGTGT GGCAAGTCCC CCTTCGCTCT CTCTCTCTCT 3001 CTCCCTCCTG CCACCATGTG AAGAAGGTGC CTGCTTCCTT TTCTCCTTCC 3051 ACCATGGTTG TAAGTTTCCT GAGGCCTCCC AGTCATGCTT CCTGTTAAGC 3101 CTGTGGAACT GTGAGTCCAA TTAAACCTCT TTTATTCATA AAATATCCAG 3151 TTTCTGGTAG TTCTTTATAG CAGTGTGAGA ATGGGCTAAT ACACGGAGCA 3201 AGCATCGTTC TTTCATTTTT ATTTATTTTA TTTTTTGAGA TGGAGTTTCA 3251 CCTTATTCCC AGGCTGGAGT GCAATGTCGT GATCTTGGCT CACTGCAACC 3301 CCCGCCTCCA GGGTTCAAGT GATTCTCCTG CCTCAGCCTC CTGAGTAGCT 3351 GGGATTACAG GCATGTACCA CCACACCCAG CTAATTTTGT ATTTTTAGTA 3401 GAGATGGGGT TTCTCCATGT TGATCAGACT AGTCTTGAAC TCCCGACCTC 3451 AGGTGATCCA CCTGTCTTGG CCTCCCAAAG TGCTGGGATT ACAGGCATGA 3501 GCCACCATGC CTAGCCAGCA AGCATCATTT CTATTATACC TTGGTGTTTG 3551 CCTCTTTCTA AGTTTGGACT AGCTTCCAAC ATCTTATCCC TTGAATTTTC 3601 AATATTGTGG AATCACTCCA GAAGATCCTT TCATGTGAAG TTTTTTGCTG 3651 GCATTTCAAC CTTTGGGACA TCTTCAGCCC TTTTATTACC ACTCCTCTCC 3701 CATTTGTGGC AGTTTGCGTT TACTACCTCC CTCTGGCTGC CTATCTGAAG 3751 TTCCTGCATC AGGGTCTACA TTGCCACAGT CAACTATTTG TACTTCTAGA 3801 ATTC // LOCUS AB028996 4680 bp mRNA PRI 04-AUG-1999 DEFINITION Homo sapiens mRNA for KIAA1073 protein, complete cds. ACCESSION AB028996 NID g5689482 VERSION AB028996.1 GI:5689482 KEYWORDS . SOURCE Homo sapiens brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hj06550. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kikuno,R., Nagase,T., Ishikawa,K., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6, 197-205 (1999) REFERENCE 2 (bases 1 to 4680) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (17-JUN-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4680 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /clone="hj06550" /clone_lib="pBluescriptII SK plus" /tissue_type="brain" gene 342. .2273 /gene="KIAA1073" CDS 342. .2273 /gene="KIAA1073" /codon_start=1 /product="KIAA1073 protein" /protein_id="BAA83025.1" /db_xref="PID:d1046852" /db_xref="PID:g5689483" /db_xref="GI:5689483" /translation="METSSSCESLGSQPAAARPPSVDSLSSASTSHSENSVHTKSASV VSSDSISTSADNFSPDLRVLRESNKLAEMEEPPLLPGENIKDMAKDVTYICPFTGAVR GTLTVTNYRLYFKSMERDPPFVLDASLGVINRVEKIGGASSRGENSYGLETVCKDIRN LRFAHKPEGRTRRSIFENLMKYAFPVSNNLPLFAFEYKEVFPENGWKLYDPLLEYRRQ GIPNESWRITKINERYELCDTYPALLVVPANIPDEELKRVASFRSRGRIPVLSWIHPE SQATITRCSQPMVGVSGKRSKEDEKYLQAIMDSNAQSHKIFIFDARPSVNAVANKAKG GGYESEDAYQNAELVFLDIHNIHVMRESLRKLKEIVYPNIEETHWLSNLESTHWLEHI KLILAGALRIADKVESGKTSVVVHCSDGWDRTAQLTSLAMLMLDGYYRTIRGFEVLVE KEWLSFGHRFQLRVGHGDKNHADADRSPVFLQFIDCVWQMTRQFPTAFEFNEYFLITI LDHLYSCLFGTFLCNSEQQRGKENLPKRTVSLWSYINSQLEDFTNPLYGSYSNHVLYP VASMRHLELWVGYYIRWNPRMKPQEPIHNRYKELLAKRAELQKKVEELQREISNRSTS SSERASSPAQCVTPVQTVV" BASE COUNT 1354 a 1011 c 920 g 1395 t ORIGIN 1 CTCCTTCCAT AAGGGTCTGG TCCTTCATCC CTCCCGCTTC ACCAGACCCC 51 CTCACCCTGG GAGCCGCCAC CTCCCTTTCC CCAAGACCAG ATGTCGCGCG 101 GCCGGACACA GCCAGCACGG AGAGTCGATG CCGGCGTCTG AGCTGCGCAG 151 TGGGGTCTTC CCGCTGCCCA GCAGCCTACA GGCGCGGTGC ACTCTGGGGG 201 AACATGGCCG CTTCCGGTCT CCCTCCCGGG CCGGCGCTGG CCTGACTGCG 251 GCCCCGGTCC GTAGCACTCC GCCCTCCGCT TCTCCCGCCC TGTAGCCGCG 301 AAGACTGCTT CAGCCTTTCC CTGTGCTGCC CCTGCCGCGC GATGGAGACG 351 AGCTCGAGCT GCGAGAGTCT TGGCTCCCAG CCGGCGGCGG CTCGGCCGCC 401 CAGCGTGGAC TCCTTGTCCA GTGCCTCCAC TTCTCATTCA GAGAATTCAG 451 TGCATACAAA ATCAGCTTCT GTTGTATCAT CAGATTCCAT TTCAACTTCT 501 GCCGACAACT TTTCTCCTGA TTTGAGGGTC CTGAGGGAGT CTAACAAGTT 551 AGCAGAAATG GAAGAACCAC CCTTGCTTCC AGGAGAAAAT ATTAAAGACA 601 TGGCCAAAGA TGTAACTTAT ATATGTCCAT TCACTGGCGC TGTACGAGGA 651 ACTCTGACTG TCACGAATTA TAGGTTATAT TTCAAAAGCA TGGAACGGGA 701 TCCCCCATTT GTTTTAGATG CTTCCCTTGG TGTGATAAAT AGAGTAGAAA 751 AAATTGGTGG TGCTTCTAGT CGAGGTGAAA ATTCTTATGG ACTAGAAACT 801 GTGTGTAAGG ATATTAGGAA TTTACGATTT GCTCATAAAC CTGAGGGGCG 851 GACAAGAAGA TCCATATTTG AGAATCTAAT GAAATATGCA TTTCCTGTCT 901 CTAATAACCT GCCTCTTTTT GCTTTTGAAT ACAAAGAAGT ATTCCCTGAA 951 AATGGGTGGA AGCTATATGA CCCTCTTTTA GAGTATAGAA GGCAGGGAAT 1001 TCCAAATGAA AGCTGGAGAA TAACAAAGAT AAATGAACGA TATGAACTTT 1051 GTGATACATA CCCTGCCCTC CTGGTTGTGC CAGCAAATAT TCCTGATGAA 1101 GAATTAAAGA GAGTGGCATC CTTCAGATCA AGAGGCCGTA TCCCAGTTTT 1151 ATCATGGATT CATCCTGAAA GTCAAGCCAC AATCACTCGG TGTAGCCAGC 1201 CCATGGTTGG AGTGAGTGGA AAGCGAAGCA AAGAAGATGA AAAATACCTT 1251 CAAGCTATCA TGGATTCCAA TGCCCAGTCT CACAAAATCT TTATATTTGA 1301 TGCCCGGCCA AGTGTTAATG CTGTTGCCAA CAAGGCAAAG GGTGGAGGTT 1351 ATGAAAGTGA AGATGCCTAT CAAAATGCTG AACTAGTTTT CCTGGATATC 1401 CACAATATTC ATGTTATGAG AGAATCATTA CGAAAACTTA AGGAGATTGT 1451 GTACCCCAAC ATTGAGGAAA CTCACTGGTT GTCTAACTTG GAATCTACTC 1501 ATTGGCTAGA ACATATTAAG CTTATTCTTG CAGGGGCTCT TAGGATTGCT 1551 GACAAGGTAG AGTCAGGGAA GACGTCTGTG GTAGTGCATT GCAGTGATGG 1601 TTGGGATCGC ACAGCTCAGC TCACTTCCCT TGCCATGCTC ATGTTGGATG 1651 GATACTATCG AACCATCCGA GGATTTGAAG TCCTTGTGGA GAAAGAATGG 1701 CTAAGTTTTG GACATCGATT TCAACTAAGA GTTGGCCATG GAGATAAGAA 1751 CCATGCAGAT GCAGACAGAT CGCCTGTTTT TCTTCAATTT ATTGACTGTG 1801 TCTGGCAGAT GACAAGACAG TTTCCTACCG CATTTGAATT CAATGAGTAT 1851 TTTCTCATTA CCATTTTGGA CCACCTATAC AGCTGCTTAT TCGGAACATT 1901 CCTCTGTAAT AGTGAACAAC AGAGAGGAAA AGAGAATCTT CCTAAAAGGA 1951 CTGTGTCACT GTGGTCTTAC ATAAACAGCC AGCTGGAAGA CTTCACTAAT 2001 CCTCTCTATG GGAGCTATTC CAATCATGTC CTTTATCCAG TAGCCAGCAT 2051 GCGCCACCTA GAGCTCTGGG TGGGATATTA CATAAGGTGG AATCCACGGA 2101 TGAAACCACA GGAACCTATT CACAACAGAT ACAAAGAACT TCTTGCTAAA 2151 CGAGCAGAGC TTCAGAAAAA AGTAGAGGAA CTACAGAGAG AGATTTCTAA 2201 CCGATCAACC TCATCCTCAG AGAGAGCCAG CTCTCCTGCA CAGTGTGTCA 2251 CTCCTGTCCA AACTGTTGTA TAAAGGACTG TAAGATCAGG GGCATCATTG 2301 CTATACACTC TTGATTACAC TGGCAGCTCT ATGAGTAGAA AGTCTTCGGA 2351 ATTTAGAACC CATCTATGAG AGAAAGTTCA GTCACTTTAT TTATTTTAAA 2401 TCTCTCTAGG ATGAGTTTAG AACTGTAGCA GTGCAGGTGG CTTAAGTGAA 2451 GTAACTCCAT ATGTAATTAC ATGATTATGA TACTAATCTT TTAAGTATCC 2501 AAAGAATATT AAAATACTTC AATCCTGGAT TTACAGTGGG AACAAGTTTC 2551 TATTAAAAGG CAAATGCTGT TACAAATTTT TGGCATCTGG TAATATTAAA 2601 ACCATTTTAG AAATACACTC TGTGCTCACT GTGCAGAGGA ACATCAGTTT 2651 TCAAACCAAC ACTGAAATTC TGTGGCATCA CATATATTGG GCCTTGATGT 2701 CATGACAGAT CAAAATCATT TGATATCCCT TTCTCCATTC TAGGTTTTTC 2751 TTTTTTTCAG TAACTGATTT ACCTTGATCA CTTTTCAACT TCCATATTCT 2801 TCATATAGTA AAAGGCAAAG TATTGAAGAT ACTACGGTGT GGTAGTAGTT 2851 GAAAATTATT GCCGTCATTA TTTACATACT TAAGACATAT TAGCAAGTTG 2901 ATCCAAAATG GGAGGCCTTA TAGATGTGCT TGGGGGAAAA TGAAGGGGAG 2951 AAAGTAGCCA TACAGGAGTT CAAAGAATTC CATGCCCTTC AGATTAGCCC 3001 AATTACCAGA AACATCATGA AAGATATTTT AAAAACTAAT TATTACTACA 3051 GTGTATTTCA CTTGTCTTGT GTGTCTGAAC ACACAGAAGC TAATTAGCAA 3101 GTTTTTAAGA AGTATTTAAA AATCTTACTA GGATTGACAT TTTTTCTGAA 3151 TTCTGTATAA ATAGCTTATA GTGAGAAGTA CTGTGCTCAA ATTTTACATT 3201 TTTTTCCTTT GCAAATTCTG TAATTTCACT CAACGATTAA GTCTACCAAA 3251 GAACACACTG CATGTAAAAG ATGTATTACA ATCTCAAAGC CAGTAAAAGA 3301 AATCTTGCTT CACTGTTCAC CTGCTACAAG TAAGAGTTTG GTGCTGGTAG 3351 AAACATTTGA CTCTGATGTC TATTTTATTC TACATAAGAG CCATATGTAA 3401 TGTACTGTAA CAAAGGAGCT TCTTGTCCCC TTGGTCTTTT AATTAAAAGA 3451 AATTCCAACT GACTTTTAAA CTTTGTTCTT GTCCAAAGTT GCCATTTCTT 3501 TTTTTTCCCC AGAAATATTT GGAAATTATT GGAGGAATAT GCACCCCAGA 3551 TGAAAATGTT CAGTTTGTAC CCATTTTTCC TTAACCAACA CCCAAATCAA 3601 ACAATTAAAA TATACAGTGT TTTTCCACTC ACTAATTCAC TATACAGAGA 3651 GTCTGAACCT TAGCCTCCCT CTTGGTCTTG CAGTGAGGAA ATTTCTATTA 3701 GTATATCCAA TTTAGCAAAA TTGGTACCAA AATGATTTCT TTGGTAATTG 3751 TGTGAAATAT AAGCTTTTTA ACAGGGCATT TAAGTGGCTA GCAAATCAGT 3801 AATTAAAAAT TAAGCTTTCT ACTCCAAGTA TTTCACAAAT GCATCTGCCA 3851 TTTTCCTCAT TTAAACCTTG GTTATCTTGG CCTGATACCA CATAAAAGAA 3901 TGTAGAATGG CTGAAGAGAT CAAGAATTTA AAGCTTCTAG TCTTAACATA 3951 CTTGCATCCA CTTCAAATTC AAATCAAAAG CCAGGGAAAT CTAAGTGCAA 4001 CCCTACCACT TCTCTGCTGA GAACCTTCCA GTGGTTCCCC TCACCTTCTG 4051 CAGAAGTCTC CAATATGGAG TACATGCACT TGGGCATTTA ATATATACCA 4101 CTGGTGTGTG TGGGAGGGAG GGAGGAGGAA TACTAGCCCT TTTTATATAT 4151 TTACACAAGC AAAACTTTTA AATATTTGAA TTGACAGTTA CATGTTTCAT 4201 AACTTCGTAT GTCTATTGGT TGTGCAGGTG TAATTTTTTC CCTTTTTGAT 4251 TAGGGTTACA AAATTTAGAG ACCAGTATGA TTAAGTTGAA GCTCCTTAGC 4301 CTCCTTCGAC CTAGTCTCTG CATACCTCAA CTTTTACGTA CCAATGCTAC 4351 TCTGCTGTTC ACAATTGCCT CATGTAATCT GCAGATTCCT GCCTCCCCAC 4401 TTTGGTTCAG TCTGTCTTGT GCACCTGGAA CAACTGTTCT CCCTTGTGAC 4451 TAATTCCTAT TTTCTAGAGT TTAGGCATCA TCTCTTCCTT TGGGAAGTTA 4501 TCTGATTCAC GACTGCCTTC TCTGACATCC CCACCTTCCT CTGTGCCCCC 4551 ATAGCACTGT GTATACCGCT ACTACCACTG CAATTCACAT TATATTGGAA 4601 TGAACAATTC ACATGTCTAC CACAAGTCTG TAAACATAAC CTTATTTGAA 4651 ATGAATTGCA ATAAAGCTCT GTTACAACGT // LOCUS D86985 6025 bp mRNA PRI 07-FEB-1999 DEFINITION Human mRNA for KIAA0232 gene, complete cds. ACCESSION D86985 NID g1504043 VERSION D86985.1 GI:1504043 KEYWORDS KIAA0232. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2598. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6025) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6025) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1. .6025 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 4. (RH_ID :RH25438)" /clone="HA2598" /sex="male" /tissue_type="bone marrow" 5'UTR 1. .596 gene 597. .3623 /gene="KIAA0232" CDS 597. .3623 /gene="KIAA0232" /citation=[3] /codon_start=1 /protein_id="BAA13221.1" /db_xref="PID:d1013910" /db_xref="PID:g1504044" /db_xref="GI:1504044" /translation="MENRKDTEYKEEPLWYTEPIAEYFVPLSRKSKLETTYRNRQDTS DLTSEAVEELSESVHGLCISNNNLHKTYLAAGTFIDGHFVEMPAVINEDIDLTGTSLC SLPEDNKYLDDIHLSELTHFYEVDIDQSMLDPGASETMQGESRILNMIRQKSKENTDF EAECCIVLDGMELQGERAIWTDSTSSVGAEGLFLQDLGNLAQFWECCSSSSGDADGES FGGDSPVRLSPILDSTVLNSHLLAGNQELFSDINEGSGINSCFSVFEVQCSNSVLPFS FETLNLGNENTDSSANMLGKTQSRLLIWTKNSAFEENEHCSNLSTRTCSPWSHSEETR SDNETLNIQFEESTQFNAEDINYVVPRVSSNYVDEELLDFLQDETCQQNSRTLGEIPT LVFKKTSKLESVCGIQLEQKTENKNFETTQVCNESPHGDGYSSGVIKDIWTKMADTNS VATVEIERTDAELFSADVNNYCCCLDAEAELETLQEPDKAVRRSEYHLWEGQKESLEK RAFASSELSNVDGGDYTTPSKPWDVAQDKENTFILGGVYGELKTFNSDGEWAVVPPSH TKGSLLQCAASDVVTIAGTDVFMTPGNSFAPGHRQLWKPFVSFEQNDQPKSGENGLNK GFSFIFHEDLLGACGNFQVEDPGLEYSFSSFDLSNPFSQVLHVECSFEPEGIASFSPS FKPKSILCSDSDSEVFHPRICGVDRTQYRAIRISPRTHFRPISASELSPGGGSESEFE SEKDEANIPIPSQVDIFEDPQADLKPLEEDAEKEGHYYGKSELESGKFLPRLKKSGME KSAQTSLDSQEESTGILSVGKQNQCLECSMNESLEIDLESSEANCKIMAQCEEEINNF CGCKAGCQFPAYEDNPVSSGQLEEFPVLNTDIQGMNRSQEKQTWWEKALYSPLFPASE CEECYTNAKGESGLEEYPDAKETPSNEERLLDFNRVSSVYEARCTGERDSGAKSDGFR GKMCSSASSTSEETGSEGGGEWVGPSEEELFSRTHL" 3'UTR 3624. .6025 BASE COUNT 1789 a 1146 c 1391 g 1699 t ORIGIN 1 CGATCTGCTT CTGATGAAAG CTCTGGTATC GAGACTTTAG TGGAGGAGCT 51 CTGCTCCAGA CTGAAAGACC TTCAGAGTAA GCAAGAAGAG AAGATTCACA 101 AAAAGTTAGA GGGGTCTCCC TCTCCAGAGG CAGAATTATC CCCTCCAGCA 151 AAGGATCAAG TGGAAATGTA CTATGAAGCA TTTCCACCAC TTTCTGAGAA 201 ACCAGTTTGC CTGCAAGAAA TCATGACTGT GTGGAACAAG TCTAAAGTCT 251 GTTCTTACTC TAGCTCTTCT TCATCATCCA CAGCCCCACC AGCTAGCACA 301 GATACTTCCT CTCCTAAGGA CTGCAACAGT GAAAGTGAAG TCACCAAGGA 351 AAGAAGCAGT GAAGTACCCA CCACTGTGCA TGAGAAAACC CAGAGCAAAA 401 GCAAAAACGA GAAGGAAAAC AAATTTAGTA ATGGCTTTCT TTGAAGCACG 451 GTGAAAAGGC TGAAAGGAAC ATTCATACTG GAAGTAGTAG CAGTAGCAGC 501 AGTGGTTCTG TCAAACAGCT GTGCAAGCGG GGTAAGAGAC CTTTAAAAGA 551 AATAGGGAGA AAAGATCCTG GGAGCACTGA AGGAAAAGAC CTGTACATGG 601 AGAATAGAAA GGACACAGAG TATAAAGAGG AGCCCTTGTG GTACACCGAG 651 CCAATTGCTG AATATTTTGT TCCTCTGAGC AGAAAAAGTA AACTAGAGAC 701 CACATACCGA AACAGACAGG ATACAAGTGA TCTGACATCA GAGGCAGTGG 751 AAGAATTGTC TGAATCAGTG CATGGTCTTT GTATCAGCAA CAATAATCTT 801 CATAAAACAT ACCTCGCAGC AGGTACTTTC ATTGATGGTC ATTTTGTAGA 851 AATGCCTGCA GTTATAAATG AGGATATTGA CCTCACTGGG ACCTCATTAT 901 GTTCTCTACC AGAGGACAAT AAATACCTGG ATGATATTCA TCTATCAGAA 951 TTAACGCACT TCTATGAAGT GGATATTGAT CAATCCATGT TGGATCCTGG 1001 TGCCTCAGAA ACAATGCAAG GAGAAAGTCG GATTTTGAAT ATGATTCGAC 1051 AGAAAAGCAA AGAGAACACA GATTTTGAGG CAGAATGTTG CATAGTGTTA 1101 GATGGTATGG AGTTGCAAGG GGAACGTGCA ATATGGACAG ATTCTACCAG 1151 CTCCGTAGGT GCTGAGGGCT TATTCCTGCA GGACCTTGGC AATCTGGCTC 1201 AGTTTTGGGA GTGCTGTTCA TCCAGCTCCG GTGATGCTGA TGGGGAGAGT 1251 TTTGGAGGAG ACTCTCCAGT TAGACTCTCT CCCATCTTAG ACAGCACAGT 1301 GCTCAATTCA CACCTGCTTG CTGGCAATCA AGAGCTCTTT TCAGATATTA 1351 ATGAAGGATC TGGTATAAAC TCTTGTTTTT CAGTGTTTGA AGTGCAATGC 1401 AGTAATTCTG TTTTACCATT TTCTTTTGAA ACACTCAACT TGGGAAATGA 1451 AAATACAGAT TCTAGTGCTA ATATGCTTGG GAAAACACAG TCTAGATTGC 1501 TAATATGGAC CAAAAATAGT GCCTTTGAAG AAAATGAACA CTGTTCTAAT 1551 CTTTCAACAA GAACTTGTAG TCCATGGTCC CATTCAGAAG AAACACGTTC 1601 AGACAATGAA ACATTAAATA TTCAGTTTGA AGAATCCACA CAGTTTAATG 1651 CCGAAGATAT TAATTATGTA GTTCCTAGAG TCTCGTCAAA TTATGTAGAT 1701 GAAGAACTTC TAGATTTTTT GCAAGATGAA ACTTGCCAGC AAAACAGTAG 1751 AACTTTAGGT GAGATTCCTA CATTAGTTTT CAAAAAAACA TCTAAACTAG 1801 AATCCGTCTG TGGTATTCAG CTAGAACAAA AAACAGAAAA CAAAAATTTT 1851 GAAACTACAC AAGTATGTAA TGAAAGTCCA CATGGAGATG GCTACAGCTC 1901 AGGGGTTATT AAAGACATTT GGACAAAGAT GGCAGACACA AATTCTGTGG 1951 CTACAGTAGA AATAGAAAGA ACTGATGCTG AGTTGTTTTC GGCAGATGTA 2001 AATAACTACT GCTGCTGTCT AGATGCTGAA GCTGAACTGG AGACCCTTCA 2051 GGAGCCTGAT AAGGCTGTGC GGAGGTCAGA GTACCATCTG TGGGAGGGAC 2101 AGAAAGAGAG CCTGGAGAAA AGAGCATTTG CTTCTAGTGA GCTATCAAAC 2151 GTGGATGGTG GTGATTATAC AACACCCTCT AAACCCTGGG ATGTAGCCCA 2201 AGATAAAGAA AACACATTCA TTCTTGGAGG AGTTTATGGA GAACTCAAAA 2251 CCTTCAATAG TGATGGGGAG TGGGCAGTCG TACCACCTAG TCACACAAAA 2301 GGAAGTCTGT TACAGTGTGC AGCTTCTGAT GTTGTGACGA TAGCTGGTAC 2351 AGATGTCTTT ATGACCCCAG GAAACAGTTT TGCTCCTGGG CACAGGCAGT 2401 TATGGAAACC CTTCGTGTCA TTTGAACAGA ATGATCAGCC GAAGAGTGGG 2451 GAAAATGGGT TAAATAAGGG ATTTTCTTTT ATCTTCCATG AAGACTTACT 2501 AGGAGCTTGT GGCAACTTTC AAGTCGAAGA TCCTGGACTT GAATACTCAT 2551 TTTCTTCCTT TGACTTAAGC AATCCATTTT CACAAGTTCT TCATGTAGAA 2601 TGCTCATTTG AACCTGAAGG GATTGCATCT TTCAGCCCCA GTTTTAAACC 2651 GAAATCAATC CTCTGTTCTG ATTCAGACAG TGAAGTGTTT CACCCCAGGA 2701 TATGTGGTGT TGACAGAACA CAATACAGGG CTATTCGGAT CTCTCCTCGG 2751 ACTCACTTTC GCCCAATTTC TGCATCCGAA CTGTCCCCAG GAGGAGGAAG 2801 CGAGTCAGAA TTTGAATCTG AGAAAGATGA AGCAAATATT CCCATTCCTT 2851 CTCAAGTTGA TATATTTGAA GATCCGCAGG CAGATCTCAA ACCTTTGGAA 2901 GAAGATGCAG AGAAAGAAGG CCATTACTAT GGAAAATCAG AGCTTGAGTC 2951 TGGAAAATTC CTTCCCAGGT TAAAAAAATC TGGGATGGAA AAGAGTGCTC 3001 AGACATCACT GGATTCCCAG GAGGAATCAA CTGGGATTCT TTCAGTAGGA 3051 AAGCAAAATC AGTGTTTGGA ATGTAGCATG AATGAATCCC TGGAAATAGA 3101 TTTAGAAAGC TCAGAAGCAA ATTGTAAAAT AATGGCACAA TGCGAGGAAG 3151 AAATTAATAA TTTTTGTGGT TGCAAAGCAG GTTGTCAGTT TCCTGCTTAT 3201 GAAGATAATC CAGTTTCTTC GGGACAGCTG GAAGAGTTCC CTGTATTGAA 3251 CACTGATATA CAAGGAATGA ATAGAAGTCA AGAAAAACAG ACCTGGTGGG 3301 AAAAAGCCTT GTACTCTCCT CTTTTTCCTG CATCAGAGTG TGAAGAATGT 3351 TACACAAATG CCAAGGGAGA GAGTGGTTTA GAAGAATATC CAGATGCTAA 3401 AGAGACACCC AGTAATGAAG AGCGCCTGTT AGATTTTAAT AGGGTGTCTT 3451 CTGTTTATGA AGCAAGATGT ACAGGAGAGA GAGATTCTGG AGCAAAGTCA 3501 GATGGCTTCC GCGGAAAGAT GTGCTCCAGC GCCAGCTCCA CCTCGGAAGA 3551 GACAGGCTCA GAAGGCGGAG GCGAGTGGGT GGGCCCTAGT GAAGAGGAGC 3601 TCTTTTCTCG AACTCATCTC TAAACCTGCA AAATAGTACA AATTATTGTT 3651 TAAAAATGAT ATGTGATGGA AAATTACTCT TCAGTGAGAC CTGTTAATCT 3701 AAAACAACAA CTTAGGTTTC CTCTTCAATT AACTGATTCA GATTGGTAAT 3751 AATTATCTTT CTCTTCTTGC TTATTTTAGA GTTGAGGACA GCTATCCTGT 3801 TAAAGATTTT TTTTCCCAGC TGTTAAATTC TTGGCTATTT GAAATAGACT 3851 AGATTGTGTT GTCAAATCAA GAATGGGTGT GCATGTGCTT GTCTTAGAAG 3901 TATCACTGCT TTTTGCATCT TAACTGCAGT TAATTTTCCT TCCGACTGCG 3951 GTTATATCAC TATGACCTTA CTAGCATTGC AGTGTCAACA ACCACTTCTG 4001 CTCTTCAGAG ACTTCAGCTT TGGAGCATTT AGGCTTTGTT CTCCAAGAAC 4051 TGGGATATCC ATTCTTACCC TACAGTGGCT TGATGCCTTT CTGAAGGCGA 4101 GAGGGAAGCC TGGGTGACTC AGCGGTGGTC TCCATTCAGC AAAATCTCAT 4151 GTACATTTCC AGTAGGAACC GCAGAGGTGT GCTTTTCAAG ACTCACCAAA 4201 TACTGTGTTT TCTCTCTTAG GATTTCTTTT CCCCTAAAGT ATCACGGAAG 4251 ATACTATGGT TCGTGACTTT CTTGCTAACT GAAGAAGCCA AGGATTTGGG 4301 GTGTGGGGTC GTATGCGAGA CACAGTGGGG TAAGGGTGCA TACCCCACCC 4351 CTTACCTGCT CTCATACTGC AGTTACATTT ACACCAAAAC CCCATGCAGG 4401 GTTCTTTGTG GTGAGTGTTC CATACGTGCT AAGGACCTTA GTTGCAGATT 4451 GTTACTTTCT GGTGACCTAT GTTGAATTGA AACCCCCAAA ACTTGAAATT 4501 GTGAACATTT GACATGCAGT AAAGGCCACC TCATCACCCA GAGAAATCTT 4551 TGGCTGCTGC AGCTAGCCGC TTCTTGGCTG TGATGTAGTA TAGCTTCGAT 4601 CTCATTTTGT GTTTGAGAGA ATGTTCTGGG CAAGTTCTGT GTGTGGTGGG 4651 TTGGGGCGGG TAGAGTCATG AGTTTTCCAC ATCCCTGTGT GGTGGTTTTG 4701 CTGACTGTCG CTCCGTGGGA CTGGCTCCCG TTTCTCCTTG GTGAGCCCGG 4751 GGAGCCGGCG CATCTTGTGA GTCGCGTCTG TGCATGGCGA TCCGCTCCTC 4801 CGGCTCTCAT GGCATTGTGC CACAGGCAGA GGCCAGGAGG AGCAGTATGT 4851 GCACAGCCGA AACATTTTAC ATTTTTTACA TTGTTTTTCT TTTTTAACCA 4901 ACTCATTGTT TAAAAAACAA AAACAAAAAA AACCTAATCT GTGAAATCAG 4951 CGTAGCATGC CTGGAGCATC AGGAATGGCA GAAAAGTCTG ATGCGCTCTA 5001 GACAGCTTCA CCACTCATTT GGGCAGGCAG TAAACACACA TATAATTTAT 5051 TAGCTGGGAG CTGAACTGGC TGTGAAATCT ATGATTTGCT TTGAACATTT 5101 GGGTTTTGTT GCCTTTTTCT TAATTGATAA CACAGAAAAG AAAGTACCAT 5151 CAAAGACTGT GGAGTCATTG AGGGTCTGTG TGTCCTCACC GAGAGGGACC 5201 TGGTGTGCCC GCCGGGTCGA TCTTCCCACG TGTTAGGGTT TATTTTTATA 5251 CAACACATCT TTTGACACTT TAAGGTGGGT GGGTGTGTGT GTGTGTGTGT 5301 GTGTGCGCGC GTGCGCGCGC GCATGTGTAA GGTTTTATGT TGCTGTTATT 5351 TATTTACGAA CTTCAGATAC GTTTTTATGT ATTTTTCATT CTTCTGGAGC 5401 TTCCTAAAAA TTGATAAGCA TCTGCACTGA AATATAATTT AACAGCAAAA 5451 GTAAAAAAGG ATTGAAAGTT GTAAATTCCT CATATCACTA CAGTGACGAT 5501 TATTCTAGAA ATCGTTGCTT GTGTAGCAAA GACCAAATAA ATAGATTTCA 5551 GACACAACCT TGAGCACAGT TGATTTTGGA CAGCTGCTGT TTATTAGGAA 5601 AGGGCTCCAG GTGGCAAAGG TGCACACTTC CTCAGACACA GGTGAGAAGA 5651 TGCAGCACCT TCCACAGGTG AATGGGACGG ATTCGAAGTG AGCAAAGGGA 5701 TTCACAAATT ATGTATTTAT TTGTTTTCAT AGTTAAGTAG CTGAAGCTCA 5751 GAGGCTTTCA GCAACAGAGA TGAAAGTGTG GCTTTTTAGT TTTGTGAATG 5801 GATGATCACA AAGAAAAAGC ATTTTTAAAA AGTTGGCAAA CGCTGAAACG 5851 CACTGTGGTA TGAAGCGCAT TGCATTTCCA TAGCACTGAA GTACCAGTTT 5901 CCATTCCTGG GCTGAGATTG TTTTTCCCGT GGTTGTATTG TTCTGATTTC 5951 ACGTACACCA GAGTAACTGA TTTTTTTTTG TTTGTTTTCT TGTGGAGTTA 6001 ACACCAAATA AAAATTGTAA AAAAC // LOCUS AF030880 4930 bp mRNA PRI 01-DEC-1997 DEFINITION Homo sapiens pendrin (PDS) mRNA, complete cds. ACCESSION AF030880 NID g2654004 VERSION AF030880.1 GI:2654004 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4930) AUTHORS Everett,L.A., Glaser,B., Beck,J.C., Idol,J.R., Buchs,A., Heyman,M., Adawi,F., Hazani,E., Nassir,E., Baxevanis,A.D., Sheffield,V.S. and Green,E.D. TITLE Pendred syndrome is caused by mutations in a putative sulphate transporter gene (PDS) JOURNAL Nature Genet. 17 (4), 411-422 (1997) MEDLINE 98061089 REFERENCE 2 (bases 1 to 4930) AUTHORS Everett,L.A., Glaser,B., Beck,J.C., Idol,J.R., Buchs,A., Heyman,M., Adawi,F., Hazani,E., Nassir,E., Baxevanis,A.D., Sheffield,V.S. and Green,E.D. TITLE Direct Submission JOURNAL Submitted (21-OCT-1997) Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, 49 Convent Drive, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1. .4930 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q22-q31.1" gene 1. .4930 /gene="PDS" CDS 225. .2567 /gene="PDS" /function="putative sulfate transporter" /note="mutated in Pendred syndrome" /codon_start=1 /product="pendrin" /protein_id="AAC51873.1" /db_xref="PID:g2654005" /db_xref="GI:2654005" /translation="MAAPGGRSEPPQLPEYSCSYMVSRPVYSELAFQQQHERRLQERK TLRESLAKCCSCSRKRAFGVLKTLVPILEWLPKYRVKEWLLSDVISGVSTGLVATLQG MAYALLAAVPVGYGLYSAFFPILTYFIFGTSRHISVGPFPVVSLMVGSVVLSMAPDEH FLVSSSNGTVLNTTMIDTAARDTARVLIASALTLLVGIIQLIFGGLQIGFIVRYLADP LVGGFTTAAAFQVLVSQLKIVLNVSTKNYNGVLSIIYTLVEIFQNIGDTNLADFTAGL LTIVVCMAVKELNDRFRHKIPVPIPIEVIVTIIATAISYGANLEKNYNAGIVKSIPRG FLPPELPPVSLFSEMLAASFSIAVVAYAIAVSVGKVYATKYDYTIDGNQEFIAFGISN IFSGFFSCFVATTALSRTAVQESTGGKTQVAGIISAAIVMIAILALGKLLEPLQKSVL AAVVIANLKGMFMQLCDIPRLWRQNKIDAVIWVFTCIVSIILGLDLGLLAGLIFGLLT VVLRVQFPSWNGLGSIPSTDIYKSTKNYKNIEEPQGVKILRFSSPIFYGNVDGFKKCI KSTVGFDAIRVYNKRLKALRKIQKLIKSGQLRATKNGIISDAVSTNNAFEPDEDIEDL EELDIPTKEIEIQVDWNSELPVKVNVPKVPIHSLVLDCGAISFLDVVGVRSLRVIVKE FQRIDVNVYFASLQDYVIEKLEQCGFFDDNIRKDTFFLTVHDAILYLQNQVKSQEGQG SILETITLIQDCKDTLELIETELTEEELDVQDEAMRTLAS" BASE COUNT 1454 a 937 c 1082 g 1457 t ORIGIN 1 CTCAGCCTTC CCGGTTCGGG AAAGGGGAAG AATGCAGGAG GGGTAGGATT 51 TCTTTCCTGA TAGGATCGGT TGGGAAAGAC CGCAGCCTGT GTGTGTCTTT 101 CCCTTCGACC AAGGTGTCTG TTGCTCCGTA AATAAAACGT CCCACTGCCT 151 TCTGAGAGCG CTATAAAGGC AGCGGAAGGG TAGTCCGCGG GGCATTCCGG 201 GCGGGGCGCG AGCAGAGACA GGTCATGGCA GCGCCAGGCG GCAGGTCGGA 251 GCCGCCGCAG CTCCCCGAGT ACAGCTGCAG CTACATGGTG TCGCGGCCGG 301 TCTACAGCGA GCTCGCTTTC CAGCAACAGC ACGAGCGGCG CCTGCAGGAG 351 CGCAAGACGC TGCGGGAGAG CCTGGCCAAG TGCTGCAGTT GTTCAAGAAA 401 GAGAGCCTTT GGTGTGCTAA AGACTCTTGT GCCCATCTTG GAGTGGCTCC 451 CCAAATACCG AGTCAAGGAA TGGCTGCTTA GTGACGTCAT TTCGGGAGTT 501 AGTACTGGGC TAGTGGCCAC GCTGCAAGGG ATGGCATATG CCCTACTAGC 551 TGCAGTTCCT GTCGGATATG GTCTCTACTC TGCTTTTTTC CCTATCCTGA 601 CATACTTTAT CTTTGGAACA TCAAGACATA TCTCAGTTGG ACCTTTTCCA 651 GTGGTGAGTT TAATGGTGGG ATCTGTTGTT CTGAGCATGG CCCCCGACGA 701 ACACTTTCTC GTATCCAGCA GCAATGGAAC TGTATTAAAT ACTACTATGA 751 TAGACACTGC AGCTAGAGAT ACAGCTAGAG TCCTGATTGC CAGTGCCCTG 801 ACTCTGCTGG TTGGAATTAT ACAGTTGATA TTTGGTGGCT TGCAGATTGG 851 ATTCATAGTG AGGTACTTGG CAGATCCTTT GGTTGGTGGC TTCACAACAG 901 CTGCTGCCTT CCAAGTGCTG GTCTCACAGC TAAAGATTGT CCTCAATGTT 951 TCAACCAAAA ACTACAATGG AGTTCTCTCT ATTATCTATA CGCTGGTTGA 1001 GATTTTTCAA AATATTGGTG ATACCAATCT TGCTGATTTC ACTGCTGGAT 1051 TGCTCACCAT TGTCGTCTGT ATGGCAGTTA AGGAATTAAA TGATCGGTTT 1101 AGACACAAAA TCCCAGTCCC TATTCCTATA GAAGTAATTG TGACGATAAT 1151 TGCTACTGCC ATTTCATATG GAGCCAACCT GGAAAAAAAT TACAATGCTG 1201 GCATTGTTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTGA ACTTCCACCT 1251 GTGAGCTTGT TCTCGGAGAT GCTGGCTGCA TCATTTTCCA TCGCTGTGGT 1301 GGCTTATGCT ATTGCAGTGT CAGTAGGAAA AGTATATGCC ACCAAGTATG 1351 ATTACACCAT CGATGGGAAC CAGGAATTCA TTGCCTTTGG GATCAGCAAC 1401 ATCTTCTCAG GATTCTTCTC TTGTTTTGTG GCCACCACTG CTCTTTCCCG 1451 CACGGCCGTC CAGGAGAGCA CTGGAGGAAA GACACAGGTT GCTGGCATCA 1501 TCTCTGCTGC GATTGTGATG ATCGCCATTC TTGCCCTGGG GAAGCTTCTG 1551 GAACCCTTGC AGAAGTCGGT CTTGGCAGCT GTTGTAATTG CCAACCTGAA 1601 AGGGATGTTT ATGCAGCTGT GTGACATTCC TCGTCTGTGG AGACAGAATA 1651 AGATTGATGC TGTTATCTGG GTGTTTACGT GTATAGTGTC CATCATTCTG 1701 GGGCTGGATC TCGGTTTACT AGCTGGCCTT ATATTTGGAC TGTTGACTGT 1751 GGTCCTGAGA GTTCAGTTTC CTTCTTGGAA TGGCCTTGGA AGCATCCCTA 1801 GCACAGATAT CTACAAAAGT ACCAAGAATT ACAAAAACAT TGAAGAACCT 1851 CAAGGAGTGA AGATTCTTAG ATTTTCCAGT CCTATTTTCT ATGGCAATGT 1901 CGATGGTTTT AAAAAATGTA TCAAGTCCAC AGTTGGATTT GATGCCATTA 1951 GAGTATATAA TAAGAGGCTG AAAGCGCTGA GGAAAATACA GAAACTAATA 2001 AAAAGTGGAC AATTAAGAGC AACAAAGAAT GGCATCATAA GTGATGCTGT 2051 TTCAACAAAT AATGCTTTTG AGCCTGATGA GGATATTGAA GATCTGGAGG 2101 AACTTGATAT CCCAACCAAG GAAATAGAGA TTCAAGTGGA TTGGAACTCT 2151 GAGCTTCCAG TCAAAGTGAA CGTTCCCAAA GTGCCAATCC ATAGCCTTGT 2201 GCTTGACTGT GGAGCTATAT CTTTCCTGGA CGTTGTTGGA GTGAGATCAC 2251 TGCGGGTGAT TGTCAAAGAA TTCCAAAGAA TTGATGTGAA TGTGTATTTT 2301 GCATCACTTC AAGATTATGT GATAGAAAAG CTGGAGCAAT GCGGGTTCTT 2351 TGACGACAAC ATTAGAAAGG ACACATTCTT TTTGACGGTC CATGATGCTA 2401 TACTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT 2451 TTAGAAACGA TCACTCTCAT TCAGGATTGT AAAGATACCC TTGAATTAAT 2501 AGAAACAGAG CTGACGGAAG AAGAACTTGA TGTCCAGGAT GAGGCTATGC 2551 GTACACTTGC ATCCTGAAAG TGGGTTCGGG AGGTCTCTAT GAGCAAGGAA 2601 TACAAGACAA AACTTCCTCA ATGCATTGAC TATTTCTTCA GACTCAAAAC 2651 ACTCATTCTT TTTTCTATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 2701 TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TGGGCAAAAT 2751 GGCTAGAATT ATTCAGACGA TTTGGCAGCG TCCAGGGTAA GCTGGTGTTA 2801 TAATACGCTG CTGATCTACA TCACAGATTT GCTAATAATG TTCACGTGGG 2851 CCCTGGCATA TCTCTGTTCA GTTAGAGTGA GTGCTGACCC AACAGCCTCT 2901 GTGGTCAAGC GAGTCACGAA TGATTAATCA TAAAGAAAAA TCAGTTTTTG 2951 ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGGTCACATT 3001 TAGTAAATGC TGAAATAAAA TGATTAATGC ATTTATCAAT AAAAGCCTTT 3051 GAAAATACTT TGGATAATAA ATTGGAGTTT TAAAAATGCA AATTTGCTTA 3101 GTATCTAATA ATGAAGTGTT ATTACATATA GCCGGAATTG AGGATCTCTT 3151 TGATCCTGGA AATGGTTTAC CTAAAAGCTA CAGAACCAGG CCAATATATT 3201 TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT TTCATGGTTT 3251 ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGATCAA 3301 AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT 3351 AGTCTATAGA AATGATCATT GCATGGAGGC ATGTATAGGT ATGATCTGTG 3401 TAAAATCTGA CATAAAAACA GTGCTATTCT GAGTGAAAAT TTTTTTGATG 3451 TGCTTACATA ACCATGGTGA TTAAAATGAG TTTATATTTT TTCTCAAAAA 3501 TTTTAGCAGT GTGTAAAGTA AGTAATCTTT AACTGAACTC TGACCACTTA 3551 AAAAAAAATC TAAAAATTGA ACTACCTATA GTAGTCTGTG TTTAAAGTGA 3601 ATTTTTAAAG ACAAAGCATT CTAAATGAAC TCAATATAAA AACATTCATT 3651 TGGAATGTAC ATACTGAAAA ATACAGGTTT TTTTGACCAA AAGTTTTTAT 3701 ATCTTTTCTT TTTATTTATT TTTTTCCTAA GTGCCAACAA TTTTCTAGAT 3751 ATTATATACA ACACAGGCTT TGATCTTGGG GACTTTTCCC ATATATTTCA 3801 CACTGGAGTG AATGAAGTTG TACTTCATTT CTAGAGAAAA GTTATACCCA 3851 GGTCCCCAAT TGAGAATGTC TTGCTTGATT GAAAACGACA TCATCCCTTG 3901 GTATACTCCA GGGATTGGTT TCAGGACCCC TGCATTTACC AAAATTTGTG 3951 CACACTCAAG TCCTGCAGTC ACCCCTGCCT AAAGATAGAA TGGCTTCTCT 4001 GTTTTTCTTC TGAAATACAA CCAGAAACAA TGTGTCTATT TCTGAAAGAA 4051 TAGGATTAAT GATCATACAA ATGGGTTAAT CCTGAATTCT GGTTGTAAAT 4101 CTGGTTACAG CATAACTAGG ATTATAATGC TGCCTCATTT TCACAGCACT 4151 ACTTGCTTAT ATTGACAACA AATCATCTCG CTAAAGAGTG AATGTAGGCC 4201 AGGCGCGGTG GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG 4251 GGTGGATCAC GAGGTCAGGA GATCGAGACC ATCCTGGCTA ACATGGTAAA 4301 ACCCCGTCTC TACTAAAAAT AGAAAAAAAG AAATTAGCCT AGCGTGGTGG 4351 CTGGCGGGCG CCTGTAGTCC CAGCTATTTG GGAGGCTAAG GCAGGAGAAT 4401 GGCGTGAACC CGGGAGGCGG AGCTTGCAGT GAGCCGAGGT CGTGCCACTG 4451 CACTCCAGCC TGGGCGACAG AGCAAGACTC CGTCTCAAAA AAAAAAAAAA 4501 AAAAAAAAAA AGAGTGAATG TAATAGTCTT GCAGAAAATG AATGAATACC 4551 TTTGTTCAAT AAAGGAAATA TGCACTGCTC ACTTTTTTGA AGGAAATGCC 4601 AAAGTTACGT TTTACAACAA GGCTAGAGTT TGTAAATTCT GGGTTCATTT 4651 GTGATGACAT AAGTCAGCAA ACTGCGGGAA TACTGTCTCT TCTATGTATT 4701 TTGTGAATAG TAAGCATAAT TTTAGTTTTG TATTATCAAT GAAAATTTCA 4751 CTTGAAATTA AAGCTGCCTT TTGTTATATT TTTAACCTAT AGGATAAGAT 4801 TCCAGTATTG TATATGAGTT TTAACAAATT AAAAAATCAA ATCATGTACA 4851 TTTGAAAATA TTTGCACACA TTTAAAAATA AATGTAAAGT TGTCTTTTAA 4901 ACTACTCGGA TGTGTCCTTT CTGAACAAAA // LOCUS AB011107 6183 bp mRNA PRI 10-APR-1998 DEFINITION Homo sapiens mRNA for KIAA0535 protein, complete cds. ACCESSION AB011107 NID g3043593 VERSION AB011107.1 GI:3043593 KEYWORDS KIAA0535 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG3847. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6183) AUTHORS Ohara,O., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (13-FEB-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. IX. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (1), 31-39 (1998) MEDLINE 98290545 FEATURES Location/Qualifiers source 1. .6183 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG3847" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 685. .3828 /gene="KIAA0535" CDS 685. .3828 /gene="KIAA0535" /codon_start=1 /product="KIAA0535 protein" /protein_id="BAA25461.1" /db_xref="PID:d1026391" /db_xref="PID:g3043594" /db_xref="GI:3043594" /translation="MDAEAEDKTLRTRSKGTEVPMDSLIQELSVAYDCSMAKKRTAED QALGVPVNKRKSLLMKPRHYSPKADCQEDRSDRTEDDGPLETHGHSTAEEIMIKPMDE SLLSTAQENSSRKEDRYSCYQELMVKSLMHLGKFEKNVSVQTVSENLNDSGIQSLKAE SDEADECFLIHSDDGRDKIDDSQPPFCSSDDNESNSESAENGWDSGSNFSEETKPPRV PKYVLTDHKKDLLEVPEIKTEGDKFIPCENRCDSETERKDPQNALAEPLDGNAQPSFP DVEEEDSESLAVMTEEGSDLEKAKGNLSLLEQAIALQAERGCVFHNTYKELDRFLLEH LAGERRQTKVIDMGGRQIFNNKHSPRPEKRETKCPIPGCDGTGHVTGLYPHHRSLSGC PHKVRVPLEILAMHENVLKCPTPGCTGRGHVNSNRNTHRSLSGCPIAAAEKLAMSQDK NQLDSPQTGQCPDQAHRTSLVKQIEFNFPSQAITSPRATVSKEQEKFGKVPFDYASFD AQVFGKRPLIQTVQGRKTPPFPESKHFPNPVKFPNRLPSAGAHTQSPGRASSYSYGQC SEDTHIAAAAAILNLSTRCREATDILSNKPQSLHAKGAEIEVDENGTLDLSMKKNRIL DKSAPLTSSNTSIPTPSSSPFKTSSILVNAAFYQALCDQEGWDTPINYSKTHGKTEEE KEKDPVSSLENLEEKKFPGEASIPSPKPKLHARDLKKELITCPTPGCDGSGHVTGNYA SHRSVSGCPLADKTLKSLMAANSQELKCPTPGCDGSGHVTGNYASHRSLSGCPRARKG GVKMTPTKEEKEDPELKCPVIGCDGQGHISGKYTSHRTASGCPLAAKRQKENPLNGAS LSWKLNKQELPHCPLPGCNGLGHVNNVFVTHRSLSGCPLNAQVIKKGKVSEELMTIKL KATGGIESDEEIRHLDEEIKELNESNLKIEADMMKLQTQITSMESNLKTIEEENKLIE QNNESLLKELAGLSQALISSLADIQLPQMGPISEQNFEAYVNTLTDMYSNLERDYSPE CKALLESIKQAVKGIHV" BASE COUNT 1993 a 1244 c 1293 g 1653 t ORIGIN 1 CTTTTTGTAG GGAGAAGGGC AGGATGTTTT TAACTGAATG TGACCTCAGG 51 GGAATACTAG AGAAAATAAT AAAATTTCTG AATGGGGCAG CGTGGAGAAA 101 TCCTAAGAGA AATAGCATAA GAGCATTTTG GAACACATCC AGGAAAAGAT 151 AACTTTCGAC ACACCTGTAG ACGTTCGCCA GGTAAAGGAG TGATGGAAAC 201 TCTCCAGTTC AGATCCAGTA GCTTTTAGGG AAGGAACTAC AGTTGCTGAC 251 TTAAGTTGAA GAAGCATCTA TTTAATGTCT GGTCAAATCC TACAAGAAAC 301 ACAGAAATCT ATGATTAAAA AGCTGAGCAC TTTGATATAC TGCAAAGGGT 351 AGAGAAGGCA GGACGGTAGA AATTTTCTGC AAGAAAGAAT GAATTTCAGG 401 ATTTATCACT AAATAAGACA AAGTCATTTA TTTAGTCCCC CTGACACAGC 451 AGGGCAAACT GAGTTGACAT ACAAGTTACC TGGAGAAAAA GAGAGCAATT 501 CCAGGACTTC CTCTTCAGCC TAAAAGAAGG TACCAGATCT GTGCACTGGG 551 GCGATGTGGA AGAGACCTGC TTATTGCCCC TGATGTAAGC TCCAGTAAGA 601 AAAGACGTCA AGTACAAGTA CTAGGAAATC ACTTTATACA TCTGTTTATA 651 GGAATGACCT CAGGACTTTG TGTTCATGTT ATAGATGGAT GCAGAGGCTG 701 AAGATAAAAC GCTGCGTACT CGCTCTAAAG GAACCGAGGT GCCAATGGAT 751 TCACTAATCC AGGAGCTCAG TGTTGCCTAT GATTGCTCCA TGGCAAAGAA 801 GAGAACAGCT GAAGATCAGG CTTTGGGGGT TCCAGTCAAC AAAAGGAAAT 851 CCCTGCTAAT GAAGCCCCGA CACTACAGCC CAAAAGCAGA CTGCCAAGAA 901 GACCGCAGTG ACAGGACAGA GGACGATGGC CCCTTGGAAA CACATGGTCA 951 CTCTACCGCA GAGGAAATCA TGATAAAACC TATGGATGAA AGTCTTCTTT 1001 CAACTGCACA AGAAAACTCC AGTAGGAAGG AAGACAGATA CTCTTGTTAT 1051 CAAGAGCTCA TGGTCAAGTC TTTAATGCAC TTGGGGAAAT TTGAAAAAAA 1101 TGTATCTGTT CAGACTGTAA GTGAAAATTT AAATGACAGT GGCATCCAGT 1151 CTTTAAAAGC AGAGAGCGAT GAAGCAGACG AGTGCTTTCT GATTCATTCT 1201 GATGATGGAA GAGACAAGAT TGATGATTCT CAGCCACCCT TCTGCTCCTC 1251 TGATGACAAT GAAAGTAACT CTGAAAGTGC AGAAAATGGC TGGGACAGTG 1301 GCTCCAACTT CTCAGAAGAA ACCAAACCAC CTAGAGTCCC AAAGTATGTT 1351 TTAACAGATC ATAAAAAAGA CCTATTGGAA GTTCCTGAAA TAAAAACTGA 1401 AGGTGACAAA TTTATCCCTT GTGAGAACAG GTGTGATTCT GAAACAGAAA 1451 GGAAAGACCC GCAGAATGCT CTCGCAGAAC CCCTGGATGG CAATGCCCAG 1501 CCCTCATTCC CTGACGTTGA GGAGGAAGAT AGCGAGAGCC TGGCAGTAAT 1551 GACGGAAGAG GGTAGTGACC TGGAAAAGGC CAAGGGGAAT TTAAGTTTGC 1601 TGGAGCAGGC AATTGCTCTG CAGGCTGAGC GAGGTTGTGT TTTCCATAAC 1651 ACCTACAAAG AGCTGGATAG GTTCCTGCTG GAGCACCTAG CAGGGGAAAG 1701 GAGGCAAACC AAAGTTATCG ACATGGGTGG AAGACAAATC TTTAACAATA 1751 AACATTCACC AAGGCCTGAA AAGAGGGAGA CCAAGTGCCC GATCCCTGGA 1801 TGTGATGGCA CGGGACACGT GACAGGGCTC TACCCGCACC ACCGCAGCCT 1851 TTCGGGGTGC CCCCACAAAG TGCGGGTTCC CCTGGAAATT CTTGCCATGC 1901 ATGAAAATGT GCTCAAGTGT CCCACGCCGG GATGCACAGG AAGGGGTCAT 1951 GTGAACAGCA ACCGCAACAC CCACAGGAGT CTTTCTGGTT GTCCAATTGC 2001 TGCAGCTGAA AAATTGGCAA TGTCCCAGGA TAAAAATCAG CTTGATTCTC 2051 CCCAAACTGG GCAGTGTCCT GACCAGGCCC ACAGGACAAG TTTGGTGAAG 2101 CAAATTGAAT TCAATTTCCC GTCACAAGCC ATCACCTCTC CCAGAGCCAC 2151 AGTGTCAAAA GAACAAGAGA AGTTTGGAAA AGTACCATTT GATTATGCCA 2201 GTTTTGATGC CCAAGTTTTC GGTAAACGCC CTCTCATACA AACAGTGCAA 2251 GGACGAAAAA CACCACCATT TCCTGAATCA AAGCATTTTC CAAATCCAGT 2301 GAAATTTCCT AATCGACTGC CTAGTGCAGG CGCCCACACC CAGAGCCCTG 2351 GCCGTGCCAG CTCTTATAGC TACGGTCAAT GTAGTGAAGA CACCCACATA 2401 GCAGCAGCTG CTGCCATCCT GAACCTTTCC ACCCGCTGCA GGGAAGCCAC 2451 AGACATCCTC TCCAACAAGC CACAGAGTCT GCATGCCAAG GGAGCCGAAA 2501 TAGAAGTGGA TGAAAATGGC ACATTGGACT TAAGCATGAA AAAAAATCGA 2551 ATCCTGGACA AGTCTGCACC CCTAACTTCC TCTAACACTT CTATTCCAAC 2601 TCCTTCCTCT TCCCCATTCA AAACAAGCAG CATTCTGGTC AATGCAGCAT 2651 TCTATCAGGC TCTTTGTGAC CAAGAGGGCT GGGACACTCC TATCAACTAT 2701 AGCAAAACTC ACGGGAAGAC AGAGGAGGAG AAAGAGAAAG ACCCAGTGAG 2751 CTCTCTAGAA AATTTAGAGG AAAAAAAGTT TCCTGGAGAG GCCTCTATAC 2801 CAAGCCCTAA ACCCAAGCTT CATGCAAGAG ATCTCAAAAA GGAACTAATC 2851 ACCTGTCCAA CACCAGGATG TGATGGAAGT GGCCACGTGA CAGGAAACTA 2901 TGCATCTCAT CGCAGTGTTT CTGGATGTCC TTTAGCAGAT AAGACTCTAA 2951 AATCCCTCAT GGCTGCCAAC TCTCAGGAGC TTAAGTGTCC AACCCCAGGC 3001 TGCGATGGCT CGGGGCACGT GACTGGAAAC TATGCTTCCC ACAGAAGCTT 3051 GTCCGGATGC CCTCGTGCAA GGAAAGGTGG TGTCAAAATG ACCCCTACCA 3101 AGGAAGAAAA AGAAGACCCT GAACTGAAAT GTCCTGTGAT AGGGTGTGAT 3151 GGCCAAGGTC ACATATCAGG TAAATACACA TCACACCGCA CAGCTTCTGG 3201 CTGTCCTCTG GCTGCCAAGA GACAGAAGGA GAATCCTCTC AATGGAGCCT 3251 CCCTCTCCTG GAAACTGAAC AAACAAGAGC TACCACATTG TCCCTTGCCA 3301 GGCTGCAATG GGCTGGGCCA TGTAAATAAT GTTTTTGTCA CCCACCGAAG 3351 CTTATCTGGA TGTCCTCTCA ATGCACAAGT TATCAAAAAG GGCAAGGTTT 3401 CTGAAGAACT CATGACCATC AAGCTCAAAG CAACTGGGGG AATAGAGAGT 3451 GATGAAGAAA TTAGGCATTT GGATGAAGAA ATAAAGGAAC TGAATGAATC 3501 CAACCTTAAA ATTGAAGCAG ATATGATGAA ACTTCAGACC CAGATCACAT 3551 CTATGGAGAG CAACTTAAAG ACGATAGAGG AGGAGAACAA ACTCATAGAA 3601 CAGAACAATG AAAGTCTGCT GAAAGAGCTG GCAGGTCTAA GCCAAGCTCT 3651 CATTTCAAGC CTTGCTGACA TCCAGCTTCC ACAGATGGGA CCTATCAGTG 3701 AGCAGAATTT TGAAGCATAT GTAAATACAC TCACAGATAT GTACAGCAAT 3751 CTGGAACGGG ACTATTCCCC GGAATGCAAA GCTCTACTGG AAAGTATCAA 3801 ACAGGCAGTG AAGGGTATCC ATGTGTAGGA TCACAGCGCT GCCGGGCAAC 3851 AGAAGTTACC AACAGCAGTA AACTCCAGAT GGATCTGTTA GAGGTTCATG 3901 TACTGCTAAG GCGTGGAGGT TGCCGTACTG CATTTACAAT TTGCAACATT 3951 GCACTAATTT TATTTTCCCC AGCTGATATA AAAAGGAAAG AAAAACTATG 4001 ATAGACTTCT TGGATTAAAA GCAATGCAGT CAATTATTAG ATCTTATTTA 4051 TTTTCATATG TTTTTCTTTT ATTTCTTCAT TGTACTCTTC TTTTGTAAAG 4101 TATATGTAAA ATAAATGTGA CATTTTTATA ATTTATTTAT TACTAATCAA 4151 AGAGTTTTTT ATCTTTTAAC TGCATTTTGA AGTCTGCCGT ATTTTTACAA 4201 GTGTGTTTAT TAATTTATTT TCCAATAGGA TTTAAATAGA AATGCTATTC 4251 TCAAGTCATC TTTCTTGCTG GGTTTTAATG AGGAAACAGG AAAGGGTGAA 4301 GGAAATCCTT GTCTAAGGAC TGCACTATAG TTGAGTTTGA TTTTTATTGC 4351 ACACTTCTTC CCCCACCTTT CACTGATTTT TGTATTTATA AATGAATTTG 4401 CGGTAAGGTG AGCTGCACGG AAGGAATAAG AAGACAAATG GCGCCCACTA 4451 GTGGGGAATC CGCACTCACA AAAGCACAGG ATGCTGGAAA ACAGCCTGCT 4501 CAGAATTTGT TAGCAATAAT TAAATATAGC AATCAGCAAA GTATTCGACT 4551 TGGCTGGACG GTTTTCGTTA ATATGAATTA TTTATTTGAA ATGTTTTAAA 4601 GAAACATAAG CCTTTTTAGT GATGCAGATT TGTCTGTTTG TTTTTCAAGT 4651 CATATCAGAT CGTTGGCAAC TCGTATCCCA AGATGAAAAA TAAGACTTGG 4701 TGTGACCAGC CAGGCTTTCC TGCCATATGT TGGTACAATA TACAAGTGAC 4751 AATATTGGTG TAGATTTGTA CTTAGCAAAT ACAAACACAT CCAAATGAAA 4801 AATTTTGTAG ATACCATATC CCCTGAAATA GCATTTATCT TACTGGGTTG 4851 ACTGGAAAGG AATGGAAAAT ATAGTAACAC ATGAAAAAAT GCTACTCCAA 4901 TCTGAATGAT TACTTCAAAC ACTGGCACCT TGGGTCTCAC CCACCATAGG 4951 AAACAAGACA ACATTCAATT TGATAGAAAT CTTGCCACAA AACTTCAAAT 5001 GCTACAAAAT ATACACACAC ACTCACACAC ACAGGCATAC TCACACACAG 5051 ACACACACAC ACACACACAC ACAGACTCAT CCACACTTCA AATTGAGCCC 5101 ACAATCTTGA ATTTCTGAAC GGATCAGAGT TTCATAGTTT CTATAGTAAA 5151 GGCAATGTCT ATTTCAGGGA TTGTAAAGTA GTTAAGCATT GTTTCAAAAG 5201 TTTTTTTATA TTTATTTTTT TTAAGGAAAA GGTATAGACA ACCAGCTAAA 5251 CTGCCTTTTT GGTGTGCACA CACATTTCAT GTGCAGACGT GCCTCTGTGT 5301 AAATGTACAC ATGAACTTCA TGTGGGCTTA ATTTTCTGTG CTATAAACAA 5351 AAGTGTTTAT TTTTTATTAA CCTCATGGAT ATTTAGATGG AAAGTGATGG 5401 CATTCACAGG CTTGATGTAT TCCACTGTTA TTACTGTTAC CTGCACAAAT 5451 GAAAAACAAT ACTCAACAGT AATTCCACTC CCATGAAACT TTGGTCATTG 5501 TTATGCATTA AGTGGGGCTT ATCTTTGGTT TGGAGTTCAT TTGAACTCTT 5551 GAACCTTAGT TTAGTGAAGA TGAACTGTCT GTTCTTAGGT AGAAACGGTG 5601 TTTATTTAAA AATCAGTTTT AAAAAATGAG CTACCATATG TGCTGTCTAT 5651 TATAAATGGG ACACCAAACA AAATTTTCTA TTACAGTTGT GTACTTGCAA 5701 ACATTTTGCT ATACAGTACT TCATAGATGC ATACAAATGA GCTCACTTAT 5751 TACAAAGACA AACGTTTAAT TTGCTAAATA TTTTAACAAG TTTGTTATAT 5801 ATTTTATTTA ATTTAAAAGA AATCTCTTAC CAACCTACAT ATTTATTACT 5851 ATAATTTGCT ATGACTTCAG GTTAATTTAT TTGTGTTTGC ATAGTTTGAG 5901 CAGGATGTTT TGTGAAGTAT GTTTGTATTT ATTTGCCTAC TTTGTACTTG 5951 ATGTGTTTTG TAATGTGCAC TGAATTTGTT TTCTTTTCAA CTATGTTAAT 6001 GATCAATACT GTAAATTGGG TCTTTTGTAA ACAAAAAGGC AATGATGTAT 6051 GCATTTTTTT TAATTTGAGG TAGTTTGTTT GTATACTGTT TCTCCAAACA 6101 CTTAATATTT CTTACATCAA AGCAACAAAA TTGTGTTCAG TGCTGTACAT 6151 TTGGTGTATG GTAGGAAATA AAAATTGATA ACG // LOCUS AF061326 4199 bp mRNA PRI 06-MAR-1999 DEFINITION Homo sapiens T41p (C8orf1) mRNA, complete cds. ACCESSION AF061326 NID g4337461 VERSION AF061326.1 GI:4337461 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4199) AUTHORS Tauchi,H., Matsuura,S., Isomura,M., Kinjo,T., Nakamura,A., Sakamoto,S., Kondo,N., Endo,S., Komatsu,K. and Nakamura,Y. TITLE Sequence analysis of an 800-kb genomic DNA region on chromosome 8q21 that contains the nijmegen breakage syndrome gene, NBS1 JOURNAL Genomics 55 (2), 242-247 (1999) MEDLINE 99134304 REFERENCE 2 (bases 1 to 4199) AUTHORS Tauchi,H., Matsuura,S. and Komatsu,K. TITLE Direct Submission JOURNAL Submitted (24-APR-1998) Department of Radiation Biology, RIRBM, Hiroshima University, 1-2-3 Kasumi Minami-ku, Hiroshima 734-8553, Japan FEATURES Location/Qualifiers source 1. .4199 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q21" /tissue_type="testis" gene 1. .4199 /gene="C8orf1" /note="alias hT41" CDS 346. .1863 /gene="C8orf1" /codon_start=1 /product="T41p" /protein_id="AAD18134.1" /db_xref="PID:g4337462" /db_xref="GI:4337462" /translation="MPLVEETSLLEDSSVTFPVVIIGNGPSGICLSYMLSGYRPYLSS EAIHPNTILNSKLEEARHLSIVDQDLEYLSEGLEGRSSNPVAVLFDTLLHPDADFGYD YPSVLHWKLEQHHYIPHVVLGKGPPGGAWHNMEGSMLTISFGSWMELPGLKFKDWVSS KRRSLKGDRVMPEEIARYYKHYVKVMGLQKNFRENTYITSVSRLYRDQDDDDIQDRDI STKHLQIEKSNFIKRNWEIRGYQRIADGSHVPFCLFAENVALATGTLDSPAHLEIEGE DFPFVFHSMPEFGAAINKGKLRGKVDPVLIVGSGLTAADAVLCAYNSNIPVIHVFRRR VTDPSLIFKQLPKKLYPEYHKVYHMMCTQSYSVDSNLLSDYTSFPEHRVLSFKSDMKC VLQSVSGLKKIFKLSAAVVLIGSHPNLSFLKDQGCYLGHKSSQPITCKGNPVEIDTYT YECIKEANLFALGPLVGDNFVRFLKGGALGVTRCLATRQKKKHLFVERGGGDGIA" BASE COUNT 1255 a 712 c 832 g 1400 t ORIGIN 1 TGGGGCGGGC TGATGAGCAC CTGGATTTTC ACCCGGCGTG CTGGTAATTC 51 CTACACTTGT GGGTGTTTGA GAGACATGAA GAGGGGAGGA TGACCTCTTC 101 CCGGAAGCGG GACTTCCATA AAAGAAGCGT GGTGGGCGGG TCCCGGGCAC 151 CTGTGGTTTG GTGAGTCCTC CAGGTAACTG TGATGCGGGG GTCAGCGGGA 201 GAGGCACCCG GAAGCTCCCG GCGTCTGCAC CCCGGCAGCG CGAGGAAATG 251 CCCAAAGAAA CTATAGTGAC ACTGAAACTG AAGGAGAGAT TTTTAATTCC 301 TTAGTGCAAT ACTTTGGTGA CAACTTGGGG CGAAAAGTTA AAGCGATGCC 351 ATTAGTTGAA GAAACTTCTT TATTGGAAGA TTCGTCAGTG ACTTTTCCTG 401 TGGTAATAAT AGGAAATGGA CCCTCAGGAA TATGCCTTTC TTATATGTTA 451 TCAGGCTACA GACCGTATTT ATCATCAGAA GCAATACACC CAAATACAAT 501 CTTAAATAGT AAATTAGAAG AAGCAAGACA TCTTTCCATT GTTGATCAGG 551 ACTTAGAATA CTTGTCTGAG GGCCTTGAGG GCCGATCATC CAATCCAGTT 601 GCAGTACTTT TCGATACACT TCTTCATCCA GATGCTGACT TTGGGTATGA 651 TTATCCATCC GTTTTGCATT GGAAATTAGA GCAACATCAT TATATCCCTC 701 ACGTAGTTCT TGGTAAAGGT CCACCTGGTG GGGCTTGGCA TAATATGGAA 751 GGCTCCATGT TGACAATCAG CTTTGGAAGT TGGATGGAAC TACCTGGACT 801 TAAATTTAAG GACTGGGTAT CAAGTAAACG AAGGAGCCTA AAAGGGGATC 851 GAGTTATGCC AGAGGAAATA GCTCGCTACT ATAAACATTA TGTAAAAGTC 901 ATGGGTCTTC AGAAGAATTT CAGAGAGAAT ACTTACATAA CTTCCGTATC 951 AAGACTCTAC AGAGATCAAG ATGATGATGA TATTCAAGAC AGAGATATTT 1001 CAACAAAGCA TTTACAGATA GAGAAGTCAA ACTTTATCAA GAGAAACTGG 1051 GAAATTAGGG GTTATCAGCG AATAGCTGAT GGTTCTCATG TTCCCTTCTG 1101 CCTCTTTGCT GAGAATGTAG CGCTGGCAAC TGGAACGCTG GATTCTCCTG 1151 CCCATCTGGA AATTGAAGGG GAAGATTTTC CTTTTGTGTT TCATTCAATG 1201 CCTGAATTTG GAGCTGCTAT AAACAAAGGA AAGTTGCGTG GCAAAGTGGA 1251 TCCAGTGTTA ATTGTAGGTT CTGGGCTTAC TGCCGCTGAC GCAGTACTGT 1301 GTGCTTACAA CAGTAATATC CCTGTGATTC ATGTGTTTCG CAGACGAGTA 1351 ACTGATCCAA GCTTAATTTT CAAACAGCTT CCCAAAAAGC TGTATCCTGA 1401 ATATCATAAA GTCTATCATA TGATGTGTAC TCAGTCATAT TCTGTAGACT 1451 CAAATCTTTT ATCTGATTAT ACCAGCTTTC CCGAGCACCG TGTGCTTTCC 1501 TTTAAGTCGG ACATGAAATG TGTTCTCCAA AGCGTTTCTG GATTGAAGAA 1551 AATATTTAAG CTGTCTGCAG CAGTAGTATT GATAGGTTCT CATCCTAATC 1601 TGTCTTTTCT GAAGGATCAA GGGTGTTACC TAGGCCATAA GTCAAGCCAG 1651 CCAATCACAT GTAAGGGTAA TCCTGTGGAA ATAGATACAT ATACCTATGA 1701 GTGTATTAAA GAAGCCAACC TTTTTGCATT GGGTCCTTTG GTTGGAGACA 1751 ATTTTGTTCG ATTTTTAAAG GGAGGGGCGC TGGGTGTTAC ACGCTGTTTA 1801 GCTACAAGAC AGAAGAAAAA GCATTTGTTT GTTGAAAGAG GAGGAGGAGA 1851 TGGGATAGCT TAAAGCAAGT TTACAAGTAA TTAAAATGGA CAGTTTGCCA 1901 TTAAAGATTT TTAATAGTGG TTTTGCAGTG TACTGGCTTG AATTTTCTGG 1951 ACTTGAGTTA ACTGAAGGAG AGCCTCAAAC TATAGTAACT TCATTTTTAA 2001 AAGTTACTAG AATTTGGTAT CCTGATTTAT ATTGCAGTGT TTCAAAGGTG 2051 TCACTGTCAG ACAAATAGAA ACACTGCCAA CTTGGTGTAA CTTAAGCTTT 2101 CATTTAACTA AAACATTCTT TTCTTGCAAA ACTTATTTTT CATGATCATT 2151 TTTGGTTATT TATTATACTT GATTCCAAAA TAGTACAGCC TTGAATCTAT 2201 AAAACTGTGC AGTCATTATG CCAGAAATTA TCTTAAATAT ATAATGGGTC 2251 ACCTTGCTGT TCAAAGGGTG GTGCAAGGTC CTGCAGCATC TTACATCTGT 2301 AGCTTGTTAG AAATGTAAAC TCTCAGGCCC CACAACTTAC TTCCTGCATT 2351 TTAACAAGAT CCCCAAGGGA TATGTATGCT CATAAAAATT TGAGACACTG 2401 GTTTAAATGA AAATGGATAT AAGGTATGTA TAACTGGGGG TGGGGTGAGG 2451 GTAGGAGGCA TTTACAACTC AGATTTTATT TATTTTGAAA TTATCAATTG 2501 TATAAATCTA ATTTATTACC AAATAGGGTC TTTTAAAAAA TATTTTTATC 2551 GTTGAAACCT TGACAGGTAC TTCATATTCT TCTAATAATT TAAACAGTCC 2601 AATAATGTGG TATACACTTT GACATCCAAG AACTCACCAA GATGTTTTTC 2651 AGAGATTTAT TCTCGATTTA ACTATCATAG CATTTAATGA ATCTGATTTG 2701 TAGTTCAATA AATTGTGGGT TGAACTACTT ATCCCTGTGT GAACATTGAA 2751 TTACTTTCTG TCACTGAAAC TGAGGTATTT GGGTGTGGTA AGTACTTCGA 2801 AAATTGTAAT ACTGTTTGGG CATTGTCTAA ATTATTAAAG GTTAAAATAG 2851 AAAATAAAGT CAGAATTTTT CTTTTCCATT CCAAAGGTGT ACTTAGAGAT 2901 CTCTATTAGT ATTCATTCGA GATGACATAG CAGCTCATAT CATGGTTGTT 2951 TATTGGATTT ATCTGTTCTA ATTATATAAG TGTGTTTACT GTCTGTGTTT 3001 TCACACAAAC TGCTAGAATT TTTAATGTTA AGACGAAAAC ATCTGAAGTT 3051 CTCCATGGCA AATTGAATTT TTCAGTCATT TTCTTTTCTT TTTTTGGTAC 3101 AATTACTTCA TCTGGAATGT CTTCATTGAA CTCGTTATTC TATTTTTCTT 3151 AGAATTAAAA GTGGATTAAT GTGGGTTTTT CTGTTCATTT TATTGCAGTA 3201 TTAAATGCTT AAGCTTATTA GGACCATAAT TCACTTTAAA TATAATTGTA 3251 TAGAATATAT TTGCGTCGAT CAAATAATTG CTTCAGATGA ATTCTTAGAC 3301 TCTTGATAAT ATCACACCTA ATTTAACTTG ATTTTACAAG CTGTACAATC 3351 CAGTTTTAGT TTTCTATTGT GATAATAACT TTTTTCAAAC CAGTTTCACA 3401 TCTTAATGAA ATAACATTCT CTGACTGCAC TTGCTTCAGT ACTCTCTTGC 3451 CTGCCTGTTT TTGACCTCTG CATGAGTTGG ATTAGATGTT TTTCTTACTG 3501 TCACTTCTAA ATAGAAAATG ACAGTGTTAT AAAAAAGGGA AGGATAAAAC 3551 CTTTGACATC CCCTTGTGTC TCAAAAGTCC ACAGTTATTC AAACAATGGC 3601 TTTTTTTGTG ATGAGAGTAT TTGTTAAAAA AAAAAAAAGA CTTCAAGAAA 3651 AATAAAAGTT CAGTGGAGCT GCAAATAAAT CTGGTGAATA ATTTCATCTT 3701 TGGTAATCTC CCATTTCCTG AGTTCTTCCT CAATCCAAGC TGTCCTGTGT 3751 AGTATATAAC ATTTGGGCAT TTTCTCTGAT ATACTATACT CTCATGTTCT 3801 ATAAATTTCT GTCCCGTAAT TCTAACACTT TACATTTTTT CTTTGCTATC 3851 AGCTATAGCT ATTCATGGAA GGGAAGAATC ACTAAATACT TGTCTAGTTA 3901 TAGCATGATG TGAGCATCTC CTCCTTATCC CTCGATGCCT GGCTTGGTGT 3951 CTGGCAAACA GTCCATAATT AGCAGATGTT GAAAGACCGT TTACAAAGCA 4001 GAATTTGGGG ATTTAAAGTG CAATGATACA ACAAAAAGAT TTAATTACAG 4051 CTTCCAGTGT TTTGACTATG TGAACCATAT CCAACTACTT TTTTGAAAAT 4101 CTAGTTCTAT GTAATATATT TCTGTGGCAT CAAATTTTAG TTGATTGTAT 4151 TAGTCAATAG GAAGTGGTGG AAAATTTCTA AATAAATTCA ACTATTAAA // LOCUS HSU25676 825 bp mRNA PRI 20-JUL-1995 DEFINITION Human interleukin 2 (IL2) mRNA, complete cds. ACCESSION U25676 NID g847817 VERSION U25676.1 GI:847817 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 825) AUTHORS Xu,D., Wu,Y., Chen,J., Yu,L., Zhong,M., Hui,Y. and Qu,H. TITLE Expression of human IL-2 from gene transferred mouse melanoma cells and its effect on the growth of mouse melanoma JOURNAL Chung-Hua Min Kuo Wei Sheng Wu Chi Mien I Hsueh Tsa Chih 13 (No.2), 78-82 (1993) REFERENCE 2 (bases 1 to 825) AUTHORS Xu,L. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) Lin Xu, Institute of Biophysics, Academia Sinica, Dept. of Protein Engineering, 15 Datun Road, Chaoyang District, Beijng, Peoples Republic of China FEATURES Location/Qualifiers source 1. .825 /organism="Homo sapiens" /db_xref="taxon:9606" gene 48. .518 /gene="IL2" CDS 48. .518 /gene="IL2" /codon_start=1 /label=JC1074 /product="interleukin 2" /protein_id="AAA70092.1" /db_xref="PID:g847818" /db_xref="GI:847818" /translation="MYRMQLLSCIALILALVTNSAPTSSSTKKTKKTQLQLEHLLLDL QMILNGINNYKNPKLTRMLTFKFYMPKKATELKQLQCLEEELKPLEEVLNLAQSKNFH LRPRDLISNINVIVLELKGSETTFMCEYADETATIVEFLNRWITFCQSIISTLT" BASE COUNT 302 a 148 c 116 g 259 t ORIGIN 1 ATCACTCTCT TTAATCACTA CTCACATTAA CCTCAACTCC TGCCACAATG 51 TACAGGATGC AACTCCTGTC TTGCATTGCA CTAATTCTTG CACTTGTCAC 101 AAACAGTGCA CCTACTTCAA GTTCGACAAA GAAAACAAAG AAAACACAGC 151 TACAACTGGA GCATTTACTG CTGGATTTAC AGATGATTTT GAATGGAATT 201 AATAATTACA AGAATCCCAA ACTCACCAGG ATGCTCACAT TTAAGTTTTA 251 CATGCCCAAG AAGGCCACAG AACTGAAACA GCTTCAGTGT CTAGAAGAAG 301 AACTCAAACC TCTGGAGGAA GTGCTGAATT TAGCTCAAAG CAAAAACTTT 351 CACTTAAGAC CCAGGGACTT AATCAGCAAT ATCAACGTAA TAGTTCTGGA 401 ACTAAAGGGA TCTGAAACAA CATTCATGTG TGAATATGCA GATGAGACAG 451 CAACCATTGT AGAATTTCTG AACAGATGGA TTACCTTTTG TCAAAGCATC 501 ATCTCAACAC TAACTTGATA ATTAAGTGCT TCCCACTTAA AACATATCAG 551 GCCTTCTATT TATTTATTTA AATATTTAAA TTTTATATTT ATTGTTGAAT 601 GTATGGTTGC TACCTATTGT AACTATTATT CTTAATCTTA AAACTATAAA 651 TATGGATCTT TTATGATTCT TTTTGTAAGC CCTAGGGGCT CTAAAATGGT 701 TTACCTTATT TATCCCAAAA ATATTTATTA TTATGTTGAA TGTTAAATAT 751 AGTATCTATG TAGATTGGTT AGTAAAACTA TTTAATAAAT TTGATAAATA 801 TAAAAAAAAA AAACAAAAAA AAAAA // LOCUS HSU83192 3995 bp mRNA PRI 14-JUL-1998 DEFINITION Homo sapiens post-synaptic density protein 95 (PSD95) mRNA, complete cds. ACCESSION U83192 NID g3318652 VERSION U83192.1 GI:3318652 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3995) AUTHORS Stathakis,D.G., Hoover,K.B., You,Z. and Bryant,P.J. TITLE Human postsynaptic density-95 (PSD95): location of the gene (DLG4) and possible function in nonneural as well as in neural tissues JOURNAL Genomics 44 (1), 71-82 (1997) MEDLINE 97432822 REFERENCE 2 (bases 1 to 3995) AUTHORS Stathakis,D.G., Hoover,K.H., You,Z. and Bryant,P.J. TITLE Direct Submission JOURNAL Submitted (24-DEC-1996) Developmental Biology Center, University of California, Irvine, 4240 Biological Sciences II, Irvine, CA 92697-2275, USA REFERENCE 3 (bases 1 to 3995) AUTHORS Stathakis,D.G., Hoover,K.H., You,Z. and Bryant,P.J. TITLE Direct Submission JOURNAL Submitted (14-JUL-1998) Developmental Biology Center, University of California, Irvine, 4240 Biological Sciences II, Irvine, CA 92697-2275, USA REMARK Sequence update by submitter COMMENT On Jul 14, 1998 this sequence version replaced gi:1857478. FEATURES Location/Qualifiers source 1. .3995 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p13.1" /clone="P3-1" /tissue_type="mammary" /clone_lib="Clontech 5'-Stretch cDNA Library" gene 1. .3995 /gene="PSD95" misc_feature 61. .149 /gene="PSD95" /note="encodes PDZ1 domain" misc_feature 156. .244 /gene="PSD95" /note="encodes PDZ2 domain" misc_feature 309. .390 /gene="PSD95" /note="encodes PDZ3 domain" misc_feature 375. .429 /gene="PSD95" /note="encodes SH3 domain" misc_feature 532. .712 /gene="PSD95" /note="encodes GUK domain" CDS 860. .3163 /gene="PSD95" /note="similar to Rattus norvegicus PSD95/SAP90, GenBank Accession Number D50621; membrane associated putative guanylate kinase protein member" /codon_start=1 /product="post-synaptic density protein 95" /protein_id="AAC52113.1" /db_xref="PID:g3318653" /db_xref="GI:3318653" /translation="MSQRPRAPRSALWLLAPPLLRWAPPLLTVLHSDLFQALLDILDY YEASLSESQKYRYQDEDTPPLEHSPAHLPNQANSPPVIVNTDTLEAPGYELQVNGTEG EMEYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAQDGRLRVNDSI LFVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAEKVMEIKLIKGPKGLGFSIA GGVGNQHIPGDNSIYVTKIIEGGAAHKDGRLQIGDKILAVNSVGLEDVMHEDAVAALK NTYDVVYLKVAKPSNAYLSDSYAPPDITTSYSQHLDNEISHSSYLGTDYPTAMTPTSP RRYSPVAKDLLGEEDIPREPRRIVIHRGSTGLGFNIVGGEDGEGIFISFILAGGPADL SGELRKGDQILSVNGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEEYSRFEAKIHDL REQLMNSSLGSGTASLRSNPKRGFYIRALFDYDKTKDCGFLSQALSFRFGDVLHVIDA SDEEWWQARRVHSDSETDDIGFIPSKRRVERREWSRLKAKDWGSSSGSQGREDSVLSY ETVTQMEVHYARPIIILGPTKDRANDDLLSEFPDKFGSCVPHTTRPKREYEIDGRDYH FVSSREKMEKDIQAHKFIEAGQYNSHLYGTSVQSVREVAEQGKHCILDVSANAVRRLQ AAHLHPIAIFIRPRSLENVLEINKRITEEQARKAFDRATKLEQEFTECFSAIVEGDSF EEIYHKVKRVIEDLSGPYIWVPARERL" BASE COUNT 901 a 1113 c 1160 g 821 t ORIGIN 1 GGATCCGCGG GACAGATGAG GAAGGGGCTT AAGTCACTGC AGCCAGAGGG 51 ATGGAGGTGG ACTGATGGGA GGGCTTCTCC GGTGGGGTTA GAAGGGAAAA 101 GTAGGGAAAG AGAAGTGTAA GGTAGATGGC AGAGGCAGAG ACATGGAAAG 151 ACAGACTCTA GGGTTCCTGA TGATATCTAT CTCGGCCAAC ACAAAAGGGA 201 GGGTACAGTG GTGGGGGCAC CCAAGCTAGG GTGTGAGTAC CCTAAGTGTA 251 TTCTTCTGAG ATGTAGGCCA TTCACTAACT CTTGGAACAG CTACAGTTTC 301 ACAGTAGGAA GACCCCCCCA GATTCACTGC CCCTCCCTTA GTAAAGCCTC 351 TGAGACCTTC CTGAACATTC CCTTCTGTCT TTGCCCTCTG TTCCTTCCAG 401 AGACTATGTG CCCAGGCAGA TGGATTCCTC CCGGGCCTGA GAGGAACTGC 451 AGGAATTCTC CTGCCTCTTA CCCGTAAAAC CCCAACTTCT CTAGCCCTAG 501 GGCAGGAAGT CCCAAACAAT TTCTACCCCT TTTTCTGCAA TTCTCATTGG 551 GGTGAGAGGA GGCCCAGGAG GAGAGAGAGC TGGGCTCAGC TTCTTTTTGA 601 GCTGCTGGAG CCCTCTGTGA GGAGGCCCTC TTTGCTGGCT TCTCAGGAGA 651 GTGTGGCTAG GTTCTGCCTG CCTATGGGAA GAGGGGGCCA GGGTGTGTGG 701 AGCAAGATGG TGCGGTGCTG GTGCCTTGGG ACCTGGGGGA ATGGGACAGC 751 TGGTCGGCTC AGAGACGGCC TACTTTACTC ACAGCTGGAA TTTAGTGGGG 801 AGAAGCAGCT CAACTCCAAT CCTGGAGGAT TAGGGAGATT AAAGTGAGAG 851 AAGAGAGAGA TGTCCCAGAG ACCAAGAGCT CCCAGGTCAG CCCTCTGGCT 901 CCTGGCACCC CCACTGCTGC GGTGGGCACC CCCACTCCTC ACAGTGCTGC 951 ATAGCGACCT CTTCCAGGCC TTGCTGGACA TCCTGGACTA TTATGAGGCT 1001 TCCCTCTCAG AGAGTCAGAA ATACCGCTAC CAAGATGAAG ACACGCCCCC 1051 TCTGGAGCAC AGCCCGGCCC ACCTCCCCAA CCAGGCCAAT TCTCCCCCAG 1101 TGATTGTCAA CACAGATACC CTAGAAGCCC CAGGATATGA GTTGCAGGTG 1151 AACGGGACCG AGGGGGAGAT GGAATACGAG GAAATCACAT TGGAAAGGGG 1201 TAACTCAGGT CTGGGCTTCA GCATCGCAGG TGGCACTGAC AACCCACACA 1251 TCGGTGACGA CCCATCCATT TTCATCACCA AGATCATTCC TGGTGGGGCT 1301 GCGGCCCAGG ATGGCCGCCT CAGGGTCAAC GACAGCATCC TGTTTGTAAA 1351 TGAAGTGGAC GTGCGCGAGG TGACCCACTC AGCGGCGGTG GAAGCCCTCA 1401 AAGAGGCAGG CTCCATCGTT CGCCTCTATG TCATGCGCCG GAAGCCCCCG 1451 GCTGAGAAGG TCATGGAGAT CAAGCTCATC AAGGGGCCTA AAGGTCTTGG 1501 CTTCAGCATC GCAGGGGGCG TAGGGAACCA GCACATCCCA GGAGATAATA 1551 GCATCTATGT AACAAAGATC ATCGAAGGGG GTGCTGCCCA CAAGGATGGG 1601 AGGTTGCAGA TTGGAGACAA GATCCTGGCG GTCAACAGTG TGGGGCTAGA 1651 GGACGTCATG CATGAAGATG CTGTGGCAGC CCTGAAGAAC ACGTATGATG 1701 TTGTCTACCT AAAGGTGGCC AAGCCCAGCA ATGCCTACCT GAGTGACAGC 1751 TATGCTCCCC CAGACATCAC AACCTCTTAT TCCCAGCACC TGGACAATGA 1801 GATCAGTCAC AGCAGCTACC TGGGCACCGA CTACCCCACA GCCATGACCC 1851 CCACTTCCCC TCGGCGCTAC TCTCCAGTGG CCAAGGACCT GCTCGGGGAG 1901 GAAGACATTC CCCGAGAACC GAGGCGAATT GTGATCCACC GGGGCTCCAC 1951 GGGCCTGGGC TTCAACATCG TGGGTGGCGA GGACGGTGAA GGCATCTTCA 2001 TCTCCTTTAT CCTGGCCGGG GGCCCTGCAG ACCTCAGTGG GGAGCTGCGG 2051 AAGGGGGACC AGATCCTGTC GGTCAACGGT GTGGACCTCC GAAATGCCAG 2101 CCATGAGCAG GCTGCCATTG CCCTGAAGAA TGCGGGTCAG ACGGTCACGA 2151 TCATCGCTCA GTATAAACCA GAAGAGTACA GCCGATTCGA GGCCAAGATC 2201 CACGACCTTC GGGAACAGCT CATGAACAGC AGCCTGGGCT CAGGGACTGC 2251 GTCCTTGCGG AGCAACCCCA AAAGGGGTTT CTACATCAGG GCCCTGTTTG 2301 ATTACGACAA GACCAAGGAC TGCGGCTTCC TGAGCCAGGC CCTGAGCTTC 2351 CGCTTTGGGG ATGTGCTGCA TGTCATCGAT GCTAGTGATG AGGAGTGGTG 2401 GCAGGCACGG CGGGTCCACT CTGACAGTGA GACCGACGAC ATTGGGTTCA 2451 TCCCCAGCAA ACGGCGGGTT GAGCGACGAG AGTGGTCAAG GTTAAAGGCC 2501 AAGGACTGGG GCTCCAGCTC TGGATCGCAG GGTCGAGAAG ACTCGGTTCT 2551 GAGCTACGAG ACAGTGACGC AGATGGAAGT GCACTATGCT CGCCCCATCA 2601 TCATCCTTGG GCCCACCAAG GACCGCGCCA ACGATGATCT TCTCTCCGAG 2651 TTCCCCGACA AGTTTGGATC CTGTGTTCCC CATACGACAC GGCCCAAGCG 2701 GGAGTATGAG ATAGATGGCC GGGATTACCA CTTTGTGTCG TCCCGGGAGA 2751 AAATGGAGAA GGACATTCAG GCGCACAAGT TCATTGAGGC CGGCCAGTAC 2801 AACAGCCACC TCTATGGGAC CAGCGTCCAG TCCGTGCGAG AGGTGGCAGA 2851 GCAGGGGAAG CACTGCATCC TCGATGTCTC GGCCAATGCC GTGCGGCGGC 2901 TGCAGGCGGC CCACCTGCAC CCCATCGCCA TCTTCATCCG CCCCCGCTCC 2951 CTGGAGAATG TGCTAGAGAT TAACAAGCGG ATCACAGAGG AGCAAGCCCG 3001 CAAAGCCTTC GACAGAGCCA CCAAGCTGGA GCAGGAGTTC ACAGAGTGCT 3051 TCTCAGCCAT CGTGGAGGGT GACAGCTTTG AGGAGATCTA CCACAAGGTG 3101 AAGCGTGTCA TCGAGGACCT CTCAGGCCCC TACATCTGGG TTCCAGCCCG 3151 AGAGAGACTC TGATTCCTGC CCTGGCTTGG CCTGGACTCG CCCTGCCTCC 3201 ATCACCTGGG CCCTTGGTCT GGACTGAATT GCCCAAGCCC TTGGCTCCCC 3251 CCGGCCTCCC TCCCACCCCT TCTTATTTAT TTCCTTTCTA ACTGGATCCA 3301 GCCTGTTGGA GGGGGGACAC TCCTCTGCAT GTATCCCCGC ACCCCAGAAC 3351 TGGGCTCCTG AACGCCAGGA ACCTGGGGTC TGGGGGGGAG CTGGGCTCCT 3401 TGTTCCGAGC CCTTGCTCCT TAGGATCCCC GCCCCCACCT GCCCCCAATG 3451 CACACACAGA CCCACCGGGG GCCACCTGCC CTCCCCCATC CTCTCCCACA 3501 CACATTCCAG AAGTCAGGGC CCCCTCGAGG AGCACCCGCT GCAGGGATGC 3551 AGGGCCACAG GCCTCCGCTC TCTCCTAAGG CAGGGTCTGG GGTCACCCCT 3601 GCCTCATCGT AATTCCCCAT GTTACCTTGA TTTCTCATTT ATTTTTTCCA 3651 CTTTTTTTCT TCTCAAAGGT GGTTTTTTGG GGGGAGAAGC AGGGGACTCC 3701 GCAGCGGGCC CCTGCCTTCC ACATGCCCCC ACCATTTTTC TTTGCCGGTT 3751 TGCATGAGTG GAAGGTCTAA ATGTGGCTTT TTTTTTTTTT TTCCTGGGAA 3801 TTTTTTTGGG GAAAAGGGAG GGATGGGTCT AGGGAGTGGG AAATGCGGGA 3851 GGGAGGGTGG GGCAGGGGTC GGGGGTCGGG TGTCCGGGAG CCAGGGAAGA 3901 CTGGAAATGC TGCCGCCTTC TGCAATTTAT TTATTTTTTT CTTTTGAGAG 3951 AGTGAAAGGA AGAGACAGAT ACTTGAAAAA AAAAAAAAAA AAAAA // LOCUS AF001437 2320 bp mRNA PRI 09-AUG-1997 DEFINITION Homo sapiens dihydrolipoamide dehydrogenase-binding protein mRNA, complete cds. ACCESSION AF001437 NID g2316039 VERSION AF001437.1 GI:2316039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2320) AUTHORS Harris,R.A., Bowker-Kinley,M.M., Wu,P., Jeng,J. and Popov,K.M. TITLE Dihydrolipoamide dehydrogenase-binding protein of the human pyruvate dehydrogenase complex. DNA-derived amino acid sequence, expression, and reconstitution of the pyruvate dehydrogenase complex JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 2320) AUTHORS Harris,R.A., Bowker-Kinley,M.M., Wu,P., Jeng,J. and Popov,K.M. TITLE Direct Submission JOURNAL Submitted (28-APR-1997) Biochemistry and Molecular Biology, Indiana University School of Medicine, 635 Barnhill Dr., Indianapolis, IN 46202-5122, USA FEATURES Location/Qualifiers source 1. .2320 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p12-p13" CDS 9. .1514 /note="E3-binding protein; E3BP; dihydrolipoamide dehydrogenase-binding protein of pyruvate dehydrogenase complex; protein X" /codon_start=1 /product="dihydrolipoamide dehydrogenase-binding protein" /protein_id="AAB66315.1" /db_xref="PID:g2316040" /db_xref="GI:2316040" /translation="MAASWRLGCDPRLLRYLVGFPGCRSVGLVKGALGWSVSRGANWR WFHSTQWLRGDPIKILMPSLSPTMEEGNIVKWLKKEGEAVSAGDALCEIETDKAVVTL DASDDGILAKIVVEEGSKNIRLGSLIGLIVEEGEDWKHVEIPKDVGPPPPVSKPSEPR PSPEPQISIPVKKEHIPGTLRFRLSPAARNILEKHSLDASQGTATGPRGIFTKEDALK LVQLKQTGKITESRPTPAPTATPTAPSPLQATSGPSYPRPVIPPVSTPGQPNAVGTFT EIPASNIRRVIAKRLTESKSTVPHAYATADCDLGAVLKVRQDLVKDDIKVSVNDFIIK AAAVTLKQMPDVNVSWDGEGPKQLPFIDISVAVATDKGLLTPIIKDAAAKGIQEIADS VKALSKKARDGKLLPEEYQGGSFSISNLGMFGIDEFTAVINPPQACILAVGRFRPVLK LTEDEEGNAKLQQRQLITVTMSSDSRVVDDELATRFLKSFKANLENPIRLA" BASE COUNT 730 a 446 c 522 g 622 t ORIGIN 1 GGCACGAGAT GGCGGCCTCC TGGAGGCTGG GCTGTGATCC GCGGCTGCTG 51 CGTTATCTTG TGGGCTTCCC TGGCTGCCGA AGCGTAGGGC TGGTGAAGGG 101 GGCTCTTGGG TGGTCCGTAA GCCGCGGAGC TAATTGGAGA TGGTTTCACA 151 GCACGCAGTG GCTTCGGGGT GATCCCATTA AGATACTAAT GCCATCACTG 201 TCTCCTACAA TGGAAGAAGG AAACATTGTG AAATGGCTGA AAAAGGAAGG 251 TGAAGCGGTG AGTGCTGGAG ATGCATTATG TGAAATTGAG ACTGACAAAG 301 CTGTGGTTAC CTTAGATGCA AGTGATGATG GAATCTTGGC CAAAATCGTG 351 GTTGAAGAAG GAAGTAAAAA TATACGGCTA GGTTCACTAA TTGGTTTGAT 401 AGTAGAAGAA GGAGAAGATT GGAAACATGT TGAAATTCCC AAAGACGTAG 451 GTCCTCCACC ACCAGTTTCA AAACCTTCAG AGCCTCGCCC CTCACCAGAA 501 CCACAGATTT CCATCCCTGT CAAGAAGGAA CACATACCCG GGACACTACG 551 GTTCCGTTTA AGTCCAGCTG CCCGCAATAT TCTGGAAAAA CACTCACTGG 601 ATGCTAGCCA GGGCACAGCC ACTGGCCCTC GGGGGATATT CACTAAAGAG 651 GATGCTCTCA AACTTGTCCA GTTGAAACAA ACGGGCAAGA TTACCGAGTC 701 CAGACCAACT CCAGCCCCCA CAGCCACTCC CACAGCACCT TCGCCCCTAC 751 AGGCCACATC TGGACCATCT TATCCCCGGC CTGTGATCCC ACCAGTATCA 801 ACTCCCGGAC AACCCAATGC AGTGGGCACA TTCACTGAAA TCCCCGCCAG 851 CAATATTCGA AGAGTTATTG CCAAGAGATT AACTGAATCT AAAAGTACTG 901 TACCTCATGC ATATGCTACT GCTGACTGTG ACCTTGGAGC TGTTTTAAAA 951 GTTAGGCAAG ATCTGGTCAA AGATGACATT AAAGTATCAG TAAATGATTT 1001 TATCATCAAG GCAGCAGCTG TTACCCTTAA ACAAATGCCA GATGTTAATG 1051 TAAGCTGGGA TGGAGAGGGC CCAAAGCAAC TGCCATTTAT TGACATTTCA 1101 GTGGCTGTGG CAACAGATAA AGGCTTACTT ACTCCAATCA TAAAAGATGC 1151 TGCTGCTAAA GGTATCCAGG AAATTGCTGA CTCTGTAAAG GCTCTATCAA 1201 AGAAAGCAAG AGATGGAAAA TTGTTGCCTG AAGAATACCA AGGAGGATCT 1251 TTTAGTATTT CCAACTTGGG GATGTTTGGC ATCGACGAAT TTACTGCAGT 1301 GATTAACCCT CCTCAGGCCT GCATTTTGGC GGTTGGGAGG TTCCGACCTG 1351 TGCTGAAGCT CACTGAGGAT GAAGAGGGAA ATGCCAAACT GCAGCAGCGC 1401 CAGCTCATAA CAGTCACAAT GTCAAGTGAC AGTCGAGTGG TTGATGACGA 1451 ACTGGCAACC AGGTTTCTTA AAAGTTTTAA AGCAAACCTA GAGAATCCTA 1501 TCCGACTTGC CTAGTCCTCA AAGATAAGAA GTTGGTGTTC AGCTTAGTTG 1551 ATTCAGTAGT TGTTACCAAG AAACATATGT TATAGGAAAA CAACTTGGTA 1601 TTTAAGTATG AAGTGGATGA AATGTTTATT TATTTAAGGT GAAAGCATTT 1651 GACCCAGGGT GTCTTCATCT TCAATTTGGG TTTAATGTTA TAGAAATAAA 1701 TGATGATAAA CTCTAACTAA TAAAGGAAAG AGAATATTTG GTTACTCAGA 1751 TCCATTTTTA ACCTCTGGTG CTGTATAAAG GGAATATTAA ACTAGATGTA 1801 AATCAAAGTA TATGTTTGGC TCATTTGAGC ATTTTGGAAT ATTTGAGAAT 1851 GTATGATACA TGTAAAATTA AAAAAACTAT TAGAACTGTA CCATAATTAT 1901 GTTGAAGGTA GAAGTGATCT TCAAAGAGAT GGCCATTAAC TTAGCAGTGG 1951 GACCTCACTT TTACAAGCAC TGCTCTAGAT ATACTTGAAG AATTTAATAG 2001 GTACAGAAGT TTATTCTGGA TAATAAATAA ATAAGGATCA CACTGTATTA 2051 GGGGTTATGG CAACATTATT GAATTTTTTA TGTACATAAA GCCATATGTT 2101 TAGGGTGGTT TCTATCTGTC TTGTTTTTCA CTTATATAAC ACTGTGAACT 2151 TCTAAAGCAA GAGGATAAAA GAAGCATGAA TGAAAAGAAT GACATTTCAA 2201 AAAAATGGTT CAATGAAAAA CTATAGCTAA AATATGTAAA CCTTTCTAGG 2251 TAAACCGCTT GCCTTCATCT TGAGTCGGAA TATATTTAAA TAAATTGTGT 2301 TATCTCTTGC CAAAAAAAAA // LOCUS AF093774 6184 bp mRNA PRI 14-DEC-1998 DEFINITION Homo sapiens type 2 iodothyronine deiodinase mRNA, complete cds and 3'UTR. ACCESSION AF093774 NID g4009516 VERSION AF093774.1 GI:4009516 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6184) AUTHORS Buettner,C., Harney,J.W. and Larsen,P.R. TITLE The 3'-untranslated region of human type 2 iodothyronine deiodinase mRNA contains a functional selenocysteine insertion sequence element JOURNAL J. Biol. Chem. 273 (50), 33374-33378 (1998) MEDLINE 99057897 REFERENCE 2 (bases 1 to 6184) AUTHORS Buettner,C., Harney,J.W. and Larsen,P.R. TITLE Direct Submission JOURNAL Submitted (23-SEP-1998) Medicine, Brigham and Women's Hospital, Harvard Medical School, 77 Ave Louis Pasteur, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1. .6184 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 135. .977 /note="selenocysteine" /codon_start=1 /transl_except=(pos:552. .554,aa:OTHER) /transl_except=(pos:951. .953,aa:OTHER) /product="type 2 iodothyronine deiodinase" /protein_id="AAC95470.1" /db_xref="PID:g4009517" /db_xref="GI:4009517" /translation="MTQEAEKMGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHV VLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQVKLGEDAPNSSVVHVSSTEGG DNSGNGTQEKIAEGATCHLLDFASPERPLVVNFGSATXPPFTSQLPAFRKLVEEFSSV ADFLLVYIDEAHPSDGWAIPGDSSLSFEVKKHQNQEDRCAAAQQLLERFSLPPQCRVV ADRMDNNANIAYGVAFERVCIVQRQKIAYLGGKGPFSYNLQEVRHWLEKNFSKRXKKT RLAG" 3'UTR 978. .6184 misc_feature 5868. .6000 /note="SECIS element" BASE COUNT 1877 a 1176 c 1319 g 1812 t ORIGIN 1 GGGCTTCCTT TTAATTTAGT TTTTTTTCCC CTTCTCCCCC AACCCCCAAC 51 CTTCCCCCTT ACCTCCCCCA CCCCCTTTAT CACCACCCCC CTTTTAAATA 101 AGAGGGTGAA GGGGAACCAG AGCGCACAAG GGAACTGACT CAGGAGGCAG 151 AGAAGATGGG CATCCTCAGC GTAGACTTGC TGATCACACT GCAAATTCTG 201 CCAGTTTTTT TCTCCAACTG CCTCTTCCTG GCTCTCTATG ACTCGGTCAT 251 TCTGCTCAAG CACGTGGTGC TGCTGTTGAG CCGCTCCAAG TCCACTCGCG 301 GAGAGTGGCG GCGCATGCTG ACCTCAGAGG GACTGCGCTG CGTCTGGAAG 351 AGCTTCCTCC TCGATGCCTA CAAACAGGTG AAATTGGGTG AGGATGCCCC 401 CAATTCCAGT GTGGTGCATG TCTCCAGTAC AGAAGGAGGT GACAACAGTG 451 GCAATGGTAC CCAGGAGAAG ATAGCTGAGG GAGCCACATG CCACCTTCTT 501 GACTTTGCCA GCCCTGAGCG CCCACTAGTG GTCAACTTTG GCTCAGCCAC 551 TTGACCTCCT TTCACGAGCC AGCTGCCAGC CTTCCGCAAA CTGGTGGAAG 601 AGTTCTCCTC AGTGGCTGAC TTCCTGCTGG TCTACATTGA TGAGGCTCAT 651 CCATCAGATG GCTGGGCGAT ACCGGGGGAC TCCTCTTTGT CTTTTGAGGT 701 GAAGAAGCAC CAGAACCAGG AAGATCGATG TGCAGCAGCC CAGCAGCTTC 751 TGGAGCGTTT CTCCTTGCCG CCCCAGTGCC GAGTTGTGGC TGACCGCATG 801 GACAATAACG CCAACATAGC TTACGGGGTA GCCTTTGAAC GTGTGTGCAT 851 TGTGCAGAGA CAGAAAATTG CTTATCTGGG AGGAAAGGGC CCCTTCTCCT 901 ACAACCTTCA AGAAGTCCGG CATTGGCTGG AGAAGAATTT CAGCAAGAGA 951 TGAAAGAAAA CTAGATTAGC TGGTTAAAGG TATGATTATA AGAGAGCTTA 1001 TTGTTTTAAA AAGTTATATA AAGGCAAGGA AATTAAGAAC TGAATCCATA 1051 TTTCAACAGA GCCCTATTGG CTTACTGAAA GACAGGAGTT TATCTATCGG 1101 AAGAACATGA ATCTCTAACA GCTCCATACT TCTTTCACTA CTCAAATGGC 1151 ATTGGGCTGA GTAAGTAACC ATATCACCTC TCTTCTTAGT AAAAAGCCCT 1201 ATGTGAAAAG ATCCCAAGAT GGAGAGGAAG AAACGCTAAT TCAGCATGTG 1251 TTCATTCTGC ATTGAGAAGG AACTGATACA TCTGATGCAT GCTTTGAGAC 1301 CAGAAGAAAA GACTTACCTG AATAATTACT ACATTAGGGA AGCTACTGTC 1351 TACGTTAAGA TAAAGGGTAT TGCCTTGGCT CTATTTGGCA TGGATGGAGC 1401 CCAGTTGGAA AATTCCCAAA TATTACAACA AGTCCTTGAA CCCAGGCCAT 1451 GTGGTTAGAC GTTGGTGTTA AGGTTAGACC TTATGTTAGA GTCATTTCTG 1501 ATGTTCCAGC TTCTAGCCAT GTAGTGCTCT CAGTCTTCAT ACCCCAGAAA 1551 TTATTGGTAT ATTTGTAGAT ACCGAGAATG ATCCCTCAGT CTGAGAGGTT 1601 AGAATGATCA TCTGTAATCT GAGGGTTAAT TTCTAGGCAG GTGGAGAGAG 1651 TGGTAAAAAA GAAATGAAAT TGACAAGCTA GGAAAGAGGA GGCAGAAAGA 1701 TTTGGAAAAT TCACAGAGTT TCACCCTTAA GCTGTAGAGA GTGGGTCACA 1751 TTTGTTAGCC ACGGAAACAT AGAAACATAC ACAAGGCCAG AAAAAGAAGA 1801 AGGAGCTCAA CTAAAAGTGG CATAGAGAAT ACACATATAA AAACAATATA 1851 TTTGTCATAT GCTCCTAGAG AGGAGAAAGG GGTGATTGAA AGAAAAAAAA 1901 ATACTTAAAT ATTTGTAATT GTGAGGGGTT TCTTTTGGAA ATAATTACTT 1951 TTGAACCATG TATGTGGTAT GTATATTTTC AGTGGGTTAA TTATACCCCA 2001 TGATACCTAT TAAAGGAAAA CCAGTGGGTC TGGTGGTGCT GGTCTTTTCC 2051 TCCCCATTCC TACAATTTCT ATGTGGCCCA AGTCATTCCT AATCTTGGTC 2101 TCTATAGCAG TGTTCTCTCT GAATGCTGAG CTGAAGAAAT TATACGTACA 2151 TACACACATA CATACATACA TACAAATATA TGTATATATA TTCTCAGCTG 2201 CTGCGGGAGG TAGGTACCAT GGCCATTCAG CACAGCCTTG ATTTCCTCCC 2251 AAAGTAGGTG AGCTATAGTG AAGAATAGGT GCAAACAAAC AAGCTTACTT 2301 CCATTGCAAA ATAGAAGAAG AGGAAGTTAG AGATAATTCT GATCAATCAT 2351 TTTGGAGGCT TTGTTATAAG GCAACCCCCG GTATATCATG GAATTTCCAT 2401 TGACATTTGA ATTTGGACTT GGATCTTCCC TTGGTCCCAT TAGCTGAGGT 2451 TTAGTAATCT AAAGTCCCTA TAGTATATGA TTATAATGCT ATTTTAAAAA 2501 ATATATATAT AAAATATTTT TTTCTTTTTA AAATAGACAC TATAGTTTTA 2551 CCCATAAGTA ATATTTAAAG ATTATAGCTC CCAAAAGAAT GGACCAACCA 2601 CTTTCGTATC ATAATTTCTT TTTGGTAAAT ATGAGACTAT TATGAAATCA 2651 TAGTATATGA TTGTATTTAA AGGTACAATC AAAGGATCTT TTGTCCATTC 2701 CATTAATAAC TGAATAAAAA ATAAATAAAA TGGATAGAAA AAAACTAAAG 2751 TTGAAAATAC ATTCTTAAAC TAGTTGTCTG AAATGAGAAA AGAGTGAGAA 2801 CTAGGTGTGC AAGAACCAAA CGTATTTTAT TTTATTTTTT AAATGGGAGC 2851 AACATATCAG TCGTGTCACC AGCTGGTATA TTGTGTAAAT ATTAAAGCTC 2901 CATTGGGACT GATTTTTCAT GGCAACATCA GCTTTCTAAT GTTCTAAATT 2951 CTATAAAAAC CACCCACAAA GAAACAAAGC AAATTTCATT ATCTAATGAG 3001 TTGCTGGAAA ATCATATTGA GAATAATTAT TTCAGATTCC TCAGTTGTTA 3051 ACTTCTACAT TCAAGGGCTT ATCTCTGCCC CCATTGATTT TTAACCTCAA 3101 AATGGTGTGA GATTTACTGT GGAACCCTAA AGCAGTAAAA TAAAAAACCT 3151 GGTTGCAGCA CATTCACACT GTTGTCCTTA AAATTCCCCT TTTTTCTCTA 3201 TGTACGATAA AGTAACAGTA TGTCAGATAA GCCGGTGGGG GGATGAGATT 3251 AGGCTGAGGC AGTGCTAGTC AACTGGGGGA AAAGGATGAT GGAAAAATCA 3301 CCCAGTTGTG CTATATTTTT AAAGAAGGAG GTCGTTTATG TGTGCAGACA 3351 ATTCTCCCTG AGGTTAGCCC AATGGAGAAA TGAAGCAGAG GAAGGAAACA 3401 TAGAAAGACA TGGGCTATCA GGGAGGAAGA TGTTCAATAG AACATGCAAG 3451 AATTTCTGGA AGAAAGGCTG TGGAAGGGCC AATGGAGAAA ATGAATGGAC 3501 AAAGCTCAGG AATCCCTACG CTATGTAGAA TGTTCTTGGT GTTATCAGGG 3551 TTAAGCCCTG TAATTATGTA ACCTATTTAT CGCAACATGA ATTTTTATGA 3601 TTTCTTGTGA TGTATTCTTT TATGAAATTA ACAAGAACTC ATTATTTTGA 3651 GGTAGAGGAA AATCAATGCT TTATCTGATA TGCTGAGAAA TTATTAGATT 3701 GCCAATACTC ATGTGCGTTT CATGTGTTTT ATAAGGTTTG TTCCTTTGAA 3751 GAATTGTAGT TCTTAGTCCC ACAGGGAAAT GTGTATCTAT TTATATATCA 3801 TAGTATAAAT CTATGATATA TTTATATCAT ATATAAAAGT CTGAGTTCTC 3851 TTTCTTAGTC CCTAATCATG TTTCTCCCAT AGGCTGTGTT TACATGGAGC 3901 TATCGGTTTA GCCTTTTAAG CTTCATTAGC TTGTCTATTA TTGAAATAGT 3951 TTCCAAGAAA TTTTAGATAT TATCATAACA TCTGGGTCTA CTCAAACACT 4001 TATTGTTTGA AAGACTTATG TCTTGGACCT ATCAAAAACT GACTTTATTT 4051 ATTGCTTAGT GAAAATACTA GTGGGATCAA CAATGATTTT CTTGAATGGG 4101 CATGAATGGA GATGCCCGCA CAGTAATGTA GAAATGTTTC ATACAGCTAT 4151 TAAAATGTAA CTGACCTCCT TAGAGGCAGA TTAGTAACTG TTCCTACTTT 4201 GTATAGCTAA GTGACAGTCA CTTAACTTAC ATGACTTTCT TTTTTCACAT 4251 TGGGTCTCTG GTCCTGTGTC TTCACCTCAT TTATAGCACG TCTCCTTGAT 4301 TTTTGGTAGT ATCAACTTCC CAGTGATCTG TTCAGTTAAG TTCTTCTCCC 4351 GTTAACCAGG AAGTGCTTAT TCTCTCATCA CAGTGGGAAG AATAGCCTAT 4401 TGTCTTTCAT TTTGCCTGAG TGTATTTTAC TATTTGGGCT CTGAAATAAA 4451 AATTATGAAA TATGGTGAGG TCACATGTTG GTGCTGCCTT GCTGCATAAA 4501 ATTCTAGGAG GGCAGGTTAG GAGACAGTTA TGTATGGCCT TTCGGGAAAA 4551 TTCAAAGGGT GGGATTACAA GGGTGTTCCT CAGGCATGCC CCTATGGGCC 4601 CTATGTGGAA GCAAGAAGAA TTGACTGATT TACAGGACTT CTCTTTATGT 4651 CAATCTTAAG AGGATGGATG AATCTGGACA TTTGTTCCAC CCGACCTCTG 4701 ACTGATGGTT TGGAAAATAA CTTTAATTAG GATCATATGA CCATTGAAAA 4751 AGGAAAAATG TAGACTCTGA CTTCCGTCCC ACTGAAGGAT TAATGAAAAC 4801 CTTTACTAGC ATTTAGAGCT TTTCAGAACA TCCCCACTGT CATGTGTCTC 4851 AGCAGTGGAG ACTGCAAGTA AGGCTTTTAA TTTTAGGAGG TTTTTTTTTT 4901 TTTTTTTTTT TCCCCTAAAT GGTATGGCCA AAAGTCAGAG TTAAAATATA 4951 TATAGTTAGA TTCCAACTTC CTCCTTCACT CTAAAAATAG AATCCAAACC 5001 CACTCTTCAT ATATGCTTCC AGAATGGGGC TTAAGTACCA ATCTCTGCTT 5051 TGCAATGGGC ACAATCTTGG TCATGTCCTG AGGCTCTCTA AGAAAAGAGA 5101 GGATCTAGGA TGGGAGAGCT AGAAAGTTGC TAACTGGGAA GAACAAGGCC 5151 CTGAGGGGTT GGTCTACCAA TCTGGGAAGA TTTGAAAACA AACTTCTCGC 5201 AACTGAAGGA AGGCTGAAGG CTGCTGCAAG TCATTGAGTG ACTTTAGGAT 5251 GAGCAAAACA TTGGGCCACT TCCTAATGCC CTATGTGTAT AGTACCAGAA 5301 GCAAGGTCTC AGACTTAACA GACCCAGCTC TGTTCCAAGG TGAGTCTGAA 5351 CCAATAGAAA GCAAACATGT GCAGATATCC AAACAAGACT GCTCATGCAA 5401 GTCGGGGCTG GCTACCCGTC TTAGGCAGCA ACAGCAGAGC TCCAGGGAGC 5451 TTATTCAATA TTTACTGAGA CTTCGAAGAC CCAGCAGATG TTTAATGAAG 5501 TCACTATTTT GGCTCAAACC CTCCACTTCT CCCCCTCCCC TCAAAAAGCC 5551 AACAGGTAAA CACATAAATG AAAGAAACCC ACAGAAGGGG ATGGGAAATA 5601 AAGAAAATTC TCTCAAGACT TCTCCAGGCC CATGTCACTG GTCAGCGTGG 5651 TTTTTATGTG TATTAGGATT GGGGGATGTG AAGAAATAAG TATCCAGTAC 5701 TTTATAACCA AAGCAATTAA ATGATATTGG GGTAGGGAAT GTTGGCCAGT 5751 TTTGTTTAGT TTTGCCATCA CATTGTCACC CAGACCTCAC CTAGCCCCAA 5801 GTAATCGGGC GCCCCGAAGA GGGAGACAGA GATGTGCCAG AGTTGACCCA 5851 GTGTGCGGAT GATAACTACT GACGAAAGAG TCATCGACCT CAGTTAGTGG 5901 TTGGATGTAG TCACATTAGT TTGCCTCTCC CCATCTTTGT CTCCCTGGCA 5951 AGGAGAATAT GCGGGACATG ATGCTAAGAG CCCTGGGTAA ATGTGGTGAG 6001 AATGCACGCG TGCATATGCT ACACATATGT GCTTCTCAGT TGCAGAAAAT 6051 GAACTGCTTT GGGAGATTAT CAGTAGAAAG AGTGTTATCA TATTGGTGCT 6101 GAGTGCTATG TGTGCTTATA CAATTTGTTC TTGTATTTTA ATAAACTTTG 6151 AATAAAAGAA TAAAAAAAAA AAAAAAAAAA AAAA // LOCUS HSU27655 2638 bp mRNA PRI 07-MAR-1996 DEFINITION Human RGP3 mRNA, complete cds. ACCESSION U27655 NID g1216368 VERSION U27655.1 GI:1216368 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2638) AUTHORS Druey,K.M., Blumer,K.J., Kang,V.H. and Kehrl,J.H. TITLE Inhibition of G-protein-mediated MAP kinase activation by a new mammalian gene family JOURNAL Nature 379 (6567), 742-746 (1996) MEDLINE 96178495 REFERENCE 2 (bases 1 to 2638) AUTHORS Druey,K. TITLE Direct Submission JOURNAL Submitted (25-MAY-1995) Kirk Druey, Intramural Research/NIAID/LIR, Rm 11B13, National Institutes of Health, 10 Center Drive, MSC 1876, Bethesda, MD 20892-1876, USA FEATURES Location/Qualifiers source 1. .2638 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphocytes" CDS 288. .1847 /codon_start=1 /product="RGP3" /protein_id="AAC50394.1" /db_xref="PID:g1216369" /db_xref="GI:1216369" /translation="MFETEADEKREMALEEGKGPGAEDSPPSKEPSPGQELPPGQDLP PNKDSPSGQEPAPSQEPLSSKDSATSEGSPPGPDAPPSKDVPPCQEPPPAQDLSPCQD LPAGQEPLPHQDPLLTKDLPAIQESPTRDLPPCQDLPPSQVSLPAKALTEDTMSSGDL LAATGDPPAAPRPAFVIPEVRLDSTYSQKAGAEQGCSGDEEDAEEAEEVEEGEEGEED EDEDTSDDNYGERSEAKRSSMIETGQGAEGGLSLRVQNSLRRRTHSEGSLLQEPRGPC FASDTTLHCSDGEGAASTWGMPSPSTLKKELGRNGGSMHHLSLFFTGHRKMSGADTVG DDDEASRKRKSKNLAKDMKNKLGIFRRRNESPGAPPAGKADKMMKSFKPTSEEALKWG ESLEKLLVHKYGLAVFQAFLRTEFSEENLEFWLACEDFKKVKSQSKMASKAKKIFAEY IAIQACKEVNLDSYTREHTKDNLQSVTRGCFDLAQKRIFGLMEKDSYPRFLRSDLYLD LINQKKMSPPL" BASE COUNT 593 a 801 c 780 g 464 t ORIGIN 1 GAGGAAAGGG GAAATGCGGC CCGCTCCCCA CTCAGTGCCA CTCTGTGCCA 51 CTCCGTGCCA GGCCCTGAGG GCACCCGGTT GCTGCTTCCT TCCGTCTTTC 101 CCCAAGGACT ATCAGAGATG CCAGCGTGAC CCCTGACACG TGTGTGCAGC 151 AGCCTGCAGC TGCCCCAAGC CATGGCTGAA CACTGACTCC CAGCTGTGGG 201 CTTCACCATT ACAGACTCCC CAGGGCTTCA AAGACTTCTC AGCTTCGAGC 251 ATGGCTTTTG GCTGTCAGGG CAGCTGTACA ATAGTGGATG TTTGAGACGG 301 AGGCAGATGA GAAGAGGGAG ATGGCCTTGG AGGAAGGGAA GGGGCCTGGT 351 GCCGAGGATT CCCCACCCAG CAAGGAGCCC TCTCCTGGCC AGGAGCTTCC 401 TCCAGGACAA GACCTTCCAC CCAACAAGGA CTCCCCTTCT GGGCAGGAAC 451 CCGCTCCCAG CCAAGAACCA CTGTCCAGCA AAGACTCAGC TACCTCTGAA 501 GGATCCCCTC CAGGCCCAGA TGCTCCGCCC AGCAAGGATG TGCCACCATG 551 CCAGGAACCC CCTCCAGCCC AAGACCTCTC ACCCTGCCAG GACCTACCTG 601 CTGGTCAAGA ACCCCTGCCT CACCAGGACC CTCTACTCAC CAAAGACCTC 651 CCTGCCATCC AGGAATCCCC CACCCGGGAC CTTCCACCCT GTCAAGATCT 701 GCCTCCTAGC CAGGTCTCCC TGCCAGCCAA GGCCCTTACT GAGGACACCA 751 TGAGCTCCGG GGACCTACTA GCAGCTACTG GGGACCCACC TGCGGCCCCC 801 AGGCCAGCCT TCGTGATCCC TGAGGTCCGG CTGGATAGCA CCTACAGCCA 851 GAAGGCAGGG GCAGAGCAGG GCTGCTCGGG AGATGAGGAG GATGCAGAAG 901 AGGCCGAGGA GGTGGAGGAG GGGGAGGAAG GGGAGGAGGA CGAGGATGAG 951 GACACCAGCG ATGACAACTA CGGAGAGCGC AGTGAGGCCA AGCGCAGCAG 1001 CATGATCGAG ACGGGCCAGG GGGCTGAGGG TGGCCTCTCA CTGCGTGTGC 1051 AGAACTCGCT GCGGCGCCGG ACGCACAGCG AGGGCAGCCT GCTGCAGGAG 1101 CCCCGAGGGC CCTGCTTTGC CTCCGACACC ACCTTGCACT GCTCAGACGG 1151 TGAGGGCGCC GCCTCCACCT GGGGCATGCC TTCGCCCAGC ACCCTCAAGA 1201 AAGAGCTGGG CCGCAATGGT GGCTCCATGC ACCACCTTTC CCTCTTCTTC 1251 ACAGGACACA GGAAGATGAG CGGGGCTGAC ACCGTTGGGG ATGATGACGA 1301 AGCCTCCCGG AAGAGAAAGA GCAAAAACCT AGCCAAGGAC ATGAAGAACA 1351 AGCTGGGGAT CTTCAGACGG CGGAATGAGT CCCCTGGAGC CCCTCCCGCG 1401 GGCAAGGCAG ACAAAATGAT GAAGTCATTC AAGCCCACCT CAGAGGAAGC 1451 CCTCAAGTGG GGCGAGTCCT TGGAGAAGCT GCTGGTTCAC AAATACGGGT 1501 TAGCAGTGTT CCAAGCCTTC CTTCGCACTG AGTTCAGTGA GGAGAATCTG 1551 GAGTTCTGGT TGGCTTGTGA GGACTTCAAG AAGGTCAAGT CACAGTCCAA 1601 GATGGCATCC AAGGCCAAGA AGATCTTTGC TGAATACATC GCGATCCAGG 1651 CATGCAAGGA GGTCAACCTG GACTCCTACA CGCGGGAGCA CACCAAGGAC 1701 AACCTGCAGA GCGTCACGCG GGGCTGCTTC GACCTGGCAC AGAAGCGCAT 1751 CTTCGGGCTC ATGGAAAAGG ACTCGTACCC TCGCTTTCTC CGTTCTGACC 1801 TCTACCTGGA CCTTATTAAC CAGAAGAAGA TGAGTCCCCC GCTTTAGGGG 1851 CCACTGGAGT CGAGCTCAGC GTTCACACCA GGCGGGCTGG GTCCCCTGCC 1901 CACCTGCCTC CCTGCCCCCT GTGACGGAGG GGGCAAGCAA GCCCCCAGAG 1951 GCCGTGTCTC TGGACAGACG GATAGACATA CGGAAGCGAG GCCTGGACCA 2001 AGAGAGGCCC AGGCTACTGG AGGAGTAGAA GGATGGGCCC CGTGGGGTCC 2051 CCACTGCCCC GGTACGAGGG GGCCCAAGAC CCTGGCAGGT CAGGGGCCCT 2101 GGCCAAGCCA GATCTGGAGC TGCTGCTCCC TGCTGCGGAG ACCGCGGAGG 2151 CTTCGCGTTG ACCAAGTTCC TTAAAGAACT GGCTGATGGG GCAGGAGGTC 2201 CAGGCCTGGG CTCTCGGGCC CTCCTAGAGG GCCATTGGAG CTTGCAGCTC 2251 AGACCCCCAC TTTGAGTTTT ATTTATTTAA ATAGTAGTTG GATGCTTGGC 2301 ACGTCGTCCT GTAATAGGAA ACCCTTGCCT CATCAGTTTT CCTGATTTAC 2351 AAGTGCAATA TTTTAGCCAA TGCCTTGGGA GAAGCTGCCA TGCAAAGGTG 2401 GACACCATTC TCCAGCTTCA GGGGATATGC TCGTCCCGGG CACCGGTGGC 2451 AGGCAGCTGG CCTTCTGGAC TAAGGCAGCC TGGGGGGACA CTGCAGTCTG 2501 GCTACACACA GAGATCTGGC ACCCCCTGGG TGGAGTGTCC CTCGGGGGCT 2551 TTGGGAAAGC ATGGCACCCT CAGACCACAC AGTAGCCAAG TTCTGGAGCA 2601 AATAAAAGGC CTGTGTTATT TCTTGTTCTT GAAAAAAA // LOCUS AF067170 2520 bp mRNA PRI 27-MAY-1999 DEFINITION Homo sapiens alpha endosulfine mRNA, complete cds. ACCESSION AF067170 NID g4894373 VERSION AF067170.1 GI:4894373 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 650) AUTHORS Zhang,Q., Fu,G., Wu,J., Zhou,J., Ye,M., Shen,Y., Kan,L., He,K., Gu,B., Chen,S., Mao,M. and Chen,Z. TITLE Human alpha endosulfine gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 2520) AUTHORS Zhang,Q. TITLE Direct Submission JOURNAL Submitted (18-MAY-1998) Shanghai Institute of Hematology, Shanghai Second Medical University, Rui-Jin Hospital, 197 Rui-Jin Road II, Shanghai 200025, China FEATURES Location/Qualifiers source 1. .2520 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="CD34+ cell" CDS 110. .463 /codon_start=1 /product="alpha endosulfine" /protein_id="AAD32454.1" /db_xref="PID:g4894374" /db_xref="GI:4894374" /translation="MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSL GQKPGGSDFLMKRLQKGQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQD LPQRKSSLVTSKLAG" BASE COUNT 726 a 535 c 611 g 648 t ORIGIN 1 GGAGTGACAG GAGCCGAAGC AGCAGCGCAG GTTGTCCCCG TTTCCCCTCC 51 CCCTTCCCTT CTCCGGTTGC CTTCCCGGGC CCCTTACACT CCACAGTCCC 101 GGTCCCGCCA TGTCCCAGAA ACAAGAAGAA GAGAACCCTG CGGAGGAGAC 151 CGGCGAGGAG AAGCAGGACA CGCAGGAGAA AGAAGGTATT CTGCCTGAGA 201 GAGCTGAAGA GGCAAAGCTA AAGGCCAAAT ACCCAAGCCT AGGACAAAAG 251 CCTGGAGGCT CCGACTTCCT CATGAAGAGA CTCCAGAAAG GGCAAAAGTA 301 CTTTGACTCA GGAGACTACA ACATGGCCAA AGCCAAGATG AAGAATAAGC 351 AGCTGCCAAG TGCAGGACCA GACAAGAACC TGGTGACCGG TGATCACATC 401 CCCACCCCAC AGGATCTGCC CCAGAGAAAG TCCTCGCTCG TCACCAGCAA 451 GCTTGCGGGG TAACCTGAGC CCCCCTCTCC TCCCCTTCCT CAACCACTGG 501 ACGTTTATAT ATTATAGGCA GGGATGAAAT GGGCACCTAG TCAGATCTTC 551 TCAGCTTGCT AGCCAGAAAT GACTGTGATT CTGCTGGGGG CTGCTGAGAA 601 GGTAATGTAG GTTGAAAAGG GGCTCTAAGT TTATTTATTT CGTTAGATTG 651 ACACTTCCAC ACACTCCCTG TAGTCCAGGT AGGGCCCAGA AATGGGAAAG 701 GCTAGGATTG GATAATGCTG CAAATGCTTT TTTTTGTGTG AGAAACTGGA 751 GAGATGTGAT TTCTCCTTTT GGGAGAGAAT GTCCCAAAAT TGATTAGGCT 801 GAGCCTTGGG AATAGTTTGG CAGGTTTAAC ATCCCAAGGC TAACCTAACG 851 TAGTTGGGAA AGGTAGATTG AATGAGACAT GTTTTCTGTG CTTCTAAGTG 901 TTCTGTCCCT TAGGCTGCTA TTGCTTCATG TTTCCATTAT GGCAGGTTTA 951 GAGAATCCTT AAAAAGAAAA ATTGACTTGC TTGCCTAAAA CTACAGTGCC 1001 CCCTTAGCCT CCATTACTTA GTATCTCTTA CAGTTTGCTC TGGCTCTCAA 1051 ATAATATAAA GATTGATGAA CATTATTCAC AGAAAATGGG CACCTTCTCT 1101 CTCTCTTCCA GCATGTTGCT TTTGAAAGTA TCATGGGATA GTGGGACGAG 1151 CACTGGTTTA GGAGTCAAGA TATCTGGGTT CTCACTCTAC TCAGTCCCAG 1201 CAGAGCTGTT GTATAACCTT GATTAAGTCA TTTAGCCTCT CTGGATTTCT 1251 ATTTCCTCAT CTGTAAAATA GAGTTAAATG TATGTAAGAT TGATTCTGAT 1301 TCTGAAGTCT TAACTGCCAG CAGAAAAACT CCATACTGTT CATTGTAAAA 1351 CTAAAAGTGA GGAAGGCTCG TGGGTGGTGA ACCTCTGCTC TGTAATACTG 1401 GGAAAGTACT ACAGAGGGGA ACTATTTGAG GGATACATGA GGAGAGAGTA 1451 AATTGAGGTT TGGGGATTAT AAATTCAGGC AAGAGAACCT TATGGATATC 1501 ACAGTCTTGA GGGTCAAAAA AAAATACTTA GGAAGCTAGC CATGGAAGCT 1551 TCTTGCCTCT GACCCAGCCC ACTTTCCCAG CCTACCTCTG GGCCTTAGCT 1601 GCTAAAAAGC TTCTCTGGCA GCGGAGCTGC AGGCCTGAGG AAACATGCTC 1651 AGTCATGCAC ATGTGTTGAC CCATGTTTCA GATGCAGTCT GATGCCAGGT 1701 ATATTGTTAG GGAAAGAGGA AGAAGGGTAA ACTGAGCTAG CCCCTGGGCT 1751 GAGTACTCCT GCAGGGCTGG AGGAACAGGT TCTTAGAAGA TGATTCACCT 1801 TGGAGGAAGC TGGGGAAAGT GGTTAGGGAG GGAGAGGAGG AAGACTGAGA 1851 AATCTGCATC CCCAGAACTC AGCAACCCAG CTCTTTTATC ATGAGGAGAG 1901 GTAGCCATGC TGACTCATGA ACAGGTGGAG GAAAGGGGTT GCAGGTGACC 1951 AGCATCTTCA ATCTAGTTAC CACCAGCTTG GCTAGAATTT TATTGAACGC 2001 AACTTGCTAG TTATGACAGA GAGGACCAGG GTGAGAACTA ATGCTCAAGA 2051 GAGCAAGATT CTGTAATCCT CTTGCCCTCT CACATCACAA ATTCTGATAT 2101 GCCAAACATT TGGCCCAATG AGAATGACTG ATAGACTTGC TGATGTGCAT 2151 GTGGGGGGAT GAGGTGTGTA TATGCCTTTC TCCTGACTGG CCATTCAGTG 2201 TTCAGGCTGT CATTCAAATT GGATAGGAGA AAGTTGGTGA GGAAGGAATG 2251 GAGAGAGCAG GATCCCTGGA CTCCCTGGTC CCCAAACTCC TTGACTGTCA 2301 CTTGTAATTG TATATAATTT TTGTGTTTCT TGCTCCATTT CCTCATGCTG 2351 TCATCCTTAA AGTCCATCTG GGAAGGGGAA AATGCTGAAC ACCATTGTAT 2401 AGTTTCTTCA ACTGTCCCAG CCATGTTGTA CATAGATATG TCATGTTATT 2451 ATATATATAA ATATATATAT AATTTTGTAA AAAAAAAAAC AAAAAAAAAA 2501 AAAAAAAAAA AAAAAAAAAA // LOCUS AF013168 8600 bp mRNA PRI 17-AUG-1997 DEFINITION Homo sapiens hamartin (TSC1) mRNA, complete cds. ACCESSION AF013168 NID g2331280 VERSION AF013168.1 GI:2331280 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8600) AUTHORS van Slegtenhorst,M., de Hoogt,R., Hermans,C., Nellist,M., Janssen,B., Verhoef,S., Lindhout,D., van den Ouweland,A., Halley,D., Young,J., Burley,M., Jeremiah,S., Woodward,K., Nahmias,J., Fox,M., Ekong,R., Osborne,J., Wolfe,J., Povey,S., Snell,R.G., Cheadle,J.P., Jones,A.C., Tachataki,M., Ravine,D., Sampson,J.R., Reeve,M.P., Richardson,P., Wilmer,F., Munro,C., Hawkins,T.L., Sepp,T., Ali,J.B.M., Ward,S., Green,A.J., Yates,J.R.W., Kwiatkowska,J., Henske,E.P., Short,M.P., Haines,J.H., Jozwiak,S. and Kwiatkowski,D.J. TITLE Identification of the tuberous sclerosis gene TSC1 on chromosome 9q34 JOURNAL Science 277 (5327), 805-808 (1997) MEDLINE 97390505 REFERENCE 2 (bases 1 to 8600) AUTHORS van Slegtenhorst,M., Young,J., Halley,D., Povey,S., Sampson,J.R. and Kwiatkowski,D.J. TITLE Direct Submission JOURNAL Submitted (09-JUL-1997) Clinical Genetics, Erasmus University and University Hospital, Dr Molewaterplein 50, Rotterdam 3015GE, The Netherlands FEATURES Location/Qualifiers source 1. .8600 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34" gene 1. .8600 /gene="TSC1" CDS 222. .3716 /gene="TSC1" /function="tumor suppressor protein" /note="tuberous sclerosis complex 1 protein" /codon_start=1 /product="hamartin" /protein_id="AAC51674.1" /db_xref="PID:g2331281" /db_xref="GI:2331281" /translation="MAQQANVGELLAMLDSPMLGVRDDVTAVFKENLNSDRGPMLVNT LVDYYLETSSQPALHILTTLQEPHDKHLLDRINEYVGKAATRLSILSLLGHVIRLQPS WKHKLSQAPLLPSLLKCLKMDTDVVVLTTGVLVLITMLPMIPQSGKQHLLDFFDIFGR LSSWCLKKPGHVAEVYLVHLHASVYALFHRLYGMYPCNFVSFLRSHYSMKENLETFEE VVKPMMEHVRIHPELVTGSKDHELDPRRWKRLETHDVVIECAKISLDPTEASYEDGYS VSHQISARFPHRSADVTTSPYADTQNSYGCATSTPYSTSRLMLLNMPGQLPQTLSSPS TRLITEPPQATLWSPSMVCGMTTPPTSPGNVPPDLSHPYSKVFGTTAGGKGTPLGTPA TSPPPAPLCHSDDYVHISLPQATVTPPRKEERMDSARPCLHRQHHLLNDRGSEEPPGS KGSVTLSDLPGFLGDLASEEDSIEKDKEEAAISRELSEITTAEAEPVVPRGGFDSPFY RDSLPGSQRKTHSAASSSQGASVNPEPLHSSLDKLGPDTPKQAFTPIDLPCGSADESP AGDRECQTSLETSIFTPSPCKIPPPTRVGFGSGQPPPYDHLFEVALPKTAHHFVIRKT EELLKKAKGNTEEDGVPSTSPMEVLDRLIQQGADAHSKELNKLPLPSKSVDWTHFGGS PPSDEIRTLRDQLLLLHNQLLYERFKRQQHALRNRRLLRKVIKAAALEEHNAAMKDQL KLQEKDIQMWKVSLQKEQARYNQLQEQRDTMVTKLHSQIRQLQHDREEFYNQSQELQT KLEDCRNMIAELRIELKKANNKVCHTELLLSQVSQKLSNSESVQQQMEFLNRQLLVLG EVNELYLEQLQNKHSDTTKEVEMMKAAYRKELEKNRSHVLQQTQRLDTSQKRILELES HLAKKDHLLLEQKKYLEDVKLQARGQLQAAESRYEAQKRITQVFELEILDLYGRLEKD GLLKKLEEEKAEAAEAAEERLDCCNDGCSDSMVGHNEEASGHNGETKTPRPSSARGSS GSRGGGGSSSSSSELSTPEKPPHQRAGPFSSRWETTMGEASASIPTTVGSLPSSKSFL GMKARELFRNKSESQCDEDGMTSSLSESLKTELGKDLGVEAKIPLNLDGPHPSPPTPD SVGQLHIMDYNETHHEHS" misc_feature 2409. .3116 /gene="TSC1" /note="encodes a coiled-coil domain" BASE COUNT 2365 a 2010 c 2137 g 2087 t 1 others ORIGIN 1 GTGCTGTACG TCCAAGATGG CGGCGCCTGT AGGCTGGAGG GACTGTGAGG 51 TAAACAGCTG AGGGGGAGGA GACGGTGGTG ACCATGAAAG ACACCAGGTT 101 GACAGCACTG GAAACTGAAG TACCAGTTGT CGCTAGAACA GTTTGGTAGT 151 GGCCCCAATG AAGAACCTTC AGAACCTGTA GCACACGTCC TGGAGCCAGC 201 ACAGCGCCTT CGAGCGAGAG AATGGCCCAA CAAGCAAATG TCGGGGAGCT 251 TCTTGCCATG CTGGACTCCC CCATGCTGGG TGTGCGGGAC GACGTGACAG 301 CTGTCTTTAA AGAGAACCTC AATTCTGACC GTGGCCCTAT GCTTGTAAAC 351 ACCTTGGTGG ATTATTACCT GGAAACCAGC TCTCAGCCGG CATTGCACAT 401 CCTGACCACC TTGCAAGAGC CACATGACAA GCACCTCTTG GACAGGATTA 451 ACGAATATGT GGGCAAAGCC GCCACTCGTT TATCCATCCT CTCGTTACTG 501 GGTCATGTCA TAAGACTGCA GCCATCTTGG AAGCATAAGC TCTCTCAAGC 551 ACCTCTTTTG CCTTCTTTAC TAAAATGTCT CAAGATGGAC ACTGACGTCG 601 TTGTCCTCAC AACAGGCGTC TTGGTGTTGA TAACCATGCT ACCAATGATT 651 CCACAGTCTG GGAAACAGCA TCTTCTTGAT TTCTTTGACA TTTTTGGCCG 701 TCTGTCATCA TGGTGCCTGA AGAAACCAGG CCACGTGGCG GAAGTCTATC 751 TCGTCCATCT CCATGCCAGT GTGTACGCAC TCTTTCATCG CCTTTATGGA 801 ATGTACCCTT GCAACTTCGT CTCCTTTTTG CGTTCTCATT ACAGTATGAA 851 AGAAAACCTG GAGACTTTTG AAGAAGTGGT CAAGCCAATG ATGGAGCATG 901 TGCGAATTCA TCCGGAATTA GTGACTGGAT CCAAGGACCA TGAACTGGAC 951 CCTCGAAGGT GGAAGAGATT AGAAACTCAT GATGTTGTGA TCGAGTGTGC 1001 CAAAATCTCT CTGGATCCCA CAGAAGCCTC ATATGAAGAT GGCTATTCTG 1051 TGTCTCACCA AATCTCAGCC CGCTTTCCTC ATCGTTCAGC CGATGTCACC 1101 ACCAGCCCTT ATGCTGACAC ACAGAATAGC TATGGGTGTG CTACTTCTAC 1151 CCCTTACTCC ACGTCTCGGC TGATGTTGTT AAATATGCCA GGGCAGCTAC 1201 CTCAGACTCT GAGTTCCCCA TCGACACGGC TGATAACTGA ACCACCACAA 1251 GCTACTCTTT GGAGCCCATC TATGGTTTGT GGTATGACCA CTCCTCCAAC 1301 TTCTCCTGGA AATGTCCCAC CTGATCTGTC ACACCCTTAC AGTAAAGTCT 1351 TTGGTACAAC TGCAGGTGGA AAAGGAACTC CTCTGGGAAC CCCAGCAACC 1401 TCTCCTCCTC CAGCCCCACT CTGTCATTCG GATGACTACG TGCACATTTC 1451 ACTCCCCCAG GCCACAGTCA CACCCCCCAG GAAGGAAGAG AGAATGGATT 1501 CTGCAAGACC ATGTCTACAC AGACAACACC ATCTTCTGAA TGACAGAGGA 1551 TCAGAAGAGC CACCTGGCAG CAAAGGTTCT GTCACTCTAA GTGATCTTCC 1601 AGGGTTTTTA GGTGATCTGG CCTCTGAAGA AGATAGTATT GAAAAAGATA 1651 AAGAAGAAGC TGCAATATCT AGAGAACTTT CTGAGATCAC CACAGCAGAG 1701 GCAGAGCCTG TGGTTCCTCG AGGAGGCTTT GACTCTCCCT TTTACCGAGA 1751 CAGTCTCCCA GGTTCTCAGC GGAAGACCCA CTCGGCAGCC TCCAGTTCTC 1801 AGGGCGCCAG CGTGAACCCT GAGCCTTTAC ACTCCTCCCT GGACAAGCTT 1851 GGGCCTGACA CACCAAAGCA AGCCTTTACT CCCATAGACC TGCCCTGCGG 1901 CAGTGCTGAT GAAAGCCCTG CGGGAGACAG GGAATGCCAG ACTTCTTTGG 1951 AGACCAGTAT CTTCACTCCC AGTCCTTGTA AAATTCCACC TCCGACGAGA 2001 GTGGGCTTTG GAAGCGGGCA GCCTCCCCCG TATGATCATC TTTTTGAGGT 2051 GGCATTGCCA AAGACAGCCC ATCATTTTGT CATCAGGAAG ACTGAGGAGC 2101 TGTTAAAGAA AGCAAAAGGA AACACAGAGG AAGATGGTGT GCCCTCTACC 2151 TCCCCAATGG AAGTGCTGGA CAGACTGATA CAGCAGGGAG CAGACGCGCA 2201 CAGCAAGGAG CTGAACAAGT TGCCTTTACC CAGCAAGTCT GTCGACTGGA 2251 CCCACTTTGG AGGCTCTCCT CCTTCAGATG AGATCCGCAC CCTCCGAGAC 2301 CAGTTGCTTT TACTGCACAA CCAGTTACTC TATGAGCGTT TTAAGAGGCA 2351 GCAGCATGCC CTCCGGAACA GGCGGCTCCT CCGCAAGGTG ATCAAAGCAG 2401 CAGCTCTGGA GGAACATAAT GCTGCCATGA AAGATCAGTT GAAGTTACAA 2451 GAGAAGGACA TCCAGATGTG GAAGGTTAGT CTGCAGAAAG AACAAGCTAG 2501 ATACAATCAG CTCCAGGAGC AGCGTGACAC TATGGTAACC AAGCTCCACA 2551 GCCAGATCAG ACAGCTGCAG CATGACCGAG AGGAATTCTA CAACCAGAGC 2601 CAGGAATTAC AGACGAAGCT GGAGGACTGC AGGAACATGA TTGCGGAGCT 2651 GCGGATAGAA CTGAAGAAGG CCAACAACAA GGTGTGTCAC ACTGAGCTGC 2701 TGCTCAGTCA GGTTTCCCAA AAGCTCTCAA ACAGTGAGTC GGTCCAGCAG 2751 CAGATGGAGT TCTTGAACAG GCAGCTGTTG GTTCTTGGGG AGGTCAACGA 2801 GCTCTATTTG GAACAACTGC AGAACAAGCA CTCAGATACC ACAAAGGAAG 2851 TAGAAATGAT GAAAGCCGCC TATCGGAAAG AGCTAGAAAA AAACAGAAGC 2901 CATGTTCTCC AGCAGACTCA GAGGCTTGAT ACCTCCCAAA AACGGATTTT 2951 GGAACTGGAA TCTCACCTGG CCAAGAAAGA CCACCTTCTT TTGGAACAGA 3001 AGAAATATCT AGAGGATGTC AAACTCCAGG CAAGAGGACA GCTGCAGGCC 3051 GCAGAGAGCA GGTATGAGGC TCAGAAAAGG ATAACCCAGG TGTTTGAATT 3101 GGAGATCTTA GATTTATATG GCAGGTTGGA GAAAGATGGC CTCCTGAAAA 3151 AACTTGAAGA AGAAAAAGCA GAAGCAGCTG AAGCAGCAGA AGAAAGGCTT 3201 GACTGTTGTA ATGACGGGTG CTCAGATTCC ATGGTAGGGC ACAATGAAGA 3251 GGCATCTGGC CACAACGGTG AGACCAAGAC CCCCAGGCCC AGCAGCGCCC 3301 GGGGCAGTAG TGGAAGCAGA GGTGGTGGAG GCAGCAGCAG CAGCAGCAGC 3351 GAGCTTTCTA CCCCAGAGAA ACCCCCACAC CAGAGGGCAG GCCCATTCAG 3401 CAGTCGGTGG GAGACGACTA TGGGAGAAGC GTCTGCCAGC ATCCCCACCA 3451 CTGTGGGCTC ACTTCCCAGT TCAAAAAGCT TCCTGGGTAT GAAGGCTCGA 3501 GAGTTATTTC GTAATAAGAG CGAGAGCCAG TGTGATGAGG ACGGCATGAC 3551 CAGTAGCCTT TCTGAGAGCC TAAAGACAGA ACTGGGCAAA GACTTGGGTG 3601 TGGAAGCCAA GATTCCCCTG AACCTAGATG GCCCTCACCC GTCTCCCCCG 3651 ACCCCGGACA GTGTTGGACA GCTACATATC ATGGACTACA ATGAGACTCA 3701 TCATGAACAC AGCTAAGGAA TGATGGTCAA TCAGTGTTAA CTTGCATATT 3751 GTTGGCACAG AACAGGAGGT GTGAATGCAC GTTTCAAAGC TTTCCTGTTT 3801 CCAGGGTCTG AGTGCAAGTT CATGTGTGGA AATGGGACGG AGGTCCTTTG 3851 GACAGCTGAC TGAATGCAGA ACGGTTTTTG GATCTGGCAT TGAAATGCCT 3901 CTTGACCTTC CCCTCCACCC GCCCTAACCC CCTCTCATTT ACCTCGCAGT 3951 GTGTTCTAAT CCAAGGGCCA GTTGGTGTTC CTCAGTAGCT TTACTTTCTT 4001 CCTTCCCCCC CAAATGGTTG CGTCCTTTGA ACCTGTGCAA TATGAGGCCA 4051 AATTTAATCT TTGAGTCTAA CACACCACTT TCTGCTTTCC CGAAGTTCAG 4101 ATAACTGGGT TGGCTCTCAA TTAGACCAGG TAGTTTGTTG CATTGCAGGT 4151 AAGTCTGGTT TTGTCCCTTC CAGGAGGACA TAGCCTGCAA AGCTGGTTGT 4201 CTTTACATGA AAGCGTTTAC ATGAGACTTT CCGACTGCTT TTTTGATTCT 4251 GAAGTTCAGC ATCTAAAGCA GCAGGTCTAG AAGAACAACG GTTTATTCAT 4301 ACTTGCATTC TTTTGGCAGT TCTGATAAGC TTCCTAGAAA GTTCTGTGTA 4351 AACAGAAGCC TGTTTCAGAA ATCTGGAGCT GGCACTGTGG AGACCACACA 4401 CCCTTTGGGA AAGCTCTTGT CTCTTCTTCC CCCACTACCT CTTATTTATT 4451 TGGTGTTTGC TTGAATGCTG GTACTATTGT GACCACAGGC TGGTGTGTAG 4501 GTGGTAAAAC CTGTTCTCCA TAGGAGGGAA GGAGCAGTCA CTGGGAGAGG 4551 TTACCCGAGA AGCACTTGAG CATGAGGAAC TGCACCTTTA GGCCATCTCA 4601 GCTTGCTGGG CCTTTTGTTA AACCCTTCTG TCTACTGGCC TCCCTTTGTG 4651 TGCATACGCC TCTTGTTCAT GTCAGCTTAT ATGTGACACT GCAGCAGAAA 4701 GGCTCTGAAG GTCCAAAGAG TTTCTGCAAA GTGTATGTGA CCATCATTTC 4751 CCAGGCCATT AGGGTTGCCT CACTGTAGCA GGTTCTAGGC TACCAGAAGA 4801 GGGGCAGCTT TTTCATACCA ATTCCAACTT TCAGGGGCTG ACTCTCCAGG 4851 GAGCTGATGT CATCACACTC TCCATGTTAG TAATGGCAGA GCAGTCTAAA 4901 CAGAGTCCGG GAGAATGCTG GCAAAGGCTG GCTGTGTATA CCCACTAGGC 4951 TGCCCCACGT GCTCCCGAGA GATGACACTA GTCAGAAAAG TGGCAGTGGC 5001 AGAGAATCCA AACTCAACAA GTGCTCCTGA AAGAAATGCT AGAAGCCTAA 5051 GAACTGTGGT CTGGTGTTCC AGCTGAGGCA GGGGGATTTG GTAGGAAGGA 5101 GCCAGTGAAC TTGGCTTTCC TGTTTCTATC TTTCATTAAA AAGAATAGAA 5151 GGATTCAGTC ATAAAGAGGT AAAAAACTGT CACGGTACGA AATCTTAGTG 5201 CCTACGGAGG CCTCGAGCAG AAAGAATGAA AGTCTTTTTT TTTTTTTTTT 5251 TTTTTTAGCA TGGCAATAAA TATTCTAGCA TCCCTAACTA AAGGGGACTA 5301 GACAGTTAGA GACTCTGTCA CCCTAGCTAT ACCAGCAGAA AACCTGTTCA 5351 GGCAGGCTTT CTGGGTGTGA CTGATTCCCA GCCTGTGGCA GGGCGTGGTC 5401 CCAACTACTC AGCCTAGCAC AGGCTGGCAG TTGGTACTGA ATTGTCAGAT 5451 GTGGAGTATT AGTGACACCA CACATTTAAT TCAGCTTTGT CCAAAGGAAA 5501 GCTTAAAACC CAATACAGTC TAGTTTCCTG GTTCCGTTTT AGAAAAGGAA 5551 AACGTGAACA AACTTAGAAA GGGAAGGAAA TCCCATCAGT GAATCCTGAA 5601 ACTGGTTTTA AGTGCTTTCC TTCTCCTCAT GCCCAAGAGA TCTGTGCCAT 5651 AGAACAAGAT ACCAGGCACT TAAAGCCTTT TCCTGAATTG GAAAGGAAAA 5701 GAGGCCCAAG TGCAAAAGAA AAAACATTTT AGAAACGGAC AGCTTATAAA 5751 AATAAAGGGA AGAAAGGAGG CAGCATGGAG AGAGGCCTGT GCTAGAAGCT 5801 CCATGGACGT GTCTGCACAG GGTCCTCAGC TCATCCATGC GGCCTGGGTG 5851 TCCTTTTACT CAGCTTTATA ACAAATGTGG CTCCAAGCTC AGGTGCCTTT 5901 GAGTTCTAGG AGGCTGTGGG TTTTATTCAA CTACGGTTGG GAGAATGAGA 5951 CCTGGAGTCA TGTTGAAGGT GCCCAACCTA AAAATGTAGG CTTTCATGTT 6001 GCAAAGAACT CCAGAGTCAG TAGTTAGGTT TGGTTTGGTT TTGGACATGA 6051 TAAACCTGCC AAGAGTCAAC AGGTCACTTG ATCATGCTGC AGTGGGTAGT 6101 TCTAAGGATG GAAAGGTGAC AGTATTACTC TCGAGAGGCA ATTCAGTCCT 6151 GGGCAAAGGT ATTAGTACAA TAAGCGTTAA GGGCAGAGTC TACCTTGAAA 6201 CCAATTAAGC AGCTTGGTAT TCATAAATAT TGGGATTGGA TGGCCTCCAT 6251 CCAGAAATCA CTATGGGTGA GCATACCTGT CTCAGCTGTT TGGCCAATGT 6301 GCATAACCTA CTCGGATCCC CACCTGACAC TAACCAGAGT CAGCACAGGC 6351 CCCGAGGAGC CCGAAGTTCT CTGCTGTGCA GCATGGAATT CCTTTAAAAA 6401 GGTGCACTAC AGTTTTAGCG GGGAGGGGGA TAGGAAGACG CAGAGCAAAT 6451 GAGCTCCGGA GTCCCTGCAG GTGAATAAAC ACACAGATCT GCATCTGATA 6501 GAACTTTGAT GGATTTTCAA AAAGCCGTTG ACAAGGCTCT GCTATACAGT 6551 CTATAAAAAT TGTTATTATG GGATTGGAAG AAACACATGG TCATGAATAG 6601 AAAAAAAACA AACCCAAAGG TAGGAAGGTC AAGGTCATTT CTTAGATGGA 6651 GAAGTTGTGA AAGATGTCCT TGGAGATGAG TTTTAGGACC AGCATTACTA 6701 AGGCAGGTGG GCAGACAGTG ACCTCTCTAG GTGTGTCCAC AGAGTTTTTC 6751 AGGAGAGAAA ACTGCCTGAC CTTTGGGACT AAGCTGCGGA ATCTTCTTAC 6801 TAAGCTTGAA GAGTGGAGAG GCGAGAGGTG AGCTACTTTG TGAGCCAAAG 6851 CTTATGTGAC ATGGTTGGGG AAACAGTCCA AACTGTTCTG AGAAGGTGAA 6901 CTGTTACGAC CCAGGACAAT TAGAAAAATT CACCCACCAT GCCGCACATT 6951 ACTGGGTAAA AGCAGGGCAG CAGGGAACAA AACTCCAGAC TCTTGGGCCG 7001 TCCCCATTTG CAACAGCACA CATAGTTTCT GGTATATTTG TTGGGAAAGA 7051 TAAAACTCTA GCAGTTGTTG AGGGGAGGAT GTATAAAATG GTCATGGGGA 7101 TGAAAGGATC TCTGAGACCA CAGAGGCTCA GACTCACTGT TAAGAATAGA 7151 AAACTGGGTA TGCGTTTCAT GTAGCCAGCA GAACTGAAGT GTGCTGTGAC 7201 AAGCCAATGT GAATTTCTAC CAAATAGTAG AGCATACCAC TTGAAGAAGG 7251 AAAGAACCGA AGAGCAAACA AAAGTTCTGC GTAATGAGAC TCACCTTTTC 7301 TCGCTGAAAG CACTAAGAGG TGGGAGGAGG CCTGCACAGG CTGGAGGAGG 7351 GTTTGGGCAG AGCGAAGACC CGGCCAGGAC CTTGGTGAGA TGGAGTGCCG 7401 CCCACCTCCT GCGGATACTC TTGGAGAGTT GTTCCCCCAG GGGNCTCTGC 7451 CCCACCTGGA GAAGGAAGCT GCCTGGTGTG GAGTGACTCA AATCAGTATA 7501 CCTATCTGCT GCACCTTCAC TCTCCAGGGT ACATGCTTTA AAACCGACCC 7551 GCAACAAGTA TTGGAAAAAT GTATCCAGTC TGAAGATGTT TGTGTATCTG 7601 TTTACATCCA GAGTTCTGTG ACACATGCCC CCCAGATTGC TGCAAAGATC 7651 CCAAGGCATT GATTGCACTT GATTAAGCTT TTGTCTGTAG GTGAAAGAAC 7701 AAGTTTAGGT CGAGGACTGG CCCCTAGGCT GCTGCTGTGA CCCTTGTCCC 7751 ATGTGGCTTG TTTGCCTGTC CGGGACTCTT CGATGTGCCC AGGGGAGCGT 7801 GTTCCTGTCT CTTCCATGCC GTCCTGCAGT CCTTATCTGC TCGCCTGAGG 7851 GAAGAGTAGC TGTAGCTACA AGGGAAGCCT GCCTGGAAGA GCCGAGCACC 7901 TGTGCCCATG GCTTCTGGTC ATGAAACGAG TTAATGATGG CAGAGGAGCT 7951 TCCTCCCCAC TTCGCAGCGC CACATTATCC ATCCTCTGAG ATAAGTAGGC 8001 TGGTTTAACC ATTGGAATGG ACCTTTCAGT GGAAACCCTG AGAGTCTGAG 8051 AACCCCCAGA CCAACCCTTC CCTCCCTTTC CCCACCTCTT ACAGTGTTTG 8101 GACAGGAGGG TATGGTGCTG CTCTGTGTAG CAAGTACTTT GGCTTATGAA 8151 AGAGGCAGCC ACGCATTTTG CACTAGGAAG AATCAGTAAT CACTTTTCAG 8201 AAGACTTCTA TGGACCACAA ATATATTACG GAGGAACAGA TTTTGCTAAG 8251 ACATAATCTA GTTTTATAAC TCAATCATGA ATGAACCATG TGTGGCAAAC 8301 TTGCAGTTTA AAGGGGTCCC ATCAGTGAAA GAAACTGATT TTTTTTAACG 8351 GACTGCTTTT AGTTAAATTG AAGAAAGTCA GCTCTTGTCA AAAGGTCTAA 8401 ACTTTCCCGC CTCAATCCTA AAAGCATGTC AACAATCCAC ATCAGATGCC 8451 ATAAATATGA ACTGCAGGAT AAAATGGTAC AATCTTAGTG AATGGGAATT 8501 GGAATCAAAA GAGTTTGCTG TCCTTCTTAG AATGTTCTAA AATGTCAAGG 8551 CAGTTGCTTG TGTTTAACTG TGAACAAATA AAAATTTATT GTTTTGCACT // LOCUS HUMA20 4426 bp mRNA PRI 30-OCT-1994 DEFINITION Human tumor necrosis factor alpha inducible protein A20 mRNA, complete cds. ACCESSION M59465 J05610 NID g177865 VERSION M59465.1 GI:177865 KEYWORDS . SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4426) AUTHORS Opipari,A.W. Jr., Boguski,M.S. and Dixit,V.M. TITLE The A20 cDNA induced by tumor necrosis factor alpha encodes a novel type of zinc finger protein JOURNAL J. Biol. Chem. 265 (25), 14705-14708 (1990) MEDLINE 90368626 FEATURES Location/Qualifiers source 1. .4426 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 67. .2439 /gene="TNFAIP1" CDS 67. .2439 /gene="TNFAIP1" /note="tumor necrosis factor alpha inducible protein" /codon_start=1 /db_xref="GDB:G00-127-514" /product="A20" /protein_id="AAA51550.1" /db_xref="PID:g177866" /db_xref="GI:177866" /translation="MAEQVLPQALYLSNMRKAVKIRERTPEDIFKPTNGIIHHFKTMH RYTLEMFRTCQFCPQFREIIHKALIDRNIQATLESQKKLNWCREVRKLVALKTNGDGN CLMHATSQYMWGVQDTDLVLRKALFSTLKETDTRNFKFRWQLESLKSQEFVETGLCYD TRNWNDEWDNLIKMASTDTPMARSGLQYNSLEEIHIFVLCNILRRPIIVISDKMLRSL ESGSNFAPLKVGGIYLPLHWPAQECYRYPIVLGYDSHHFVPLVTLKDSGPEIRAVPLV NRDRGRFEDLKVHFLTDPENEMKEKLLKEYLMVIEIPVQGWDHGTTHLINAAKLDEAN LPKEINLVDDYFELVQHEYKKWQENSEQGRREGHAQNPMEPSVPQLSLMDVKCETPNC PFFMSVNTQPLCHECSERRQKNQNKLPKLNSKPGPEGLPGMALGASRGEAYEPLAWNP EESTGGPHSAPPTAPSPFLFSETTAMKCRSPGCPFTLNVQHNGFCERCHNARQLHASH APDHTRHLDPGKCQACLQDVTRTFNGICSTCFKRTTAEASSSLSTSLPPSCHQRSKSD PSRLVRSPSPHSCHRAGNDAPAGCLSQAARTPGDRTGTSKCRKAGCVYFGTPENKGFC TLCFIEYRENKHFAAASGKVSPTASRFQNTIPCLGRECGTLGSTMFEGYCQKCFIEAQ NQRFHEAKRTEEQLRSSQRRDVPRTTQSTSRPKCARASCKNILACRSEELCMECQHPN QRMGPGAHRGEPAPEDPPKQRCRAPACDHFGNAKCNGYCNECFQFKQMYG" BASE COUNT 1192 a 1070 c 1055 g 1109 t ORIGIN 1 TGCCTTGACC AGGACTTGGG ACTTTGCGAA AGGATCGCGG GGCCCGGAGA 51 GGTGTTGGAG AGCACAATGG CTGAACAAGT CCTTCCTCAG GCTTTGTATT 101 TGAGCAATAT GCGGAAAGCT GTGAAGATAC GGGAGAGAAC TCCAGAAGAC 151 ATTTTTAAAC CTACTAATGG GATCATTCAT CATTTTAAAA CCATGCACCG 201 ATACACACTG GAAATGTTCA GAACTTGCCA GTTTTGTCCT CAGTTTCGGG 251 AGATCATCCA CAAAGCCCTC ATCGACAGAA ACATCCAGGC CACCCTGGAA 301 AGCCAGAAGA AACTCAACTG GTGTCGAGAA GTCCGGAAGC TTGTGGCGCT 351 GAAAACGAAC GGTGACGGCA ATTGCCTCAT GCATGCCACT TCTCAGTACA 401 TGTGGGGCGT TCAGGACACA GACTTGGTAC TGAGGAAGGC GCTGTTCAGC 451 ACGCTCAAGG AAACAGACAC ACGCAACTTT AAATTCCGCT GGCAACTGGA 501 GTCTCTCAAA TCTCAGGAAT TTGTTGAAAC GGGGCTTTGC TATGATACTC 551 GGAACTGGAA TGATGAATGG GACAATCTTA TCAAAATGGC TTCCACAGAC 601 ACACCCATGG CCCGAAGTGG ACTTCAGTAC AACTCACTGG AAGAAATACA 651 CATATTTGTC CTTTGCAACA TCCTCAGAAG GCCAATCATT GTCATTTCAG 701 ACAAAATGCT AAGAAGTTTG GAATCAGGTT CCAATTTCGC CCCTTTGAAA 751 GTGGGTGGAA TTTACTTGCC TCTCCACTGG CCTGCCCAGG AATGCTACAG 801 ATACCCCATT GTTCTCGGCT ATGACAGCCA TCATTTTGTA CCCTTGGTGA 851 CCCTGAAGGA CAGTGGGCCT GAAATCCGAG CTGTTCCACT TGTTAACAGA 901 GACCGGGGAA GATTTGAAGA CTTAAAAGTT CACTTTTTGA CAGATCCTGA 951 AAATGAGATG AAGGAGAAGC TCTTAAAAGA GTACTTAATG GTGATAGAAA 1001 TCCCCGTCCA AGGCTGGGAC CATGGCACAA CTCATCTCAT CAATGCCGCA 1051 AAGTTGGATG AAGCTAACTT ACCAAAAGAA ATCAATCTGG TAGATGATTA 1101 CTTTGAACTT GTTCAGCATG AGTACAAGAA ATGGCAGGAA AACAGCGAGC 1151 AGGGGAGGAG AGAGGGGCAC GCCCAGAATC CCATGGAACC TTCCGTGCCC 1201 CAGCTTTCTC TCATGGATGT AAAATGTGAA ACGCCCAACT GCCCCTTCTT 1251 CATGTCTGTG AACACCCAGC CTTTATGCCA TGAGTGCTCA GAGAGGCGGC 1301 AAAAGAATCA AAACAAACTC CCAAAGCTGA ACTCCAAGCC GGGCCCTGAG 1351 GGGCTCCCTG GCATGGCGCT CGGGGCCTCT CGGGGAGAAG CCTATGAGCC 1401 CTTGGCGTGG AACCCTGAGG AGTCCACTGG GGGGCCTCAT TCGGCCCCAC 1451 CGACAGCACC CAGCCCTTTT CTGTTCAGTG AGACCACTGC CATGAAGTGC 1501 AGGAGCCCCG GCTGCCCCTT CACACTGAAT GTGCAGCACA ACGGATTTTG 1551 TGAACGTTGC CACAACGCCC GGCAACTTCA CGCCAGCCAC GCCCCAGACC 1601 ACACAAGGCA CTTGGATCCC GGGAAGTGCC AAGCCTGCCT CCAGGATGTT 1651 ACCAGGACAT TTAATGGGAT CTGCAGTACT TGCTTCAAAA GGACTACAGC 1701 AGAGGCCTCC TCCAGCCTCA GCACCAGCCT CCCTCCTTCC TGTCACCAGC 1751 GTTCCAAGTC AGATCCCTCG CGGCTCGTCC GGAGCCCCTC CCCGCATTCT 1801 TGCCACAGAG CTGGAAACGA CGCCCCTGCT GGCTGCCTGT CTCAAGCTGC 1851 ACGGACTCCT GGGGACAGGA CGGGGACGAG CAAGTGCAGA AAAGCCGGCT 1901 GCGTGTATTT TGGGACTCCA GAAAACAAGG GCTTTTGCAC ACTGTGTTTC 1951 ATCGAGTACA GAGAAAACAA ACATTTTGCT GCTGCCTCAG GGAAAGTCAG 2001 TCCCACAGCG TCCAGGTTCC AGAACACCAT TCCGTGCCTG GGGAGGGAAT 2051 GCGGCACCCT TGGAAGCACC ATGTTTGAAG GATACTGCCA GAAGTGTTTC 2101 ATTGAAGCTC AGAATCAGAG ATTTCATGAG GCCAAAAGGA CAGAAGAGCA 2151 ACTGAGATCG AGCCAGCGCA GAGATGTGCC TCGAACCACA CAAAGCACCT 2201 CAAGGCCCAA GTGCGCCCGG GCCTCCTGCA AGAACATCCT GGCCTGCCGC 2251 AGCGAGGAGC TCTGCATGGA GTGTCAGCAT CCCAACCAGA GGATGGGCCC 2301 TGGGGCCCAC CGGGGTGAGC CTGCCCCCGA AGACCCCCCC AAGCAGCGTT 2351 GCCGGGCCCC CGCCTGTGAT CATTTTGGCA ATGCCAAGTG CAACGGCTAC 2401 TGCAACGAAT GCTTTCAGTT CAAGCAGATG TATGGCTAAC CGGAAACAGG 2451 TGGGTCACCT CCTGCAAGAA GTGGGGCCTC GAGCTGTCAG TCATCATGGT 2501 GCTATCCTCT GAACCCCTCA GCTGCCACTG CAACAGTGGG CTTAAGGGTG 2551 TCTGAGCAGG AGAGGAAAGA TAAGCTCTTC GTGGTGCCCA CGATGCTCAG 2601 GTTTGGTAAC CCGGGAGTGT TCCCAGGTGG CCTTAGAAAG CAAAGCTTGT 2651 AACTGGCAAG GGATGATGTC AGATTCAGCC CAAGGTTCCT CCTCTCCTAC 2701 CAAGCAGGAG GCCAGGAACT TCTTTGGACT TGGAAGGTGT GCGGGGACTG 2751 GCCGAGGCCC CTGCACCCTG CGCATCAGGA CTGCTTCATC GTCTTGGCTG 2801 AGAAAGGGAA AAGACACACA AGTCGCGTGG GTTGGAGAAG CCAGAGCCAT 2851 TCCACCTCCC CTCCCCCAGC ATCTCTCAGA GATGTGAAGC CAGATCCTCA 2901 TGGCAGCGAG GCCCTCTGCA AGAAGCTCAA GGAAGCTCAG GGAAAATGGA 2951 CGTATTCAGA GAGTGTTTGT AGTTCATGGT TTTTCCCTAC CTGCCCGGTT 3001 CCTTTCCTGA GGACCCGGCA GAAATGCAGA ACCATCCATG GACTGTGATT 3051 CTGAGGCTGC TGAGACTGAA CATGTTCACA TTGACAGAAA AACAAGCTGC 3101 TCTTTATAAT ATGCACCTTT TAAAAAATTA GAATATTTTA CTGGGAAGAC 3151 GTGTAACTCT TTGGGTTATT ACTGTCTTTA CTTCTAAAGA AGTTAGCTTG 3201 AACTGAGGAG TAAAAGTGTG TACATATATA ATATACCCTT ACATTATGTA 3251 TGAGGGATTT TTTTAAATTA TATTGAAATG CTGCCCTAGA AGTACAATAG 3301 GAAGGCTAAA TAATAATAAC CTGTTTTCTG GTTGTTGTTG GGGCATGAGC 3351 TTGTGTATAC ACTGCTTGCA TAAACTCAAC CAGCTGCCTT TTTAAAGGGA 3401 GCTCTAGTCC TTTTTGTGTA ATTCACTTTA TTTATTTTAT TACAAACTTC 3451 AAGATTATTT AAGTGAAGAT ATTTCTTCAG CTCTGGGGAA AATGCCACAG 3501 TGTTCTCCTG AGAGAACATC CTTGCTTTGA GTCAGGCTGT GGGCAAGTTC 3551 CTGACCACAG GGAGTAAATT GGCCTCTTTG ATACACTTTT GCTTGCCTCC 3601 CCAGGAAAGA AGGAATTGCA TCCAAGGTAT ACATACATAT TCATCGATGT 3651 TTCGTGCTTC TCCTTATGAA ACTCCAGCTA TGTAATAAAA AACTATACTC 3701 TGTGTTCTGT TAATGCCTCT GAGTGTCCTA CCTCCTTGGA GATGAGATAG 3751 GGAAGGAGCA GGGATGAGAC TGGCAATGGT CACAGGGAAA GATGTGGCCT 3801 TTTGTGATGG TTTTATTTTC TGTTAACACT GTGTCCTGGG GGGGCTGGGA 3851 AGTCCCCTGC ATCCCATGGT ACCCTGGTAT TGGGACAGCA AAAGCCAGTA 3901 ACCATGAGTA TGAGGAAATC TCTTTCTGTT GCTGGCTTAC AGTTTCTCTG 3951 TGTGCTTTGT GGTTGCTGTC ATATTTGCTC TAGAAGAAAA AAAAAAAAGG 4001 AGGGGAAATG CATTTTCCCC AGAGATAAAG GCTGCCATTT TGGGGGTCTG 4051 TACTTATGGC CTGAAAATAT TTGTGATCCA TAACTCTACA CAGCCTTTAC 4101 TCATACTATT AGGCACACTT TCCCCTTAGA GCCCCCTAAG TTTTTCCCAG 4151 ACGAATCTTT ATAATTTCCT TTCCAAAGAT ACCAAATAAA CTTCAGTGTT 4201 TTCATCTAAT TCTCTTAAAG TTGATATCTT AATATTTTGT GTTGATCATT 4251 ATTTCCATTC TTAATGTGAA AAAAAGTAAT TATTTATACT TATTATAAAA 4301 AGTATTTGAA ATTTGCACAT TTAATTGTCC CTAATAGAAA GCCACCTATT 4351 CTTTGTTGGA TTTCTTCAAG TTTTTCTAAA TAAATGTAAC TTTTCACAAG 4401 AGTCAACATT AAAAAATAAA TTATTT // LOCUS AB014526 6184 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0626 protein, complete cds. ACCESSION AB014526 NID g3327065 VERSION AB014526.1 GI:3327065 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH00753. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6184) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .6184 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH00753" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 179. .1408 /gene="KIAA0626" CDS 179. .1408 /gene="KIAA0626" /codon_start=1 /product="KIAA0626 protein" /protein_id="BAA31601.1" /db_xref="PID:d1032562" /db_xref="PID:g3327066" /db_xref="GI:3327066" /translation="MDRLKSHLTVCFLPSVPFLILVSTLATAKSVTNSTLNGTNVVLG SVPVIIARTDHIIVKEGNSALINCSVYGIPDPQFKWYNSIGKLLKEEEDEKERGGGKW QMHDSGLLNITKVSFSDRGKYTCVASNIYGTVNNTVTLRVIFTSGDMGVYYMVVCLVA FTIVMVLNITRLCMMSSHLKKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLE LAKVTQFKTMEFARYIEELARSVPLPPLIMNCRTIMEEIMEVVGLEEQGQNFVRHTPE GQEAADRDEVYTIPNSLKRSDSPAADSDASSLHEQPQQIAIKVSVHPQSKKEHADDQE GGQFEVKDVEETELSAEHSPETAEPSTDVTSTELTSEEPTPVEVPDKVLPPAYLEATE PAVTHDKNTCIIYESHV" BASE COUNT 1815 a 1235 c 1281 g 1853 t ORIGIN 1 GGCGAGGGGT GCACGGCGGC CACCTGAGTG GCGCGGCGGT GTCAGGTTCT 51 TGCTCAAGTA CCAACTCTAT GGACCCAGGA CAGGTTTGTC CCATGACCTG 101 CTGTGAACAG TGTGTTGTCT GATAGAAGAT TCGGTTGGCA AACCATCTCT 151 CTATTGCCTT ACAGAGCAAG CAAAGAAGAT GGATCGATTG AAGAGCCATC 201 TGACTGTGTG CTTTCTACCT TCTGTGCCCT TTTTAATCCT AGTATCCACT 251 CTAGCCACCG CTAAGAGTGT GACTAACAGC ACTTTAAATG GCACTAACGT 301 GGTCTTGGGC TCTGTGCCCG TAATCATTGC CAGAACTGAC CATATCATAG 351 TCAAGGAAGG GAACAGTGCC TTGATTAACT GTAGTGTTTA TGGCATCCCT 401 GACCCACAGT TCAAGTGGTA TAATTCCATT GGCAAGCTGC TGAAAGAAGA 451 AGAGGATGAG AAGGAGAGAG GAGGAGGAAA ATGGCAAATG CACGACAGCG 501 GCCTCCTGAA CATCACCAAG GTATCCTTCT CAGACCGAGG TAAATACACG 551 TGTGTGGCTT CTAACATCTA CGGCACCGTG AACAACACGG TGACCTTGCG 601 CGTCATCTTC ACTTCTGGAG ACATGGGTGT CTACTACATG GTCGTGTGCC 651 TGGTGGCCTT CACCATCGTC ATGGTCCTCA ATATCACCCG CCTGTGCATG 701 ATGAGCAGCC ATCTAAAGAA GACTGAGAAG GCCATCAATG AGTTCTTTAG 751 GACCGAAGGT GCAGAGAAGC TGCAGAAGGC ATTTGAGATC GCCAAGCGCA 801 TCCCCATCAT CACCTCCGCC AAAACTCTAG AGCTTGCCAA AGTCACCCAG 851 TTCAAAACCA TGGAGTTCGC CCGCTACATC GAAGAGCTTG CCAGGAGCGT 901 GCCTCTGCCG CCTCTCATTA TGAACTGCAG GACTATCATG GAGGAGATTA 951 TGGAGGTGGT TGGGCTGGAG GAGCAGGGGC AGAATTTTGT GAGGCATACT 1001 CCAGAGGGCC AGGAGGCCGC AGACAGGGAT GAGGTCTACA CAATCCCCAA 1051 CTCTCTGAAG CGGAGCGACT CCCCTGCCGC TGACTCGGAC GCCTCATCGC 1101 TGCACGAGCA ACCTCAGCAA ATTGCCATCA AGGTGTCAGT TCACCCGCAG 1151 TCCAAAAAAG AGCATGCAGA TGACCAAGAG GGTGGACAGT TTGAAGTCAA 1201 AGATGTAGAG GAGACAGAAC TGTCGGCGGA ACATTCCCCC GAAACTGCAG 1251 AACCTTCTAC CGATGTCACG TCCACCGAGC TAACATCTGA AGAGCCAACA 1301 CCTGTTGAGG TACCAGATAA GGTACTGCCG CCAGCTTACC TGGAAGCCAC 1351 AGAGCCAGCA GTGACACATG ACAAAAACAC CTGCATTATT TACGAAAGCC 1401 ATGTCTAATA CCAACCCCGA AAAGCTATGC ATATCAAGAA AATCAGGGGC 1451 TGCTCCTTGT AATACAGATG TAGTACGCAC TTGCCGCTAA GCCTTACCAG 1501 GAGACTCTCA TCCCTTAGGT AGGAGTGATG CCACTTTAAA AGGAGAAACA 1551 CCTGCCTGCA GTGAATGGGA CTGGAATTTC CCCAGTAGAG AAGGGTGCGA 1601 GAAACATCAG GGTGCAGAAT TGATACCAGA CAGAAGGTGT CTATGTGATA 1651 ATGAGTTTCA GAGGCTGATC TCTGCCAAAT ACCTTAATTG GTGATGCCTT 1701 CTTGGCAAAG AGTACACCAC TGTAAGATAT TCTGAGTTCA AGAACCCTGT 1751 CCAGTGCCCC CTGCATTGCT TTTCCTTTTA AAAAGTATAG GTCTGCTACA 1801 ATAGCAAATG CACGTACGTG GGTTTTTTGC AGTTTCTTCT CAGTTTTAAT 1851 TTTGCTTTTC CTTTATAATG GGGTCATTGT TATTAATACT AATTGTTCTT 1901 TCTGGTTTAG TCCTCATTGC CACTTTTGTC CTTATGTTTC CCTAGAACAC 1951 GTACCTCAGA GACTTTGGTA TCAGTCACCA GTACCAGGGC TGATATCTAC 2001 AAGTCACATT ACATTTGTCA TGTTCCAAAG TAGTTACGAG GCTTGTTATT 2051 TTTTTTTCAT TCCCCAGGCC TATTTCCATA GATAGCTTTT TTTGTTTGTT 2101 TCCAACGAAG CTGCTGTTAA ACGAAACTGA GAAAAACTTT GCCCCGGAAT 2151 AGCACTTTAA TAGTCAAAAA TGTGTTTACC TGTCTGATTG AGTGAGCCTT 2201 TTGGTGAGCT CAGCTGAGAT GTAGAGGGAG ATTGTAAAAG GTTAAATATA 2251 CCCACACCAC CCATGAAAGT CACTGTTTAA GTTACATCAT CCTCCAAATA 2301 AAGACTGATT CTTTACCTGG AAAATATATT GCTTCCAAAG ACATCAGATT 2351 CAGTGGATTC CTGTAGGTTA TAGAATATTG GCTTCCAAAC AGGCTTGCAG 2401 GGACCATATG CTGTTGGATG ACATATAACC AGGTCCACTT TTATGAACTG 2451 CATAGCTGAC TTGGTTGTCC TTAAAGAGGA AAGCGAAAGG TTAGGGTAAT 2501 AGCAAAGGGA ACTGTGCCAT CAGATTTTAT GCCAAAACTG TTGAATAATT 2551 ATGCAGTCCT GCAAGAAAGT GGTTATATGT GAGGTGCGTG ATGTTATGGA 2601 AAGAAGACAA AATTAGTCAT CCAAAGGCTT AATACCCACT GTGCCAATAA 2651 CCAGCTGCCT GGCTTTGGAC AAGTCTGGAC CTCAGGTCCC TTATCTGTAG 2701 AAGGGGCAGA TGACATGAGC TCTGAGCACT GTTGAAATGG TATCACTGTC 2751 ACACAGAACC AAACCAATAT TCACATCCTT GCTCCTTTTC ACAATGACTT 2801 TAAAGATTTT TGCTTTCATC TCTTGGTCCA CCTAACATTT TCATGCTTCA 2851 TTACTTAAAT AAGAATGTTG GTTTTGAGAA ATAGCATTTT AAACAAATTG 2901 TGGATCTTCT CCTTCCAAAA AAACCATTAG GACCACATCT GCAATTAAGA 2951 TTTAATATTG GTGAGAATGA GTGGTTTTAT TTAATTTTCC CTTAAAAGCA 3001 AAGGAGACAG TAATCTTAAT AAATTCATAG GGGCCGTGGC CACATCAGGT 3051 AATGGGGTTA TGATGTCCAA GATTGCATGG ATCACATTGG TGATGAGAGC 3101 AGACCCAGAT GTTTAGTCCT CACTCTGTCA CCATCTGAGG AGGTGACCTT 3151 GGACAACTCC CTTCCTCTCT CTGGGATTTA ATCTTTTTCA TCTGTAAAAT 3201 ATGCAGGTAG TACTCGAGGG TCTACAGGAT CCCTTCTAGT TGAAACATTT 3251 ATAGTTCACA GAAAGTTTGC AGTCTTCCAG GATAACCAAC CCCCGTTGCA 3301 TGAGACAAGC AAAAAATGGG TCCATGAAAT TGGATACTTT TGCCATCCAA 3351 ACTTTACAAC AAACATTATC TGGCTCTGTA ATTGAGAGCA GTGGGCTTGG 3401 TTTTAAACCT AGCCTTGATT AGTTTGTTTA TAGATAACTG TTGTGGAAGG 3451 TGATAGAACT AGTCATGGAG TTTGATGAGA CATCTCTTGA AAAGGACTGA 3501 ACTGTTGACT TCTGGTTAGA AGTGCTTTGG GCAGTCACAT AAAGAAATGA 3551 GCAGTGAGAA ATCAGGAGAA ATTATGACTC CTGTTGGGCT TTCTGGACTA 3601 GCATTGTATG TTTTTGGGTT GCAGAAAAGT TTTAACACCA CCTCTTAGAA 3651 TATAAAAATT TTCCAGTTGT CATGGAGGTC CACAGATTCA TTACCATGGG 3701 TTTATATGCC CAAAGCAACA ACAGAGGACT TAAGTTCATT TTGTGATACT 3751 GTATGGATGT TACCCCATCC TATTCAGTTG TCATTCCACC CAAACCCATG 3801 TGTAGGTTTC CACATGGAAA GGAGAAGGCA TCCATTCCAC CTAGACATTG 3851 AATAGTGATA ATAAGCTAAA AGTGGGCAGA TTTTCAGTGG AGCAAGAGCA 3901 GAAATATGCG GCCAAAGAAT GTTTCCTGAT TGGTTTTGCT GCTTTAGACT 3951 GCAGTGGGGA GAGCTTATGT AGATTTTCAA AACTTTCTCC CTCTTTAAGG 4001 CATCATAATG CTCTCGGTTT TGATAACAAC TGACATAAAG GGAGGTTGAC 4051 TTAAAATGGG AATTTCTCCT TCCAAAAATG CTACACTCTT CCTATCCATC 4101 CTACAGCTTC TTTATGAAAT GAGAGGCCCT CCTGCTAGAA TATGAAATGC 4151 AGAAGACCTC ATGACTTTCA GCTGATTTTT CAAAGATAAA GTGAACTGTT 4201 CAGCTTCATA GAAATTCATG CGAGTGTGAC TGAACGTGTG TGCATACACA 4251 CTCGTGCACA TTGGACTCAT TTGGGCAGTT TTAAAAGCTT CACACTAAAT 4301 CCAAAGCCTC GTCCTTTGGG TCGTATGTAG TCGTTTGTAA AATCAATTTC 4351 TGGCTTCTGA GTCATCCTGG TCATATCTCT AGCAATGTTT TTCTTGAAAT 4401 TCTGAAAATG ATTCACATAT GTGTGTACAT TTAATTCACT TAGATGATCT 4451 GTAAACTTGG ATGGTATTTA TTCTAAATGG GGAAAACAAT TTTATATGGA 4501 AAAATCTATG TAATTTATAA TGGTTTTGTT TTATATATTA TATTTTCATA 4551 TCTCTAGGGC ACATCTATCC TCATCTTTTT GTATACCATA CTTAGCAAAA 4601 AGAAATACTA ATACTTGACT AAAATCTCTA GGAACCAAAC GTGATACATG 4651 TGATATATAG CTTCTAGAAA TCGCTCTAAA AATCTCTGAA TGTCTCATCC 4701 ATCCCAAGCA TTATTGTGCT GTGTCATTAT GTCCAGAATG ATTTGTCTTG 4751 GATGCTTATG AGCATTTGTT TTTCACAACT AAGGTTGAAA GACCTGACAT 4801 CTCACACAAT GGGGTTCTGG AATTCCCCTT TCCTCCTTTA TCTGTTTTTA 4851 TTGTTTGTTT CATTTTTAAT TGCACCAGTC TATGTTGTCG AAACTTTGTT 4901 TTGAAGGGCA AATGTGAGAT AACAAGAAAG CAATGTGATG GAAAGACTGG 4951 ATGAATTTAC CTATGGCTAT GTAAATTATT TTAATGGACT GATAAGATGT 5001 TTCAAGTCTC ATGCTTGGAT CTTTATTTAT TGGTGATCTA GGATCTGCTC 5051 AGCTCTTTAG CACATGAAGA AAATCAGGTA CAAAGGACAT TTGCATGTTT 5101 GGAACAGCAT GCTCTAAGCC CCGTGCAGCC AACACAAATT AACTTGACTG 5151 TAGAAACACC AATTCCAGCT GCTGGAAGAA ATGGTTTAGA AAGGCAAACC 5201 AGATACCTTT TATTCTGCCC TAGGAAATAC AGTGTTGATC AGTGCTAAAA 5251 CTCTTCAGTG GCAGTCACTG TGGTTCTTTT AACTGGGGAT TTCCTTTCAG 5301 TGTTTCATTT GGTACCAAAA CAGAACATTT ACCTTACATT TCAGATACTC 5351 TGTTTTCTCA GCATTGTTCA GATACTTTCC TTTACCGCTC TTCACGTACC 5401 CTTTTGGCAT TGAGTAATTC TATAAATGTT TCTATCCTTG GTTTTTAAAC 5451 CAAGTTATTC ATACTCTTAA AATATCTACC AAATCTCATT GTATTTTCAC 5501 ATATTTTGAG CATCAAGATA CTGGTCATTT TAAAAAATCC TTCAGTAAAT 5551 AGCACAGTTT ATTTTCCTAA TGACATTTTT AGGGTTTCTT CATTGATCAA 5601 CCAGGTTTGG GTTACACAAA TCAATTGTGG GGGAAAAATC AAATAAAACA 5651 ATTGCTTATT ATATTTTCCA AAGGACTGAG CATTTATCTT TTATTCACGA 5701 AGATATCATA TGAGGATGAT AATGATCTTT AACAGATTTT TTAGAGATAG 5751 AATTTATAAA GAGGCTGATA CTAAGAATAC TACAATCAAA ATTGAAGCTA 5801 GAGAATGTAA AAATAGAAAG TAAATAGTTC TAAGAATATT CTGGCATAAA 5851 TTATTTTTAT TTAGCCAATA AAATAGCCTC CAAATGTATA TCTCAGACAC 5901 CATAGAGCTG CTAACAATGA GAATCAAGGA AGATGCTTGC ACTTAGATTT 5951 CGTTTGTTGT ATTTCAGTAG TTCTGGATGT CCTTTGTTAA AATTGGAAAA 6001 TGGAAAAATG TCTCGACAGA AATGTCAATC TGGTGATTCT GTGAACTGTA 6051 AAATGTTCAC TTTTAAAAAT AAAGTTGTAA ACAAGTTACT CATATAAGTT 6101 GGTATTACAG TAGCAAAAAC AGAAAACCAT GTGATCCATC CTGTATTTTG 6151 ATTGATGCTT TAATAAAGGG TTTGCACAGC TGTG // LOCUS AF022375 3166 bp mRNA PRI 07-OCT-1998 DEFINITION Homo sapiens vascular endothelial growth factor mRNA, complete cds. ACCESSION AF022375 NID g3719220 VERSION AF022375.1 GI:3719220 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3166) AUTHORS Claffey,K.P., Shih,S.-C., Mullen,A., Dziennis,S., Cusick,J.L., Abrams,K.R., Lee,S.W. and Detmar,M. TITLE Identification of a human VPF/VEGF 3' untranslated region mediating hypoxia-induced mRNA stability JOURNAL Mol. Biol. Cell 9 (2), 469-481 (1998) MEDLINE 98119755 REFERENCE 2 (bases 1 to 3166) AUTHORS Detmar,M., Claffey,K.P. and Lee,S.W. TITLE Direct Submission JOURNAL Submitted (03-SEP-1997) Pathology, Beth Israel Deaconess Med. Ctr., 99 Brookline Ave., Boston, MA 02215, USA FEATURES Location/Qualifiers source 1. .3166 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epithelial" /tissue_type="breast" CDS 702. .1277 /codon_start=1 /product="vascular endothelial growth factor" /protein_id="AAC63143.1" /db_xref="PID:g3719221" /db_xref="GI:3719221" /translation="MNFLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEVVKFM DVYQRSYCHPIETLVDIFQEYPDEIEYIFKPSCVPLMRCGGCSNDEGLECVPTEESNI TMQIMRIKPHQGQHIGEMSFLQHNKCECRPKKDRARQENPCGPCSERRKHLFVQDPQT CKCSCKNTHSRCKARQLELNERTCRCDKPRR" 3'UTR 1278. .3166 BASE COUNT 790 a 793 c 842 g 741 t ORIGIN 1 AAGAGCTCCA GAGAGAAGTC GAGGAAGAGA GAGACGGGGT CAGAGAGAGC 51 GCGCGGGCGT GCGAGCAGCG AAAGCGACAG GGGCAAAGTG AGTGACCTGC 101 TTTTGGGGGT GACCGCCGGA GCGCGGCGTG AGCCCTCCCC CTTGGGATCC 151 CGCAGCTGAC CAGTCGCGCT GACGGACAGA CAGACAGACA CCGCCCCCAG 201 CCCCAGTTAC CACCTCCTCC CCGGCCGGCG GCGGACAGTG GACGCGGCGG 251 CGAGCCGCGG GCAGGGGCCG GAGCCCGCCC CCGGAGGCGG GGTGGAGGGG 301 GTCGGAGCTC GCGGCGTCGC ACTGAAACTT TTCGTCCAAC TTCTGGGCTG 351 TTCTCGCTTC GGAGGAGCCG TGGTCCGCGC GGGGGAAGCC GAGCCGAGCG 401 GAGCCGCGAG AAGTGCTAGC TCGGGCCGGG AGGAGCCGCA GCCGGAGGAG 451 GGGGAGGAGG AAGAAGAGAA GGAAGAGGAG AGGGGGCCGC AGTGGCGACT 501 CGGCGCTCGG AAGCCGGGCT CATGGACGGG TGAGGCGGCG GTGTGCGCAG 551 ACAGTGCTCC AGCGCGCGCG CTCCCCAGCC CTGGCCCGGC CTCGGGCCGG 601 GAGGAAGAGT AGCTCGCCGA GGCGCCGAGG AGAGCGGGCC GCCCCACAGC 651 CCGAGCCGGA GAGGGACGCG AGCCGCGCGC CCCGGTCGGG CCTCCGAAAC 701 CATGAACTTT CTGCTGTCTT GGGTGCATTG GAGCCTTGCC TTGCTGCTCT 751 ACCTCCACCA TGCCAAGTGG TCCCAGGCTG CACCCATGGC AGAAGGAGGA 801 GGGCAGAATC ATCACGAAGT GGTGAAGTTC ATGGATGTCT ATCAGCGCAG 851 CTACTGCCAT CCAATCGAGA CCCTGGTGGA CATCTTCCAG GAGTACCCTG 901 ATGAGATCGA GTACATCTTC AAGCCATCCT GTGTGCCCCT GATGCGATGC 951 GGGGGCTGCT CCAATGACGA GGGCCTGGAG TGTGTGCCCA CTGAGGAGTC 1001 CAACATCACC ATGCAGATTA TGCGGATCAA ACCTCACCAA GGCCAGCACA 1051 TAGGAGAGAT GAGCTTCCTA CAGCACAACA AATGTGAATG CAGACCAAAG 1101 AAAGATAGAG CAAGACAAGA AAATCCCTGT GGGCCTTGCT CAGAGCGGAG 1151 AAAGCATTTG TTTGTACAAG ATCCGCAGAC GTGTAAATGT TCCTGCAAAA 1201 ACACACACTC GCGTTGCAAG GCGAGGCAGC TTGAGTTAAA CGAACGTACT 1251 TGCAGATGTG ACAAGCCGAG GCGGTGAGCC GGGCAGGAGG AAGGAGCCTC 1301 CCTCAGGGTT TCGGGAACCA GATCTCTCTC CAGGAAAGAC TGATACAGAA 1351 CGATCGATAC AGAAACCACG CTGCCGCCAC CACACCATCA CCATCGACAG 1401 AACAGTCCTT AATCCAGAAA CCTGAAATGA AGGAAGAGGA GACTCTGCGC 1451 AGAGCACTTT GGGTCCGGAG GGCGAGACTC CGGCGGAAGC ATTCCCGGGC 1501 GGGTGACCCA GCACGGTCCC TCTTGGAATT GGATTCGCCA TTTTATTTTT 1551 CTTGCTGCTA AATCACCGAG CCCGGAAGAT TAGAGAGTTT TATTTCTGGG 1601 ATTCCTGTAG ACACACCCAC CCACATACAT ACATTTATAT ATATATATAT 1651 TATATATATA TAAAAATAAA TATCTCTATT TTATATATAT AAAATATATA 1701 TATTCTTTTT TTAAATTAAC AGTGCTAATG TTATTGGTGT CTTCACTGGA 1751 TGTATTTGAC TGCTGTGGAC TTGAGTTGGG AGGGGAATGT TCCCACTCAG 1801 ATCCTGACAG GGAAGAGGAG GAGATGAGAG ACTCTGGCAT GATCTTTTTT 1851 TTGTCCCACT TGGTGGGGCC AGGGTCCTCT CCCCTGCCCA AGAATGTGCA 1901 AGGCCAGGGC ATGGGGGCAA ATATGACCCA GTTTTGGGAA CACCGACAAA 1951 CCCAGCCCTG GCGCTGAGCC TCTCTACCCC AGGTCAGACG GACAGAAAGA 2001 CAAATCACAG GTTCCGGGAT GAGGACACCG GCTCTGACCA GGAGTTTGGG 2051 GAGCTTCAGG ACATTGCTGT GCTTTGGGGA TTCCCTCCAC ATGCTGCACG 2101 CGCATCTCGC CCCCAGGGGC ACTGCCTGGA AGATTCAGGA GCCTGGGCGG 2151 CCTTCGCTTA CTCTCACCTG CTTCTGAGTT GCCCAGGAGG CCACTGGCAG 2201 ATGTCCCGGC GAAGAGAAGA GACACATTGT TGGAAGAAGC AGCCCATGAC 2251 AGCGCCCCTT CCTGGGACTC GCCCTCATCC TCTTCCTGCT CCCCTTCCTG 2301 GGGTGCAGCC TAAAAGGACC TATGTCCTCA CACCATTGAA ACCACTAGTT 2351 CTGTCCCCCC AGGAAACCTG GTTGTGTGTG TGTGAGTGGT TGACCTTCCT 2401 CCATCCCCTG GTCCTTCCCT TCCCTTCCCG AGGCACAGAG AGACAGGGCA 2451 GGATCCACGT GCCCATTGTG GAGGCAGAGA AAAGAGAAAG TGTTTTATAT 2501 ACGGTACTTA TTTAATATCC CTTTTTAATT AGAAATTAGA ACAGTTAATT 2551 TAATTAAAGA GTAGGGTTTT TTTTCAGTAT TCTTGGTTAA TATTTAATTT 2601 CAACTATTTA TGAGATGTAT CTTTTGCTCT CTCTTGCTCT CTTATTTGTA 2651 CCGGTTTTTG TATATAAAAT TCATGTTTCC AATCTCTCTC TCCCTGATCG 2701 GTGACAGTCA CTAGCTTATC TTGAACAGAT ATTTAATTTT GCTAACACTC 2751 AGCTCTGCCC TCCCCGATCC CCTGGCTCCC CAGCACACAT TCCTTTGAAA 2801 GAGGGTTTCA ATATACATCT ACATACTATA TATATATTGG GCAACTTGTA 2851 TTTGTGTGTA TATATATATA TATATGTTTA TGTATATATG TGATCCTGAA 2901 AAAATAAACA TCGCTATTCT GTTTTTTATA TGTTCAAACC AAACAAGAAA 2951 AAATAGAGAA TTCTACATAC TAAATCTCTC TCCTTTTTTA ATTTTAATAT 3001 TTGTTATCAT TTATTTATTG GTGCTACTGT TTATCCGTAA TAATTGTGGG 3051 GAAAAGATAT TAACATCACG TCTTTGTCTC TAGTGCAGTT TTTCGAGATA 3101 TTCCGTAGTA CATATTTATT TTTAAACAAC GACAAAGAAA TACAGATATA 3151 TCTTAAAAAA AAAAAA // LOCUS HSRNACINP 1901 bp mRNA PRI 05-SEP-1995 DEFINITION H.sapiens mRNA for cytokine inducible nuclear protein. ACCESSION X83703 NID g793840 VERSION X83703.1 GI:793840 KEYWORDS ankyrin-like repeat; nuclear localisation signal; nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1901) AUTHORS Chu,W., Burns,D.K., Swerlick,R.A. and Presky,D.H. TITLE Identification and characterization of a novel cytokine-inducible nuclear protein from human endothelial cells JOURNAL J. Biol. Chem. 270 (17), 10236-10245 (1995) MEDLINE 95247734 REFERENCE 2 (bases 1 to 1901) AUTHORS Chu,W. TITLE Direct Submission JOURNAL Submitted (05-JAN-1995) W. Chu, Hoffmann-La Roche, 340 Kingsland Street, Dept. of Inflammation/Autoimmune Disease, Hoffmann-La Roche, Nutley, NJ 07110, USA FEATURES Location/Qualifiers source 1. .1901 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /tissue_type="skin" /cell_type="endothelial" /clone_lib="HDMEC cDNA" /clone="C-193" mRNA 1. .1901 misc_feature 94. .98 /note="nuclear localization signal" repeat_unit 152. .283 /note="ankyrin-like repeats" CDS 250. .1209 /note="cytokine-inducible expression" /codon_start=1 /product="nuclear protein" /protein_id="CAA58676.1" /db_xref="PID:g793841" /db_xref="GI:793841" /db_xref="SPTREMBL:Q15327" /translation="MMVLKVEELVTGKKNGNGEAGEFLPEDFRDGEYEAAVTLEKQED LKTLLAHPVTLGEQQWKSEKQREAELPKKKLEQRSKLENLEDLEIIIQLKKRKKYRKT KVPVVKEPEPEIITEPVDVPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRA CLEGHLAIVEKLMEAGAQIEFRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKL LSTALHVAVRTGHYECAEHLIACEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGA DLNIKNCAGKTPMDLVLHWQNGTKAIFDSLRENSYKTSRIATF" BASE COUNT 592 a 378 c 460 g 471 t ORIGIN 1 AAAAAACAGC AGGGTTAGCT TGTCCCTCCC CTCCCTCTTC AGCTTCCCAG 51 ACACTGATTC TGGAATGAAA ATTCACCTGC CTCTGAGTTG GCTCCTAATG 101 GGGGTGGGAG TGTTACTTCG GTTCCCAGGT TGGAAGATTA TCTCACCCGG 151 CCCCAGCTAT ATAAGCTGAC CGGTGTGGAG GGGCCCAGCA GGGCCAACTC 201 CAGGGATTCC TTCCACGACA GAAAAACATA CAAGACTCCT TCAGCCAACA 251 TGATGGTACT GAAAGTAGAG GAACTGGTCA CTGGAAAGAA GAATGGCAAT 301 GGGGAGGCAG GGGAATTCCT TCCTGAGGAT TTCAGAGATG GAGAGTATGA 351 AGCTGCTGTT ACTTTAGAGA AGCAGGAGGA TCTGAAGACA CTTCTAGCCC 401 ACCCTGTGAC CCTGGGGGAG CAACAGTGGA AAAGCGAGAA ACAACGAGAG 451 GCAGAGCTCC CAAAGAAAAA ACTAGAACAA AGATCCAAGC TTGAAAATTT 501 AGAAGACCTT GAAATAATCA TTCAACTGAA GAAAAGGAAA AAATACAGGA 551 AAACTAAAGT TCCAGTTGTA AAGGAACCAG AACCTGAAAT CATTACGGAA 601 CCTGTGGATG TGCCTACGTT TCTGAAGGCT GCTCTGGAGA ATAAACTGCC 651 AGTAGTAGAA AAATTCTTGT CAGACAAGAA CAATCCAGAT GTTTGTGATG 701 AGTATAAACG GACAGCTCTT CATAGAGCAT GCTTGGAAGG ACATTTGGCA 751 ATTGTGGAGA AGTTAATGGA AGCTGGAGCC CAGATCGAAT TCCGTGATAT 801 GCTTGAATCC ACAGCCATCC ACTGGGCAAG CCGTGGAGGA AACCTGGATG 851 TTTTAAAATT GTTGCTGAAT AAAGGAGCAA AAATTAGCGC CCGAGATAAG 901 TTGCTCAGCA CAGCGCTGCA TGTGGCGGTG AGGACTGGCC ACTATGAGTG 951 CGCGGAGCAT CTTATCGCCT GTGAGGCAGA CCTCAACGCC AAAGACAGAG 1001 AAGGAGATAC CCCGTTGCAT GATGCGGTGA GACTGAACCG CTATAAGATG 1051 ATCCGACTCC TGATTATGTA TGGCGCGGAT CTCAACATCA AGAACTGTGC 1101 TGGGAAGACG CCGATGGATC TGGTGCTACA CTGGCAGAAT GGAACCAAAG 1151 CAATATTCGA CAGCCTCAGA GAGAACTCCT ACAAGACCTC TCGCATAGCT 1201 ACATTCTGAG GCAAACGACA GACTCTTAAT CAGTAAATGT TCACTGGCAT 1251 TTTGAAGGCA TGGCCCAGGA GAAGAGACAC TAGCCATAAA ATCTAGTTTC 1301 TATTTATCAA CGTGTTGTGA AGATGTACCT AATGAAGTTT TGAGAAAGCA 1351 CAGGGTTATA GGTGTTTAAA TTTCCTTTAG TGAAACTCTT ATTTATTTTT 1401 ATGTATTCCT GTTTATTTAT TTACTGCCAC GCTACTGATA TTCAGACCTT 1451 CATGATCATC CATCTGGTGA GCAGAGCTTC ATTTGTATAT AACACTTTCA 1501 GAGCCTTCCC ACCCATAGGT AGTTCTTAAA CCAGGTGAAA GAGCAAAGTT 1551 CAAGTGCCTA CTTATGTGTC ATTCGCTCAT GTAAGAGTTT TTAAGAGAGG 1601 GCTGATTATC ACAGCCCTCT TTTCTCCTGA ATTTTTAATG CAGAAGTTTG 1651 AATGAAGCAA GGGAAGGCAT GTAGGGACAG GAAAGGAAAC AATGGAAGGA 1701 AAGTGATTCT GTGAAAAGGA CAGTGAAGCC AGCTATTTTA CCCCCAGGCT 1751 GGATTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTACCGA GTACACAGAG 1801 TACCCAAGTG AAGAGAACGT CATGAGTGTA AGTGCAAATC AGTGGAAGGA 1851 GCGGCAAACT GGGACATGCA GAATTGAATT TGCTCAAAAA AAAAAAAAAA 1901 A // LOCUS D87717 5615 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0013 gene, complete cds. ACCESSION D87717 D13638 NID g1663709 VERSION D87717.1 GI:1663709 KEYWORDS KIAA0013. SOURCE Homo sapiens bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript clone:HA0450. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5615) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (09-SEP-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5615) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REMARK Erratum:[[published erratum appears in DNA Res 1995 Aug 31;2(4):211]] REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 COMMENT On Nov 8, 1996 this sequence version replaced gi:285980. FEATURES Location/Qualifiers source 1. .5615 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="15" /clone="HA0450" /clone_lib="pBluescript" /map="RH-ID RH25252" /tissue_type="bone marrow" gene 722. .3793 /gene="KIAA0013" CDS 722. .3793 /gene="KIAA0013" /note="similar to human GTPase-activating protein(A49869)" /codon_start=1 /protein_id="BAA13442.1" /db_xref="PID:d1014132" /db_xref="PID:g285981" /db_xref="GI:285981" /translation="MWDQRLVRLALLQHLRAFYGIKVKGVRGQCDRRRHETAATEIGG KIFGVPFNALPHSAVPEYGHIPSFLVDACTSLEDHIHTEGLFRKSGSVIRLKALKNKV DHGEGCLSSAPPCDIAGLLKQFFRELPEPILPADLHEALLKAQQLGTEEKNKATLLLS CLLADHTVHVLRYFFNFLRNVSLRSSENKMDSSNLAVIFAPNLLQTSEGHEKMSSNTE KKLRLQAAVVQTLIDYASDIGRVPDFILEKIPAMLGIDGLCATPSLEGFEEGEYETPG EYKRKRRQSVGDFVSGALNKFKPNRTPSITPQEERIAQLSESPVILTPNAKRTLPVDS SHGFSSKKRKSIKHNFNFELLPSNLFNSSSTPVSVHIDTSSEGSSQSSLSPVLIGGNH LITAGVPRRSKRIAGKKVCRVESGKAGCFSPKISHKEKVRRSLRLKFNLGKNGREVNG CSGVNRYESVGWRLANQQSLKNRIESVKTGLLFSPDVDEKLPKKGSEKISKSEETLLT PERLVGTNYRMSWTGPNNSSFQEVDANEASSMVENLEVENSLEPDIMVEKSPATSCEL TPSNLNNKHNSNITSSPLSGDENNMTKETLVKVQKAFSESGSNLHALMNQRQSSVTNV GKVKLTEPSYLEDSPEENLFETNDLTIVESKEKYEHHTGKGEKCFSERDFSPLQTQTF NRETTIKCYSTQMKMEHEKDIHSNMPKDYLSKQEFSSDEEIKKQQSPKDKLNNKLKEN ENMMEGNLPKCAAHSKDEARSSFSQQSTCVVTNLSKPRPMRIAKQQSLETCEKTVSES SQMTEHRKVSDHIQWFNKLSLNEPNRIKVKSPLKFQRTPVRQSVRRINSLLEYSRQPT GHKLASLGDTASPLVKSVSCDGALSSCIESASKDSSVSCIKSGPKEQKSMSCEESNIG AISKSSMELPSKSFLKMRKHPDSVNASLRSTTVYKQKILSDGQVKVPLDDLTNHDIVK PVVNNNMGISSGINNRVLRRPSERGRAWYKGSPKHPIGKTQLLPTSKPVDL" BASE COUNT 1783 a 981 c 1275 g 1576 t ORIGIN 1 GAAACTGCGG GTGTGACCCC CCCGTGGTGG CTCTGGGTGT CTGCGGAGGA 51 GCTGGGGGCG GAAGATGAGG CTAACGGCTT GGCTTCAGTG AACGCACCGG 101 GATGTGCAGG CCGGGAGGTA GAGGCAGGCT GATGGGGGAG GGAACGAGCA 151 GCCTGTGAGA CGGGGTGACG GCGGCTACCA GCCCGGGCGG GCACCGGGAC 201 TGGAAGAGTT GCCTGAGCAG CCGGCTGGTC CGGCGGCCAG GCTAGGGCGG 251 GGGCGAGCGC CCAGTTGAGC CTGCTGGGGC TGGAGGAGCG AGAAGGGTTT 301 TCTTCACATT TCAGAGCGAA CCAGACGGGG ACAGTAAGGT TTGGAGGAAG 351 GGGGATCGTT GGAAGTAGCA AGAAGTGGAG AGAATCTGGC AATAGACGAG 401 AAACCGAAAG AATCAGAAAG AAGTCTATGT GAGTAGCTGA AAGCATTGGG 451 TGACCAGAAA GAAGGTCGGT GTAAGTGAAG GAAGAGTGAG GTGTGGCTGG 501 ATCAAAGGGC TAAGAGAAGC GGGTCTGTGT AAGTGGATGT GAGTGAGGAT 551 CAAGGAAAAG CCGTGGAAGT GGCCGGGGGT CGGGGCCGCA GAAGTGCCAG 601 ACGGGGCCGG AAAGCAGCCG AGCGGAGTTC AAATTTGAGA GCGTTTGGAA 651 ATTGGAAGAC TTGGTGGCGA ACGAGGGTCA GGACCTGCAT CCTGCCTCAG 701 AGAGTTATCG ACGTATCCGG AATGTGGGAT CAGAGGCTGG TGAGGTTGGC 751 CCTGTTGCAG CATCTGCGGG CCTTCTATGG TATTAAGGTG AAGGGTGTCC 801 GTGGGCAGTG CGATCGCAGG AGACATGAAA CAGCAGCCAC GGAAATAGGG 851 GGTAAAATAT TTGGAGTACC TTTTAATGCA CTGCCCCATT CTGCTGTACC 901 AGAATATGGA CACATTCCAA GCTTTCTTGT CGATGCTTGC ACATCTTTAG 951 AAGACCATAT TCATACCGAA GGGCTTTTTC GGAAATCAGG ATCTGTGATT 1001 CGCCTAAAAG CACTAAAGAA TAAAGTGGAT CATGGTGAAG GTTGCCTATC 1051 TTCTGCACCT CCTTGTGATA TTGCGGGACT TCTTAAGCAG TTTTTTAGGG 1101 AACTGCCAGA GCCCATTCTC CCAGCTGATT TGCATGAAGC ACTTTTGAAA 1151 GCTCAACAGT TAGGCACAGA GGAAAAGAAT AAAGCTACAC TGTTGCTCTC 1201 CTGTCTTCTG GCTGACCACA CAGTTCATGT ATTAAGATAC TTCTTTAACT 1251 TTCTCAGGAA TGTTTCTCTT AGATCCAGTG AGAATAAGAT GGACAGCAGC 1301 AATCTTGCAG TAATATTTGC ACCGAATCTT CTTCAGACAA GTGAAGGACA 1351 TGAAAAGATG TCTTCTAACA CAGAAAAGAA GCTACGATTA CAGGCTGCAG 1401 TAGTACAGAC TCTTATCGAT TATGCATCAG ATATTGGGCG TGTACCAGAT 1451 TTTATCCTGG AAAAGATACC AGCCATGTTG GGTATTGATG GTCTCTGTGC 1501 TACTCCATCA CTGGAAGGCT TTGAAGAAGG TGAATATGAA ACTCCTGGTG 1551 AATATAAGAG AAAGAGAAGA CAAAGTGTAG GAGATTTTGT TAGTGGAGCA 1601 CTAAATAAAT TTAAACCTAA CAGAACACCT TCTATTACAC CTCAAGAAGA 1651 AAGAATTGCC CAGCTATCTG AATCACCAGT GATTCTTACA CCAAATGCTA 1701 AGCGTACATT GCCAGTAGAT TCTTCTCATG GTTTCTCAAG TAAGAAAAGG 1751 AAGTCCATCA AGCACAATTT TAACTTTGAG CTGTTGCCAA GTAATCTCTT 1801 CAATAGCAGT TCTACACCGG TATCAGTTCA CATCGATACA AGCTCAGAAG 1851 GGTCATCTCA GAGTTCACTC TCTCCTGTAC TCATTGGTGG AAACCATTTG 1901 ATCACTGCAG GTGTGCCAAG GCGAAGTAAA AGAATTGCAG GCAAAAAAGT 1951 TTGCAGAGTG GAATCAGGAA AAGCAGGCTG CTTTTCTCCT AAAATCAGCC 2001 ATAAAGAAAA GGTTCGAAGA TCTCTGCGTT TGAAATTCAA TCTAGGGAAA 2051 AATGGCAGAG AAGTAAATGG ATGTTCTGGT GTCAATAGAT ATGAAAGTGT 2101 TGGTTGGCGA CTTGCAAATC AACAAAGTTT AAAAAATCGA ATTGAATCTG 2151 TAAAAACAGG TTTGCTTTTT AGCCCAGATG TTGATGAAAA GTTACCAAAG 2201 AAAGGTTCAG AAAAGATCAG TAAGTCTGAG GAAACCTTAC TAACTCCAGA 2251 GCGACTAGTT GGAACAAATT ACCGGATGTC TTGGACAGGA CCTAATAATT 2301 CAAGTTTTCA AGAAGTAGAT GCAAATGAAG CTTCTTCAAT GGTGGAAAAT 2351 CTTGAGGTAG AAAACTCTTT GGAGCCTGAT ATTATGGTAG AAAAGTCACC 2401 TGCTACTTCA TGTGAACTCA CCCCTTCCAA TTTAAACAAT AAGCATAATA 2451 GCAACATAAC AAGTAGCCCT CTTAGCGGGG ATGAAAATAA CATGACCAAA 2501 GAGACTTTGG TGAAAGTTCA AAAAGCGTTT TCTGAATCTG GAAGTAATCT 2551 TCACGCATTG ATGAATCAGA GGCAGTCATC AGTAACTAAT GTGGGGAAAG 2601 TAAAATTAAC TGAACCATCT TATTTAGAAG ATAGCCCAGA GGAAAATCTA 2651 TTTGAAACTA ATGATTTGAC TATAGTAGAA TCAAAGGAGA AATATGAACA 2701 CCACACTGGT AAAGGTGAAA AATGTTTTTC AGAGAGGGAC TTTTCACCCC 2751 TTCAAACTCA AACATTTAAT AGAGAAACAA CTATAAAATG TTATTCAACT 2801 CAGATGAAGA TGGAACATGA AAAAGACATT CATTCAAATA TGCCAAAAGA 2851 TTATTTAAGC AAGCAAGAAT TCTCCAGTGA TGAAGAAATA AAGAAACAGC 2901 AGTCCCCAAA GGATAAACTA AATAATAAAT TAAAAGAGAA TGAGAATATG 2951 ATGGAAGGTA ACTTACCGAA GTGTGCAGCA CATAGCAAGG ACGAGGCTAG 3001 ATCCTCTTTC TCACAGCAGA GTACATGTGT TGTAACAAAC TTGTCAAAAC 3051 CTAGGCCTAT GAGAATTGCT AAACAGCAGT CATTGGAAAC ATGTGAGAAA 3101 ACAGTTTCTG AAAGTTCACA AATGACAGAA CATAGAAAGG TTTCTGATCA 3151 CATACAGTGG TTTAACAAGC TTTCTTTAAA TGAACCAAAT AGAATAAAAG 3201 TCAAGTCACC TCTTAAGTTT CAGCGTACTC CTGTTCGTCA GTCCGTCAGA 3251 AGAATTAATT CTTTGTTGGA GTATAGCAGA CAACCTACAG GGCATAAGTT 3301 GGCGAGTCTT GGTGATACAG CTTCTCCTTT GGTCAAATCA GTGAGCTGTG 3351 ACGGTGCTCT TTCCTCTTGT ATAGAAAGTG CATCAAAAGA TTCCTCTGTT 3401 TCATGTATCA AATCAGGTCC TAAAGAACAG AAGTCCATGT CATGTGAAGA 3451 GTCAAATATT GGTGCAATTT CAAAGTCAAG CATGGAGTTA CCCTCGAAAT 3501 CTTTCTTAAA GATGAGGAAG CACCCAGATT CAGTGAATGC TTCTCTTAGG 3551 TCTACTACAG TTTATAAACA GAAGATCTTA TCTGATGGCC AAGTTAAGGT 3601 TCCCTTGGAT GATCTGACTA ATCATGATAT AGTAAAACCA GTTGTAAATA 3651 ACAACATGGG CATTTCTTCT GGGATAAATA ACAGGGTCCT TAGGAGACCA 3701 TCAGAAAGAG GAAGGGCCTG GTACAAAGGT TCTCCAAAAC ATCCTATCGG 3751 AAAAACTCAA TTACTACCAA CAAGTAAACC TGTAGATTTG TAATTGGTAA 3801 ATGTTATACT TGTCATTAAT GTAAATAAAG TGAGTAATTG GTATGACTTG 3851 CAGGATGATG TACATGTTAG TTTGTAGCTC AGGATGATTG TTAAGCAATA 3901 GATTTGCTCT ATTGAAAATG TTTCATTTTT TTCACTGTAC AAGCAACTTA 3951 GATTTTTATT TGTACAAATT ACTTCTTTGT TTTTCTTAAT GATGGCAATT 4001 TTTAAACTTT AATTTTATTG TGATCTCTTA AAGCAGAGGT TAGACTTTAC 4051 CTTTCTGACT CTGTCGTCCA GGCTGGAGTG CAGTGGCGCA ATCTCACTGC 4101 AAGCTCCACT TCCTGGGTTC ATGCCATTTT CCTGCCTCAG CCTCCCGAGT 4151 AGCTGGGACT ACAGGTGCCC GCCACCACGC CCAGCTAATT TTTTGTATTT 4201 TTAGTAGAGA CGGTTTCACC GTGTTAGCCA GGATGGTCTC GATCTCCTGA 4251 CCTTGTGATC CGCCCGCCTC AGCCTCCCAA AGTGCTGGGA TTACAGGCAT 4301 GAGCCACCAC GCCCGGCTAG ACTTTACCTT TCTAAAGAAA TTGTTTACTG 4351 GATTTATAAG AAGTTAATTT TTGAAAATGA CATATTTTTG TGTGATAGAA 4401 AGAATGGAGC AAGTTGTGCC TATTTCCTCC AAGTCAGATA AGGTTTCTAA 4451 AATAAATAAA TTTCTAGCAT ATAAAGGGTA GAGATAAACT CTGCAAATCT 4501 TATGTCTGGA ATTATATTAA TGTTTATTGT CCTTGCCAAA ATTCCTAGAA 4551 ATTAATTTCC TTCAATAGCA TCCTAAAACT CTATTTTTAT TTGGGGCAGA 4601 GTAATTTCAT TTATAGTGCC AGTAGGTGTA CCTTGTGTTC ACTCGAACTA 4651 AGAACAATGG TTAAGGCAGA ATAATGACTA AAATATGTTC ATATATTATG 4701 ATGTGGAAAT AATTGATAAC TTTTAAGCCA TACTATGTTT TTAAAGATAA 4751 TTTGCACAAA CACGTTTGTG TCTGTTCTGT CCAATATAGA TTTGGCAATT 4801 ATTTAAAGAG GGATAATCTT GAAAAAAATT AACCAAGGTG ATTTCTTATA 4851 TGTAGATGCT CGATTTTGGA ATTTGAAATA GTAGATGCAC CTCTTTACCT 4901 TTTTTACTTG GATAAAAACC TATGATGATT TTGTCCTGTG TGTAAATGTT 4951 ATTTATTTAG CATAGACATT AAAGATAACT CTCTGGAAAA TGACTTGACT 5001 AAGGCTCTCA TGAAATTCAA AGTGCCATTT AGAACATGCA CCAAATTGTC 5051 AAGTAAATCT GTCTAAATTT ATATTTTAAA TTATTACAAA TTACACATCT 5101 TTGAGGAAAG AGTATTATGA ACAATAGAAC ATATTCTCTA GGTTGTAGAG 5151 GAAGGAATAA GCAGACAGAA TCAACCACTA AAGGTAGTTT TTCAGATTGG 5201 TTGTTAGAAT GTCATGTTTA GATGTTGGAG CAGATTAGAG CAGCATTCAT 5251 GCCACTCGGA GCAACCAGAC TTACAGCATA AGTATGTACG AGGAATTTCA 5301 AATCATCAGA TGTTTGCTTG GCTAGGTTCT ACTTTGTTTA TTTGATATCA 5351 AATAGGTTTG TAGATGTTTA TGGCATTTCT AATTGTAAGT AGAGACAAAA 5401 TATTCATATA GTCAGATATA TGTTGTCTGC TTTAAACAAT TTTTAAATTT 5451 TAAAAATGCA TTAACGTCTT TTTATATCCA TCAAGGGAAG GATGAAATGT 5501 TGAATTTGAA GACTAATTCA GTAAGAAGTC CTAGGGGTTT AACTGTACAT 5551 ACTACCTGAA CTGGCTTTTC TGAGAGATGA ATCAATAATG AAACATGTCT 5601 GTTTTAAAAA CTACC // LOCUS AB007944 5983 bp mRNA PRI 13-AUG-1998 DEFINITION Homo sapiens mRNA for KIAA0475 protein, complete cds. ACCESSION AB007944 NID g3413911 VERSION AB007944.1 GI:3413911 KEYWORDS KIAA0475 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0451. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5983) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (08-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Seki,N., Ohira,M., Nagase,T., Ishikawa,K., Miyajima,N., Nakajima,D., Nomura,N. and Ohara,O. TITLE Characterization of cDNA clones in size-fractionated cDNA libraries from human brain JOURNAL DNA Res. 4 (5), 345-349 (1997) MEDLINE 98116662 FEATURES Location/Qualifiers source 1. .5983 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="HH0451" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 337. .1566 /gene="KIAA0475" CDS 337. .1566 /gene="KIAA0475" /codon_start=1 /product="KIAA0475 protein" /protein_id="BAA32320.1" /db_xref="PID:d1033282" /db_xref="PID:g3413912" /db_xref="GI:3413912" /translation="MKLKQRVVLLAILLVIFIFTKVFLIDNLDTSAANREDQRAFHRM MTGLRVELAPKLDHTLQSPWEIAAQWVVPREVYPEETPELGAVMHAMATKKIIKADVG YKGTQLKALLILEGGQKVVFKPKRYSRDHVVEGEPYAGYDRHNAEVAAFHLDRILGFH RAPLVVGRFVNLRTEIKPVATEQLLSTFLTVGNNTCFYGKCYYCRETEPACADGDIME GSVTLWLPDVWPLQKHRHPWGRTYREGKLARWEYDESYCDAVKKTSPYDSGPRLLDII DTAVFDYLIGNADRHHYESFQDDEGASMLILLDNAKSFGNPSLDERSILAPLYQCCII RVSTWNRLNYLKNGVLKSALKSAMAHDPISPVLSDPHLDAVDQRLLSVLATVKQCTDQ FGMDTVLVEDRMPLSHL" BASE COUNT 1690 a 1227 c 1404 g 1662 t ORIGIN 1 GTGTGGGACT GCCGCTCTGC GCGGCGAGAG GTGGCCTGGG AATGGCCGGG 51 CCGGGGGTGG GCCGGAGCCG CTGTGGCGGC GGCGGCGGCT GGGGGCGGTG 101 AGCGCGGCGT GGGGCTGCCC CTCCCCGGAG GCGGCGGGGG CGGCCGGGGC 151 CGCGCCGCAC CGCACCGCGC GGGCGGCCAT GGAGCGAGCC TAGGGCCCGA 201 CAGGAACTGT GGAAGGTGCA TCAGTGAAGA AATGGACCAA TGTGTATAAT 251 CATGGAATCT CCTTGCTAAC CATCACCACC AGCTCTCCTT AATACATGAG 301 CAAGAGTGGG TCAGGGGAGA AGGAAAAGAG GTCAACATGA AGCTAAAGCA 351 GCGAGTCGTG CTGTTAGCAA TTCTCCTTGT CATTTTTATC TTCACCAAAG 401 TTTTCCTGAT TGACAACTTA GATACATCAG CTGCCAACCG GGAGGACCAG 451 AGGGCCTTTC ACCGAATGAT GACTGGCTTG CGGGTGGAGC TGGCACCCAA 501 GCTGGACCAT ACCTTGCAGT CTCCCTGGGA GATTGCAGCC CAGTGGGTGG 551 TTCCCCGGGA AGTGTACCCT GAAGAGACAC CAGAGCTGGG GGCAGTCATG 601 CATGCCATGG CCACCAAGAA AATCATTAAA GCTGATGTGG GTTATAAAGG 651 GACACAGCTG AAAGCCTTAC TGATACTTGA AGGAGGCCAG AAAGTTGTTT 701 TCAAACCTAA GCGGTATAGC CGAGACCATG TGGTGGAAGG GGAACCGTAT 751 GCTGGTTATG ATAGACACAA TGCAGAGGTA GCAGCCTTTC ACTTGGACAG 801 GATTCTGGGT TTCCACCGAG CCCCCTTGGT AGTTGGCAGA TTTGTTAATC 851 TTCGGACAGA GATCAAACCT GTCGCCACAG AGCAGCTGTT GAGCACCTTC 901 CTAACTGTAG GAAACAATAC TTGTTTTTAT GGGAAGTGCT ATTACTGCCG 951 AGAAACAGAA CCAGCTTGTG CTGATGGAGA CATAATGGAG GGATCTGTCA 1001 CACTTTGGCT TCCAGATGTG TGGCCTCTGC AGAAGCACCG TCACCCATGG 1051 GGCAGGACTT ACCGAGAAGG CAAATTGGCC AGGTGGGAGT ATGATGAGAG 1101 CTACTGTGAT GCTGTGAAGA AAACGTCCCC TTATGACTCT GGCCCGCGCC 1151 TCTTGGACAT CATTGACACA GCTGTCTTTG ATTACCTGAT TGGCAATGCT 1201 GACCGCCATC ACTATGAGAG CTTTCAAGAT GATGAAGGCG CTAGTATGCT 1251 CATCCTTCTT GATAATGCCA AAAGCTTTGG GAACCCCTCG CTGGATGAAA 1301 GAAGCATTCT TGCCCCTCTC TATCAGTGTT GCATCATTCG GGTGTCCACC 1351 TGGAACAGAC TGAACTACCT AAAGAATGGT GTGCTAAAGT CTGCCTTAAA 1401 ATCTGCCATG GCCCATGACC CCATCTCCCC AGTGCTCTCT GATCCTCATC 1451 TGGACGCCGT GGACCAGCGG CTCCTGAGTG TCCTGGCCAC CGTGAAGCAG 1501 TGCACCGACC AGTTTGGGAT GGACACAGTA CTGGTGGAAG ACAGGATGCC 1551 TCTCTCACAC TTGTAATTCT CGACACAAAA TAAGTGAAAC TTCTTTTTAC 1601 AAAGATAGAG AAACAGCACA ATCAATTCCA AATGGTATGA GATGGATTGG 1651 AAGTGGCCAG CAGCAAGTTC TGGTGACAGG ACAGGGTGGC CTTGGATGTC 1701 TTTGGTATTT TCTGTAGTAG AAACTAAAGC AAAGACCACA AGTTTCAGAG 1751 CATGGAGACA TTCCTGCTGA ATCGCCTTCT CACCTCCTCG GCAATTGCTC 1801 ATTCTAGGGT TGGGCATCAT AGTTGGTCAG TCTTAATTCC CATGCCAAAG 1851 GACAAACAGG TGTGACATTT GGATAGATGA ATACTGGGAT TGGCTCTGGA 1901 GCATGTGTTT TGAGTTGAAC CTTGCAGTCC TTTCTCTACG CCCGTGGATT 1951 TTGTGGAAAC ACTTTGCAAT CTCTTTGTCT TTTTTTTTTT ACCAGAACTA 2001 GTTACATTGG AATGCTTACT GTCCTACAGA GTGGCAGCAA ATAAAACCTT 2051 GCATTCCATC AAGCCAAAAT AGCACACTCT GTTAGAGGAG ATACATGTTT 2101 AAGATAGAAT TGGAGGGAAG GACAAAAACA GAAAAATGTT TGGGCTTTTA 2151 AGCCATTGGG TAGTATTGTT TTGATGATCT TAGAGGAGGG AAGAAGAGAG 2201 AGAGACCCAA TGGTAGAACC AGAATCAGGG AGATGACTGA ACTACTGAAA 2251 AACAGGTTCC CTTGTATTTA GGATCTTAAG GTGTATAAAA AGCAAACATG 2301 ACTTTGCACC TAAGTAAATT CTGCATTCTC ATAGTTGTGT CCCAATTAAC 2351 CAAAAAGTTG TCTCTAGAGA AAATAATATT ACAATCTAAG CATGATTCTC 2401 TGTGGAGACT AATTTTTTCC CCTTTTGCCA AAAGCAGTCC TTCCCAAATT 2451 AACAAAGCAA ACTGAAATAA TACCTTGAAT AACAGGTTGC CTGTGGTCTC 2501 TGTCATCCTC GTCTCTCTTC TGAAATGAAT TTCCACCTCT GCCTTTAAGG 2551 CATTTTTGTC ACTGAAGCTG CTGTTCCCAA GAGATAGGCA ACCTTTTTGT 2601 CCCTTTCTCA TAAGAAAGGG ACACTCCTAC AGGTGAGAGT GTATACCTTA 2651 CTCTCTCAGA TAAGTGGCTG GACTTATCTT GTGATTTGGG GCCATGGAAG 2701 ATTGGAAACA AAGATTTTAA GCCTTCTTCT TTTTTGCTTT TTTCTTTTTT 2751 TTTTGAGACC AAGTCTCACT CTGTTGCCCA GGCTGGAGTG CAGTGGCACG 2801 ATCTTGGCTC ACTGCAACCT CCGTCTCCCA GGTTCAAGCG ATTCTCTTGC 2851 CTCAGCCTCC AGAGTAGCTG GGATTACAGG CGCCCGCCAT CGTGCCCAGC 2901 TAATTTTTAT ATTTTTAGTG GAGACAGGGT TTCGGGTTTC ACCATGTTGG 2951 CCAGGTTGAT CTTGGACTCC TGACCCCAGG TGATCCACCT GCCTCAGCCT 3001 TCCAAAGTGC TGGGATTACA GGCATGAGCC ACCGTGGCCG GCCAAGATTT 3051 TAAGCCTTCT GAGCCTTGAA ATTGAGGAGG TTAAAAGGAA GAGCCTTAAG 3101 ATTTTGATTT ATGTCAAATC CTAATTCTAT CATTCAGTCT TGTTTGGAGT 3151 TCTGAACCCA TGATGTTGTA TTATGCTTCT TTCTCCTCTT AGCACTCTCA 3201 AATTTCAGGT TTGTAAAACA CAGTTTTTGT TTTGTGTTCT GGCAAAGTGA 3251 TCTCAACATG TAAGTAGTTG CAGTAAAACA CAGGGGCAAA GGAAGACAGG 3301 CCTGATGTGC CCACTCATCT ATGGACTCAG AGCTGTGTGC TTTGCTCCTG 3351 CATCTTGTTG AGGTGCTGTT CCAGCTTTGC ATTTCTGTCA AGTAGAGGCG 3401 AATATATAAA CAGTGTGGTT GAATACATTT AATGCCAGCC ATTGGAAACT 3451 AGTTTTAGGC AACCACTCTC AAAAACAGCT TTAGAATTTA TGCCCAGTTT 3501 TCTTGCATTG AAAGATAACT GAGTAATAAC CTGTAACTAT TTTTAAATGG 3551 CATGAAATTA GGAAACTTTT GTACATTTTA TATACATTTT GAGATGAACA 3601 GAACAATGGG CTGAGTTATA AAAAGCGTGT ATTGAATTTA AGAAGACAGA 3651 CTAGCACAAA ACACAGAATT CGTGTTAACC AAAGGAGGCA TTGATTTCAG 3701 TTTTAAGGCT ACTCAGTGTT GTGTGTCCAG GGAAATTCAC AGCTCAGTAT 3751 GAGAATACCT TGGTTAGTGC TCACCCACAA GCTTCCAGGA GCCAGCTGGG 3801 AGGAGACAAT AGGAAGAGAT GTCAGCTCTG CTCTCCCTGT AAATGTTAGT 3851 TGAACTAAGT TATGGATTTG TGGTCTTTCA AATACATGAC GCCTTTAGTA 3901 TGCCACACTG AAATGAATAA GAAGTCTTCT GAAACTGGGA ACTTCATAAC 3951 ATTGAAGGCA GAAGATTCTG CTAAGGAAAA AAGCAGGCAG GAAAGAAAAT 4001 GTCTCATCCT TTCTTGAAAG CATTTGCAGA AAATATATCA TTTCATTTTA 4051 TTCCCATCTG TTTTCAAACT CGTGATCTTA AAAGGCATTC TGATGATAAA 4101 TTTAGAATTT TCATCTATAA AATTTAGAAC TCTAATCCAT AAAGTTAGAA 4151 TTGAGCTAAT AGAGTGGTAT GACATGGCAC TAAAAATATA AATTTTTGTT 4201 GTAAGTCAGG ATTGGAGTAA GCTGGAAAAG TATGTTTAGG CAAATCTTGG 4251 AGAAAACCAA CCATAAACTT ACAGCTCTAA AATTCAGAAA GCCCTAAAAT 4301 TTCAAACACT GTTTGAAAGA AGAGGTGGGG GCCGGGTGCC GTGGCTCATG 4351 CCTGTCATCC CAGCATTTGG GAGGCTGAGG CAGGCAGATC ACCTGAGGCC 4401 AGGAGTTCGA GACCAGCCTG GCTGGCTAGC ATGGTGAGAC CGTCTCTACT 4451 AAAAATGCAA AAATTAACAG GGCACGGTGG CATGCGCCTG TAGTCCCAGC 4501 TACTCGGGAG GCTGAGGCAG GAGAATCACT TGAATCCAGG AGGCGAAGGT 4551 TGCAGTGAGC TGAGATTGTG CTGCTGCACT CCAGCCTGGG AGACAGAGCG 4601 AGACTCTGTC TTAAAAAAAA AAAAAGGAGG TGAATTTTTT TTTAAGTTTT 4651 GTAACACTGT CCTACTTTAT TTATTAGAAT CTAAGGCTGT TACAATCAAG 4701 TCGTTGCAGG GTTTGGATCA GCTGTAAGTT AGGTATGCCT ACCAAACATC 4751 CAAAGGTAGA CGTGGAGACA TTTTAATACT ACAAAACTAG GAAAATCAGA 4801 ACTCATGGCC ATTTCCTGCC CTCCTCCAAC TTGTTAAAAC ATGTTTATTC 4851 TAAAGTTCGA ATGGATAAAT TTGAGTATAA AGGTTTTGTT ATAAAACTGT 4901 TCTTTAGTGT AAGGCTGCAT TGTGGGTTTG GGGGAAATGT AAATAATTTT 4951 CTGTGTAAAA CAAATTCATA GGATCTGATT TGCTCAGAGT ATTATTCAAG 5001 AATGTATTAA TAAGGCATTG CCCCCTGTTT GCACTCAGGG TTAATATGTC 5051 AAATGAAATT TAAGAAGGAA ATGGAAGAAT TCAGGTACAT TAATTGCATA 5101 TTATTTTGGG AAAGATGAGT CCTATACGTG GCAATTTTTC AATGTCATCT 5151 GAAGCCAGCA TTATCTTCCA AAGAAATCGA TCTTTTTTTT CTAAAAAAAA 5201 AAAATGCTTT TGCCTTCCCT TCCCTTCCCA TCCGCCATAT TTCTTCAGCC 5251 TTTCTTCTCG ATCACCCGTG TATTCTTTGA CCAGTAAATG ACCACACCTC 5301 AATGATGGTA AAACAGCATC ATCAGTAAGC TATCTTATAT GCCTCATCCT 5351 GTGAGTTTGA GCTTCAGGAA ACATGAGTAA AAGTATATGT AATGTATATA 5401 GTCGTATATG TATTCTAGCA AGAAAAACAT ATTTATTTTG ACAAAGGGGA 5451 ACACTGACTT TCTGAAGGAT TCAGAAAGAA CCTTAGTGAA AGGTTCTCAG 5501 TCTCTGAGAG TGGACCCTAA TTAACATAAA GACCATTCAT CAGCGAATAA 5551 CTACTGAGCA ACTCTAGTGT GCCAGCACAG GCCAGACATA CTAGTGAGCC 5601 AGGCACATCT GGCCTTGGGA AACTCATCCT ACAGGGGAAG GCCAGTTTTT 5651 TTCCCTTCAA TTCCTCAAGT CTGGGTGGTG ACAAGGTAGG GGCTAGGTAC 5701 TGGACTACCA CAGGTTTTTA GGAACTAAGG TGTTTCTCAT AAACACAAAA 5751 TGTTGGGTGA AACTGGGAAC AACTACTCAG AAGCTCATTT ATTTGCTTAA 5801 ATGGAAAGTG TGGGAGCCAC TACCCTCTCT TTTGATCTGC CAAGGATTTC 5851 CTCTCAGAGC TGTTGCACAG ACAGAGATTG TACTTGGTAA GATACCAAAC 5901 AAGACAGATA TGGATCTAAA TTTCTAATGT GTTCTATGGG TTTCAATTCT 5951 GAAAAAAGAA AATGAATAAA GATTTTAATA AAT // LOCUS AF070674 5212 bp mRNA PRI 21-JUN-1999 DEFINITION Homo sapiens inhibitor of apoptosis protein-1 (MIHC) mRNA, complete cds. ACCESSION AF070674 NID g3978243 VERSION AF070674.1 GI:3978243 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5212) AUTHORS Horrevoets,A.J., Fontijn,R.D., van Zonneveld,A.J., de Vries,C.J., ten Cate,J.W. and Pannekoek,H. TITLE Vascular endothelial genes that are responsive to tumor necrosis factor-alpha in vitro are expressed in atherosclerotic lesions, including inhibitor of apoptosis protein-1, stannin, and two novel genes JOURNAL Blood 93 (10), 3418-3431 (1999) MEDLINE 99252096 REFERENCE 2 (bases 1 to 5212) AUTHORS Horrevoets,A.J.G., Fontijn,R.D., van Zonneveld,A.J. and Pannekoek,H. TITLE Direct Submission JOURNAL Submitted (05-JUN-1998) Biochemistry, Academic Medical Center, Meibergdreef 15, Amsterdam 1105 AZ, The Netherlands FEATURES Location/Qualifiers source 1. .5212 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="D11S1339-D11S1325" gene 1. .5212 /gene="MIHC" CDS 2752. .4566 /gene="MIHC" /note="hIAP-1; c-IAP2" /codon_start=1 /product="inhibitor of apoptosis protein-1" /protein_id="AAC83232.1" /db_xref="PID:g3978244" /db_xref="GI:3978244" /translation="MNIVENSIFLSNLMKSAYTFELKYDLSCELYRMSTYSTFPAGVP VSERSLARAGFYYTGVNDKVKCFCCGLMLDNWKRGDSPTEKHKKLYPSCRFVQSLNSV NNLEATSQPTFPSSVTNSTHSLLPGTENSGYFRGSYSNSPSNPVNSRANQDFSALMRS SYHCAMNNENARLLTFQTWPLTFLSPTDLAKAGFYYIGPGDRVACFACGGKLSNWEPK DNAMSEHLRHFPKCPFIENQLQDTSRYTVSNLSMQTHAARFKTFFNWPSSVLVNPEQL ASAGFYYVGNSDDVKCFCCDGGLRCWESGDDPWVQHAKWFPRCEYLIRIKGQEFIRQV QASYPHLLEQLLSTSDSPGDENAESSIIHFEPGEDHSEDAIMMNTPVINAAVEMGFSR SLVKQTVQRKILATGENYRLVNDLVLDLLNAEDEIREEERERATEEKESNDLLLIRKN RMALFQHLTCVIPILDSLLTAGIINEQEHDVIKQKTQTSLQARELIDTILVKGNIAAT VFRNSLQEAEAVLYEHLFVQQDIKYIPTEDVSDLPVEEQLRRLQEERTCKVCMDKEVS IVFIPCGHLVVCKDCAPSLRKCPICRSTIKGTVRTFLS" BASE COUNT 1782 a 839 c 968 g 1623 t ORIGIN 1 GGCCAGGCGA CAGGTGTCGC TTGAAAAGAC TGGGCTTGTC CTTGCTGGTG 51 CATGCGTCGT CGGCCTCTGG GCAGCAGGTT TACAAAGGAG GAAAACGACT 101 TCTTCTAGAT TTTTTTTTCA GTTTCTTCTA TAAATCAAAA CATCTCAAAA 151 TGGAGACCTA AAATCCTTAA AGGGACTTAG TCTAATCTCG GGAGGTAGTT 201 TTGTGCATGG GTAAACAAAT TAAGTATTAA CTGGTGTTTT ACTATCCAAA 251 GAATGCTAAT TTTATAAACA TGATCGAGTT ATATAAGGTA TACCATAATG 301 AGTTTGATTT TGAATTTGAT TTGTGGAAAT AAAGGAAAAG TGATTCTAGC 351 TGGGGCATAT TGTTAAAGCA TTTTTTTCAG AGTTGGCCAG GCAGTCTCCT 401 ACTGGCACAT TCTCCCATTA TGTAGAATAG AAATAGTACC TGTGTTTGGG 451 AAAGATTTTA AAATGAGTGA CAGTTATTTG GAACAAAGAG CTAATAATCA 501 ATCCACTGCA AATTAAAGAA ACATGCAGAT GAAAGTTTTG ACACATTAAA 551 ATACTTCTAC AGTGACAAAG AAAAATCAAG AACAAAGCTT TTTGATATGT 601 GCAACAAATT TAGAGGAAGT AAAAAGATAA ATGTGATGAT TGGTCAAGAA 651 ATTATCCAGT TATTTACAAG GCCACTGATA TTTTAAACGT CCAAAAGTTT 701 GTTTAAATGG GCTGTTACCG CTGAGAATGA TGAGGATGAG AATGATGGTT 751 GAAGGTTACA TTTTAGGAAA TGAAGAAACT TAGAAAATTA ATATAAAGAC 801 AGTGATGAAT ACAAAGAAGA TTTTTATAAC AATGTGTAAA ATTTTTGGCC 851 AGGGAAAGGA ATATTGAAGT TAGATACAAT TACTTACCTT TGAGGGAAAT 901 AATTGTTGGT AATGAGATGT GATGTTTCTC CTGCCACCTG GAAACAAAGC 951 ATTGAAGTCT GCAGTTGAAA AGCCCAACGT CTGTGAGATC CAGGAAACCA 1001 TGCTTGCAAA CCACTGGTAA AAAAAAAAAA AAAAAAAAAA AAAAAGCCAC 1051 AGTGACTTGC TTATTGGTCA TTGCTAGTAT TATCGACTCA GAACCTCTTT 1101 ACTAATGGCT AGTAAATCAT AATTGAGAAA TTCTGAATTT TGACAAGGTC 1151 TCTGCTGTTG AAATGGTAAA TTTATTATTT TTTTTGTCAT GATAAATTCT 1201 GGTTCAAGGT ATGCTATCCA TGAAATAATT TCTGACCAAA ACTAAATTGA 1251 TGCAATTTGA TTATCCATCT TAGCCTACAG ATGGCATCTG GTAACTTTTG 1301 ACTGTTTTAA AAATAAATCC ACTATCAGAG TAGATTTGAT GTTGGCTTCA 1351 GAAACATTTT GAAAAACAAA AGTTCAAAAA TGTTTTCAGG AGGTGATAAG 1401 TTGAATAACT CTACAATGTT AGTTCTTTGA GGGGGACAAA AAATTTAAAA 1451 TCTTTGAAAG GTCTTATTTT ACAGCCCATA TCTAAATTAT CTTAAGAAAA 1501 TTTTTAACAA AGGGAATGAA ATATATATCA TGATTCTCTT TTTCCAAAAG 1551 TAACCTGAAT ATAGCTATGA AGTTCAGTTT TGTTATTGGT AGTTTGGGCA 1601 GAGTCTCTTT TTGCAGCACC TGTTGTCTAC CATAATTACA GAGGACATTT 1651 CCATGTTCTA GCCAAGTATA CTATTAGAAT AAAAAAACTT AACATTGAGT 1701 TGCTTCAACA GCATGAAACT GAGTCCAAAA GACCAAATGA ACAAACACAT 1751 TAATCTCTGA TTATTTATTT TAAATAGAAT ATTTAATTGT GTAAGATCTA 1801 ATAGTATCAT TATACTTAAG CAATCATATT CCTGATGATC TATGGGAAAT 1851 AACTATTATT TAATTAATAT TGAAACCAGG TTTTAAGATG TGTTAGCCAG 1901 TCCTGTTACT AGTAAATCTC TTTATTTGGA GAGAAATTTT AGATTGTTTT 1951 GTTCTCCTTA TTAGAAGGAT TGTAGAAAGA AAAAAATGAC TAATTGGAGA 2001 AAAATTGGGG ATATATCATA TTTCACTGAA TTCAAAATGT CTTCAGTTGT 2051 AAATCTTACC ATTATTTTAC GTACCTCTAA GAAATAAAAG TGCTTCTAAT 2101 TAAAATATGA TGTCATTAAT TATGAAATAC TTCTTGATAA CAGAAGTTTT 2151 AAAATAGCCA TCTTAGAATC AGTGAAATAT GGTAATGTAT TATTTTCCTC 2201 CTTTGAGTTA GGTCTTGTGC TTTTTTTTCC TGGCCACTAA ATTTCACAAT 2251 TTCCAAAAAG CAAAATAAAC ATATTCTGAA TATTTTTGCT GTGAAACACT 2301 TGACAGCAGA GCTTTCCACC ATGAAAAGAA GCTTCATGAG TCACACATTA 2351 CATCTTTGGG TTGATTGAAT GCCACTGAAA CATTCTAGTA GCCTGGAGAA 2401 GTTGACCTAC CTGTGGAGAT GCCTGCCATT AAATGGCATC CTGATGGCTT 2451 AATACACATC ACTCTTCTGT GAAGGGTTTT AATTTTCAAC ACAGCTTACT 2501 CTGTAGCATC ATGTTTACAT TGTATGTATA AAGATTATAC AAAGGTGCAA 2551 TTGTGTATTT CTTCCTTAAA ATGTATCAGT ATAGGATTTA GAATCTCCAT 2601 GTTGAAACTC TAAATGCATA GAAATAAAAA TAATAAAAAA TTTTTCATTT 2651 TGGCTTTTCA GCCTAGTATT AAAACTGATA AAAGCAAAGC CATGCACAAA 2701 ACTACCTCCC TAGAGAAAGG CTAGTCCCTT TTCTTCCCCA TTCATTTCAT 2751 TATGAACATA GTAGAAAACA GCATATTCTT ATCAAATTTG ATGAAAAGCG 2801 CCTACACGTT TGAACTGAAA TACGACTTGT CATGTGAACT GTACCGAATG 2851 TCTACGTATT CCACTTTTCC TGCTGGGGTT CCTGTCTCAG AAAGGAGTCT 2901 TGCTCGTGCT GGTTTCTATT ACACTGGTGT GAATGACAAG GTCAAATGCT 2951 TCTGTTGTGG CCTGATGCTG GATAACTGGA AAAGAGGAGA CAGTCCTACT 3001 GAAAAGCATA AAAAGTTGTA TCCTAGCTGC AGATTCGTTC AGAGTCTAAA 3051 TTCCGTTAAC AACTTGGAAG CTACCTCTCA GCCTACTTTT CCTTCTTCAG 3101 TAACAAATTC CACACACTCA TTACTTCCGG GTACAGAAAA CAGTGGATAT 3151 TTCCGTGGCT CTTATTCAAA CTCTCCATCA AATCCTGTAA ACTCCAGAGC 3201 AAATCAAGAT TTTTCTGCCT TGATGAGAAG TTCCTACCAC TGTGCAATGA 3251 ATAACGAAAA TGCCAGATTA CTTACTTTTC AGACATGGCC ATTGACTTTT 3301 CTGTCGCCAA CAGATCTGGC AAAAGCAGGC TTTTACTACA TAGGACCTGG 3351 AGACAGAGTG GCTTGCTTTG CCTGTGGTGG AAAATTGAGC AATTGGGAAC 3401 CGAAGGATAA TGCTATGTCA GAACACCTGA GACATTTTCC CAAATGCCCA 3451 TTTATAGAAA ATCAGCTTCA AGACACTTCA AGATACACAG TTTCTAATCT 3501 GAGCATGCAG ACACATGCAG CCCGCTTTAA AACATTCTTT AACTGGCCCT 3551 CTAGTGTTCT AGTTAATCCT GAGCAGCTTG CAAGTGCGGG TTTTTATTAT 3601 GTGGGTAACA GTGATGATGT CAAATGCTTT TGCTGTGATG GTGGACTCAG 3651 GTGTTGGGAA TCTGGAGATG ATCCATGGGT TCAACATGCC AAGTGGTTTC 3701 CAAGGTGTGA GTACTTGATA AGAATTAAAG GACAGGAGTT CATCCGTCAA 3751 GTTCAAGCCA GTTACCCTCA TCTACTTGAA CAGCTGCTAT CCACATCAGA 3801 CAGCCCAGGA GATGAAAATG CAGAGTCATC AATTATCCAT TTTGAACCTG 3851 GAGAAGACCA TTCAGAAGAT GCAATCATGA TGAATACCCC TGTGATTAAT 3901 GCTGCCGTGG AAATGGGCTT TAGTAGAAGC CTGGTAAAAC AGACAGTTCA 3951 GAGAAAAATC CTAGCAACTG GAGAGAATTA TAGACTAGTC AATGATCTTG 4001 TGTTAGACTT ACTCAATGCA GAAGATGAAA TAAGGGAAGA GGAGAGAGAA 4051 AGAGCAACTG AGGAAAAAGA ATCAAATGAT TTATTATTAA TCCGGAAGAA 4101 TAGAATGGCA CTTTTTCAAC ATTTGACTTG TGTAATTCCA ATCCTGGATA 4151 GTCTACTAAC TGCCGGAATT ATTAATGAAC AAGAACATGA TGTTATTAAA 4201 CAGAAGACAC AGACGTCTTT ACAAGCAAGA GAACTGATTG ATACGATTTT 4251 AGTAAAAGGA AATATTGCAG CCACTGTATT CAGAAACTCT CTGCAAGAAG 4301 CTGAAGCTGT GTTATATGAG CATTTATTTG TGCAACAGGA CATAAAATAT 4351 ATTCCCACAG AAGATGTTTC AGATCTACCA GTGGAAGAAC AATTGCGGAG 4401 ACTACAAGAA GAAAGAACAT GTAAAGTGTG TATGGACAAA GAAGTGTCCA 4451 TAGTGTTTAT TCCTTGTGGT CATCTAGTAG TATGCAAAGA TTGTGCTCCT 4501 TCTTTAAGAA AGTGTCCTAT TTGTAGGAGT ACAATCAAGG GTACAGTTCG 4551 TACATTTCTT TCATGAAGAA GAACCAAAAC ATCATCTAAA CTTTAGAATT 4601 AATTTATTAA ATGTATTATA ACTTTAACTT TTATCCTAAT TTGGTTTCCT 4651 TAAAATTTTT ATTTATTTAC AACTCAAAAA ACATTGTTTT GTGTAACATA 4701 TTTATATATG TATCTAAACC ATATGAACAT ATATTTTTTA GAAACTAAGA 4751 GAATGATAGG CTTTTGTTCT TATGAACGAA AAAGAGGTAG CACTACAAAC 4801 ACAATATTCA ATCAAAATTT CAGCATTATT GAAATTGTAA GTGAAGTAAA 4851 ACTTAAGATA TTTGAGTTAA CCTTTAAGAA TTTTAAATAT TTTGGCATTG 4901 TACTAATACC GGGAACATGA AGCCAGGTGT GGTGGTATGT GCCTGTAGTC 4951 CCAGGCTGAG GCAAGAGAAT TACTTGAGCC CAGGAGTTTG AATCCATCCT 5001 GGGCAGCATA CTGAGACCCT GCCTTTAAAA ACAAACAGAA CAAAAACAAA 5051 ACACCAGGGA CACATTTCTC TGTCTTTTTT GATCAGTGTC CTATACATCG 5101 AAGGTGTGCA TATATGTTGA ATGACATTTT AGGGACATGG TGTTTTTATA 5151 AAGAATTCTG TGAGAAAAAA TTTAATAAAG CAACAAAAAT TACTCTTAAA 5201 AAAAAAAAAA AA // LOCUS HSU70426 2383 bp mRNA PRI 22-MAY-1998 DEFINITION Homo sapiens A28-RGS14p mRNA, complete cds. ACCESSION U70426 NID g1813543 VERSION U70426.1 GI:1813543 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2383) AUTHORS Buckbinder,L., Velasco-Miguel,S., Chen,Y., Xu,N., Talbott,R., Gelbert,L., Gao,J., Seizinger,B.R., Gutkind,J.S. and Kley,N. TITLE The p53 tumor suppressor targets a novel regulator of G protein signaling JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (15), 7868-7872 (1997) MEDLINE 97368284 REFERENCE 2 (bases 1 to 2383) AUTHORS Buckbinder,L., Kley,N., Talbott,R. and Gao,J. TITLE Direct Submission JOURNAL Submitted (11-SEP-1996) Oncology, Molecular Genetics, Bristol-Myers Squibb, PRI, PO Box 4000, Princeton, NJ 08543, USA FEATURES Location/Qualifiers source 1. .2383 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 93. .701 /codon_start=1 /product="A28-RGS14p" /protein_id="AAC16912.1" /db_xref="PID:g1813544" /db_xref="GI:1813544" /translation="MCRTLAAFPTTCLERAKEFKTRLGIFLHKSELGCDTGSTGKSEW GSKHSKENRNFSEDVLGWRESFDLLLSSKNGVAAFHAFLKTEFSEENLEFWLACEEFK KIRSATKLASRAHQIFEEFICSEAPKEVNIDHETRELTRMNLQTATATCFDAAQGKTR TLMEKDSYPRFLKSPAYRDLAAQASAASATLSSCSLDEPSHT" BASE COUNT 561 a 601 c 657 g 562 t 2 others ORIGIN 1 GGGGCTACCG CGCCTTTGCT TCCTGGCGCA CGCGGAGCCT CCTGGAGCCT 51 GCCACCATCC TGCCTACTAC GTGCTGCCCT GCGCCCGCAG CCATGTGCCG 101 CACCCTGGCC GCCTTCCCCA CCACCTGCCT GGAGAGAGCC AAAGAGTTCA 151 AGACACGTCT GGGGATCTTT CTTCACAAAT CAGAGCTGGG CTGCGATACT 201 GGGAGTACTG GCAAGTCCGA GTGGGGCAGT AAACACAGCA AAGAGAATAG 251 AAACTTCTCA GAAGATGTGC TGGGGTGGAG AGAGTCGTTC GACCTGCTGC 301 TGAGCAGTAA AAATGGAGTG GCTGCCTTCC ACGCTTTCCT GAAGACAGAG 351 TTCAGTGAGG AGAACCTGGA GTTCTGGCTG GCCTGTGAGG AGTTCAAGAA 401 GATCCGATCA GCTACCAAGC TGGCCTCCAG GGCACACCAG ATCTTTGAGG 451 AGTTCATTTG CAGTGAGGCC CCTAAAGAGG TCAACATTGA CCATGAGACC 501 CGCGAGCTGA CGAGGATGAA CCTGCAGACT GCCACAGCCA CATGCTTTGA 551 TGCGGCTCAG GGGAAGACAC GTACCCTGAT GGAGAAGGAC TCCTACCCAC 601 GCTTCCTGAA GTCGCCTGCT TACCGGGACC TGGCTGCCCA AGCCTCAGCC 651 GCCTCTGCCA CTCTGTCCAG CTGCAGCCTG GACGAGCCCT CACACACCTG 701 AGTCTCCACG GCAGTGAGGA AGCCAGCCGG GAAGAGAGGT TGAGTCACCC 751 ATCCCCGAGG TGGCTGCCCC TGTGTGGGAG GCAGGTTCTG CAAAGCAAGT 801 GCAAGAGGAC AAAAAAAAAA AAAAAAAAAA AAAAAATGCG CTCCAGCAGC 851 CTGTTTGGGA AGCAGCAGTC TCTCCTTCAG ATACTGTGGG ACTCATGCTG 901 GAGAGGAGCC GCCCACTTCC AGGACCTGTG AATAAGGGCT AATGATGAGG 951 GTTGGTGGGG CTCTCTGTGG GGCAAAAAGG TGGTATGGGG GTTAGCACTG 1001 GCTCTCGTTC TCACCGGAGA AGGAAGTGTT CTAGTGTGGT TTAGGAAACA 1051 TGTGGATAAA GGGAACCATG AAAATGAGAG GAGGAAAGAC ATCCAGATCA 1101 GCTGTTTTGC CTGTTGCTCA GTTGACTCTG ATTGCATCCT GTTTTCCTAA 1151 TTCCCAGACT GTTCTGGGCA CGGAAGGGAC CCTGGATGTG GAGTCTTCCC 1201 CTTTGGCCCT CCTCACTGGC CTCTGGGCTA GCCCAGAGTC CCTTAGCTTG 1251 TACCTCGTAA CACTCCTGTG TGTCTGTCCA GCCTTGCAGT CATGTCAAGG 1301 CCAGCAAGCT GATGTGACTC TTGCCCCATG CGAGATATTT ATACCTCAAA 1351 CACTGGCCTG TGAGCCCTTT CCAAGTCAGT GGAGAGCCCT GAAAGGAGGC 1401 TCACTTGAAT CCAGCTCAGT GCTCTGGGTG GCCCCCTGCA GGTGGCCCCT 1451 GACCCTGCGT TGCAGCAGGG TCCACCTGTG AGCAGGCCCG CCCTGGGGCC 1501 TCTTCCTGGA TGTGCCCTCT CTGAGTTCTG TGCTGTCTCT TGGAGGCAGG 1551 GCCCAGGAGA ACAAAGTGTG GAGGCCTCGG GGAGTGGCTT TTCCAGCTCT 1601 CATGCCCCGC AGTGTGGAAC AAGGCAGAAA AGGATCCTAG GAAATAAGTC 1651 TCTTGGCGGT CCCTGAGAGT CCTGCTGAAA TCCAGCCAGT GTTTTTTGTG 1701 GTATGAGAAC AGGCAAAAAG AGATGCCCCG AGATAGAAGG GGAGCCTTGT 1751 GTTTCTTTCC TGCAGACGTG AGATGAACAC TGGAGTGGGC AGAGGTGGCM 1801 CAGGACCATG GCACCCTTAG AGTGCAGAAG CTGGGGGGAG AGGCTGCTTC 1851 GAAGGGCAGG ACTGGGGATA CCTGCCTGTC ACCTCAGGGC ATCACTGAAC 1901 AAACATTTCC TGATGGSAAC TCCTGCGGCA GAGCCCAGGC TGGGGAAGTG 1951 AACTACCCAG GGCAGCCCCT TTGTGGCCCA GGATAATCAA CACTGTTCTC 2001 TCTGTACCAT GAGCTCCTCC AGGAGATTAT TTAAGTGTAT TGTATCATTG 2051 GTTTTCTGTG ATTGTCATAA CATTGTTTTT GTTATTGTTG GTGCTGTTGT 2101 TATTTATTAT TGTAATTTCA GTTTGCCTCT ACTGGAGAAT CTCAGCAGGG 2151 GTTTCAGCCT GACTGTCTCC CTTTCTCTAC CAGACTCTAC CTCTGAATGT 2201 GCTGGGAACC TCTTGGAGCC TGTCAGGAAC TCCTCACTGT TTAAATATTT 2251 ATTTATTGTG ACAAATGGAG CTGGTTTCCT AGATATGAAT GATGTTTGCA 2301 ATCCCCATTT TCCTGTTTCA GCATGTTATA TTCTTATAAA ATAAAAGCAA 2351 AAGTCAAATA TGAAAAAAAA AAAAAAAAAA AAA // LOCUS HUMSCPB 2563 bp mRNA PRI 18-JAN-1996 DEFINITION Homo sapiens TNFR2-TRAF signalling complex protein mRNA, complete cds. ACCESSION L49432 NID g1160974 VERSION L49432.1 GI:1160974 KEYWORDS signalling protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2563) AUTHORS Rothe,M., Pan,M.G., Henzel,W.J., Ayres,T.M. and Goeddel,D.V. TITLE The TNFR2-TRAF signaling complex contains two novel proteins related to baculoviral inhibitor of apoptosis proteins JOURNAL Cell 83 (7), 1243-1252 (1995) MEDLINE 96128127 FEATURES Location/Qualifiers source 1. .2563 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 123. .1937 /codon_start=1 /product="TNFR2-TRAF signalling complex protein" /protein_id="AAC41943.1" /db_xref="PID:g1160975" /db_xref="GI:1160975" /translation="MNIVENSIFLSNLMKSANTFELKYDLSCELYRMSTYSTFPAGVP VSERSLARAGFYYTGVNDKVKCFCCGLMLDNWKRGDSPTEKHKKLYPSCRFVQSLNSV NNLEATSQPTFPSSVTNSTHSLLPGTENSGYFRGSYSNSPSNPVNSRANQDFSALMRS SYHCAMNNENARLLTFQTWPLTFLSPTDLAKAGFYYIGPGDRVACFACGGKLSNWEPK DNAMSEHLRHFPKCPFIENQLQDTSRYTVSNLSMQTHAARFKTFFNWPSSVLVNPEQL ASAGFYYVGNSDDVKCFCCDGGLRCWESGDDPWVQHAKWFPRCEYLIRIKGQEFIRQV QASYPHLLEQLLSTSDSPGDENAESSIIHFEPGEDHSEDAIMMNTPVINAAVEMGFSR SLVKQTVQRKILATGENYRLVNDLVLDLLNAEDEIREEERERATEEKESNDLLLIRKN RMALFQHLTCVIPILDSLLTAGIINEQEHDVIKQKTQTSLQARELIDTILVKGNIAAT VFRNSLQEAEAVLYEHLFVQQDIKYIPTEDVSDLPVEEQLRRLQEERTCKVCMDKEVS IVFIPCGHLVVCKDCAPSLRKCPICRSTIKGTVRTFLS" BASE COUNT 839 a 467 c 501 g 756 t ORIGIN 1 GGGCAGCAGG TTTACAAAGG AGGAAAACGA CTTCTTCTAG ATTTTTTTTT 51 CAGTTTCTTC TATAAATCAA AACTACCTCC CTAGAGAAAG GCTAGTCCCT 101 TTTCTTCCCC ATTCATTTCA TTATGAACAT AGTAGAAAAC AGCATATTCT 151 TATCAAATTT GATGAAAAGC GCCAACACGT TTGAACTGAA ATACGACTTG 201 TCATGTGAAC TGTACCGAAT GTCTACGTAT TCCACTTTTC CTGCTGGGGT 251 CCCTGTCTCA GAAAGGAGTC TTGCTCGCGC TGGTTTCTAT TACACTGGTG 301 TGAATGACAA GGTCAAATGC TTCTGTTGTG GCCTGATGCT GGATAACTGG 351 AAAAGAGGAG ACAGTCCTAC TGAAAAGCAT AAAAAGTTGT ATCCTAGCTG 401 CAGATTCGTT CAGAGTCTAA ATTCCGTTAA CAACTTGGAA GCTACCTCTC 451 AGCCTACTTT TCCTTCTTCA GTAACAAATT CCACACACTC ATTACTTCCG 501 GGTACAGAAA ACAGTGGATA TTTCCGTGGC TCTTATTCAA ACTCTCCATC 551 AAATCCTGTA AACTCCAGAG CAAATCAAGA TTTTTCTGCC TTGATGAGAA 601 GTTCCTACCA CTGTGCAATG AATAACGAAA ATGCCAGATT ACTTACTTTT 651 CAGACATGGC CATTGACTTT TCTGTCGCCA ACAGATCTGG CAAAAGCAGG 701 CTTTTACTAC ATAGGACCTG GAGACAGAGT GGCTTGCTTT GCCTGTGGTG 751 GAAAATTGAG CAATTGGGAA CCGAAGGATA ATGCTATGTC AGAACACCTG 801 AGACATTTTC CCAAATGCCC ATTTATAGAA AATCAGCTTC AAGACACTTC 851 AAGATACACA GTTTCTAATC TGAGCATGCA GACACATGCA GCCCGCTTTA 901 AAACATTCTT TAACTGGCCC TCTAGTGTTC TAGTTAATCC TGAGCAGCTT 951 GCAAGTGCGG GTTTTTATTA TGTGGGTAAC AGTGATGATG TCAAATGCTT 1001 TTGCTGTGAT GGTGGACTCA GGTGTTGGGA ATCTGGAGAT GATCCATGGG 1051 TTCAACATGC CAAGTGGTTT CCAAGGTGTG AGTACTTGAT AAGAATTAAA 1101 GGACAGGAGT TCATCCGTCA AGTTCAAGCC AGTTACCCTC ATCTACTTGA 1151 ACAGCTGCTA TCCACATCAG ACAGCCCAGG AGATGAAAAT GCAGAGTCAT 1201 CAATTATCCA TTTTGAACCT GGAGAAGACC ATTCAGAAGA TGCAATCATG 1251 ATGAATACTC CTGTGATTAA TGCTGCCGTG GAAATGGGCT TTAGTAGAAG 1301 CCTGGTAAAA CAGACAGTTC AGAGAAAAAT CCTAGCAACT GGAGAGAATT 1351 ATAGACTAGT CAATGATCTT GTGTTAGACT TACTCAATGC AGAAGATGAA 1401 ATAAGGGAAG AGGAGAGAGA AAGAGCAACT GAGGAAAAAG AATCAAATGA 1451 TTTATTATTA ATCCGGAAGA ATAGAATGGC ACTTTTTCAA CATTTGACTT 1501 GTGTAATTCC AATCCTGGAT AGTCTACTAA CTGCCGGAAT TATTAATGAA 1551 CAAGAACATG ATGTTATTAA ACAGAAGACA CAGACGTCTT TACAAGCAAG 1601 AGAACTGATT GATACGATTT TAGTAAAAGG AAATATTGCA GCCACTGTAT 1651 TCAGAAACTC TCTGCAAGAA GCTGAAGCTG TGTTATATGA GCATTTATTT 1701 GTGCAACAGG ACATAAAATA TATTCCCACA GAAGATGTTT CAGATCTACC 1751 AGTGGAAGAA CAATTGCGGA GACTACAAGA AGAAAGAACA TGTAAAGTGT 1801 GTATGGACAA AGAAGTGTCC ATAGTGTTTA TTCCTTGTGG TCATCTAGTA 1851 GTATGCAAAG ATTGTGCTCC TTCTTTAAGA AAGTGTCCTA TTTGTAGGAG 1901 TACAATCAAG GGTACAGTTC GTACATTTCT TTCATGAAGA AGAACCAAAA 1951 CATCATCTAA ACTTTAGAAT TAATTTATTA AATGTATTAT AACTTTAACT 2001 TTCATCCTAA TTTGGTTTCC TTAAAATTTT TATTTATTTA CAACTCAACA 2051 AACATTGTTT TGTGTAACAT ATTTAATATA TGTATCTAAA CCATATGAAC 2101 ATATATTTTT TAGAAACTAA GAGAATGATA GGCTTTTGTT CTTATGAACG 2151 AAAAAGAGGT AGCACTACAA ACACAATATT CAATCAAAAT TTCAGCATTA 2201 TTGAAATTGT AAGTGAAGTA AAACTTAAGA TATTTGAGTT AACCTTTAAG 2251 AATTTTAAAT ATTTTGGCAT TGTACTAATA CCGGGAACAT GAAGCCAGGT 2301 GTGGTGGTAT GTGCCTGTAG TCCCAGGCTG AGGCAAGAGA ATTACTTGAG 2351 CCCAGGAGTT TGAATCCATC CTGGGCAGCA TACTGAGACC CTGCCTTTAA 2401 AAACAAACAG AACAAAAACA AAACACCAGG GACACATTTC TCTGTCTTTT 2451 TTGATCAGTG TCCTATACAT CGAAGGTGTG CATATATGTT GAATGACATT 2501 TTAGGGACAT GGTGTTTTTA TAAAGAATTC TGTGAGAAAA AATTTAATAA 2551 AACCCCCCAA ATT // LOCUS HUMAPR 1885 bp mRNA PRI 28-APR-1993 DEFINITION Human ATL-derived PMA-responsive (APR) peptide mRNA. ACCESSION D90070 M57246 NID g219475 VERSION D90070.1 GI:219475 KEYWORDS 12-myristate 13-acetate; ATL; PMA; PMA-inducible mRNA. SOURCE Human adult T-cell leukemia cell line IKD, cDNA to mRNA, clone ICP82-23. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1885) AUTHORS Hijikata,M., Kato,N., Sato,T., Kagami,Y. and Shimotohno,K. TITLE Molecular cloning and characterization of a cDNA for a novel phorbol-12-myristate-13-acetate-responsive gene that is highly expressed in an adult T-cell leukemia cell line JOURNAL J. Virol. 64 (10), 4632-4639 (1990) MEDLINE 90376412 COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: 03-542-2511 Fax: 03-545-3567. FEATURES Location/Qualifiers source 1. .1885 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 174. .338 /note="APR peptide" /codon_start=1 /protein_id="BAA14111.1" /db_xref="PID:d1014813" /db_xref="PID:g219476" /db_xref="GI:219476" /translation="MPGKKARKNAQPSPARAPAELEVECATQLRRFGDKLNFRQKLLN LISKLFCSGT" misc_feature 1551. .1558 /note="mRNA destabilizer like sequence" misc_feature 1566. .1570 /note="mRNA destabilizer like sequence" misc_feature 1743. .1747 /note="mRNA destabilizer like sequence" misc_feature 1801. .1805 /note="mRNA destabilizer like sequence" misc_feature 1819. .1823 /note="mRNA destabilizer like sequence" polyA_signal 1859. .1864 /note="put. polyadenylation signal" BASE COUNT 560 a 303 c 388 g 634 t ORIGIN 1 CGGGCACTCA CCGTGTGTAG TTGGCATCTC CGCGCGTCCG GACACCCGAT 51 CCCAGCATCC CTGCCTGCAG GACTGTTCGT GTTCAGCTCG CGTCCTGCAG 101 CTGTCCGAGG TGCTCCAGTT GGAGGCTGAG GTTCCCGGGC TCTGTCGCTG 151 AGTGGGCGGC GGCACCGGCG GAGATGCCTG GGAAGAAGGC GCGCAAGAAC 201 GCTCAACCGA GCCCCGCGCG GGCTCCAGCA GAGCTGGAAG TCGAGTGTGC 251 TACTCAACTC AGGAGATTTG GAGACAAACT GAACTTCCGG CAGAAACTTC 301 TGAATCTGAT ATCCAAACTC TTCTGCTCAG GAACCTGACT GCATCAAAAA 351 CTTGCATGAG GGGACTCCTT CAAAAGAGTT TTCTCAGGAG GTGCACGTTT 401 CATCAATTTG AAGAAAGACT GCATTGTAAT TGAGAGGAAT GTGAAGGTGC 451 ATTCATGGGT GCCCTTGGAA ACGGAAGATG GAATACATCA AAGTGAATTT 501 CTGTTCAAGT TTTCCCAGAT TATCATTCTT TGGGATGAGA GAACATTATA 551 AAACCACTTT GTTTATTTTA AAGCAAGAAT GGAAGACCCT TGAAAATAAA 601 GAAGTAATTA TTGACACATT TCTTTTTTAC TTAGAGAATC GTTCTAGTGT 651 TTTTGCCGAA GATTACCGCT GGCCTACTGT GAAGGTAGAT GACCTGTGAT 701 TAGACTGGGC GGCTGGGGAG AAACAGTTCA GTGCATTGTT GTTGTTGCTG 751 TTTTTGGTGT TTTGCTTTTC AGTGCCAACT CAGCACATTG TATATGATTC 801 GGTTTATACA TATTACCTTG TTATAATGAA AAAACTCATT CTGAGAACAC 851 TGAAATGTTA TACTCAGTGT TGATTTCTTC GGTCACTACA CAACGTAAAA 901 TCATTTGTTT CTTTTGACTC AAATTGTATT GCTTCTGTTC AGATGATCTT 951 TCATTCAATG TGTTCCTGTT GGGCGTTACT AGAAACTATG GAAAACTGGA 1001 AAATAACTTT GAAAAAATTG GATAAAGTAT AGGAGGGTTA CTTGGGGCCA 1051 GTAAATCAGT AGACTGAACA TTCAATATAA TAAAAGAACA TGGGGATTTT 1101 GTATAACCAG GGATAATAAA AAGAAAAAGA AGTTAATTTT TAATTGATGT 1151 TTTTGAAACT TAGTAGAACA AATATTCAGA AGTAACTTGA TAAGATATGA 1201 ATGTTTCTAA AGAGTTTCTA AAGGTTCGAA ATGCTCCTTG TCACATTAGT 1251 GTGCATCCTA CAAAAAGTGA TCTCTTAATG TAAATTAAGA ATATTTTCAT 1301 AATTGGAATA TACTTTTCTT AAAAAAAAGG AACAGTTAGT TCTCATCTAG 1351 AATGAAAGTT CCATATATGC ATTGGTGAAT ATATATGTAT ACACATACTT 1401 ACATACTTAT ATGGGTATCT GTATAGATAA TTTGTATTAG AGTATTATAT 1451 AGCTTCTTAG TAGGGTCTCA AGTAAGTTCA TTTTTTTTAT CTGGGCTATA 1501 TACAGTCCTC AAATAAATAA TGTCTTGATT TTATTTCAGC AGGAATAATT 1551 TTATTTATTT TGCCTATTTA TAATTAAAGT ATTTTTCTTT AGTTTGAAAT 1601 GTGTATTAAA GTTACATTTT TGAGTTACAA GAGTCTTATA ACTACTTGAA 1651 TTTTTAGTTA AAATGTCTTA ATGTAGGTTG TAGTCACTTT AGATGGAAAA 1701 TTACCTCACA TCTGTTTTCT TCAGTATTAC TTAAGATTGT TTATTTAGTG 1751 GTAGAGAGAT TTTTTTTTTC AGCCTAGAGG CAGCTATTTT ACCATCTGGT 1801 ATTTATGGTC TAATTTGTAT TTAAACATAT GCACACATAT AAAAGTTGAT 1851 ACTGTGGCAG TAAACTATTA AAAGTTTTCA CTGTT // LOCUS HSU02882 5706 bp mRNA PRI 14-MAY-1994 DEFINITION Human rolipram-sensitive 3',5'-cyclic AMP phosphodiesterase mRNA, complete cds. ACCESSION U02882 NID g433346 VERSION U02882.1 GI:433346 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5706) AUTHORS Baecker,P.A., Obernolte,R., Bach,C., Yee,C. and Shelton,E.R. TITLE Isolation of a cDNA encoding a human rolipram-sensitive cyclic AMP phosphodiesterase (PDE IV D) JOURNAL Gene 138, 253-256 (1994) MEDLINE 94171048 REFERENCE 2 (bases 1 to 5706) AUTHORS Baecker,P.A. TITLE Direct Submission JOURNAL Submitted (25-OCT-1993) Preston A. Baecker, Department of Molecular Biology, Syntex Discovery Research, 3401 Hillview Avenue, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1. .5706 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="random primed cDNA library in lambda Zap II" /tissue_type="heart" CDS 109. .1923 /EC_number="3.1.4.17" /note="rolipram-sensitive" /codon_start=1 /product="3',5'-cyclic AMP phosphodiesterase" /protein_id="AAC13745.1" /db_xref="PID:g433347" /db_xref="GI:433347" /translation="MSRNSSIASDIHGDDLIVTPFAQVLASLRTVRNNFAALTNLQDR APSKRSPMCNQPSINKATITEEAYQKLASETLEELDWCLDQLETLQTRHSVSEMASNK FKRMLNRELTHLSEMSRSGNQVSEFISNTFLDKQHEVEIPSPTQKEKEKKKRPMSQIS GVKKLMHSSSLTNSSIPRFGVKTEQEDVLAKELEDVNKWGLHVFRIAELSGNRPLTVI MHTIFQERDLLKTFKIPVDTLITYLMTLEDHYHADVAYHNNIHAADVVQSTHVLLSTP ALEAVFTDLEILAAIFASAIHDVDHPGVSNQFLINTNSELALMYNDSSVLENHHLAVG FKLLQEENCDIFQNLTKKQRQSLRKMVIDIVLATDMSKHMNLLADLKTMVETKKVTSS GVLLLDNYSDRIQVLQNMVHCADLSNPTKPLQLYRQWTDRIMEEFFPQGDRERERGME ISPMCDKHNASVEKSQVGFIDYIVHPLWETWADLVHPDAQDILDTLEDNREWYQSTIP QSPSPAPDDPEEGRQGQTEKFQFELTLEEDGESDTEKDSGSQVEEDTSCSDSKTLCTQ DSESTEIPLDEQVEEEAVGEEEESQPEACVIDDRSPDT" 3'UTR 1921. .5698 polyA_signal 3727. .3732 polyA_signal 3871. .3876 polyA_signal 4155. .4160 polyA_signal 4283. .4288 polyA_signal 4480. .4485 polyA_signal 4485. .4490 polyA_signal 4552. .4557 polyA_signal 4755. .4760 BASE COUNT 1764 a 1100 c 1080 g 1762 t ORIGIN 1 GGAATTCCAT TTTCCCTCCC TTTTCTCCAA GCAAATTTTG TCCACAGTCA 51 ACGACGGGAG TCCTTCCTGT ATCGATCCGA CAGCGATTAT GACCTCTCTC 101 CAAAGTCTAT GTCCCGGAAC TCCTCCATTG CCAGTGATAT ACACGGAGAT 151 GACTTGATTG TGACTCCATT TGCTCAGGTC TTGGCCAGTC TGCGAACTGT 201 ACGAAACAAC TTTGCTGCAT TAACTAATTT GCAAGATCGA GCACCTAGCA 251 AAAGATCACC CATGTGCAAC CAACCATCCA TCAACAAAGC CACCATAACA 301 GAGGAGGCCT ACCAGAAACT GGCCAGCGAG ACCCTGGAGG AGCTGGACTG 351 GTGTCTGGAC CAGCTAGAGA CCCTACAGAC CAGGCACTCC GTCAGTGAGA 401 TGGCCTCCAA CAAGTTTAAA AGGATGCTTA ATCGGGAGCT CACCCATCTC 451 TCTGAAATGA GTCGGTCTGG AAATCAAGTG TCAGAGTTTA TATCAAACAC 501 ATTCTTAGAT AAGCAACATG AAGTGGAAAT TCCTTCTCCA ACTCAGAAGG 551 AAAAGGAGAA AAAGAAAAGA CCAATGTCTC AGATCAGTGG AGTCAAGAAA 601 TTGATGCACA GCTCTAGTCT GACTAATTCA AGTATCCCAA GGTTTGGAGT 651 TAAAACTGAA CAAGAAGATG TCCTTGCCAA GGAACTAGAA GATGTGAACA 701 AATGGGGTCT TCATGTTTTC AGAATAGCAG AGTTGTCTGG TAACCGGCCC 751 TTGACTGTTA TCATGCACAC CATTTTTCAG GAACGGGATT TATTAAAAAC 801 ATTTAAAATT CCAGTAGATA CTTTAATTAC ATATCTTATG ACTCTCGAAG 851 ACCATTACCA TGCTGATGTG GCCTATCACA ACAATATCCA TGCTGCAGAT 901 GTTGTCCAGT CTACTCATGT GCTATTATCT ACACCTGCTT TGGAGGCTGT 951 GTTTACAGAT TTGGAGATTC TTGCAGCAAT TTTTGCCAGT GCAATACATG 1001 ATGTAGATCA TCCTGGTGTG TCCAATCAAT TTCTGATCAA TACAAACTCT 1051 GAACTTGCCT TGATGTACAA TGATTCCTCA GTCTTAGAGA ACCATCATTT 1101 GGCTGTGGGC TTTAAATTGC TTCAGGAAGA AAACTGTGAC ATTTTCCAGA 1151 ATTTGACCAA AAAACAAAGA CAATCTTTAA GGAAAATGGT CATTGACATC 1201 GTACTTGCAA CAGATATGTC AAAACACATG AATCTACTGG CTGATTTGAA 1251 GACTATGGTT GAAACTAAGA AAGTGACAAG CTCTGGAGTT CTTCTTCTTG 1301 ATAATTATTC CGATAGGATT CAGGTTCTTC AGAATATGGT GCACTGTGCA 1351 GATCTGAGCA ACCCAACAAA GCCTCTCCAG CTGTACCGCC AGTGGACGGA 1401 CCGGATAATG GAGGAGTTCT TCCCCCAAGG AGACCGAGAG AGGGAACGTG 1451 GCATGGAGAT AAGCCCCATG TGTGACAAGC ACAATGCTTC CGTGGAAAAA 1501 TCACAGGTGG GCTTCATAGA CTATATTGTT CATCCCCTCT GGGAGACATG 1551 GGCAGACCTC GTCCACCCTG ACGCCCAGGA TATTTTGGAC ACTTTGGAGG 1601 ACAATCGTGA ATGGTACCAG AGCACAATCC CTCAGAGCCC CTCTCCTGCA 1651 CCTGATGACC CAGAGGAGGG CCGGCAGGGT CAAACTGAGA AATTCCAGTT 1701 TGAACTAACT TTAGAGGAAG ATGGTGAGTC AGACACGGAA AAGGACAGTG 1751 GCAGTCAAGT GGAAGAAGAC ACTAGCTGCA GTGACTCCAA GACTCTTTGT 1801 ACTCAAGACT CAGAGTCTAC TGAAATTCCC CTTGATGAAC AGGTTGAAGA 1851 GGAGGCAGTA GGGGAAGAAG AGGAAAGCCA GCCTGAAGCC TGTGTCATAG 1901 ATGATCGTTC TCCTGACACG TAACAGTGCA AAAACTTTCA TGCCTTTTTT 1951 TTTTTTAAGT AGAAAAATTG TTTCCAAAGT GCATGTCACA TGCCACAACC 2001 ACGGTCACAC CTCACTGTCA TCTGCCAGGA CGTTTGTTGA ACAAAACTGA 2051 CCTTGACTAC TCAGTCCAGC GCTCAGGAAT ATCGTAACCA GTTTTTTCAC 2101 CTCCATGTCA TCCGAGCAAG GTGGACATCT TCACGAACAG CGTTTTTAAC 2151 AAAATTTCAG CTTGGTAGAG CTGACAAAGC AGATAAAATC TACTCCAAAT 2201 TATTTTCAAG AGAGTGTGAC TCATCAGGCA GCCCAAAAGT TTATTGGACT 2251 TGGGGTTTCT ATTCCTTTTT ATTTGTTTGC AATATTTTCA GAAGAAAGGC 2301 ATTGCACAGA GTGAACTTAA TGGACGAAGC AACAAATATG TCAAGAACAG 2351 GACATAGCAC GAATCTGTTA CCAGTAGGAG GAGGATGAGC CACAGAAATT 2401 GCATAATTTT CTAATTTCAA GTCTTCCTGA TACATGACTG AATAGTGTGG 2451 TTCAGTGAGC TGCACTGACC TCTACATTTT GTATGATATG TAAAACAGAT 2501 TTTTTGTAGA GCTTACTTTT ATTATTAAAT GTATTGAGGT ATTATATTTA 2551 AAAAAAACTA TGTTCAGAAC TTCATCTGCC ACTGGTTATT TTTTTCTAAG 2601 GAGTAACTTG CAAGTTTTCA GTACAAATCT GTGCTACACT GGATAAAAAT 2651 CTAATTTATG AATTTTACTT GCACCTTATA GTTCATAGCA ATTAACTGAT 2701 TTGTAGTGAT TCATTGTTTG TTTTATATAC CAATGACTTC CATATTTTAA 2751 AAGAGAAAAA CAACTTTATG TTGCAGGAAA CCCTTTTTGT AAGTCTTTAT 2801 TATTCACTTT GCATTTTGTT TCACTCTTTC CAGATAAGCA GAGTTGCTCT 2851 TCACCAGTGT TTTTCTTCAT GTGCAAAGTG ACTATTTGTT CTATAATACT 2901 TTTATGTGTG TTATATCAAA TGTGTCTTAA GCTTCATGCA AACTCAGTCA 2951 TCAGTTCGTG TTGTCTGAAG CAAGTGGGAA ATATATAAAT ACCCAGTAGC 3001 TAAAATGGTC AGTCTTTTTT AGATGTTTTC CTACTTAGTA TCTCCTAATA 3051 ACGTTTTGCT GTGTCACTAG ATGTTCATTT CACAAGTGCA TGTCTTTCTA 3101 ATAATCCACA CATTTCATGC TCTAATAATC CACACATTTC ATGCTCATTT 3151 TTATTGTTTT TACAGCCAGT TATAGCAAGA AAAAGGTTTT TCCCCTTGTG 3201 CTGCTTTATA ATTTAGCGTG TGTCTGAACC TTATCCATGT TTGCTAGATG 3251 AGGTCTTGTC AAATATATCA CTACCATTGT CACCGGTGAA AAGAAACAGG 3301 TAGTTAAGTT AGGGTTAACA TTCATTTCAA CCACGAGGTT GTATATCATG 3351 ACTAGCTTTT ACTCTTGGTT TACAGAGAAA AGTTAAACAA CCAACTAGGC 3401 AGTTTTTAAG AATATTAACA ATATATTAAC AAACACCAAT ACAACTAATC 3451 CTATTTGGTT TTAATGATTT CACCATGGGA TTAAGAACTA TATCAGGAAC 3501 ATCCCTGAGA AACGGCTTTA AGTGTAGCAA CTACTCTTCC TTAATGGACA 3551 GCCACATAAC GTGTAGGAAG TCCTTTATCA CTTATCCTCG ATCCATAAGC 3601 ATATCTTGCA GAGGGGAACT ACTTCTTTAA ACACATGGAG GGAAAGAAGA 3651 TGATGCCACT GGCACCAGAG GGTTAGTACT GTGATGCATC CTAAAATATT 3701 TATTATATTG GTAAAAATTC TGGTTAAATA AAAAATTAGA GATCACTCTT 3751 GGCTGATTTC AGCACCAGGA ACTGTATTAC AGTTTTAGAG ATTAATTCCT 3801 AGTGTTTACC TGATTATAGC AGTTGGCATC ATGGGGCATT TAATTCTGAC 3851 TTTATCCCCA CGTCAGCCTT AATAAAGTCT TCTTTACCTT CTCTATGAAG 3901 ACTTTAAAGC CCAAATAATC ATTTTTCACA TTGATATTCA AGAATTGAGA 3951 TAGATAGAAG CCAAAGTGGG TATCTGACAA GTGGAAAATC AAACGTTTAA 4001 GAAGAATTAC AACTCTGAAA AGCATTTATA TGTGGAACTT CTCAAGGAGC 4051 CTCCTGGGGA CTGGAAAGTA AGTCATCAGC CAGGCAAATG ACTCATGCTG 4101 AAGAGAGTCC CCATTTCAGT CCCCTGAGAT CTAGCTGATG CTTAGATCCT 4151 TTGAAATAAA AATTATGTCT TTATAACTCT GATCTTTTAC ATAAAGCAGA 4201 AGAGGAATCA ACTAGTTAAT TGCAAGGTTT CTACTCTGTT TCCTCTGTAA 4251 AGATCAGATG GTAATCTTTC AAATAAGAAA AAAATAAAGA CGTATGTTTG 4301 ACCAAGTAGT TTCACAAGAA TATTTGGGAA CTTGTTTCTT TTAATTTTAT 4351 TTGTCCCTGA GTGAAGTCTA GAAAGAAAGG TAAAGAGTCT AGAGTTTATT 4401 CCTCTTTCCA AAACATTCTC ATTCCTCTCC TCCCTACACT TAGTATTTCC 4451 CCCACAGAGT GCCTAGAATC TTAATAATGA ATAAAATAAA AAGCAGCAAT 4501 ATGTCATTAA CAAATCCAGA CCTGAAAGGG TAAAGGGTTT ATAACTGCAC 4551 TAATAAAGAG AGGCTCTTTT TTTTTCTTCC AGTTTGTTGG TTTTTAATGG 4601 TACCGTGTTG TAAAGATACC CACTAATGGA CAATCAAATT GCAGAAAAGG 4651 CTCAATATCC AAGAGACAGG GACTAATGCA CTGTACAATC TGCTTATCCT 4701 TGCCCTTCTC TCTTGCCAAA GTGTGCTTCA GAAATATATA CTGCTTTAAA 4751 AAAGAATAAA AGAATATCCT TTTACAAGTG GCTTTACATT TCCTAAAATG 4801 CCATAAGAAA ATGCAATATC TGGGTACTGT ATGGGGAAAA AAATGTCCAA 4851 GTTTGTGTAA AACCAGTGCA TTTCAGCTTG CAAGTTACTG AACACAATAA 4901 TGCTGTTTTA ATTTTGTTTT ATATCAGTTA AAATTCACAA TAATGTAGAT 4951 AGAACAAATT ACAGACAAGG AAAGAAAAAA CTTGAATGAA ATGGATTTTA 5001 CAGAAAGCTT TATGATAATT TTTGAATGCA TTATTTATTT TTTGTGCCAT 5051 GCATTTTTTT TCTCACCAAA TGACCTTACC TGTAATACAG TCTTGTTTGT 5101 CTGTTTACAA CCATGTATTT ATTGCAATGT ACATACTGTA ATGTTAATTG 5151 TAAATTATCT GTTCTTATTA AAACATCATC CCATGATGGG GTGGTGTTGA 5201 TATATTTGGA AACTCTTGGT GAGAGAATGA ATGGTGTGTA TACATACTCT 5251 GTACATTTTT CTTTTCTCCT GTAATATAGT CTTGTCACCT TAGAGCTTGT 5301 TTATGGAAGA TTCAAGAAAA CTATAAAATA CTTAAAGATA TATAAATTTA 5351 AAAAAACATA GCTGCAGGTC TTTGGTCCCA GGGCTGTGCC TTAACTTTAA 5401 CCAATATTTT CTTCTGTTTT GCTGCATTTG AAAGGTAACA GTGGAGCTAG 5451 GGCTGGGCAT TTTACATCCA GGCTTTTAAT TGATTAGAAT TCTGCCAATA 5501 GGTGGATTTT ACAAAACCAC AGACAACCTC TGAAAGATTC TGAGACCCTT 5551 TTGAGACAGA AGCTCTTAAG TACTTCTTGC CAGGGAGCAG CACTGCATGT 5601 GTGATGGTTG TTTGCCATCT GTTGATCAGG AACTACTTCA GCTACTTGCA 5651 TTTGATTATT TCCTTTTTTT TTTTTTTTAA CTCGGAAACA CAACTGGGGG 5701 AATTCC // LOCUS HSU03272 10172 bp mRNA PRI 11-JUN-1994 DEFINITION Human fibrillin-2 mRNA, complete cds. ACCESSION U03272 NID g437971 VERSION U03272.1 GI:437971 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10172) AUTHORS Zhang,H., Apfelroth,S.D., Hu,W., Davis,E.C., Sanguineti,C., Bonadio,J., Mecham,R.P. and Ramirez,F. TITLE Structure and expression of fibrillin-2, a novel microfibrillar component preferentially located in elastic matrices JOURNAL J. Cell Biol. 124, 855-863 (1994) MEDLINE 94165150 REFERENCE 2 (bases 1 to 10172) AUTHORS Ramirez,F. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Francesco Ramirez, Brookdale Center for Molecular Biology, Mt.Sinai Medical Center, 1 Gustave L. Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1. .10172 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MG-63, osteosarcoma cell line" CDS 1. .8736 /codon_start=1 /product="fibrillin-2" /protein_id="AAA18950.1" /db_xref="PID:g437972" /db_xref="GI:437972" /translation="MGRRRRLCLQLYFLWLGCVVLWAQGTAGQPQPPPPKPPRPQPPP QQVRSATAGSEGGFLAPEYREEGAAVASRVRRRGQQDVLRGPNVCGSRFHSYCCPGWK TLPGGNQCIVPICRNSCGDGFCSRPNMCTCSSGQISSTCGSKSIQQCSVRCMNGGTCA DDHCQCQKGYIGTYCGQPVCENGCQNGGRCIAQPCACVYGFTGPQCERDYRTGPCFTQ VNNQMCQGQLTGIVCTKTLCCATTGRAWGHPCEMCPAQPQPCRRGFIPNIRTGACQDV DECQAIPGICQGGNCINTVGSFECRCPAGHKQSETTQKCEDIDECSIIPGICETGECS NTVGSYFCVCPRGYVTSTDGSRCIDQRTGMCFSGLVNGRCAQELPGRMTKMQCCCEPG RCWGIGTIPEACPVRGSEEYRRLCMDGLPMGGIPGSAGSRPGGTGGNGFAPSGNGNGY GPGGTGFIPIPGGNGFSPGVGGAGVGAGGQGPIITGLTILNQTIDICKHHANLCLNGR CIPTVSSYRCECNMGYKQDANGDCIDVDECTSNPCTNGDCVNTPGSYYCKCHAGFQRT PTKQACIDIDECIQNGVLCKNGRCVNSDGSFQCICNAGFELTTDGKNCVDHDECTTTN MCLNGMCINEDGSFKCICKPGFVLAPNGRYCTDVDECQTPGICMNGHCINSEGSFRCD CPPGLAVGMDGRVCVDTHMRSTCYGGIKKGVCVRPFPGAVTKSECCCANPDYGFGEPC QPCPAKNSAEFHGLCSSGVGITVDGRDINECALDPDICANGICENLRGSYRCNCNSGY EPDASGRNCIDIDECLVNRLLCDNGLCRNTPGSYSCTCPPGYVFRTETETCEDINECE SNPCVNGACRNNLGSFNCECSPGSKLSSTGLICIDSLKGTCWLNIQDSRCEVNINGAT LKSECCATLGAAWGSPCERCELDTACPRGLARIKGVTCEDVNECEVFPGVCPNGRCVN SKGSFHCECPEGLTLDGTGRVCLDIRMEQCYLKWDEDECIHPVPGKFRMDACCCAVGA AWGTECEECPKPGTKEYETLCPRGAGFANRGDVLTGRPFYKDINECKAFPGMCTYGKC RNTIGSFKCRCNSGFALDMEERNCTDIDECRISPDLCGSGICVNTPGSFECECFEGYE SGFMMMKNCMDIDGCERNPLLCRGGTCVNTEGSFQCDCPLGHELSPSREDCVDINECS LSDNLCRNGKCVNMIGTYQCSCNPGYQATPDRQGCTDIDECMIMNGGCDTQCTNSEGS YECSCSEGYALMPDGRSCADIDECENNPDICDGGQCTNIPGEYRCLCYDGFMASMDMK TCIDVNECDLNSNICMFGECENTKGSFICHCQLGYSVKKGTTGCTDVDECEIGAHNCD MHASCLNIPGSFKCSCREGWIGNGIKCIDLDECSNGTHQCSINAQCVNTPGSYRCACS EGFTGDGFTCSDVDECAENINLCENGQCLNVPGAYRCECEMGFTPASDSRSCQDIDEC SFQNICVSGTCNNLPGMFHCICDDGYELDRTGGNCTDIDECADPINCVNGLCVNTPGR YECNCPPDFQLNPTGVGCVDNRVGNCYLKFGPRGDGSLSCNTEIGVGVSRSSCCCSLG KAWGNPCETCPPVNSTEYYTLCPGGEGFRPNPITIILEDIDECQELPGLCQGGNCINT FGSFQCECPQGYYLSEDTRICEDIDECFAHPGVCGPGTCYNTLGNYTCICPPEYMQVN GGHNCMDMRKSFCYRSYNGTTCENELPFNVTKRMCCCTYNVGKAGNKPCEPCPTPGTA DFKTICGNIPGFTFDIHTGKAVDIDECKEIPGICANGVCINQIGSFRCECPTGFSYND LLLVCEDIDECSNGDNLCQRNADCINSPGSYRCECAAGFKLSPNGACVDRNECLEIPN VCSHGLCVDLQGSYQCICHNGFKASQDQTMCMDVDECERHPCGNGTCKNTVGSYNCLC YPGFELTHNNDCLDIDECSSFFGQVCRNGRCFNEIGSFKCLCNEGYELTPDGKNCIDT NECVALPGSCSPGTCQNLEGSFRCICPPGYEVKSENCIDINECDEDPNICLFGSCTNT PGGFQCLCPPGFVLSDNGRRCFDTRQSFCFTNFENGKCSVPKAFNTTKAKCCCSKMPG EGWGDPCELCPKDDEVAFQDLCPYGHGTVPSLHDTREDVNECLESPGICSNGQCINTD GSFRCECPMGYNLDYTGVRCVDTDECSIGNPCGNGTCTNVIGSFECNCNEGFEPGPMM NCEDINECAQNPLLCALRCMNTFGSYECTCPIGYALREDQKMCKDLDECAEGLHDCES RGMMCKNLIGTFMCICPPGMARRPDGEGCVDENECRTKPGICENGRCVNIIGSYRCEC NEGFQSSSSGTECLDNRQGLCFAEVLQTICQMASSSRNLVTKSECCCDGGRGWGHQCE LCPLPGTAQYKKICPHGPGYTTDGRDIDECKVMPNLCTNGQCINTMGSFRCFCKVGYT TDISGTSCIDLDECSQSPKPCNYICKNTEGSYQCSCPRGYVLQEDGKTCKDLDECQTK QHNCQFLCVNTLGGFTCKCPPGFTQHHTACIDNNECGSQPLLCGGKGICQNTPGSFSC ECQRGFSLDATGLNCEDVDECDGNHRCQHGCQNILGGYRCGCPQGYIQHYQWNQCVDE NECSNPNACGSASCYNTLGSYKCACPSGFSFDQFSSACHDVNECSSSKNPCNYGCSNT EGGYLCGCPPGYYRVGQGHCVSGMGFNKGQYLSLDTEVDEENALSPEACYECKINGYP KKDSRQKRSIHEPDPTAVEQISLESVDMDSPVNMKFNLSHLGSKEHILELRPAIQPLN NHIRYVISQGNDDSVFRIHQRNGLSYLHTAKKKLMPGTYTLEITSIPLYKKKELKKLE ESNEDDYLLGELGEALRMRLQIQLY" 3'UTR 8737. .10172 BASE COUNT 2694 a 2248 c 2609 g 2621 t ORIGIN 1 ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG 51 CTGTGTGGTG CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC 101 CGCCCAAGCC GCCCCGGCCC CAGCCGCCGC CGCAACAGGT TCGGTCCGCT 151 ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG CCCGAGTATC GCGAGGAGGG 201 TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG GACGTGCTCC 251 GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 301 TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA 351 TAGTTGTGGA GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA 401 GTGGGCAAAT ATCATCAACC TGTGGATCAA AATCAATTCA GCAGTGCAGT 451 GTGAGATGCA TGAATGGTGG GACCTGTGCA GATGACCACT GCCAGTGCCA 501 GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT GAAAATGGAT 551 GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 601 TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC 651 TCAGGTCAAC AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA 701 CGAAGACTCT GTGCTGTGCC ACCACTGGAC GGGCGTGGGG CCATCCCTGT 751 GAGATGTGTC CAGCCCAGCC TCAGCCCTGC CGACGGGGTT TCATCCCCAA 801 CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG GCTATCCCAG 851 GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 901 AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA 951 CATTGATGAG TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT 1001 CCAACACCGT GGGAAGCTAT TTTTGTGTTT GTCCACGTGG ATATGTAACC 1051 TCAACAGATG GCTCTCGATG CATCGATCAG AGAACAGGCA TGTGTTTCTC 1101 GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG AGAATGACGA 1151 AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1201 CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT 1251 GGATGGACTT CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG 1301 GAGGCACTGG GGGAAATGGC TTTGCCCCAA GTGGCAATGG CAATGGCTAT 1351 GGCCCAGGAG GGACAGGCTT CATCCCCATC CCTGGAGGCA ATGGCTTTTC 1401 TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG GGACCTATCA 1451 TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1501 GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG 1551 ATGTGAATGC AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG 1601 ATGTTGATGA ATGCACATCA AATCCCTGCA CTAATGGAGA TTGTGTTAAC 1651 ACACCTGGTT CCTATTATTG TAAATGTCAT GCTGGATTCC AGAGGACTCC 1701 TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG AATGGGGTTC 1751 TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1801 TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA 1851 TGATGAATGT ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG 1901 AAGATGGCAG CTTCAAGTGC ATCTGCAAAC CAGGATTTGT CTTGGCTCCA 1951 AATGGGCGTT ACTGTACTGA TGTTGATGAA TGCCAGACCC CAGGAATCTG 2001 CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC TGTGACTGTC 2051 CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2101 ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC 2151 TTTCCCCGGT GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT 2201 ATGGTTTTGG AGAACCCTGC CAGCCATGCC CTGCAAAAAA TTCAGCTGAA 2251 TTCCACGGCC TTTGTAGTAG TGGAGTAGGT ATCACTGTGG ATGGAAGAGA 2301 TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT GGGATTTGTG 2351 AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2401 GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG 2451 ACTGCTTTGT GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT 2501 GTACGTGCCC ACCAGGGTAT GTGTTCAGGA CTGAGACAGA GACCTGTGAA 2551 GATATAAATG AATGTGAAAG CAACCCATGT GTCAATGGGG CCTGCAGAAA 2601 CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC AAACTCAGCT 2651 CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2701 ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC 2751 TGAATGCTGT GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT 2801 GTGAACTAGA TACAGCTTGC CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT 2851 ACGTGTGAAG ATGTTAATGA GTGTGAGGTG TTCCCTGGCG TTTGTCCAAA 2901 TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG TGCCCTGAAG 2951 GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3001 CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG 3051 AAAGTTCCGC ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA 3101 CCGAGTGTGA GGAGTGCCCC AAACCTGGCA CCAAGGAATA CGAGACACTG 3151 TGCCCCCGCG GGGCTGGCTT TGCTAACCGA GGGGATGTTC TTACTGGGCG 3201 GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT GGGATGTGCA 3251 CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3301 AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA 3351 GTGCAGGATT TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC 3401 CGGGCAGCTT TGAGTGCGAG TGCTTCGAAG GCTATGAAAG TGGCTTCATG 3451 ATGATGAAGA ACTGCATGGA CATTGACGGA TGTGAACGTA ACCCTCTCCT 3501 TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT CAGTGTGACT 3551 GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3601 AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA 3651 CATGATTGGA ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC 3701 CAGACCGCCA GGGCTGTACA GATATTGATG AATGTATGAT AATGAACGGA 3751 GGCTGTGACA CCCAGTGCAC AAATTCAGAG GGAAGCTACG AATGCAGCTG 3801 CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT GCAGACATTG 3851 ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3901 ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT 3951 GGACATGAAA ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA 4001 TCTGCATGTT TGGGGAATGT GAGAACACAA AGGGATCCTT CATTTGCCAC 4051 TGTCAGCTGG GTTACTCAGT GAAGAAGGGG ACCACAGGAT GTACAGATGT 4101 GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT GCCTCATGTC 4151 TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4201 AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA 4251 GTGTAGCATC AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG 4301 CCTGCTCCGA AGGTTTCACT GGTGATGGCT TTACCTGCTC AGATGTTGAT 4351 GAGTGTGCAG AAAACATAAA CCTCTGTGAG AACGGACAGT GCCTTAATGT 4401 CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT CCAGCCTCAG 4451 ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4501 GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA 4551 TGATGGTTAT GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG 4601 AGTGTGCAGA TCCTATAAAC TGTGTCAATG GCCTATGTGT CAACACGCCT 4651 GGTCGCTATG AGTGTAACTG CCCACCCGAT TTTCAGTTGA ACCCAACTGG 4701 TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG AAGTTTGGAC 4751 CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4801 AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG 4851 TGAGACATGC CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG 4901 GAGGTGAAGG CTTCAGACCT AACCCCATCA CAATCATTTT AGAAGACATT 4951 GACGAATGCC AGGAGTTACC AGGTCTCTGC CAGGGTGGAA ACTGCATCAA 5001 CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC TACCTCAGCG 5051 AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5101 GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT 5151 TTGCCCACCT GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA 5201 TGAGAAAAAG CTTTTGCTAC CGAAGCTATA ATGGAACCAC TTGTGAGAAT 5251 GAGTTGCCTT TCAATGTGAC AAAAAGGATG TGCTGCTGCA CATATAATGT 5301 GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT CCAGGAACAG 5351 CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5401 CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT 5451 TTGTGCAAAT GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT 5501 GCCCTACAGG ATTCAGTTAC AATGACCTGC TGTTGGTTTG TGAAGATATA 5551 GATGAGTGCA GCAATGGTGA TAATCTCTGC CAGCGGAATG CAGACTGCAT 5601 CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT TTCAAACTTT 5651 CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5701 GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT 5751 CTGCCACAAT GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG 5801 TTGATGAGTG CGAGCGGCAC CCATGTGGAA ATGGAACTTG TAAAAACACC 5851 GTTGGATCCT ATAACTGTCT GTGCTACCCA GGGTTTGAAC TCACTCATAA 5901 TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT GGTCAGGTGT 5951 GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6001 AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA 6051 TGAGTGTGTC GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT 6101 TGGAGGGATC CTTCAGATGC ATCTGTCCCC CAGGGTATGA AGTAAAAAGC 6151 GAGAACTGCA TTGATATAAA TGAATGTGAT GAAGATCCCA ACATTTGTCT 6201 TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC CTCTGCCCCC 6251 CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6301 AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC 6351 TTTCAACACC ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG 6401 GCTGGGGGGA CCCCTGTGAG CTGTGCCCCA AAGACGATGA AGTTGCATTT 6451 CAGGATTTGT GTCCATATGG CCATGGAACT GTCCCTAGTC TTCATGATAC 6501 ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT TGTTCAAATG 6551 GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6601 TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC 6651 AATCGGCAAT CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT 6701 TTGAATGCAA TTGCAATGAA GGCTTTGAGC CAGGGCCCAT GATGAATTGT 6751 GAAGATATCA ACGAATGTGC CCAGAACCCA CTGCTGTGTG CTTTACGCTG 6801 CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT GGCTATGCCC 6851 TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6901 TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC 6951 CTTCATGTGC ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG 7001 GCTGTGTAGA TGAAAATGAA TGCAGGACCA AGCCAGGAAT CTGTGAAAAT 7051 GGACGTTGTG TTAACATTAT TGGAAGCTAT AGATGTGAGT GTAATGAAGG 7101 ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT CGACAGGGTC 7151 TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7201 CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG 7251 GGGCCACCAG TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA 7301 AGATATGTCC TCATGGCCCA GGATATACAA CTGATGGAAG AGATATTGAT 7351 GAATGTAAGG TAATGCCAAA CCTCTGCACC AATGGTCAGT GCATCAATAC 7401 CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC ACAGACATCA 7451 GTGGAACCTC TTGTATAGAC CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7501 TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC 7551 GAGGGGGTAT GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG 7601 AATGTCAAAC AAAGCAGCAT AACTGCCAGT TCCTCTGTGT CAACACCCTG 7651 GGGGGGTTTA CCTGTAAATG TCCACCTGGT TTCACACAGC ATCACACTGC 7701 TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT TGTGGAGGAA 7751 AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7801 GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG 7851 TGATGGGAAC CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT 7901 ACAGATGTGG CTGCCCCCAA GGCTACATCC AGCACTACCA GTGGAATCAG 7951 TGTGTCGATG AGAATGAATG CTCCAATCCC AATGCCTGTG GCTCTGCTTC 8001 CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC TCGGGGTTCT 8051 CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8101 TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT 8151 CTGTGGCTGC CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT 8201 CAGGAATGGG ATTTAACAAG GGGCAGTACC TGTCACTGGA TACAGAGGTC 8251 GATGAGGAAA ATGCTCTGTC CCCAGAAGCA TGCTACGAGT GCAAAATCAA 8301 CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT CATGAACCTG 8351 ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8401 CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT 8451 CCTGGAACTA AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG 8501 TCATCTCTCA AGGGAACGAT GACAGCGTCT TCCGCATCCA CCAAAGGAAT 8551 GGGCTCAGCT ACTTGCACAC GGCCAAGAAG AAGCTCATGC CCGGCACATA 8601 CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG GAGCTTAAGA 8651 AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8701 GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT 8751 GGGCCCAGGC TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA 8801 GTCAAGGACT AATTTTAAAG AGGAAAAATA ATAATAACTC TTGTTTCTTT 8851 CCTCCCTGTC TTAGACTTTG AATGTTGACC CTCACAGGGA GGGATAATTT 8901 AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC GTGGTTACTG 8951 TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9001 TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT 9051 TTTTTTTTTT CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG 9101 TTGCTAATTA AAAAAATATA GATGTTTATT TATTTTTAAT GCAGTAATAT 9151 ATGGAGAAAT GAACAAACTA TGTAAACAAA AAGGGAAACT CACTTGTTTT 9201 TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG CTTTTTAAAA 9251 ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9301 CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG 9351 GCAGTGCTAA TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT 9401 AACGTAGATC GAAGCCAAAG CAGCTCATTT GTGACAACAT TTCATATCAC 9451 CAGACACACC AGGCAACAGA AGTTGAAGCA CAACCACTGT AGCAAAATAC 9501 CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC GTACTGTATT 9551 TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9601 CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC 9651 CGCCGTTCCC TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG 9701 TCTTAAATAA TGGTGCATTT ATTATGTTCA AGTTATTTCA GGATTGCCAT 9751 ATGTGCAAAC AAATCATGCA ATGCAGCCAA GGAATATATG TTGTTGTTGT 9801 TGTTTTAAAC CCATTTTTTT TTTAGAATTT TCATTAATAC TGTAGTTATA 9851 CACCATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9901 ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC 9951 CAGGATGTGC TATTAAGAGC ACGTATCCAT TATTCTTCTC AACCCAAGAA 10001 CCTGTTTCCT GGACCAGTGA CCAAACCTCA TATGTGAAAT GGCCAAAGCA 10051 CATGCAGGCT CCTGGTTGTT CCTCTCAAAC CTGTGCTGAC CAAAGATTAG 10101 TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT TTAATAACTA 10151 AAAAAAAACT CGTGCCGAAT TC // LOCUS AB011141 5523 bp mRNA PRI 10-APR-1998 DEFINITION Homo sapiens mRNA for KIAA0569 protein, complete cds. ACCESSION AB011141 NID g3043661 VERSION AB011141.1 GI:3043661 KEYWORDS KIAA0569 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH2356. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5523) AUTHORS Ohara,O., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (13-FEB-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. IX. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (1), 31-39 (1998) MEDLINE 98290545 FEATURES Location/Qualifiers source 1. .5523 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH2356" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 445. .4089 /gene="KIAA0569" CDS 445. .4089 /gene="KIAA0569" /codon_start=1 /product="KIAA0569 protein" /protein_id="BAA25495.1" /db_xref="PID:d1026425" /db_xref="PID:g3043662" /db_xref="GI:3043662" /translation="MKQPIMADGPRCKRRKQANPRRKNVVNYDNVVDTGSETDEEDKL HIAEDDGIANPLDQETSPASVPNHESSPHVSQALLPREEEEDEIREGGVEHPWHNNEI LQASVDGPEEMKEDYDTMGPEATIQTAINNGTVKNANCTSDFEEYFAKRKLEERDGHA VSIEEYLQRSDTAIIYPEAPEELSRLGTPEANGQEENDLPPGTPDAFAQLLTCPYCDR GYKRLTSLKEHIKYRHEKNEENFSCPLCSYTFAYRTQLERHMVTHKPGTDQHQMLTQG AGNRKFKCTECGKAFKYKHHLKEHLRIHSGEKPYECPNCKKRFSHSGSYSSHISSKKC IGLISVNGRMRNNIKTGSSPNSVSSSPTNSAITQLRNKLENGKPLSMSEQTGLLKIKT EPLDFNDYKVLMATHGFSGTSPFMNGGLGATSPLGVHPSAQSPMQHLGVGMEAPLLGF PTMNSNLSEVQKVLQIVDNTVSRQKMDCKAEEISKLKGYHMKDPCSQPEEQGVTSPNI PPVGLPVVSHNGATKSIIDYTLEKVNEAKACLQSLTTDSRRQISNIKKEKLRTLIDLV TDDKMIENHNISTPFSCQFCKESFPGPIPLHQHERYLCKMNEEIKAVLQPHENIVPNK AGVFVDNKALLLSSVLSEKGMTSPINPYKDHMSVLKAYYAMNMEPNSDELLKISIAVG LPQEFVKEWFEQRKVYQYSNSRSPSLERSSKPLAPNSNPPTKDSLLPRSPVKPMDSIT SPSIAELHNSVTNCDPPLRLTKPSHFTNIKPVEKLDHSRSNTPSPLNLSSTSSKNSHS SSYTPNSFSSEELQAEPLDLSLPKQMKEPKSIIATKNKTKASSISLDHNSVSSSSENS DEPLNLTFIKKEFSNSNNLDNKSTNPVFSMNPFSAKPLYTALPPQSAFPPATFMPPVQ TSIPGLRPYPGLDQMSFLPHMAYTYPTGAATFADMQQRRKYQRKQGFQGELLDGAQDY MSGLDDMTDSDSCLSRKKIKKTESGMYACDLCDKTFQKSSSLLRHKYEHTGKRPHQCQ ICKKAFKHKHHLIEHSRLHSGEKPYQCDKCGKRFSHSGSYSQHMNHRYSYCKREAEER EAAEREAREKGHLEPTELLMNRAYLQSITPQGYSDSEERESMPRDGESEKEHEKEGED GYGKLGRQDGDEEFEEEEEESENKSMDTDPETIRDEEETGDHSMDDSSEDGKMETKSD HEEDNMEDGM" BASE COUNT 1686 a 1220 c 1155 g 1462 t ORIGIN 1 GGCGATCACG TTTTCACATG ATGCTCACGC TCAGGGCGCT TCAATTATCC 51 CTCCCCACAA AGATAGGTGG CGCGTGTTTC AGGGTCTCTC GTCTCTCTCC 101 TACAGAAAAG AAAAAGAAAA AAATGTCATT AGAAGAGGCG TAACACGTCA 151 GTCCGTCCCC AGGTTTGTGT TTCCTGGAGT GGCCGAAAGA GATCAGTTCT 201 AACCTGCTCT GCAGGAATAA CGGTCCTGCC TCCCGACACT CTTGGCGAGG 251 TTTTTGTACA GTTTGCTCCG GGAGCTGTTT CTTCGCTTCC ACCTTTTTCT 301 CCCCCACACT TCGCGGCTTC TTCATGCTTT TTCTTCTCAC CATTTCTGGC 351 CAAAACTACA AACAAGACTT CGCAGATCGA GCCTGCGTGC TGCCGAAGCA 401 GGGCGCCGAG TCCATGCGAA CTGCCATCTG ATCCGCTCTT ATCAATGAAG 451 CAGCCGATCA TGGCGGATGG CCCCCGGTGC AAGAGGCGCA AACAAGCCAA 501 TCCCAGGAGG AAAAACGTGG TGAACTATGA CAATGTAGTG GACACAGGTT 551 CTGAAACAGA TGAGGAAGAC AAGCTTCATA TTGCTGAGGA TGACGGTATT 601 GCCAACCCTC TGGACCAGGA GACGAGTCCA GCTAGTGTGC CCAACCATGA 651 GTCCTCCCCA CACGTGAGCC AAGCTCTGTT GCCAAGAGAG GAAGAGGAAG 701 ATGAAATAAG GGAGGGTGGA GTGGAACACC CCTGGCACAA CAACGAGATT 751 CTACAAGCCT CTGTAGATGG TCCAGAAGAA ATGAAGGAAG ACTATGACAC 801 TATGGGGCCA GAAGCCACGA TCCAGACCGC AATTAACAAT GGTACAGTGA 851 AGAATGCAAA TTGCACATCA GATTTTGAGG AATACTTTGC CAAAAGAAAA 901 CTGGAGGAAC GCGATGGTCA TGCAGTCAGC ATCGAGGAGT ACCTTCAGCG 951 CAGTGACACA GCCATTATTT ACCCAGAAGC CCCTGAGGAG CTGTCTCGCC 1001 TTGGCACGCC AGAGGCCAAT GGGCAAGAAG AAAATGACCT GCCACCTGGA 1051 ACTCCAGATG CTTTTGCCCA ACTGCTGACC TGCCCCTACT GCGACCGGGG 1101 CTACAAGCGC TTGACATCAC TGAAGGAGCA CATCAAGTAC CGCCACGAGA 1151 AGAATGAAGA GAACTTTTCC TGCCCTCTCT GTAGCTACAC GTTTGCCTAC 1201 CGCACCCAGC TCGAGCGGCA TATGGTGACA CACAAGCCAG GGACAGATCA 1251 GCACCAAATG CTAACCCAAG GAGCAGGTAA TCGCAAGTTC AAATGCACAG 1301 AGTGTGGCAA GGCCTTCAAA TATAAACACC ATCTGAAAGA ACACCTGCGA 1351 ATTCACAGTG GTGAAAAACC TTACGAGTGC CCAAACTGCA AGAAACGTTT 1401 CTCCCATTCT GGTTCCTACA GTTCGCACAT CAGCAGCAAG AAATGTATTG 1451 GTTTAATCTC TGTAAATGGC CGAATGAGAA ACAATATCAA GACGGGTTCT 1501 TCCCCTAATT CTGTTTCTTC TTCTCCTACT AATTCAGCCA TTACCCAGTT 1551 AAGAAACAAG TTGGAGAATG GAAAACCACT TAGTATGTCT GAACAGACAG 1601 GCTTACTTAA AATTAAAACA GAACCACTAG ACTTCAATGA CTATAAAGTT 1651 CTTATGGCTA CACACGGGTT TAGTGGCACT AGTCCCTTTA TGAATGGTGG 1701 GCTTGGAGCC ACCAGCCCTT TAGGAGTTCA TCCATCTGCT CAGAGTCCAA 1751 TGCAGCACTT AGGTGTAGGG ATGGAAGCCC CTTTACTTGG GTTTCCCACC 1801 ATGAATAGTA ATTTAAGTGA GGTACAAAAG GTTCTACAGA TTGTGGACAA 1851 TACTGTTTCC AGGCAAAAAA TGGACTGCAA GGCTGAAGAA ATTTCAAAGT 1901 TGAAAGGTTA TCACATGAAG GATCCATGCT CTCAACCTGA GGAACAAGGA 1951 GTTACTTCTC CTAATATTCC GCCTGTCGGT CTTCCGGTAG TGAGTCATAA 2001 TGGTGCCACT AAAAGTATTA TTGACTATAC GTTGGAAAAA GTCAATGAAG 2051 CCAAAGCTTG CCTCCAGAGC TTGACTACTG ACTCAAGGAG ACAGATCAGT 2101 AATATAAAGA AAGAGAAGCT ACGTACTTTA ATAGATTTGG TCACTGATGA 2151 CAAAATGATT GAGAACCACA ACATATCCAC TCCATTTTCA TGCCAGTTCT 2201 GTAAAGAAAG TTTTCCTGGC CCCATCCCTT TGCATCAGCA TGAACGTTAC 2251 CTTTGTAAGA TGAATGAAGA GATCAAGGCG GTCCTGCAGC CTCATGAAAA 2301 CATAGTCCCC AACAAAGCCG GAGTTTTTGT TGATAATAAA GCCCTCCTCT 2351 TGTCATCTGT ACTTTCTGAG AAAGGAATGA CAAGCCCCAT CAACCCATAC 2401 AAGGACCACA TGTCTGTACT CAAAGCATAC TATGCTATGA ACATGGAGCC 2451 CAACTCCGAT GAACTGCTGA AAATTTCCAT TGCTGTGGGC CTTCCTCAGG 2501 AATTTGTGAA GGAATGGTTT GAACAACGAA AAGTCTACCA GTACTCAAAT 2551 TCCAGGTCCC CATCCCTGGA AAGAAGCTCC AAGCCGTTAG CTCCCAACAG 2601 TAACCCTCCC ACAAAAGACT CTTTATTACC CAGGTCTCCT GTAAAACCTA 2651 TGGACTCCAT AACATCACCA TCTATAGCAG AACTCCACAA CAGTGTTACG 2701 AATTGTGATC CTCCTCTCAG GCTAACAAAA CCTTCCCATT TTACCAATAT 2751 TAAACCAGTT GAAAAATTGG ACCACTCCAG GAGTAATACT CCTTCTCCCT 2801 TAAATCTTTC CTCCACATCT TCTAAAAACT CCCACAGTAG TTCATACACT 2851 CCAAACAGCT TCTCTTCTGA GGAGCTCCAG GCTGAGCCTT TAGACTTGTC 2901 ATTACCAAAA CAAATGAAAG AACCCAAAAG TATTATAGCC ACAAAGAACA 2951 AAACAAAAGC TAGTAGCATC AGTTTAGATC ATAACAGTGT TTCTTCCTCA 3001 TCTGAAAACT CAGATGAGCC TCTGAACTTG ACTTTTATCA AGAAGGAATT 3051 TTCAAATTCA AATAATCTGG ACAACAAAAG CACTAACCCA GTGTTCAGCA 3101 TGAACCCATT TAGTGCCAAA CCTTTATACA CAGCTCTTCC ACCTCAAAGC 3151 GCATTTCCCC CTGCTACTTT CATGCCACCA GTCCAGACCA GTATTCCTGG 3201 GCTACGACCA TACCCAGGAC TGGATCAGAT GAGCTTCCTA CCACATATGG 3251 CCTACACCTA CCCAACTGGA GCAGCTACTT TTGCTGATAT GCAGCAAAGG 3301 AGAAAGTACC AGCGGAAACA AGGATTTCAG GGAGAATTGC TTGATGGAGC 3351 ACAAGACTAC ATGTCAGGCC TAGATGATAT GACAGACTCC GACTCCTGTC 3401 TGTCTCGCAA AAAGATCAAG AAGACAGAGA GTGGCATGTA TGCATGTGAC 3451 TTATGTGACA AGACATTCCA GAAAAGCAGT TCCCTTCTGC GACATAAATA 3501 CGAACACACA GGAAAAAGAC CACATCAGTG TCAGATTTGT AAGAAAGCGT 3551 TTAAACACAA GCACCACCTT ATCGAGCACT CAAGGCTTCA CTCGGGCGAG 3601 AAGCCCTATC AGTGTGATAA ATGTGGCAAG CGCTTCTCAC ACTCGGGCTC 3651 GTACTCGCAG CACATGAATC ACAGGTATTC CTACTGCAAG CGGGAGGCGG 3701 AGGAGCGGGA AGCGGCGGAG CGCGAGGCGC GCGAGAAAGG GCACTTGGAA 3751 CCCACCGAGC TGCTGATGAA CCGGGCTTAC TTGCAGAGCA TTACCCCTCA 3801 GGGGTACTCT GACTCGGAGG AGAGGGAGAG TATGCCGAGG GATGGCGAGA 3851 GCGAGAAGGA GCACGAGAAA GAAGGCGAGG ATGGCTACGG GAAGCTGGGC 3901 AGACAGGATG GCGACGAGGA GTTCGAGGAG GAAGAGGAAG AAAGTGAAAA 3951 TAAAAGTATG GATACGGATC CCGAAACGAT ACGAGATGAA GAAGAGACTG 4001 GAGATCACTC CATGGACGAT AGTTCGGAGG ATGGGAAAAT GGAAACCAAA 4051 TCAGACCACG AGGAAGACAA TATGGAAGAT GGCATGTAAT AAACTACTGC 4101 ATTTTAAGCT TCCTATTTTT TTTTCCAGTA GTATTGTTAC CTGCTTGAAA 4151 ACACTGCTGT GTTAAGCTGT TCATGCACGT GCCTGACGCT TCCAGGAAGC 4201 TGTAGAGAGG GACAGAAGGG GCGGTTCAGC CAAGACAGAT GTAGACGGAG 4251 TTGGAGCTGG GTATTGTTAA AAACTGCATT ATGCAAAAAT TTTGTACAGT 4301 GTTAAGGCCT AAAAACTGTG TGGTTCAGAG ACTAATTCCT GTGTTTAATA 4351 GCATTTATAC TTTAAGCACA ACTAGAAAAT TGTAAGAATT GCACTCTACT 4401 TATGTATCAC TACAAACTTT AAAAAACTAT GTCTAATTTA TATTAATACA 4451 TTTTAAAAAG GTGCCCGCAC TACCATACAT CAGTATTTTT ATTATTATTA 4501 TTGTTATTCC TTTTTAATTT AATGTGCTCG CACTACAATG CATCAGTATT 4551 ATGATTCCTC TGTACTTTCC TTTCGCTATT CATCAATTTC CCATTTTTTT 4601 TTTTCAGCTT AAGTAACCAC ACAATTTTAG GCCTCAATTT TTTTTTTTTT 4651 CTGTGAAGGA ACTTGAAGTG ATGCATGTGT GAATTTAAGA TACCGAAGTC 4701 TTAAAGTGAC CTGGACGTGA AGGAAAAAGT AAGATGAGAA ATAAAGAAAG 4751 CCTTTGTAAG GTGGTTTTAA AAGCCTTATA TGCAAACCTT TTAATCTGTG 4801 TTTCTGCAAG TGCCATCCTT GTACAGTGTT AAGAGGGTAA CATGGGTTAC 4851 CTTTGCACCA GCTTCAGTGT TAAGCTCACC CTGTTCTTTG AAGCACCCAT 4901 GTCAGTATTA GAAGAATAGG CAGCAGTTCC TTAGTTTACA TATGTTTGTG 4951 CAATTATTTT CTGTACTTTT TTGTTCATTA ATTTTGTCAG TATTACACCA 5001 AACTGTTTTT GCAACAAAAA AATTTTTTTT GCATTCATTT AATTTTAGGT 5051 CAAATAACAT TTTATTTATG TGGCTCATTT TATATTTCCT AATTTTATTT 5101 ATTTCATACT GTAGTGTACA GTATTATAGT TCTTCAATAT ATAGATATAT 5151 TTTAGTAAAA AAGGAACATG ACGTTGATCA TTTGGGCAAA TTTTACGTAA 5201 AGAGAAGAGC ATTTATTGTG TTTTGGAACA TTAATTGTGA GATGGGATTT 5251 TTCAATTTTA TTATTTTATT TTTGTTTTTT TCCAATTACT GGAAATTCCA 5301 AATTTGGGAA CTTTTGATAC GATCTTGTGA AAACACTGTA TTTTCGACTG 5351 AAAATTCCAC TTTCTTCATC TTGTTTTTTA GCTAAAAAGA GGGACTGTTA 5401 AATACAATGT ATGATACCAT GACAAAAATC TTTCCTGAAT TGTCTTTGTA 5451 AAAGTATTAT TGAATTTTCA ATTTGTAATT TCTTTTGAAA ATGACCATGC 5501 TCGAATAAAA ATGTAGCCAA ACT // LOCUS HSU37546 3076 bp mRNA PRI 05-JUN-1996 DEFINITION Human IAP homolog C (MIHC) mRNA, complete cds. ACCESSION U37546 NID g1145290 VERSION U37546.1 GI:1145290 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3076) AUTHORS Uren,A.G., Pakusch,M., Hawkins,C.J., Puls,K.L. and Vaux,D.L. TITLE Cloning and expression of apoptosis inhibitory protein homologs that function to inhibit apoptosis and/or bind tumor necrosis factor receptor-associated factors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (10), 4974-4978 (1996) MEDLINE 96209843 REFERENCE 2 (bases 1 to 3076) AUTHORS Uren,A.G. and Vaux,D.L. TITLE Direct Submission JOURNAL Submitted (04-OCT-1995) Anthony G. Uren, The Walter and Eliza Hall Institute, Royal Parade, Parkville, Victoria 3050, Australia FEATURES Location/Qualifiers source 1. .3076 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" gene 725. .2539 /gene="MIHC" CDS 725. .2539 /gene="MIHC" /note="IAP homolog C; interacts with TRAF1 and TRAF2 in yeast two hybrid system; homolog of Baculovirus IAP genes; Mammalian IAP homolog C" /codon_start=1 /product="MIHC" /protein_id="AAC50507.1" /db_xref="PID:g1145291" /db_xref="GI:1145291" /translation="MNIVENSIFLSNLMKSANTFELKYDLSCELYRMSTYSTFPAGVP VSERSLARAGFYYTGVNDKVKCFCCGLMLDNWKRGDSPTEKHKKLYPSCRFVQSLNSV NNLEATSQPTFPSSVTNSTHSLLPGTENSGYFRGSYSNSPSNPVNSRANQDFSALMRS SYHCAMNNENARLLTFQTWPLTFLSPTDLAKAGFYYIGPGDRVACFACGGKLSNWEPK DNAMSEHLRHFPKCPFIENQLQDTSRYTVSNLSMQTHAARFKTFFNWPSSVLVNPEQL ASAGFYYVGNSDDVKCFCCDGGLRCWESGDDPWVQHAKWFPRCEYLIRIKGQEFIRQV QASYPHLLEQLLSTSDSPGDENAESSIIHFEPGEDHSEDAIMMNTPVINAAVEMGFSR SLVKQTVQRKILATGENYRLVNDLVLDLLNAEDEIREEERERATEEKESNDLLLIRKN RMALFQHLTCVIPILDSLLTAGIINEQEHDVIKQKTQTSLQARELIDTILVKGNIAAT VFRNSLQEAEAVLYEHLFVQQDIKYIPTEDVSDLPVEEQLRRLQEERTCKVCMDKEVS IVFIPCGHLVVCKDCAPSLRKCPICRSTIKGTVRTFLS" misc_feature 809. .1012 /gene="MIHC" /note="encodes BIR repeat 1" misc_feature 1229. .1429 /gene="MIHC" /note="encodes BIR repeat 2" misc_feature 1487. .1690 /gene="MIHC" /note="encodes BIR repeat 3" misc_feature 2393. .2500 /gene="MIHC" /note="encodes RING finger motif" BASE COUNT 1012 a 542 c 563 g 959 t ORIGIN 1 GAATTCAAAA TGTCTTCAGT TGTAAATCTT ACCATTATTT TACGTACCTC 51 TAAGAAATAA AAGTGCTTCT AATTAAAATA TGATGTCATT AATTATGAAA 101 TACTTCTTGA TAACAGAAGT TTTAAAATAG CCATCTTAGA ATCAGTGAAA 151 TATGGTAATG TATTATTTTC CTCCTTTGAG TTAGGTCTTG TGCTTTTTTT 201 TCCTGGCCAC TAAATTTCAC AATTTCCAAA AAGCAAAATA AACATATTCT 251 GAATATTTTT GCTGTGAAAC ACTTGACAGC AGAGCTTTCC ACCATGAAAA 301 GAAGCTTCAT GAGTCACACA TTACATCTTT GGGTTGATTG AATGCCACTG 351 AAACATTCTA GTAGCCTGGA GAAGTTGACC TACCTGTGGA GATGCCTGCC 401 ATTAAATGGC ATCCTGATGG CTTAATACAC ATCACTCTTC TGTGAAGGGT 451 TTTAATTTTC AACACAGCTT ACTCTGTAGC ATCATGTTTA CATTGTATGT 501 ATAAAGATTA TACAAAGGTG CAATTGTGTA TTTCTTCCTT AAAATGTATC 551 AGTATAGGAT TTAGAATCTC CATGTTGAAA CTCTAAATGC ATAGAAATAA 601 AAATAATAAA AAATTTTTCA TTTTGGCTTT TCAGCCTAGT ATTAAAACTG 651 ATAAAAGCAA AGCCATGCAC AAAACTACCT CCCTAGAGAA AGGCTAGTCC 701 CTTTTCTTCC CCATTCATTT CATTATGAAC ATAGTAGAAA ACAGCATATT 751 CTTATCAAAT TTGATGAAAA GCGCCAACAC GTTTGAACTG AAATACGACT 801 TGTCATGTGA ACTGTACCGA ATGTCTACGT ATTCCACTTT TCCTGCTGGG 851 GTTCCTGTCT CAGAAAGGAG TCTTGCTCGT GCTGGTTTCT ATTACACTGG 901 TGTGAATGAC AAGGTCAAAT GCTTCTGTTG TGGCCTGATG CTGGATAACT 951 GGAAAAGAGG AGACAGTCCT ACTGAAAAGC ATAAAAAGTT GTATCCTAGC 1001 TGCAGATTCG TTCAGAGTCT AAATTCCGTT AACAACTTGG AAGCTACCTC 1051 TCAGCCTACT TTTCCTTCTT CAGTAACAAA TTCCACACAC TCATTACTTC 1101 CGGGTACAGA AAACAGTGGA TATTTCCGTG GCTCTTATTC AAACTCTCCA 1151 TCAAATCCTG TAAACTCCAG AGCAAATCAA GATTTTTCTG CCTTGATGAG 1201 AAGTTCCTAC CACTGTGCAA TGAATAACGA AAATGCCAGA TTACTTACTT 1251 TTCAGACATG GCCATTGACT TTTCTGTCGC CAACAGATCT GGCAAAAGCA 1301 GGCTTTTACT ACATAGGACC TGGAGACAGA GTGGCTTGCT TTGCCTGTGG 1351 TGGAAAATTG AGCAATTGGG AACCGAAGGA TAATGCTATG TCAGAACACC 1401 TGAGACATTT TCCCAAATGC CCATTTATAG AAAATCAGCT TCAAGACACT 1451 TCAAGATACA CAGTTTCTAA TCTGAGCATG CAGACACATG CAGCCCGCTT 1501 TAAAACATTC TTTAACTGGC CCTCTAGTGT TCTAGTTAAT CCTGAGCAGC 1551 TTGCAAGTGC GGGTTTTTAT TATGTGGGTA ACAGTGATGA TGTCAAATGC 1601 TTTTGCTGTG ATGGTGGACT CAGGTGTTGG GAATCTGGAG ATGATCCATG 1651 GGTTCAACAT GCCAAGTGGT TTCCAAGGTG TGAGTACTTG ATAAGAATTA 1701 AAGGACAGGA GTTCATCCGT CAAGTTCAAG CCAGTTACCC TCATCTACTT 1751 GAACAGCTGC TATCCACATC AGACAGCCCA GGAGATGAAA ATGCAGAGTC 1801 ATCAATTATC CATTTTGAAC CTGGAGAAGA CCATTCAGAA GATGCAATCA 1851 TGATGAATAC TCCTGTGATT AATGCTGCCG TGGAAATGGG CTTTAGTAGA 1901 AGCCTGGTAA AACAGACAGT TCAAAGAAAA ATCCTAGCAA CTGGAGAGAA 1951 TTATAGACTA GTCAATGATC TTGTGTTAGA CTTACTCAAT GCAGAAGATG 2001 AAATAAGGGA AGAGGAGAGA GAAAGAGCAA CTGAGGAAAA AGAATCAAAT 2051 GATTTATTAT TAATCCGGAA GAATAGAATG GCACTTTTTC AACATTTGAC 2101 TTGTGTAATT CCAATCCTGG ATAGTCTACT AACTGCCGGA ATTATTAATG 2151 AACAAGAACA TGATGTTATT AAACAGAAGA CACAGACGTC TTTACAAGCA 2201 AGAGAACTGA TTGATACGAT TTTAGTAAAA GGAAATATTG CAGCCACTGT 2251 ATTCAGAAAC TCTCTGCAAG AAGCTGAAGC TGTGTTATAT GAGCATTTAT 2301 TTGTGCAACA GGACATAAAA TATATTCCCA CAGAAGATGT TTCAGATCTA 2351 CCAGTGGAAG AACAATTGCG GAGACTACAA GAAGAAAGAA CATGTAAAGT 2401 GTGTATGGAC AAAGAAGTGT CCATAGTGTT TATTCCTTGT GGTCATCTAG 2451 TAGTATGCAA AGATTGTGCT CCTTCTTTAA GAAAGTGTCC TATTTGTAGG 2501 AGTACAATCA AGGGTACAGT TCGTACATTT CTTTCATGAA GAAGAACCAA 2551 AACATCATCT AAACTTTAGA ATTAATTTAT TAAATGTATT ATAACTTTAA 2601 CTTTTATCCT AATTTGGTTT CCTTAAAATT TTTATTTATT TACAACTCAA 2651 AAAACATTGT TTTGTGTAAC ATATTTATAT ATGTATCTAA ACCATATGAA 2701 CATATATTTT TTAGAAACTA AGAGAATGAT AGGCTTTTGT TCTTATGAAC 2751 GAAAAAGAGG TAGCACTACA AACACAATAT TCAATCAAAA TTTCAGCATT 2801 ATTGAAATTG TAAGTGAAGT AAAACTTAAG ATATTTGAGT TAACCTTTAA 2851 GAATTTTAAA TATTTTGGCA TTGTACTAAT ACCTGGTTTT TTTTTTGTTT 2901 TGTTTTTTTG TACAGACAGG GCAGCATACT GAGACCCTGC CTTTAAAAAC 2951 AAACAGAACA AAAACAAAAC ACCAGGGACA CATTTCTCTG TCTTTTTTGA 3001 TCAGTGTCCT ATACATCGAA GGTGTGCATA TATGTTGAAT GACATTTTAG 3051 GGACATGGTG TTTTTATAAA GAATTC // LOCUS AB011109 6828 bp mRNA PRI 10-APR-1998 DEFINITION Homo sapiens mRNA for KIAA0537 protein, complete cds. ACCESSION AB011109 NID g3043597 VERSION AB011109.1 GI:3043597 KEYWORDS KIAA0537 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG3925. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6828) AUTHORS Ohara,O., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (13-FEB-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. IX. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (1), 31-39 (1998) MEDLINE 98290545 FEATURES Location/Qualifiers source 1. .6828 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG3925" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1381. .3366 /gene="KIAA0537" CDS 1381. .3366 /gene="KIAA0537" /codon_start=1 /product="KIAA0537 protein" /protein_id="BAA25463.1" /db_xref="PID:d1026393" /db_xref="PID:g3043598" /db_xref="GI:3043598" /translation="MEGAAAPVAGDRPDLGLGAPGSPREAVAGATAALEPRKPHGVKR HHHKHNLKHRYELQETLGKGTYGKVKRATERFSGRVVAIKSIRKDKIKDEQDMVHIRR EIEIMSSLNHPHIISIYEVFENKDKIVIIMEYASKGELYDYISERRRLSERETRHFFR QIVSAVHYCHKNGVVHRDLKLENILLDDNCNIKIADFGLSNLYQKDKFLQTFCGSPLY ASPEIVNGRPYRGPEVDSWALGVLLYTLVYGTMPFDGFDHKNLIRQISSGEYREPTQP SDARGLIRWMLMVNPDRRATIEDIANHWWVNWGYKSSVCDCDALHDSESPLLARIIDW HHRSTGLQADTEAKMKGLAKPTTSEVMLERQRSLKKSKKENDFAQSGQDAVPESPSKL SSKRPKGILKKRSNSEHRSHSTGFIEGVVGPALPSTFKMEQDLCRTGVLLPSSPEAEV PGKLSPKQSATMPKKGILKKTQQRESGYYSSPERSESSELLDSNDVMGSSIPSPSPPD PARVTSHSLSCRRKGILKHSSKYSAGTMDPALVSPEMPTLESLSEPGVPAEGLSRSYS RPSSVISDDSVLSSDSFDLLDLQENRPARQRIRSCVSAENFLQIQDFEGLQNRPRPQY LKRYRNRLADSSFSLLTDMDDVTQVYKQALEICSKLN" BASE COUNT 1541 a 1827 c 1877 g 1583 t ORIGIN 1 AATCTCCCAA GGCCTAAGGA GGCAAGAGGC CTGCAAATCG CCTCCTGCTC 51 AGCAAACGGG TTGCTCAGCA GGCCCGGGGT CCTGGTCCAC CCCAGGTCCC 101 TGGTTTGCCC ACCTCCGATG GCGGCCTTCG CTGGCAGGGT GGGCGCCTCT 151 GGGGAGCCAG CTCCGTCCCG GCGCCTTTAG AGCCCCATCT CTTCCACGTC 201 CCTGGCCTTC CTCCCCTTCC AGGCGGCTGT CCCCGCCGGG GTCCAGATGG 251 TGTCGGAGGG CCGGCGGTTC GACGGCGGGC CCGGGGTTCA GCCTCCCGGC 301 CTCCCTCCGT CCCTGACTCT CCTTTCTTCG GAGAGGGCGC GGGGGCCGGG 351 GCCAAAGCGC CGCTCTTGGG GTTCTCCTGG ACTCGGAGTT GCCCCAGGCG 401 GGCGCAGCTC TGCCCCGCGG GGTGCCAGCC TCGGGCGGGC AAGGTCCGTG 451 AGTCACCGCC TGTAACCGAA CACCAGGCCT CCCTGCCCCC TCCCCCAGCT 501 CCGGCCGCCA GGCTGCGGCG ACACCTACAA GAAAATGAAG GGGCGCCCAG 551 GCCCGCGGCG GCCCCGGCCG TATCGCGAGC AGGTCCCGGC GGCCCCCGGC 601 TCGCGGCGCT CTTTCTTCCC CGGCCCCGGG GCTCGGCCAG CCGCAACCGC 651 CGCCCCGGCG CCAGCAGGAA TCCAGGCCGA GCGACCGGCC CCGGAGCCCG 701 AGGCGGCGGA GGGCCCGCGG TAGCTGCGAC TGGCGAGCCC GAGAGCGCCC 751 GGGGAGGGGG CGCCCGGCTT GGAATTTCCC GGTCCCTTCC GGCCCAGCGA 801 GGACAAAGCA CTCCTGGCCG CCGCCGCCGC CGCCGCCGTG GCCTACGCCG 851 CGCCGCACAA AGGGCGAGTC GCGACACGCT CCCATCCCCC TCCCAGCTCA 901 CGGCGGCCCC GGCCCCGGGT GGCTGCAGGG AGGTGGGGGA AGCCCTGGCT 951 GCACCGCCCC TCGCTCCCCC TCCCCTGGGG CCGCGCGAGC GCCGCCCCCG 1001 CCCCGTCTGC GCGTCCTCCC GGGGAGGGGT TGGGGGGCGC GGCGCCCCAC 1051 ATAACACTCC CCCTCCTGCG CTGCGAGCCA CCCTCTCCCC TCCCTCCTGC 1101 AAACACCACC GCCTCCCCTG CCACCGCCGC CACCTCGCCC GACGCTCCAC 1151 AGCTCGCCGC GGCCGGGGGG CGGTGCGCGG ACCGTGCGCG CCGCGGGCGC 1201 CAGATGTGCA GTCCCCGCCG CCGCCAGTGA CCGAGCCGCA GTCCGAGCGG 1251 TATCGGGCCG CCTCCCTGAT GCTGCGGGGG CGACCTTGAG CGTACAGCGG 1301 CTTCCCTCGG TGGGGACCCC GACATCCCAG CGCTGTGCCC GGTCTTGCCC 1351 TCTGTAGCCC GGCTCGCCCC GCGCTTGGAC ATGGAAGGGG CCGCCGCGCC 1401 TGTGGCGGGG GACCGCCCCG ACTTGGGGCT GGGGGCGCCG GGCTCTCCCC 1451 GAGAGGCGGT GGCGGGGGCG ACTGCAGCCC TGGAGCCCAG GAAGCCGCAC 1501 GGGGTGAAGC GGCATCACCA CAAGCACAAC TTGAAGCACC GCTACGAGCT 1551 GCAGGAGACC CTGGGCAAAG GCACCTACGG CAAAGTCAAG CGGGCCACCG 1601 AGAGGTTTTC TGGCCGAGTG GTTGCTATAA AATCCATTCG TAAGGACAAA 1651 ATTAAGGATG AACAAGACAT GGTTCACATC AGACGAGAGA TTGAGATCAT 1701 GTCATCTCTC AACCATCCTC ATATCATCAG TATTTATGAA GTGTTTGAGA 1751 ACAAAGATAA GATTGTGATC ATCATGGAAT ATGCCAGCAA AGGGGAGCTG 1801 TACGATTACA TCAGTGAGCG GCGACGCCTC AGTGAGAGGG AGACCCGGCA 1851 CTTCTTCCGG CAGATCGTCT CTGCTGTGCA CTATTGTCAC AAGAACGGTG 1901 TGGTCCACCG GGACTTGAAG CTGGAAAATA TACTGCTCGA TGACAACTGC 1951 AATATTAAGA TTGCTGACTT TGGGCTTTCC AACCTGTACC AGAAGGATAA 2001 GTTCTTACAA ACGTTTTGTG GGAGTCCACT CTATGCATCT CCTGAGATTG 2051 TCAATGGGAG ACCTTACCGA GGGCCAGAGG TGGACAGCTG GGCCCTGGGT 2101 GTGTTGCTTT ACACTCTTGT TTATGGAACA ATGCCCTTCG ATGGTTTCGA 2151 TCACAAAAAC CTCATTCGGC AAATCAGCAG CGGAGAGTAC CGGGAGCCAA 2201 CACAGCCCTC AGATGCTCGA GGACTCATAC GGTGGATGCT GATGGTGAAC 2251 CCCGATCGCC GGGCCACTAT TGAGGACATT GCCAACCACT GGTGGGTGAA 2301 CTGGGGCTAT AAGAGCAGCG TGTGTGACTG TGATGCCCTC CATGACTCTG 2351 AGTCCCCACT CCTGGCTCGG ATCATTGACT GGCACCACCG TTCCACAGGG 2401 CTGCAGGCTG ACACCGAAGC CAAAATGAAG GGCCTGGCCA AACCCACGAC 2451 CTCTGAGGTC ATGCTAGAGC GGCAGCGGTC GCTGAAGAAA TCCAAGAAAG 2501 AGAATGACTT TGCTCAGTCT GGTCAGGATG CAGTGCCTGA AAGCCCATCC 2551 AAGTTGAGTT CTAAGAGGCC CAAGGGGATC CTGAAGAAGC GAAGCAACAG 2601 CGAGCATCGC TCTCACAGCA CTGGCTTCAT TGAAGGTGTA GTTGGTCCTG 2651 CCTTACCCTC TACTTTCAAG ATGGAGCAGG ACTTGTGCAG GACTGGCGTG 2701 CTCCTCCCAA GCTCACCAGA GGCAGAGGTG CCGGGAAAAC TCAGCCCCAA 2751 GCAGTCGGCC ACGATGCCCA AGAAAGGCAT CTTGAAAAAG ACCCAGCAGA 2801 GAGAATCAGG TTACTACTCT TCCCCAGAGC GCAGTGAGTC TTCGGAGCTG 2851 TTGGACAGTA ATGATGTGAT GGGCAGCAGC ATCCCCTCCC CCAGCCCCCC 2901 GGACCCAGCC AGGGTAACCT CCCACAGCCT CTCCTGCCGG AGGAAGGGCA 2951 TCTTGAAACA CAGCAGCAAA TACTCAGCGG GCACCATGGA CCCAGCCCTG 3001 GTCAGCCCTG AAATGCCCAC ACTGGAATCC CTGTCAGAGC CTGGTGTCCC 3051 TGCCGAGGGC CTCTCCCGGA GCTACAGCCG CCCTTCCAGT GTCATCAGCG 3101 ATGACAGCGT GCTGTCCAGC GACTCTTTTG ACTTGCTGGA TTTGCAGGAG 3151 AATCGCCCTG CCCGCCAGCG CATCCGCAGC TGCGTCTCTG CAGAAAACTT 3201 CCTCCAGATC CAGGACTTTG AGGGGCTCCA GAACCGGCCC CGGCCCCAGT 3251 ACCTGAAGCG GTACCGGAAC CGGCTGGCAG ACAGCAGCTT CTCCCTCCTC 3301 ACAGACATGG ATGATGTGAC TCAGGTCTAC AAGCAAGCGC TGGAGATCTG 3351 CAGCAAGCTC AACTAGCATT CCAGGGCGCC CAGGGGCGGG CGGGGGTACG 3401 AGGGAGGAAG GGGAGCAAGA CTTGGGCTCA CAGGCTGGTT ACCTCTTTGC 3451 TGGCTGTGAC AACAGACTGA AAAAGGATTG GCACTGTCTC ACTTGGCCAA 3501 GTTTGCAGCC TTGAGCCAAC ACCTAAAAGG GAGAGGTGGG CTCTTCTGCC 3551 AGTTCTGTCA ATTGTCAGTC AGAATTTGGG CCCTGTTTGG CATTTGCTTT 3601 ATGGCACCTC CTAGAGGACC AGCTGTCCAG GGGAGGTGGT ATTGACCGGC 3651 ACTCAGTGGG TGGAGAGGAA GCATATGTGG AAGGAGCATT TCCTTAGAAA 3701 TGCTTCATTC ATCCAGATGC TTCTGGAGGA GGGGCAGGAG ACACTTGGGC 3751 TGTTTGCCTT GGGCGAGCCC AAAGAACTTG CCCCTTTTCT CCTTGCATTA 3801 GCAAGCTAGG TCTGGCTGCG TGGAGCTGGC AAGTAGATTT CAGCAACTTG 3851 AGCTTGAGTT GATGATCAAT TAATAGTGGC TGCCAGTTGT GCTGGCGTAA 3901 GTGGCCCACA TCATGGGGAA GGAGTGCTGT GATTGACTAG TAATGGCTAC 3951 CACGGGAAAG GGAAGGGGAA GCAGTAGCAC TAATGCTATG TAGTTGTCAT 4001 CTTTGATCTG GCTAGGCCCT GGGAATCGGG TTTAGTCATC CTGTGGGATC 4051 TGTGTTAACT CTTTCATGCC ACTGGTGAGG CATTTGTTAA TTTGCTACTC 4101 AACTTTGAGG AAAGACAGGG CCTTGGTCAG AGAGAGAATG CTCTGAACTC 4151 TGCTAAGGAC ATAGAGTCAG CCCATGGTGA TTTAGCTCCT TGCTGTTCAC 4201 CTCCTCTTTC CTGATGTCTG CCTTGCTCTA CAGCACAACC TCTTGAGGGT 4251 GGACAGGGAG AAAGATGATG GTGTCAGAGG TCAAAACTAT TATATATGAC 4301 AGGGCACAAG ATGGTCTGTG ATCTTTGCAC AGATGAATGG AAGTTGATGC 4351 ACACCAACAA GAGGCAACTT GTCACTTTCT TTCTCAATAT TAACTGGAAT 4401 GCTGCCTCTT GGGTTCTCAC CTGCATGGAT GCTTTGAGTT GGATGTGATA 4451 CTGTCCATAT TCTCCAGAGG ATTACCTGGC TGAACCATTG GCTCTGTTCA 4501 CCAGTGACAG ATGGTTTCCC CATCCACTGA GTGTAGCATC CTCAGAGGTA 4551 GGCAAGTTTG CTTCTAGGGA GTTAGCATGT AGATGGGATA TTGGGATGAG 4601 GAAAGGAAAA TCAGGTAGAT GGTGCTTTTT TTCCCCCAAA TCTAAGTATT 4651 CTATGTCATG GTTTTAAACT TTGCCATGAA CTCCTGGGCT TTGGGGGAAG 4701 AGAAAGTTCC ATTCATTTAA ATGAATAAGG TGTTGAAAGA GTGCAGGGGG 4751 TTGGGAGGAA GCATGTAAGA GAGGGAACAT TTCCTTAGAT GTTACCCAGA 4801 TGGTTCTGGG GGAGACAGAA AAGAGGTGCG GCAGGACTCT TATCTTAAAA 4851 AGTAAACAAA ACAAAACAAA ACAAAACAAA AAAACTAGAT ATGTAATTTC 4901 TAAACACCCA GATCACAATG ACAAGATGCC ACTCCAACCA TGGGACACCT 4951 TCATGATACT AGGTTTGTAC TTCCTGGTCT CTGGGATGAC TTCAGATTCT 5001 GCTGGCCAAG GCAAATTGAA CTCAGTTCAA GATGGCCACC ACTGGTAGAC 5051 GTGTAGATAG AAAAGAGGAC TGGTCTTGGG AACATCTTTG GAAAAACCAA 5101 CAAACAATAG TTCTAGGGAG ATGAGAAAAA AATTCACCTT ACAGTGCTAA 5151 GAAAGTGCAT TAGAATGGAA TTGCCCTTTC CTTAAGGAGA CAGTTTGGGC 5201 TCTCCCCTTG CCACCGGCTC TGGTGTTTTG GCTTATGCGT TCCTTCAGGT 5251 TGAGCTGAGC AGTGTGTTAT GGGAAGCTGC TCAATTTCCT TTCATTCAAT 5301 TCCACCTCCT TCCTGAACTC TAATAGAGGT TAAAAGGGAA AAAAAAAATT 5351 CTGTAGATAG CAAATTGTGT GTGTGGGGGG GGGTGGGGGT GTGGGTGCAT 5401 GGAGGACAAC CTGCAACTCT GAGCTCCCTA CTTCCTGCCT CATTTCATGC 5451 AGTCTTTTCT GAACAGCCTA TGCTGCTGCC CTGCTGGCCC CTTGTGCACG 5501 GCAGCTGGCC GTGTCCGTAG CTGTCAGTAT GACTTAGATC TAGCTCCTAC 5551 CTACTGGTTG ATGTGTTTTT TCCTTTTGCC AAGTGATTGA GTCTGTTTAG 5601 TAGTTTCCAT CATTCTAGTC TTTAAGTAAA AATGACACTA TTGAGGAAAG 5651 TCAGTCTACT CCCTTCTTCC TCCCCCCAAA CACGTGTTCT CTTTTGTCAG 5701 GAAACTCAGC CAGTGGGCTG TGGCAGAGAA AGTCCTCCAC TCAGAGGCAG 5751 AGACTGAGTT AAGTCATAGG TGGCCTTAGG CATCTGCATT GTTTGCAGGG 5801 GTTAAGTTTT CCTTCCAGTG AGGGCTGGAG GGATGAATTA GCTGGTACCT 5851 GAAGCCCCGC TTAGCTCTGA CACTCTGCCA ACATCCTCTG ATTCTAGGTG 5901 TGGTGTTGAC TGTCCTTTCA AGGAAAAACT TGCAATAGAG GGAAAAGCCA 5951 TTAAAGCAGC TCCCTGCTTC ATCATTAAGT CCTGTCATCC CTACCAGCCA 6001 ATCCCAGTCA AAGAAGTTAT GCTTTATTCA CTTCTGTGGA ATTACAAGTG 6051 AGAGACACTT TTAGGACCTG ATGGACAAAG CAGGAGATTC ACTGTCAGCT 6101 TTCCTGGTCC TCTCCTTACT TCTGTGGGCC TTGCACCGTC TTAGTTTACA 6151 CATCTGCCAA AGGGGTAGAA TTACACTTCT TTTTACAGGT AAATGTCAAG 6201 GCACAATCAG TTTTCAGGAA GTGCTTCAAG ACCCCAGGTG AAATGAAAAT 6251 GCTAAGTACC CTCTGAATGG CCATGCCTGT TACCAGGTGC TGCTTCTTCA 6301 GATGATGGGG AGCACTTTTC AGGGTGAAAT TCAGGCGAGT TTTGCCCAGG 6351 CCTGCTGTCT TGAGTACAAA TGTGAATGAT CGACTGACTG CTTGTTGCCA 6401 AACTGGAAAT GTTCTGTAGG GATTTACTGG CATGGTATCA TTCCTAGAAG 6451 AAAAAAAGAG AGAAACTTGA CTGCACATTA AAAAAAAAAA AATCCACATT 6501 GTGACTTTTA TTTAATTTCT ATTTTTTTTG GTAATAAAAA GTTGACTTTT 6551 TTATTTGAAT TTGTCTTTTT TATTTATTGG TCTGAAAGGC ATTTCAAAGG 6601 TATTATAATA ATATATTGGT GTAATTTAAT TGGTGCAACA TGCTTTATGG 6651 CTCCTGTCAA AATTGGTTTT CACTCATTTG ATTGGTTTGA GCCCAGAACA 6701 GCCTACAGGG GAAAAACAAG CTGGATAACC ACCCAAAGTG TTTGTATTTT 6751 CGTTGGAAAC TGATTTTTGT TTCATTTTGG TTTTTGTTTC TGTTTTTATT 6801 TTTAAATTAA ATAAATTGCA ATGAACTG // LOCUS AB023172 5059 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0955 protein, complete cds. ACCESSION AB023172 NID g4589553 VERSION AB023172.1 GI:4589553 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hj05544. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 5059) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .5059 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hj05544" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 314. .1609 /gene="KIAA0955" CDS 314. .1609 /gene="KIAA0955" /codon_start=1 /product="KIAA0955 protein" /protein_id="BAA76799.1" /db_xref="PID:d1040552" /db_xref="PID:g4589554" /db_xref="GI:4589554" /translation="MMRQRQSHYCSVLFLSVNYLGGTFPGDICSEENQIVSSYASKVC FEIEEDYKNRQFLGPEGNVDVELIDKSTNRYSVWFPTAGWYLWSATGLGFLVRDEVTV TIAFGSWSQHLALDLQHHEQWLVGGPLFDVTAEPEEAVAEIHLPHFISLQGEVDVSWF LVAHFKNEGMVLEHPARVEPFYAVLESPSFSLMGILLRIASGTRLSIPITSNTLIYYH PHPEDIKFHLYLVPSDALLTKAIDDEEDRFHGVRLQTSPPMEPLNFGSSYIVSNSANL KVMPKELKLSYRSPGEIQHFSKFYAGQMKEPIQLEITEKRHGTLVWDTEVKPVDLQLV AASAPPPFSGAAFVKENHRQLQARMGDLKGVLDDLQDNEVLTENEKELVEQEKTRQSK NEALLSMVEKKGDLALDVLFRSISERDPYLVSYLRQQNL" BASE COUNT 1305 a 1024 c 1047 g 1683 t ORIGIN 1 CTGGTTCTCA ACTTCTTTTG AAATAATGTT CATAGAGAAG GAGGGCTGTC 51 TGAGATTCGA GGGAAACAAG CTCTCAGGAC TTCCGGTCGC CATGATGGCT 101 GTGGGCGGTA AACGCGGTTA GTGCAAGCAT CTGGGCCATC TTCAATGGTA 151 AAAAAGATAC AGTAAAGACA TAAATACCAC ATTTGACAAA TGGAAAAAAA 201 GGAGTGTCCA GAAAAGAGTA GCAGCAGTGA GGAAGAGCTG CCGAGACGGG 251 TATACAGGGA GCTACCCTGT GTTTCTGAGA CCCTTTGTGA CATCTCACAT 301 TTTTTCCAAG AAGATGATGA GACAGAGGCA GAGCCATTAT TGTTCCGTGC 351 TGTTCCTGAG TGTCAACTAT CTGGGGGGGA CATTCCCAGG AGACATTTGC 401 TCAGAAGAGA ATCAAATAGT TTCCTCTTAT GCTTCTAAAG TCTGTTTTGA 451 GATCGAAGAA GATTATAAAA ATCGTCAGTT TCTGGGGCCT GAAGGAAATG 501 TGGATGTTGA GTTGATTGAT AAGAGCACAA ACAGATACAG CGTTTGGTTC 551 CCCACTGCTG GCTGGTATCT GTGGTCAGCC ACAGGCCTCG GCTTCCTGGT 601 AAGGGATGAG GTCACAGTGA CGATTGCGTT TGGTTCCTGG AGTCAGCACC 651 TGGCCCTGGA CCTGCAGCAC CATGAACAGT GGCTGGTGGG CGGCCCCTTG 701 TTTGATGTCA CTGCAGAGCC AGAGGAGGCT GTCGCCGAAA TCCACCTCCC 751 CCACTTCATC TCCCTCCAAG GTGAGGTGGA CGTCTCCTGG TTTCTCGTTG 801 CCCATTTTAA GAATGAAGGG ATGGTCCTGG AGCATCCAGC CCGGGTGGAG 851 CCTTTCTATG CTGTCCTGGA AAGCCCCAGC TTCTCTCTGA TGGGCATCCT 901 GCTGCGGATC GCCAGTGGGA CTCGCCTCTC CATCCCCATC ACTTCCAACA 951 CATTGATCTA TTATCACCCC CACCCCGAAG ATATTAAGTT CCACTTGTAC 1001 CTTGTCCCCA GCGACGCCTT GCTAACAAAG GCGATAGATG ATGAGGAAGA 1051 TCGCTTCCAT GGTGTGCGCC TGCAGACTTC GCCCCCAATG GAACCCCTGA 1101 ACTTTGGTTC CAGTTATATT GTGTCTAATT CTGCTAACCT GAAAGTAATG 1151 CCCAAGGAGT TGAAATTGTC CTACAGGAGC CCTGGAGAAA TTCAGCACTT 1201 CTCAAAATTC TATGCTGGGC AGATGAAGGA ACCCATTCAA CTTGAGATTA 1251 CTGAAAAAAG ACATGGGACT TTGGTGTGGG ATACTGAGGT GAAGCCAGTG 1301 GATCTCCAGC TTGTAGCTGC ATCAGCCCCT CCTCCTTTCT CAGGTGCAGC 1351 CTTTGTGAAG GAGAACCACC GGCAACTCCA AGCCAGGATG GGGGACCTGA 1401 AAGGGGTGCT CGATGATCTC CAGGACAATG AGGTTCTTAC TGAGAATGAG 1451 AAGGAGCTGG TGGAGCAGGA AAAGACACGG CAGAGCAAGA ATGAGGCCTT 1501 GCTGAGCATG GTGGAGAAGA AAGGGGACCT GGCCCTGGAC GTGCTCTTCA 1551 GAAGCATTAG TGAAAGGGAC CCTTACCTCG TGTCCTATCT TAGACAGCAG 1601 AATTTGTAAA ATGAGTCAGT TAGGTAGTCT GGAAGAGAGA ATCCAGCGTT 1651 CTCATTGGAA ATGGATAAAC AGAAATGTGA TCATTGATTT CAGTGTTCAA 1701 GACAGAAGAA GACTGGGTAA CATCTATCAC ACAGGCTTTC AGGACAGACT 1751 TGTAACCTGG CATGTACCTA TTGACTGTAT CCTCATGCAT TTTCCTCAAG 1801 AATGTCTGAA GAAGGTAGTA ATATTCCTTT TAAATTTTTT CCAACCATTG 1851 CTTGATATAT CACTATTTTA TCCATTGACA TGATTCTTGA AGACCCAGGA 1901 TAAAGGACAT CCGGATAGGT GTGTTTATGA AGGATGGGGC CTGGAAAGGC 1951 AACTTTTCCT GATTAATGTG AAAAATAATT CCTATGGACA CTCCGTTTGA 2001 AGTATCACCT TCTCATAACT AAAAGCAGAA AAGCTAACAA AAGCTTCTCA 2051 GCTGAGGACA CTCAAGGCAT ACATGATGAC AGTCTTTTTT TTTTTTGTAT 2101 GTTAGGACTT TAACACTTTA TCTATGGCTA CTGTTATTAG AACAATGTAA 2151 ATGTATTTGC TGAAAGAGAG CACAAAAATG GGAGAAAATG CAAACATGAG 2201 CAGAAAATAT TTTCCCACTG GTGTGTAGCC TGCTACAAGG AGTTGTTGGG 2251 TTAAATGTTC ATGGTCAACT CCAAGGAATA CTGAGATGAA ATGTGGTAAA 2301 TCAACTCCAC AGAACCACCA AAAAGAAAAT GAGGGTAATT CAGCTTATTC 2351 TGAGACAGAC ATTCCTGGCA ATGTACCATA CAAAAAATAA GCCAACTCTG 2401 ACATTTGGAT TCTACCATAG ACTCTGTCAT TTTGTAGCCA TTTCAGCTGT 2451 CTTTTGATTA ATGTTTTCGT GGCACACATA TTTCCATCCT TTTATGTTTA 2501 ATCTGTTTAA AACAAGTTCC TAGTAGACAC CATCTGGTTG AGTCAGTTTT 2551 TTTTATGGTG TATTTTGAAC CCATTCTGAT AGTCTCTTTT AACTGGAAGA 2601 TTTCAATTAC TTACGTTAAT GTAATTATTA ATATGTTAGG ATTTATCCTC 2651 AGTCAGCCAG TTTGTTATGT CTTTTCTATT CTACTGTTAT CACATTTGTA 2701 CCACTTAAAG TGGAATCTAG GCACTTTATC ACCATTTAGA TCCTATTACC 2751 TTTTCTCATC TAGGATATAG TTATCTTCTA CATAATCTTT CTGTATCTTA 2801 AAACCCATCA ATAAATTATT ATATATTTTC TACTTTTAAT CACTCAGAAG 2851 ATTTAAAAAA CTCATGAGAA GAGTAATCTG TTATGTTTTT CCAGATATTT 2901 ACCATTTCTG TTGCTCTTCC TTCATTATTT TCCAAATTTC GTTCTGCAAA 2951 TTTCCACTTC TTCTGATAGA CGTTTTTTAG TTCTTTTAGA GTGGTTCTGA 3001 TAGGTACAGA TTCTCTTATT TTTTGCTTCC TCTGAGGACA TCTTTTTCTC 3051 ACCTTCATTC TCAGTGATGT TTTTTGCTTG TAGTATTTTT AGTTGACATT 3101 GTTTTCTGTT CAGCAGTTTC CTTTTAGCTT CCGTATTTCC TGATGAGAAA 3151 TCTGCAGTCA TTCAAATTGT TGTTTCCCTG TATGTAGTGT GTCATTTTTC 3201 TGTCAGATTT CAAGGTATTT ATCTTTAGTT TTTAGCCATT TCATTATGTT 3251 GGGGATGAGT TTCCTTGTTT TATTCCCTTT GGAATTTGCT CCAATTCATA 3301 AATTTGCAGT TTTATGTCTT TTACCAAACT TAGAGGTTTT CAGCCTAATT 3351 TCTAAAAATA CTTTTTATTA GCCTGATTTT CATCTTTATA GGAAATAGTT 3401 TAAGTGATGA CAAGTTCCAA TAGCTTATAT GCCCAGAAGG CCTTCAAAAT 3451 AAGAATTTTG AAAGAATACA GAAAACAAAC TTTTATATCC TTCTCATGTC 3501 TTCTACTGTA AAATTCATAT GCTTTGCTAC TCTAAACCTA GTTTGAAATC 3551 AACAGTCTTG AGAATAGATG AAAATTTTGA TGAATAGTGG AATTCTTTTA 3601 AATGGAAACC TCTTACATGT GATTTTCCTT GCCATCTAGA AATAAACCAT 3651 AGTATTTATG TTGAATCAAT CAATATTATA TTTTGTTTTT TTCCTCCTCT 3701 TCTGAGACTC TTATTGTGGA AATGTTAGAC TTTTATGTTT TCCTAAATGT 3751 CCCTGATATT CTACTTATTT AGAACATCTT TTCATTTTTT CCATTATTCT 3801 GATTGGGTAA TTTTAATTTG TCTATTTTCA AATTTGCTGG AGTGTTCACC 3851 TGTTGTTGTC TGTGTCGTCC CACTGAGTGC ATTCACCACC TTTTAAATTT 3901 TGGTCACTGT ATGTATCAGT TCTAAAATTT CCATTTTGTT CTCTATATTT 3951 TAAATTTCTT GGCTTATATT CTATTTTCCT GCAAATGTGT CAGCATTTGC 4001 TTGTTTGAGC TTTTTTTTTT TCAAGACAGG GTCTCAACTC TGTTACCCAG 4051 GCTGGAGTGC AGTGGTGCGA TCTCAGCTCA CTGCAACCTC TGCCTCCTGG 4101 TTCAAGCGAT TATTGTGCCT CAGCCTCCTG AGTAGCTGGG ATTACAGGCA 4151 TGCACCACCA CAGCCCAGCT AATTTTTTGT ATTTTTAGTA GAGACAGAGT 4201 TTTGCTATGT TGGCCAGGCT GGTTTTGAAC TCCTGGCCTC AAGTGATCCA 4251 CCCACCTCAG CCTCCCAAAG TGCTGGGATT ACAGGCCACT ACACCTGGCA 4301 CATTTGAGTA TTTTTTTTTT TTTTTTTTTT TTGAGATGGA GTCTCGCTCT 4351 GTCATCTAGG CTGGAGTGCA GTGGTGTGAT CTCAGCTCAC TGCAGCCTCT 4401 GTCTCCCGGG CTCAAGCGAT TCTCTTGCCT CAGCCTCCTG AGTAGCTAGG 4451 ACTACAGGTG CATGCCAACA CGCCCGGCTA ATTTTTTTAA AAAATATTTT 4501 TAGTAGAGAC AGGGTTTCAC CATTTTGGCC AGGATGGTCT CGATCTCCTG 4551 ACCTCATGAT CCACCCGCCT CGGCCTTCCA AAGTGCTGGG ATTACAGGCA 4601 TGAGCCACCG TGCCTGGCCT CATTTGAGTA TTTTTATAAT GTCTCTTTTA 4651 AAGTCTTTGT CAGATAATTC CACTGTACAT GTTATTCAGT GTTTGGTGTC 4701 CACTGAGTTG TCATTTGCCA GACAAGTGGA GATTTTTGCA GCTCATCCTT 4751 GTATTCTCAG TAGTTCCGAT ATGTACCCTC GACATGTGAA TGTTATCTTA 4801 TGAGACTCTG TTTTATTTGT ATCCAACAGA AGATGTTTAT TATTTATTTG 4851 GCTTTCTGTG AACTGAGGTC TTAATATCAG CTCATTTTAA AAGTCTTTGC 4901 AGTGGTATTC GGATCTATCC TGTGTGTGCC TATGAGATTG GGTGCAGTGT 4951 ATCCTGTTAG CTCCATTCTC AGGGCGTTTG AATGTGAATT AGGACCAGCG 5001 CAATGAATGC TCAAGTTGGG GTTGGGCGTT AGAATTCATA AAAGTCTTTA 5051 TATGCTCAG // LOCUS AF035582 3937 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens CASK mRNA, complete cds. ACCESSION AF035582 NID g2661105 VERSION AF035582.1 GI:2661105 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3937) AUTHORS Zha,D. and Hu,G. TITLE The human homolog of the rat CASK, Drosophila Camguk and C.elegans Lin-2 genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 3937) AUTHORS Zha,D. and Hu,G. TITLE Direct Submission JOURNAL Submitted (24-NOV-1997) Max-Planck Junior Group Number 2, Shanghai Institute of Cell Biology, 320 Yue-Yang Road, Shanghai, Shanghai 200031, P. R. China FEATURES Location/Qualifiers source 1. .3937 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1. .3937 /gene="CASK" CDS 16. .2709 /gene="CASK" /note="N-terminal contains a Ca/calmodulin-dependent kinase domain next to a PDZ domain which links to a C-terminal guanylate kinase domain; similar to rat CASK, Drosophila Camguk, and C. elegans Lin-2" /codon_start=1 /product="CASK" /protein_id="AAB88198.1" /db_xref="PID:g2661106" /db_xref="GI:2661106" /translation="MADDDVLFEDVYELCEVIGKGPFSVVRRCINRETGQQFAVKIVD VAKFTSSPGLSTEDLKREASICHMLKHPHIVELLETYSSDGMLYMVFEFMDGADLCFE IVKRADAGFVYSEAVASHYMRQILEALRYCHDNNIIHRDVKPHCVLLASKENSAPVKL GGFGVAIQLGESGLVAGGRVGTPHFMAPEVVKREPYGKPVDVWGCGVILFILLSGCLP FYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRMLMLDPAERITVYEALNHPWL KERDRYAYKIHLPETVEQLRKFNARRKLKGAVLAAVSSHKFNSFYGDPPEELPDFSED PTSSGAVSQVLDSLEEIHALTDCSEKDLDFLHSVFQDQHLHTLLDLYDKINTKSSPQI RNLPSDAVQRAKEVLEEISCYPENNDAKELKRILTQPHFMALLQTHDVVAHEVYSDEA LRVTPPPTSPYLNGDSPESANGDMDMENVTRVRLVQFQKNTDEPMGITLKMNELNHCI VARIMHGGMIHRQGTLHVGDEIREINGISVANQTVEQLQKMLREMRGSITFKIVPSYR TQSSSCEDLPSTTQPKGRQIYVRAQFEYDPAKDDLIPCKEAGIRFRVGDIIQIISKDD HNWWQGKLENSKNGTAGLIPSPELQEWRVACIAMEKTKQEQQASCTWFGKKKKQYKDK YLAKHNAVFDQLDLVTYEEVVKLPAFKRKTLVLLGAHGVGRRHIKNTLITKHPDRFAY PIPHTTRPPKKDEENGKNYYFVSHDQMMQDISNNEYLEYGSHEDAMYGTKLETIRKIH EQGLIAILDVEPQALKVLRTAEFAPFVVFIAAPTITPGLNEDESLQRLQKESDILQRT YAHYFDLTIINNEIDETIRHLEEAVELVCTAPQWVPVSWVY" BASE COUNT 1230 a 750 c 847 g 1110 t ORIGIN 1 TATCCCCTCC GGACCATGGC CGACGACGAC GTGCTGTTCG AGGATGTGTA 51 CGAGCTGTGC GAGGTGATCG GAAAGGGTCC CTTCAGTGTT GTACGACGAT 101 GTATCAACAG AGAAACTGGG CAACAATTTG CTGTAAAAAT TGTTGATGTA 151 GCCAAGTTCA CATCAAGTCC AGGGTTAAGT ACAGAAGATC TAAAGCGGGA 201 AGCCAGTATC TGTCATATGC TGAAACATCC ACACATTGTA GAGTTATTGG 251 AGACATATAG CTCAGATGGA ATGCTTTACA TGGTTTTCGA ATTTATGGAT 301 GGAGCAGATC TGTGTTTTGA AATCGTAAAG CGAGCTGACG CTGGTTTTGT 351 GTACAGTGAA GCTGTAGCCA GCCATTATAT GAGACAGATA CTGGAAGCTC 401 TACGCTACTG CCATGATAAT AACATAATTC ACAGGGATGT GAAGCCCCAC 451 TGTGTTCTCC TTGCCTCAAA AGAAAACTCG GCACCTGTTA AACTTGGAGG 501 CTTTGGGGTA GCTATTCAAT TAGGGGAGTC TGGACTTGTA GCTGGAGGAC 551 GTGTTGGAAC ACCTCATTTT ATGGCACCAG AAGTGGTCAA AAGAGAGCCT 601 TACGGAAAGC CTGTAGACGT CTGGGGGTGC GGTGTGATCC TTTTTATCCT 651 GCTCAGTGGT TGTTTGCCTT TTTACGGAAC CAAGGAAAGA TTGTTTGAAG 701 GCATTATTAA AGGAAAATAT AAGATGAATC CAAGGCAGTG GAGCCATATC 751 TCTGAAAGTG CCAAAGACCT AGTACGTCGC ATGCTGATGC TGGATCCAGC 801 TGAAAGGATC ACTGTTTATG AAGCACTGAA TCACCCATGG CTTAAGGAGC 851 GGGATCGTTA CGCCTACAAG ATTCATCTTC CAGAAACAGT AGAGCAGCTG 901 AGGAAATTCA ATGCAAGGAG GAAACTAAAG GGTGCAGTAC TAGCCGCTGT 951 GTCAAGTCAC AAATTCAACT CATTCTATGG GGATCCCCCT GAAGAGTTAC 1001 CAGATTTCTC CGAAGACCCT ACCTCCTCAG GAGCAGTCTC ACAGGTGCTG 1051 GACAGCCTGG AAGAGATTCA TGCGCTTACA GACTGCAGTG AAAAGGACCT 1101 AGATTTTCTA CACAGTGTTT TCCAGGATCA GCATCTTCAC ACACTACTAG 1151 ATCTGTATGA CAAAATTAAC ACAAAGTCTT CACCACAAAT CAGGAATCTT 1201 CCAAGCGATG CAGTACAGAG AGCCAAAGAG GTATTGGAAG AAATTTCATG 1251 TTACCCTGAG AATAACGACG CAAAGGAACT AAAGCGTATT TTAACACAAC 1301 CTCATTTCAT GGCCTTACTT CAGACTCACG ACGTAGTGGC ACATGAAGTT 1351 TACAGTGATG AAGCATTGAG GGTCACACCT CCTCCCACCT CTCCCTATTT 1401 AAACGGCGAT TCTCCAGAAA GTGCTAACGG AGACATGGAT ATGGAGAATG 1451 TGACCAGAGT TCGGCTGGTA CAGTTTCAAA AGAACACAGA TGAACCAATG 1501 GGAATCACTT TAAAAATGAA TGAACTAAAT CATTGTATTG TTGCAAGAAT 1551 TATGCATGGG GGCATGATTC ACAGGCAAGG TACACTTCAT GTTGGTGATG 1601 AAATTCGAGA AATCAATGGC ATCAGTGTGG CTAACCAAAC AGTGGAACAA 1651 CTGCAAAAAA TGCTTAGGGA AATGCGGGGG AGTATTACCT TCAAGATTGT 1701 GCCAAGTTAC CGCACTCAGT CTTCGTCCTG TGAGGACTTG CCATCAACTA 1751 CCCAACCAAA AGGACGACAG ATCTATGTAA GAGCACAATT TGAATATGAT 1801 CCAGCCAAGG ATGACCTCAT CCCCTGTAAA GAAGCTGGCA TTCGATTCAG 1851 AGTTGGTGAC ATCATCCAGA TTATTAGTAA GGATGATCAT AATTGGTGGC 1901 AGGGTAAACT GGAAAACTCC AAAAATGGAA CTGCAGGTCT CATTCCTTCT 1951 CCTGAACTTC AGGAATGGCG AGTAGCTTGC ATTGCCATGG AGAAGACCAA 2001 ACAGGAGCAG CAGGCCAGCT GTACTTGGTT TGGCAAGAAA AAGAAGCAGT 2051 ACAAAGATAA ATATTTGGCA AAGCACAATG CAGTGTTTGA TCAATTAGAT 2101 CTTGTCACAT ATGAAGAAGT AGTAAAACTG CCAGCATTCA AGAGGAAAAC 2151 ACTAGTCTTA TTAGGCGCAC ATGGTGTTGG GAGAAGACAC ATAAAAAACA 2201 CTCTCATCAC AAAGCACCCA GACCGGTTTG CGTACCCTAT TCCACATACA 2251 ACCAGACCTC CAAAGAAAGA CGAAGAAAAT GGAAAGAATT ATTACTTTGT 2301 ATCTCATGAC CAAATGATGC AAGACATCTC TAATAACGAG TACTTGGAGT 2351 ACGGCAGCCA CGAGGATGCG ATGTATGGGA CAAAACTGGA GACCATCCGG 2401 AAGATCCACG AGCAGGGGCT GATTGCAATA CTGGACGTGG AGCCTCAGGC 2451 ACTGAAGGTC CTGAGAACTG CAGAGTTTGC TCCTTTTGTT GTTTTCATTG 2501 CTGCACCAAC TATTACTCCA GGTTTAAATG AAGATGAATC TCTTCAGCGT 2551 CTGCAGAAGG AGTCTGACAT CTTACAGAGA ACATATGCAC ACTACTTCGA 2601 TCTCACAATT ATCAACAATG AAATTGATGA GACAATCAGA CATCTGGAGG 2651 AAGCTGTTGA GCTCGTGTGC ACAGCCCCAC AGTGGGTCCC TGTCTCCTGG 2701 GTCTATTAGG CCTCTCCCCA GATATCTGAG CATAACTGGG AGCACCTCAT 2751 TTGTGGAAAA GCCTCTTTGT TATCGGCCTT GTGTCAGCAG GTCATGGTCC 2801 CTAGAGACTA CCTAGTTGTA GTGTGACCTA CATTTATAAT TATTGTCATG 2851 TCCGAATAGA TAGGAGGAGA AAAACAATTA CACACTAATT TAAAGAGACA 2901 GTATCTTTTT TAATCAGTTC TCCTAAACTT TAATAAAATG TATCTTTAAA 2951 TGTATGTATT ATTCAATCCT TTGGAATGTT ATATTTTTGG AAATCATAGC 3001 TTTTTATTTC CAAGGCCCCT AAAAACTGCA CAAAATAGAT GCTGCTTTCT 3051 ATAATCTATT TTAATAATAA TAAACAATGA TTCTGTTACC TTGACTGGGG 3101 GTGGAACACT ACATTCTTTT TAGAGTCTGA TTTTATGGAT TGGAATATTG 3151 GGATATCTTT CTTTCCTTTA TTTATTTGAA ATAATTAGTG TAGTGATTAC 3201 AAAACAGTCA TAAATTTTTA AAGGCCTTTT TTCTCTCTTT TTTTTTTTTT 3251 AGAATAGTAT TTTTTTTTTA AGTCCTTTAT GTAACATCCA GATAATGTGA 3301 TACTGTCTCT TTGAAGCACC CTGTAAACCT TTTAGAGATT TGAAGTTGGG 3351 TCTTGACTCT TAATGCATGT GGACAGTCGC GAGCGTTTAT GCTGTCGGTG 3401 TGTCTGTGTT GGACAAAACA ATCTGTAATC TACAGCAAAG TACATTCTAC 3451 ATTCCGTTCA TGGGGCACAC CCAGGGGAAT AAGAATAAAA TGCTATTATG 3501 ACTAAGTTGT AAACCTATGC ACATCCCTTG CATTTTGGGC AACTTTATAA 3551 AAAAAAAGAA ACTGATTTTT ATTAATAATA ATCATGTAGT GAAATGTGTT 3601 TGTAATTTTG TCTCAATTTA ATTTGTTGTA AGGTGGGGTG GGGGGAATTG 3651 CTGGTTTCAC CATTTCAGAT CTGTGTTGTC TAAGAGTATT AACGTTTTAA 3701 TTAAGCAAAG AAATGATTTT TAATCTGTAT GTAATTGTTT TAAAGCACCC 3751 ATTTTAAGAG AAAATACTGT GCAATGAAGA AACCAGTTTA GGCATTTGCT 3801 ATAAACTGAA ATATTCCAAA AGAATCATCT ATAACAGCCC TGTAAATTCC 3851 TTTAAAATGA TAACTAACAG GACAGTTTGA CCAATTTTTT TTAAATATAC 3901 TTCCTTTTAT GTGTTCAATA AAAAAAAAAA AAAAAAA // LOCUS AB020700 4195 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0893 protein, complete cds. ACCESSION AB020700 NID g4240274 VERSION AB020700.1 GI:4240274 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk08702. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (6), 355-364 (1998) MEDLINE 99156230 REFERENCE 2 (bases 1 to 4195) AUTHORS Ohara,O., Suyama,M., Kikuno,R., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (02-DEC-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4195 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk08702" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 224. .2983 /gene="KIAA0893" CDS 224. .2983 /gene="KIAA0893" /codon_start=1 /product="KIAA0893 protein" /protein_id="BAA74916.1" /db_xref="PID:d1038650" /db_xref="PID:g4240275" /db_xref="GI:4240275" /translation="MTAEETVNVKEVEIIKLILDFLNSKKLHISMLALEKESGVINGL FSDDMLFLRQLILDGQWDEVLQFIQPLECMEKFDKKRFRYIILKQKFLEALCVNNAMS AEDEPQHLEFTMQEAVQCLHALEEYCPSKDDYSKLCLLLTLPRLTNHAEFKDWNPSTA RVHCFEEACVMVAEFIPADRKLSEAGFKASNNRLFQLVMKGLLYECCVEFCQSKATGE EITESEVLLGIDLLCGNGCDDLDLSLLSWLQNLPSSVFSCAFEQKMLNIHVDKLLKPT KAAYADLLTPLISKLSPYPSSPMRRPQSADAYMTRSLNPALDGLTCGLTSHDKRISDL GNKTSPMSHSFANFHYPGVQNLSRSLMLENTECHSIYEESPERDTPVDAQRPIGSEIL GQSSVSEKEPANGAQNPGPAKQEKNELRDSTEQFQEYYRQRLRYQQHLEQKEQQRQIY QQMLLEGGVNQEDGPDQQQNLTEQFLNRSIQKLGELNIGMDGLGNEVSALNQQCNGSK GNGSNGSSVTSFTTPPQDSSQRLTHDASNIHTSTPRNPGSTNHIPFLEESPCGSQISS EHSVIKPPLGDSPGSLSRSKGEEDDKSKKQFVCINILEDTQAVRAVAFHPAGGLYAVG SNSKTLRVCAYPDVIDPSAHETPKQPVVRFKRNKHHKGSIYCVAWSPCGQLLATGSND KYVKVLPFNAETCNATGPDLEFSMHDGTIRDLAFMEGPESGGAILISAGAGDCNIYTT DCQRGQGLHALSGHTGHILALYTWSGWMIASGSQDKTVRFWDLRVPSCVRVVGTTFHG TGSAVASVAVDPSGRLLATGQEDSSCMLYDIRGGRMVQSYHPHSSDVRSVRFSPGAHY LLTGSYDMKIKVTDLQGDLTKQLPIMVVGEHKDKVIQCRWHTQDLSFLSSSADRTVTL WTYNG" BASE COUNT 1205 a 836 c 922 g 1232 t ORIGIN 1 GTCGCTGGGC CGGGAGGGGC GGACGTGAGA AGGACGGATT GACGAACTGA 51 TGGATTGACG CGCGGGCGGT AGGAGGGAGG ACCGACGCCA AACCCAGACC 101 GCCGCCGTCG TGCTCCTGCC GCAGCCCGGA GCCGGCCGCT TCGGGGCCCT 151 GGCCGCCGGC CTCCCAGCCG CGTTCTCCTC CGCCGCTCCT CCGGGCTTGC 201 CCTGGAGCCC TCAGGCTATC AATATGACGG CTGAAGAAAC AGTGAATGTA 251 AAAGAGGTTG AAATCATTAA GCTAATTTTG GACTTCCTGA ATTCAAAGAA 301 GCTTCACATT AGTATGCTGG CCCTGGAGAA GGAAAGTGGA GTCATAAATG 351 GCCTGTTTTC AGATGATATG CTTTTCCTGA GGCAGCTAAT ACTTGATGGT 401 CAATGGGATG AAGTTCTTCA GTTCATTCAG CCTCTAGAAT GTATGGAAAA 451 ATTTGACAAA AAAAGGTTTC GTTATATTAT CCTGAAGCAG AAGTTTTTAG 501 AAGCTTTATG TGTTAACAAC GCGATGTCAG CAGAAGATGA GCCCCAGCAT 551 CTGGAATTTA CCATGCAAGA AGCTGTGCAA TGTTTACATG CTCTAGAAGA 601 ATACTGTCCT TCTAAAGATG ACTATAGTAA GCTCTGTTTG CTTTTGACTT 651 TGCCTCGTCT GACCAATCAT GCCGAGTTTA AGGACTGGAA TCCCAGCACC 701 GCACGAGTTC ACTGTTTTGA AGAGGCTTGT GTCATGGTTG CAGAATTCAT 751 CCCTGCTGAT AGGAAGCTAA GTGAAGCTGG TTTTAAGGCT AGTAACAATC 801 GTTTATTTCA GCTTGTAATG AAAGGCCTGC TTTATGAATG CTGTGTAGAA 851 TTTTGTCAGA GTAAAGCAAC TGGAGAAGAA ATTACAGAAA GCGAAGTGCT 901 TCTTGGCATC GACCTCTTAT GTGGTAATGG TTGTGATGAT TTGGATCTGA 951 GTTTACTGTC ATGGCTTCAG AATCTTCCAT CTTCTGTCTT CTCTTGTGCT 1001 TTTGAACAGA AAATGCTTAA TATTCATGTT GACAAACTTC TGAAACCTAC 1051 AAAAGCTGCA TATGCTGATC TTTTGACTCC TCTTATCAGC AAACTCTCTC 1101 CCTATCCATC ATCCCCAATG AGAAGACCTC AATCAGCTGA TGCCTATATG 1151 ACCCGCTCTC TGAATCCTGC TTTAGATGGC CTCACCTGTG GACTAACCAG 1201 TCATGATAAG AGAATTTCAG ACCTTGGAAA CAAAACTTCT CCAATGTCAC 1251 ACTCCTTTGC TAACTTCCAT TATCCAGGGG TACAAAACCT CAGTAGAAGT 1301 CTCATGCTTG AGAATACAGA ATGTCACAGT ATTTACGAAG AATCCCCTGA 1351 GCGTGATACA CCTGTTGATG CACAGAGGCC TATCGGCAGT GAAATCTTGG 1401 GCCAGAGTTC AGTTTCAGAA AAAGAGCCTG CAAATGGAGC ACAGAATCCA 1451 GGACCAGCTA AACAAGAAAA AAATGAGCTT CGAGATTCAA CAGAACAATT 1501 TCAAGAATAT TATAGGCAAA GATTACGCTA TCAACAGCAT TTAGAACAGA 1551 AGGAGCAACA GCGGCAGATA TACCAACAGA TGTTGCTTGA AGGAGGCGTG 1601 AATCAGGAGG ATGGTCCTGA TCAGCAGCAG AATCTTACTG AACAGTTCCT 1651 TAATAGGTCC ATTCAAAAGC TTGGTGAATT AAATATTGGA ATGGATGGCC 1701 TTGGTAATGA GGTATCAGCA CTCAACCAGC AATGTAATGG GAGCAAAGGC 1751 AATGGATCTA ATGGTTCTTC TGTGACTAGT TTTACTACAC CACCCCAAGA 1801 CTCTAGTCAG AGATTAACAC ATGATGCTTC AAATATTCAT ACAAGCACTC 1851 CTCGTAATCC TGGATCAACA AATCACATAC CTTTTCTGGA GGAATCACCT 1901 TGTGGAAGCC AAATCTCTTC AGAACATTCG GTCATTAAGC CACCTCTTGG 1951 AGATTCTCCA GGGAGTCTTT CAAGGTCGAA AGGGGAAGAG GATGACAAAT 2001 CAAAAAAGCA GTTTGTTTGT ATTAATATCC TAGAAGACAC ACAAGCTGTT 2051 AGAGCAGTGG CTTTTCATCC AGCTGGAGGT TTATATGCTG TTGGTTCAAA 2101 TTCAAAAACT CTGAGAGTAT GTGCCTATCC AGATGTAATT GATCCAAGTG 2151 CACATGAGAC TCCTAAGCAG CCGGTGGTAC GTTTTAAAAG GAATAAACAT 2201 CATAAAGGAT CCATTTACTG TGTGGCCTGG AGTCCTTGTG GGCAGTTATT 2251 AGCAACAGGA TCAAATGACA AATACGTCAA AGTGCTGCCC TTCAATGCAG 2301 AGACTTGTAA CGCAACAGGA CCAGATCTGG AATTTAGTAT GCATGATGGA 2351 ACAATTAGAG ACTTGGCATT TATGGAAGGC CCAGAAAGCG GAGGAGCTAT 2401 TTTAATAAGT GCTGGAGCAG GGGATTGTAA CATTTATACA ACCGATTGTC 2451 AAAGAGGTCA GGGCCTCCAT GCTTTGAGTG GACATACTGG GCATATTTTA 2501 GCACTTTATA CCTGGAGTGG CTGGATGATT GCATCTGGTT CCCAAGATAA 2551 GACTGTTAGA TTTTGGGATC TTCGAGTACC AAGTTGTGTT CGTGTTGTTG 2601 GCACAACATT TCATGGAACT GGCAGTGCAG TGGCATCTGT AGCTGTAGAT 2651 CCCAGTGGTC GTCTCTTAGC CACAGGTCAA GAAGATTCTA GCTGCATGTT 2701 GTATGACATA AGAGGAGGAA GAATGGTACA AAGTTATCAT CCTCATTCCA 2751 GTGATGTTCG CTCTGTTCGA TTCTCCCCTG GAGCTCACTA CTTGCTAACA 2801 GGCTCTTATG ATATGAAAAT AAAGGTGACA GACCTACAAG GGGACCTCAC 2851 CAAGCAGCTT CCTATCATGG TGGTGGGGGA GCACAAGGAC AAAGTGATTC 2901 AGTGCAGATG GCACACCCAG GATCTTTCCT TCCTGTCATC CTCTGCAGAT 2951 AGAACTGTCA CCCTCTGGAC TTACAATGGG TAGAGCACAC CGCATGTCAG 3001 TCTATGCAGC AAAAGCACAG AGACTTAAGA CTACTGAGTT GTGAAAATTA 3051 CAAATCTGAA GAACATAGTG TCCAGGAAAG TGGTTTAGCA CGAAGAGGCC 3101 CCTTATTACC ATGTATCCCA CTGATAGGAG GTGTTGGGTG GTGTTATTCC 3151 GCAGTGCTTT CAGTCTTCCA TGTGAGCTCG TGCTGCTGTG ACCTGCTATA 3201 TGTAGTCTCG TTGCCAAAGT CTGCAGAAGA GCTCTTCAGT TGTTGGTGTG 3251 CACTCCAAGT CAGGATGGAC AATGTGTTTA CGGTTTAGTA TTCAATGCAT 3301 TCCTTGGTCT TTGCCTAAAT AACAGTTTTA TATGCACATT GAAATGGAAT 3351 TATACTTCAA CTATATTATT AAATGTAATG CAACCAAGTT CCTCCCAGAT 3401 TAAACTTCCC AGGTGTTCAG AATTACTTTT GCTCTTCTCA CGATCCCATA 3451 TTGTATTATC ACTTGTCTTC TAGAGGTCAG AATTCCATAA TATATGTCAC 3501 TCAAAAGTTA CATGGTTGCT TTCACTTAAG GATCATTATG GAGTTTAAAG 3551 ATGAATGAAA AACTGCTTCT TAGTTTACTA CATGGTATAG GCCCTTTTTT 3601 CTTAAACCCA GGGATATGAT TATTTTGTCA TATAATTTTG TTTCAGGCTA 3651 AAAGGTAAAT GTGTTTGCTT CAGAAACTTG TTAACTTCAG TTTTTTGAAT 3701 GCAACAGGAT ACCTCCCTTC CAAACTGAAC TGTAGAAGCA GAGCAGCAGC 3751 AGTTATGTGA TGCAACACTT GATGGTACAG TAAATTTACT GGCATTTTTC 3801 TCCTTAAAAA TTAAAATCCT TGACATAGAC CATAGCATGG CTTGAAATGC 3851 TATGTCTGCA TGATAATTTA AAATGGAAGA TTTAAACTTT GCACTCCAAA 3901 AGCTTATTTG GATTTTTTTC TTGCACTGTT TTGTGTAATG CAGAATAATG 3951 ATTTTATTTC TACAGCTTTG TAGATTCTAA CATTTATGTA TCTTTATTTT 4001 CATATTGTAC AGTAATTTTA CTTTAAATTA TTTAAATAGG CTATTTTATT 4051 TATTTCAAAT GCAGTTGTAT TAGTTCTCAT TATTGAACTG TCTGTGCACT 4101 GTATGTAGCA AGCATTTTTC ATCTGTTGTA TACAAGTGGA AAGGGTATTA 4151 GAAGTGTAAC TGTGCTATTA TTTCAATAAA GACCTCTTGA CATTT // LOCUS HSU43653 3426 bp mRNA PRI 14-MAR-1996 DEFINITION Human obese protein (ob) mRNA, complete cds. ACCESSION U43653 NID g1226243 VERSION U43653.1 GI:1226243 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3426) AUTHORS Gong,D.W., Bi,S., Pratley,R.E. and Weintraub,B.D. TITLE Genomic structure and promoter analysis of the human obese gene JOURNAL J. Biol. Chem. 271 (8), 3971-3974 (1996) MEDLINE 96223958 REFERENCE 2 (bases 1 to 3426) AUTHORS Gong,D.-W. TITLE Direct Submission JOURNAL Submitted (19-DEC-1995) Da-Wei Gong, Molecular and Cellular Endocrinology Branch, NIDDK/NIH, Bldg10/Rm8D14, 10 Center Drive MSC1822, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1. .3426 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" gene 57. .560 /gene="ob" CDS 57. .560 /gene="ob" /codon_start=1 /product="obese protein" /protein_id="AAC50400.1" /db_xref="PID:g1226244" /db_xref="GI:1226244" /translation="MHWGTLCGFLWLWPYLFYVQAVPIQKVQDDTKTLIKTIVTRIND ISHTQSVSSKQKVTGLDFIPGLHPILTLSKMDQTLAVYQQILTSMPSRNVIQISNDLE NLRDLLHVLAFSKSCHLPWASGLETLDSLGGVLEASGYSTEVVALSRLQGSLQDMLWQ LDLSPGC" BASE COUNT 887 a 799 c 920 g 820 t ORIGIN 1 GTAGGAATCG CAGCGCCAAC GGTTGCAAGG CCCAAGAAGC CATCCTGGGA 51 AGGAAAATGC ATTGGGGAAC CCTGTGCGGA TTCTTGTGGC TTTGGCCCTA 101 TCTTTTCTAT GTCCAAGCTG TGCCCATCCA AAAAGTCCAA GATGACACCA 151 AAACCCTCAT CAAGACAATT GTCACCAGGA TCAATGACAT TTCACACACG 201 CAGTCAGTCT CCTCCAAACA GAAAGTCACC GGTTTGGACT TCATTCCTGG 251 GCTCCACCCC ATCCTGACCT TATCCAAGAT GGACCAGACA CTGGCAGTCT 301 ACCAACAGAT CCTCACCAGT ATGCCTTCCA GAAACGTGAT CCAAATATCC 351 AACGACCTGG AGAACCTCCG GGATCTTCTT CACGTGCTGG CCTTCTCTAA 401 GAGCTGCCAC TTGCCCTGGG CCAGTGGCCT GGAGACCTTG GACAGCCTGG 451 GGGGTGTCCT GGAAGCTTCA GGCTACTCCA CAGAGGTGGT GGCCCTGAGC 501 AGGCTGCAGG GGTCTCTGCA GGACATGCTG TGGCAGCTGG ACCTCAGCCC 551 TGGGTGCTGA GGCCTTGAAG GTCACTCTTC CTGCAAGGAC TACGTTAAGG 601 GAAGGAACTC TGGCTTCCAG GTATCTCCAG GATTGAAGAG CATTGCATGG 651 ACACCCCTTA TCCAGGACTC TGTCAATTTC CCTGACTCCT CTAAGCCACT 701 CTTCCAAAGG CATAAGACCC TAAGCCTCCT TTTGCTTGAA ACCAAAGATA 751 TATACACAGG ATCCTATTCT CACCAGGAAG GGGGTCCACC CAGCAAAGAG 801 TGGGCTGCAT CTGGGATTCC CACCAAGGTC TTCAGCCATC AACAAGAGTT 851 GTCTTGTCCC CTCTTGACCC ATCTCCCCCT CACTGAATGC CTCAATGTGA 901 CCAGGGGTGA TTTCAGAGAG GGCAGAGGGG TAGGCAGAGC CTTTGGATGA 951 CCAGAACAAG GTTCCCTCTG AGAATTCCAA GGAGTTCCAT GAAGACCACA 1001 TCCACACACG CAGGAACTCC CAGCAACACA AGCTGGAAGC ACATGTTTAT 1051 TTATTCTGCA TTTTATTCTG GATGGATTTG AAGCAAAGCA CCAGCTTCTC 1101 CAGGCTCTTT GGGGTCAGCC AGGGCCAGGG GTCTCCCTGG AGTGCAGTTT 1151 CCAATCCCAT AGATGGGTCT GGCTGAGCTG AACCCATTTT GAGTGACTCG 1201 AGGGTTGGGT TCATCTGAGC AAGAGCTGGC AAAGGTGGCT CTCCAGTTAG 1251 TTCTCTCGTA ACTGGTTTCA TTTCTACTGT GACTGATGTT ACATCACAGT 1301 GTTTGCAATG GTGTTGCCCT GAGTGGATCT CCAAGGACCA GGTTATTTTA 1351 AAAAGATTTG TTTTGTCAAG TGTCATATGT AGGTGTCTGC ACCCAGGGGT 1401 GGGGAATGTT TGGGCAGAAG GGAGAAGGAT CTAGAATGTG TTTTCTGAAT 1451 AACATTTGTG TGGTGGGTTC TTTGGAAGGA GTGAGATCAT TTTCTTATCT 1501 TCTGCAATTG CTTAGGATGT TTTTCATGAA AATAGCTCTT TCAGGGGGGT 1551 TGTGAGGCCT GGCCAGGCAC CCCCTGGAGA GAAGTTTCTG GCCCTGGCTG 1601 ACCCCAAAGA GCCTGGAGAA GCTGATGCTT TGCTTCAAAT CCATCCAGAA 1651 TAAAACGCAA AGGGCTGAAA GCCATTTGTT GGGGCAGTGG TAAGCTCTGG 1701 CTTTCTCCGA CTGCTAGGGA GTGGTCTTTC CTATCATGGA GTGACGGTCC 1751 CACACTGGTG ACTGCGATCT TCAGAGCAGG GGTCCTTGGT GTGACCCTCT 1801 GAATGGGTCC AGGGTTGATC ACACTCTGGG TTTATTACAT GGCAGTGTTC 1851 CTATTTGGGG CTTGCATGCC AAATTGTAGT TCTTGTCTGA TTGGCTCACC 1901 CAAGCAAGGC CAAAATTACC AAAAATCTTG GGGGGTTTTT ACTCCAGTGG 1951 TGAAGAAAAC TCCTTTAGCA GGTGGTCCTG AGACCTGACA AGCACTGCTA 2001 GGCGAGTGCC AGGACTCCCC AGGCCAGGCC ACCAGGATGC CCTTCCCACT 2051 GGAGGTCACA TTCAGGAAGA TGAAAGAGGA GGTTTGGGGT CTGCCACCAT 2101 CCTGCTGCTG TGTTTTTGCT ATCACACAGT GGGTGGTGGA TCTGTCCAAG 2151 GAAACTTGAA TCAAAGCAGT TAACTTTAAG ACTGAGCACC TGCTTCATGC 2201 TCAGCCCTGA CTGGTGCTAT AGGCTGGAGA AGCTCACCCA ATAAACATTA 2251 AGATTGAGGC CTGCCCTCAG GGATCTTGCG TTCCCAGTGG TCAAACCGCA 2301 CTCACCCATG TGCCAAGGTG GGGTATTTAC CACAGCAGCT GAACAGCCAA 2351 ATGCATGGTG CAGTTGACAG CAGGTGGGAA ATGGTATGAG CTGAGGGGGG 2401 CCGTGCCCAG GGGCCCACAG GGAACCCTGC TTGCACTTTG TAACATGTTT 2451 ACTTTTCAGG GCATCTTAGC TTCTATTATA GCCACATCCC TTTGAAACAA 2501 GATAACTGAG AATTTAAAAA TAAGAAAATA CATAAGACCA TAACAGCCAA 2551 CAGGTGGCAG GACCAGGACT ATAGCCCAGG TCCTCTGATA CCCAGAGCAT 2601 TACGTGAGCC AGGTAATGAG GGACTGGAAC CAGGGAGACC GAGCGCTTTC 2651 TGGAAAAGAG GAGTTTCGAG GTAGAGTTTG AAGGAGGTGA GGGATGTGAA 2701 TTGCCTGCAG AGAGAAGCCT GTTTTGTTGG AAGGTTTGGT GTGTGGAGAT 2751 GCAGAGGTAA AAGTGTGAGC AGTGAGTTAC AGCGAGAGGC AGAGAAAGAA 2801 GAGACAGGAG GGCAAGGGCC ATGCTGAAGG GACCTTGAAG GGTAAAGAAG 2851 TTTGATATTA AAGGAGTTAA GAGTAGCAAG TTCTAGAGAA GAGGCTGGTG 2901 CTGTGGCCAG GGTGAGAGCT GCTCTGGAAA ATGTGACCCA GATCCTCACA 2951 ACCACCTAAT CAGGCTGAGG TGTCTTAAGC CTTTTGCTCA CAAAACCTGG 3001 CACAATGGCT AATTCCCAGA GTGTGAAACT TCCTAAGTAT AAATGGTTGT 3051 CTGTTTTTGT AACTTAAAAA AAAAAAAAAA AGTTTGGCCG GGTGCGGTGG 3101 CTCACGCCTG TAATCCCAGC ACTTTGGGAG GCCAAGGTGG GGGGATCACA 3151 AGGTCACTAG ATGGCGAGCA TCCTGGCCAA CATGGTGAAA CCCCGTCTCT 3201 ACTAAAAACA CAAAAGTTAG CTGAGCGTGG TGGCGGGCGC CTGTAGTCCC 3251 AGCCACTCGG GAGGCTGAGA CAGGAGAATC GCTTAAACCT GGGAGGCGGA 3301 GAGTACAGTG AGCCAAGATC GCGCCACTGC ACTCCGGCCT GATGACAGAG 3351 CGAGATTCCG TCTTAAAAAA AAAAAAAAAA AAAGTTTGTT TTTAAAAAAA 3401 TCTAAATAAA ATAACTTTGC CCCCTG // LOCUS AB007942 5747 bp mRNA PRI 13-AUG-1998 DEFINITION Homo sapiens mRNA for KIAA0473 protein, complete cds. ACCESSION AB007942 NID g3413907 VERSION AB007942.1 GI:3413907 KEYWORDS KIAA0473 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0220. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5747) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (08-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Seki,N., Ohira,M., Nagase,T., Ishikawa,K., Miyajima,N., Nakajima,D., Nomura,N. and Ohara,O. TITLE Characterization of cDNA clones in size-fractionated cDNA libraries from human brain JOURNAL DNA Res. 4 (5), 345-349 (1997) MEDLINE 98116662 FEATURES Location/Qualifiers source 1. .5747 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="HH0220" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 158. .2899 /gene="KIAA0473" CDS 158. .2899 /gene="KIAA0473" /codon_start=1 /product="KIAA0473 protein" /protein_id="BAA32318.1" /db_xref="PID:d1033280" /db_xref="PID:g3413908" /db_xref="GI:3413908" /translation="MKDSENKGASSPDMEPSYGGGLFDMVKGGAGRLFSNLKDNLKDT LKDTSSRVIQSVTSYTKGDLDFTYVTSRIIVMSFPLDNVDIGFRNQVDDIRSFLDSRH LDHYTVYNLSPKSYRTAKFHSRVSECSWPIRQAPSLHNLFAVCRNMYNWLLQNPKNVC VVHCLDGRAASSILVGAMFIFCNLYSTPGPAIRLLYAKRPGIGLSPSHRRYLGYMCDL LADKPYRPHFKPLTIKSITVSPIPFFNKQRNGCRPYCDVLIGETKIYSTCTDFERMKE YRVQDGKIFIPLNITVQGDVVVSMYHLRSTIGSRLQAKVTNTQIFQLQFHTGFIPLDT TVLKFTKPELDACDVPEKYPQLFQVTLDVELQPHDKVIDLTPPWEHYCTKDVNPSILF SSHQEHQDTLALGGQAPIDIPPDNPRHYGQSGFFASLCWQDQKSEKSFCEEDHAALVN QESEQSDDELLTLSSPHGNANGDKPHGVKKPSKKQQEPAAPPPPEDVDLLGLEGSAMS NSFSPPAAPPTNSELLSDLFGGGGAAGPTQAGQSGVEDVFHPSGPASTQSTPRRSATS TSASPTLRVGEGATFDPFGAPSKPSGQDLLGSFLNTSSASSDPFLQPTRSPSPTVHAS STPAVNIQPDVSGGWDWHAKPGGFGMGSKSAATSPTGSSHGTPTHQSKPQTLDPFADL GTLGSSSFASKPTTPTGLGGGFPPLSSPQKASPQPMGGGWQQGGAYNWQQPQPKPQPS MPHSSPQNRPNYNVSFSAMPGGQNERGKGSSNLEGKQKAADFEDLLSGQGFNAHKDKK GPRTIAEMRKEEMAKEMDPEKLKILEWIEGKERNIRALLSTMHTVLWAGETKWKPVGM ADLVTPEQVKKVYRKAVLVVHPDKATGQPYEQYAKMIFMELNDAWSEFENQGQKPLY" BASE COUNT 1720 a 1231 c 1199 g 1597 t ORIGIN 1 CGCACACCGA CTTGCATGCA ATTATCATAG CCCGAGTGCT CCTCCGTTGA 51 GAGACTTCGC CCCCGAGACC GCTGACTGTG AATGACAAAT CAAAAGTCAG 101 GGTTGCAGAA TCAGCCGGAC TTTCCTGCTC ATTTGCAGCA GAGGGAGGAA 151 GCAGAGAATG AAAGATTCTG AAAATAAAGG TGCCTCATCT CCAGACATGG 201 AGCCCAGCTA TGGGGGAGGT CTCTTTGACA TGGTAAAAGG AGGTGCAGGG 251 AGGCTCTTTA GTAACCTAAA GGACAACTTG AAAGACACCC TCAAAGACAC 301 ATCTTCTAGA GTGATACAAT CTGTGACCAG CTACACAAAG GGAGATTTAG 351 ACTTCACTTA TGTTACCTCC AGAATTATTG TGATGTCCTT TCCTCTGGAC 401 AATGTTGACA TAGGATTCAG GAATCAGGTT GATGACATTC GAAGCTTTTT 451 GGATTCCAGA CATCTTGACC ACTACACAGT ATACAATCTG TCACCTAAGT 501 CTTATCGAAC TGCCAAGTTT CACAGCCGGG TCTCAGAATG CAGTTGGCCC 551 ATTAGGCAGG CTCCCAGTCT GCACAACCTT TTTGCTGTGT GTCGGAATAT 601 GTATAACTGG CTACTGCAGA ATCCCAAAAA TGTCTGTGTT GTCCACTGCT 651 TGGATGGACG GGCGGCATCA TCAATTCTGG TTGGTGCTAT GTTCATTTTC 701 TGTAATCTCT ACTCTACTCC TGGCCCAGCC ATTCGATTGC TATATGCAAA 751 GCGACCAGGA ATTGGACTTT CACCATCCCA TAGGAGATAC CTGGGCTATA 801 TGTGTGACCT ACTGGCAGAC AAGCCCTACC GCCCTCACTT CAAGCCTCTC 851 ACAATTAAGT CGATCACTGT CAGTCCAATA CCCTTTTTCA ACAAACAGAG 901 GAATGGATGT CGCCCTTACT GTGATGTACT CATTGGAGAA ACCAAAATAT 951 ATTCGACTTG CACAGATTTT GAACGAATGA AAGAATATCG TGTCCAAGAT 1001 GGAAAAATCT TCATTCCCTT GAACATCACT GTGCAAGGAG ACGTGGTTGT 1051 TTCCATGTAT CACTTGAGGT CAACCATTGG GAGCCGGCTA CAGGCTAAGG 1101 TGACCAACAC ACAGATATTC CAGCTTCAGT TTCACACTGG ATTCATACCA 1151 CTGGACACAA CAGTTTTAAA GTTCACCAAG CCTGAGTTAG ATGCATGTGA 1201 TGTACCAGAA AAATATCCTC AGCTATTTCA GGTGACACTG GATGTAGAAC 1251 TACAGCCCCA TGACAAAGTA ATAGACTTAA CTCCACCATG GGAACATTAC 1301 TGCACAAAAG ATGTCAATCC CAGCATCCTC TTCTCTTCTC ACCAGGAACA 1351 TCAAGATACG CTGGCCTTAG GAGGACAGGC TCCAATAGAT ATCCCTCCAG 1401 ACAACCCCAG GCATTACGGA CAAAGTGGTT TCTTTGCCTC TCTCTGTTGG 1451 CAAGATCAGA AATCGGAGAA GTCATTCTGT GAGGAAGACC ACGCTGCCCT 1501 AGTGAATCAG GAAAGTGAGC AATCAGATGA TGAACTTCTG ACACTTTCCA 1551 GTCCGCATGG CAATGCCAAT GGTGACAAGC CTCATGGAGT CAAGAAGCCC 1601 AGCAAAAAGC AGCAGGAGCC AGCAGCCCCT CCACCCCCTG AGGATGTGGA 1651 CCTTTTGGGC CTGGAAGGGT CTGCAATGAG TAACAGCTTC TCTCCGCCAG 1701 CGGCTCCTCC CACCAATTCT GAACTACTGA GTGACCTGTT TGGGGGTGGA 1751 GGTGCAGCTG GTCCCACCCA GGCTGGACAG TCAGGAGTGG AAGATGTGTT 1801 TCATCCTAGT GGACCTGCGT CTACCCAGTC AACACCACGC CGCTCTGCCA 1851 CCTCCACCTC TGCGTCTCCA ACCCTAAGAG TGGGAGAAGG TGCCACCTTT 1901 GACCCATTTG GAGCACCTTC TAAACCATCA GGTCAGGATT TGCTGGGTTC 1951 TTTTCTGAAC ACATCCAGTG CTTCCAGTGA CCCCTTTCTC CAGCCCACAA 2001 GAAGTCCTTC GCCCACAGTA CATGCTTCTA GTACGCCTGC TGTGAACATT 2051 CAGCCAGATG TTTCTGGAGG TTGGGACTGG CATGCTAAAC CAGGAGGCTT 2101 TGGAATGGGA AGCAAGTCAG CTGCCACCAG CCCAACCGGA TCCTCGCATG 2151 GTACTCCCAC CCATCAAAGC AAACCCCAGA CTCTGGATCC TTTTGCCGAC 2201 CTTGGGACAC TAGGTAGTTC TTCCTTTGCC AGCAAACCCA CCACACCAAC 2251 TGGATTGGGT GGAGGATTCC CGCCTCTCAG CTCGCCACAG AAGGCGTCTC 2301 CCCAGCCTAT GGGTGGCGGG TGGCAGCAGG GAGGTGCCTA CAACTGGCAG 2351 CAGCCACAGC CTAAGCCTCA GCCCAGCATG CCCCACTCCT CTCCCCAGAA 2401 CCGACCCAAC TACAACGTGA GCTTCTCAGC CATGCCTGGG GGCCAGAACG 2451 AACGTGGGAA AGGATCAAGT AATTTGGAAG GGAAACAAAA AGCAGCTGAT 2501 TTTGAAGACC TACTCTCTGG TCAAGGTTTC AATGCTCACA AAGACAAAAA 2551 GGGGCCTCGG ACAATAGCTG AGATGAGAAA GGAGGAAATG GCCAAGGAAA 2601 TGGATCCTGA GAAATTAAAG ATTCTGGAAT GGATTGAAGG CAAAGAAAGA 2651 AATATCAGAG CCCTTCTTTC CACGATGCAT ACCGTACTAT GGGCTGGGGA 2701 GACCAAGTGG AAACCAGTTG GCATGGCAGA CCTGGTAACA CCAGAGCAGG 2751 TGAAGAAGGT GTACAGGAAG GCTGTCCTGG TGGTGCACCC AGATAAAGCT 2801 ACTGGGCAAC CCTATGAACA ATACGCAAAG ATGATTTTCA TGGAGCTCAA 2851 TGATGCCTGG TCTGAATTTG AAAACCAAGG CCAAAAGCCC TTATATTAAT 2901 TTATGAGCTT TTCCATCTCT GCTGCAGACC TGTGCTAATG CTTAGTGTGT 2951 GTCACAATTC TGAGGTTTTC GCAGATGAAC CAAAAACTCC AGTAACATGT 3001 TTTCAGTACT AAACCGTTAA GTTACTCATG AATTAATTTC TCATTGATAA 3051 GGAATGTGGA TGTTTGGTTT CTCCAAAGTT CCCACCATAA AAGATCCAAA 3101 GCATGAGAGG AACTTCTAGT CAGATGACCT TGCAGAACCA CCGCATTCCA 31