LOCUS A18397 1964 bp mRNA PAT 26-APR-1994 DEFINITION Human uPA cDNA. ACCESSION A18397 NID g512446 VERSION A18397.1 GI:512446 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1964) AUTHORS . TITLE THROMBUS-SPECIFIC ANTIBODY DERIVATIVES JOURNAL Patent: WO 9116353-A 36 31-OCT-1991; FEATURES Location/Qualifiers source 1. .1964 /organism="Homo sapiens" /db_xref="taxon:9606" gene 100. .1395 /gene="uPA" CDS 100. .1395 /gene="uPA" /codon_start=1 /protein_id="CAA01390.1" /db_xref="PID:g512447" /db_xref="GI:512447" /db_xref="SWISS-PROT:P00749" /translation="MRALLARLLLCVLVVSDSKGSNELHQVPSNCDCLNGGTCVSNKY FSNIHWCNCPKKFGGQHCEIDKSKTCYEGNGHFYRGKASTDTMGRPCLPWNSATVLQQ TYHAHRSDALQLGLGKHNYCRNPDNRRRPWCYVQVGLKPLVQECMVHDCADGKKPSSP PEELKFQCGQKTLRPRFKIIGGEFTTIENQPWFAAIYRRHRGGSVTYVCGGSLISPCW VISATHCFIDYPKKEDYIVYLGRSRLNSNTQGEMKFEVENLILHKDYSADTLAHHNDI ALLKIRSKEGRCAQPSRTIQTICLPSMYNDPQFGTSCEITGFGKENSTDYLYPEQLKM TVVKLISHRECQQPHYYGSEVTTKMLCAADPQWKTDSCQGDSGGPLVCSLQGRMTLTG IVSWGRGCALKDKPGVYTRVSHFLPWIRSHTKEENGLAL" BASE COUNT 468 a 524 c 499 g 473 t ORIGIN 1 AAGCTTCGGG CCAGGGTCCA CCTGTCCCCG CAGCGCCGTC GCGCCCTCCT 51 GCCGCAGGCC ACCGAGGCCG CCGCCGTCTA GCGCCCCGAC CTCGCCACCA 101 TGAGAGCCCT GCTGGCGCGC CTGCTTCTCT GCGTCCTGGT CGTGAGCGAC 151 TCCAAAGGCA GCAATGAACT TCATCAAGTT CCATCGAACT GTGACTGTCT 201 AAATGGAGGA ACATGTGTGT CCAACAAGTA CTTCTCCAAC ATTCACTGGT 251 GCAACTGCCC AAAGAAATTC GGAGGGCAGC ACTGTGAAAT AGATAAGTCA 301 AAAACCTGCT ATGAGGGGAA TGGTCACTTT TACCGAGGAA AGGCCAGCAC 351 TGACACCATG GGCCGGCCCT GCCTGCCCTG GAACTCTGCC ACTGTCCTTC 401 AGCAAACGTA CCATGCCCAC AGATCTGATG CTCTTCAGCT GGGCCTGGGG 451 AAACATAATT ACTGCAGGAA CCCAGACAAC CGGAGGCGAC CCTGGTGCTA 501 TGTGCAGGTG GGCCTAAAGC CGCTTGTCCA AGAGTGCATG GTGCATGACT 551 GCGCAGATGG AAAAAAGCCC TCCTCTCCTC CAGAAGAATT AAAATTTCAG 601 TGTGGCCAAA AGACTCTGAG GCCCCGCTTT AAGATTATTG GGGGAGAATT 651 CACCACCATC GAGAACCAGC CCTGGTTTGC GGCCATCTAC AGGAGGCACC 701 GGGGGGGCTC TGTCACCTAC GTGTGTGGAG GCAGCCTCAT CAGCCCTTGC 751 TGGGTGATCA GCGCCACACA CTGCTTCATT GATTACCCAA AGAAGGAGGA 801 CTACATCGTC TACCTGGGTC GCTCAAGGCT TAACTCCAAC ACGCAAGGGG 851 AGATGAAGTT TGAGGTGGAA AACCTCATCC TACACAAGGA CTACAGCGCT 901 GACACGCTTG CTCACCACAA TGACATTGCC TTGCTGAAGA TCCGTTCCAA 951 GGAGGGCAGG TGTGCGCAGC CATCCCGGAC TATACAGACC ATCTGCCTGC 1001 CCTCGATGTA TAACGATCCC CAGTTTGGCA CAAGCTGTGA GATCACTGGC 1051 TTTGGAAAAG AGAATTCTAC CGACTATCTC TATCCGGAGC AGCTGAAAAT 1101 GACTGTTGTG AAGCTGATTT CCCACCGGGA GTGTCAGCAG CCCCACTACT 1151 ACGGCTCTGA AGTCACCACC AAAATGCTGT GTGCTGCTGA CCCACAGTGG 1201 AAAACAGATT CCTGCCAGGG AGACTCAGGG GGACCCCTCG TCTGTTCCCT 1251 CCAAGGCCGC ATGACTTTGA CTGGAATTGT GAGCTGGGGC CGTGGATGTG 1301 CCCTGAAGGA CAAGCCAGGC GTCTACACGA GAGTCTCACA CTTCTTACCC 1351 TGGATCCGCA GTCACACCAA GGAAGAGAAT GGCCTGGCCC TCTGAGGGTC 1401 CCCAGGGAGG AAACGGGCAC CACCCGCTTT CTTGCTGGTT GTCATTTTTG 1451 CAGTAGAGTC ATCTCCATCA GAAGCTTTTG GGGAGCAGAG ACACTAACGA 1501 CTTCAGGGCA GGGCTCTGAT ATTCCATGAA TGTATCAGGA AATATATATG 1551 TGTGTGTATG TTTGCACACT TGTTGTGTGG GCTGTGAGTG TAAGTGTGAG 1601 TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT AAACTGTGTG 1651 GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 1701 CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT 1751 CTGCAGCATG ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA 1801 TGTCCCTTTC TTGGCCAGTT ATCCCTTCCT TTTAGCCTAG TTCATCCAAT 1851 CCTCACTGGG TGGGGTGAGG ACCACTCCTT ACACTGAATA TTTATATTTC 1901 ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG ATCAATAAAA 1951 TGTGATTTTT CTGA // LOCUS A18757 1400 bp mRNA PAT 27-APR-1994 DEFINITION u-PA receptor. ACCESSION A18757 NID g512463 VERSION A18757.1 GI:512463 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1400) AUTHORS . JOURNAL Patent: WO 9207083-A 1 30-APR-1992; FEATURES Location/Qualifiers source 1. .1400 /organism="Homo sapiens" /db_xref="taxon:9606" gene 47. .1054 /gene="u-PA receptor" CDS 47. .1054 /gene="u-PA receptor" /codon_start=1 /protein_id="CAA01421.1" /db_xref="PID:g512464" /db_xref="GI:512464" /db_xref="SWISS-PROT:Q03405" /translation="MGHPPLLPLLLLLHTCVPASWGLRCMQCKTNGDCRVEECALGQD LCRTTIVRLWEEGEELELVEKSCTHSEKTNRTLSYRTGLKITSLTEVVCGLDLCNQGN SGRAVTYSRSRYLECISCGSSDMSCERGRHQSLQCRSPEEQCLDVVTHWIQEGEEGRP KDDRHLRGCGYLPGCPGSNGFHNNDTFHFLKCCNTTKCNEGPILELENLPQNGRQCYS CKGNSTHGCSSEETFLIDCRGPMNQCLVATGTHEPKNQSYMVRGCATASMCQHAHLGD AFSMNHIDVSCCTKSGCNHPDLDVQYRSGAAPQPGPAHLSLTITLLMTARLWGGTLLW T" BASE COUNT 346 a 396 c 366 g 292 t ORIGIN 1 AGAGAAGACG TGCAGGGACC CCGCGCACAG GAGCTGCCCT CGCGACATGG 51 GTCACCCGCC GCTGCTGCCG CTGCTGCTGC TGCTCCACAC CTGCGTCCCA 101 GCCTCTTGGG GCCTGCGGTG CATGCAGTGT AAGACCAACG GGGATTGCCG 151 TGTGGAAGAG TGCGCCCTGG GACAGGACCT CTGCAGGACC ACGATCGTGC 201 GCTTGTGGGA AGAAGGAGAA GAGCTGGAGC TGGTGGAGAA AAGCTGTACC 251 CACTCAGAGA AGACCAACAG GACCCTGAGC TATCGGACTG GCTTGAAGAT 301 CACCAGCCTT ACCGAGGTTG TGTGTGGGTT AGACTTGTGC AACCAGGGCA 351 ACTCTGGCCG GGCTGTCACC TATTCCCGAA GCCGTTACCT CGAATGCATT 401 TCCTGTGGCT CATCAGACAT GAGCTGTGAG AGGGGCCGGC ACCAGAGCCT 451 GCAGTGCCGC AGCCCTGAAG AACAGTGCCT GGATGTGGTG ACCCACTGGA 501 TCCAGGAAGG TGAAGAAGGG CGTCCAAAGG ATGACCGCCA CCTCCGTGGC 551 TGTGGCTACC TTCCCGGCTG CCCGGGCTCC AATGGTTTCC ACAACAACGA 601 CACCTTCCAC TTCCTGAAAT GCTGCAACAC CACCAAATGC AACGAGGGCC 651 CAATCCTGGA GCTTGAAAAT CTGCCGCAGA ATGGCCGCCA GTGTTACAGC 701 TGCAAGGGGA ACAGCACCCA TGGATGCTCC TCTGAAGAGA CTTTCCTCAT 751 TGACTGCCGA GGCCCCATGA ATCAATGTCT GGTAGCCACC GGCACTCACG 801 AACCGAAAAA CCAAAGCTAT ATGGTAAGAG GCTGTGCAAC CGCCTCAATG 851 TGCCAACATG CCCACCTGGG TGACGCCTTC AGCATGAACC ACATTGATGT 901 CTCCTGCTGT ACTAAAAGTG GCTGTAACCA CCCAGACCTG GATGTCCAGT 951 ACCGCAGTGG GGCTGCTCCT CAGCCTGGCC CTGCCCATCT CAGCCTCACC 1001 ATCACCCTGC TAATGACTGC CAGACTGTGG GGAGGCACTC TCCTCTGGAC 1051 CTAAACCTGA AATCCCCCTC TCTGCCCTGG CTGGATCCGG GGGACCCCTT 1101 TGCCCTTCCC TCGGCTCCCA GCCCTACAGA CTTGCTGTGT GACCTCAGGC 1151 CAGTGTGCCG ACCTCTCTGG GCCTCAGTTT TCCCAGCTAT GAAAACAGCT 1201 ATCTCACAAA GTTGTGTGAA GCAGAAGAGA AAAGCTGGAG GAAGGCCGTG 1251 GGCAATGGGA GAGCTCTTGT TATTATTAAT ATTGTTGCCG CTGTTGTGTT 1301 GTTGTTATTA ATTAATATTC ATATTATTTA TTTTATACTT ACATAAAGAT 1351 TTTGTACCAG TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA // LOCUS A21239 1512 bp mRNA PAT 24-OCT-1994 DEFINITION H.sapiens BTA 1916 mRNA for Pai-2. ACCESSION A21239 NID g641358 VERSION A21239.1 GI:641358 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1512) AUTHORS . TITLE VARIANTS OF PAI-2 JOURNAL Patent: WO 9109124-A 2 27-JUN-1991; FEATURES Location/Qualifiers source 1. .1512 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" /cell_line="U937" /clone="BTA 1916" CDS 22. .1200 /note="amino aids 74 to 96 inclusive have been deleted" /codon_start=1 /product="plasminogen activator inhibitor type 2 protein" /protein_id="CAA01536.1" /db_xref="PID:g641359" /db_xref="GI:641359" /translation="MEDLCVANTLFALNLFKHLAKASPTQNLFLSPWSISSTMAMVYM GSRGSTEDQMAKVLQFNEVGANAVTPMTPAQAADKIHSSFRSLSSAINASTGNYLLES VNKLFGEKSASFREEYIRLCQKYYSSEPQAVDFLECAEEARKKINSWVKTQTKGKIPN LLPEGSVDGDTRMVLVNAVYFKGKWKTPFEKKLNGLYPFRVNSAQRTPVQMMYLREKL NIGYIEDLKAQILELPYAGDVSMFLLLPDEIADVSTGLELLESEITYDKLNKWTSKDK MAEDEVEVYIPQFKLEEHYELRSILRSMGMEDAFNKGRANFSGMSERNDLFLSEVFHQ AMVDVNEEGTEAAAGTGGVMTGRTGHGGPQFVADHPFLFLIMHKITNCILFFGRFSSP " BASE COUNT 449 a 318 c 324 g 421 t ORIGIN 1 GATCTGTAAG GAGGTATATA AATGGAGGAT CTTTGTGTGG CAAACACACT 51 CTTTGCCCTC AATTTATTCA AGCATCTGGC AAAAGCAAGC CCCACCCAGA 101 ACCTCTTCCT CTCCCCATGG AGCATCTCGT CCACCATGGC CATGGTCTAC 151 ATGGGCTCCA GGGGCAGCAC CGAAGACCAG ATGGCCAAGG TGCTTCAGTT 201 TAATGAAGTG GGAGCCAATG CAGTTACCCC CATGACTCCA GCACAAGCTG 251 CAGATAAAAT CCATTCATCC TTCCGCTCTC TCAGCTCTGC AATCAATGCA 301 TCCACAGGGA ATTATTTACT GGAAAGTGTC AATAAGCTGT TTGGTGAGAA 351 GTCTGCGAGC TTCCGGGAAG AATATATTCG ACTCTGTCAG AAATATTACT 401 CCTCAGAACC CCAGGCAGTA GACTTCCTAG AATGTGCAGA AGAAGCTAGA 451 AAAAAGATTA ATTCCTGGGT CAAGACTCAA ACCAAAGGCA AAATCCCAAA 501 CTTGTTACCT GAAGGTTCTG TAGATGGGGA TACCAGGATG GTCCTGGTGA 551 ATGCTGTCTA CTTCAAAGGA AAGTGGAAAA CTCCATTTGA GAAGAAACTA 601 AATGGGCTTT ATCCTTTCCG TGTAAACTCG GCTCAGCGCA CACCTGTACA 651 GATGATGTAC TTGCGTGAAA AGCTAAACAT TGGATACATA GAAGACCTAA 701 AGGCTCAGAT TCTAGAACTC CCATATGCTG GAGATGTTAG CATGTTCTTG 751 TTGCTTCCAG ATGAAATTGC CGATGTGTCC ACTGGCTTGG AGCTGCTGGA 801 AAGTGAAATA ACCTATGACA AACTCAACAA GTGGACCAGC AAAGACAAAA 851 TGGCTGAAGA TGAAGTTGAG GTATACATAC CCCAGTTCAA ATTAGAAGAG 901 CATTATGAAC TCAGATCCAT TCTGAGAAGC ATGGGCATGG AGGACGCCTT 951 CAACAAGGGA CGGGCCAATT TCTCAGGGAT GTCGGAGAGG AATGACCTGT 1001 TTCTTTCTGA AGTGTTCCAC CAAGCCATGG TGGATGTGAA TGAGGAGGGC 1051 ACTGAAGCAG CCGCTGGCAC AGGAGGTGTT ATGACAGGGA GAACTGGACA 1101 TGGAGGCCCA CAGTTTGTGG CAGATCATCC TTTTCTTTTT CTTATTATGC 1151 ATAAGATAAC CAACTGCATT TTATTTTTCG GCAGATTTTC CTCACCCTAA 1201 AACTAAGCGT GCTGCTTCTG CAAAAGATTT TTGTAGATGA GCTGTGTGCC 1251 TCAGAATTGC TATTTCAAAT TGCCAAAAAT TTAGAGATGT TTTCTACATA 1301 TTTCTGCTCT TCTGAACAAC TTCTGCTACC CACTAAATAA AAACACAGAA 1351 ATAATTAGAC AATTGTCTAT TATAACATGA CAACCCTATT AATCATTTGG 1401 TCTTCTAAAA TGGGATCATG CCCATTTAGA TTTTCCTTAC TATCAGTTTA 1451 TTTTTATAAC ATTAACTTTT ACTTTGTTAT TTATTATTTT ATATAATGGT 1501 GAGTTTTTGG GG // LOCUS A21240 1482 bp mRNA PAT 24-OCT-1994 DEFINITION H.sapiens BTA 1922 mRNA for Pai-2. ACCESSION A21240 NID g641360 VERSION A21240.1 GI:641360 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1482) AUTHORS . TITLE VARIANTS OF PAI-2 JOURNAL Patent: WO 9109124-A 3 27-JUN-1991; FEATURES Location/Qualifiers source 1. .1482 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" /cell_line="U937" /clone="BTA 1922" CDS 22. .1170 /note="amino acids 66 to 98 inclusive have been deleted" /codon_start=1 /product="plasminogen activator inhibitor type 2 protein" /protein_id="CAA01537.1" /db_xref="PID:g641361" /db_xref="GI:641361" /translation="MEDLCVANTLFALNLFKHLAKASPTQNLFLSPWSISSTMAMVYM GSRGSTEDQMAKVLQFNEVGAAADKIHSSFRSLSSAINASTGNYLLESVNKLFGEKSA SFREEYIRLCQKYYSSEPQAVDFLECAEEARKKINSWVKTQTKGKIPNLLPEGSVDGD TRMVLVNAVYFKGKWKTPFEKKLNGLYPFRVNSAQRTPVQMMYLREKLNIGYIEDLKA QILELPYAGDVSMFLLLPDEIADVSTGLELLESEITYDKLNKWTSKDKMAEDEVEVYI PQFKLEEHYELRSILRSMGMEDAFNKGRANFSGMSERNDLFLSEVFHQAMVDVNEEGT EAAAGTGGVMTGRTGHGGPQFVADHPFLFLIMHKITNCILFFGRFSSP" BASE COUNT 439 a 307 c 320 g 416 t ORIGIN 1 GATCTGTAAG GAGGTATATA AATGGAGGAT CTTTGTGTGG CAAACACACT 51 CTTTGCCCTC AATTTATTCA AGCATCTGGC AAAAGCAAGC CCCACCCAGA 101 ACCTCTTCCT CTCCCCATGG AGCATCTCGT CCACCATGGC CATGGTCTAC 151 ATGGGCTCCA GGGGCAGCAC CGAAGACCAG ATGGCCAAGG TGCTTCAGTT 201 TAATGAAGTG GGAGCCGCTG CAGATAAAAT CCATTCATCC TTCCGCTCTC 251 TCAGCTCTGC AATCAATGCA TCCACAGGGA ATTATTTACT GGAAAGTGTC 301 AATAAGCTGT TTGGTGAGAA GTCTGCGAGC TTCCGGGAAG AATATATTCG 351 ACTCTGTCAG AAATATTACT CCTCAGAACC CCAGGCAGTA GACTTCCTAG 401 AATGTGCAGA AGAAGCTAGA AAAAAGATTA ATTCCTGGGT CAAGACTCAA 451 ACCAAAGGCA AAATCCCAAA CTTGTTACCT GAAGGTTCTG TAGATGGGGA 501 TACCAGGATG GTCCTGGTGA ATGCTGTCTA CTTCAAAGGA AAGTGGAAAA 551 CTCCATTTGA GAAGAAACTA AATGGGCTTT ATCCTTTCCG TGTAAACTCG 601 GCTCAGCGCA CACCTGTACA GATGATGTAC TTGCGTGAAA AGCTAAACAT 651 TGGATACATA GAAGACCTAA AGGCTCAGAT TCTAGAACTC CCATATGCTG 701 GAGATGTTAG CATGTTCTTG TTGCTTCCAG ATGAAATTGC CGATGTGTCC 751 ACTGGCTTGG AGCTGCTGGA AAGTGAAATA ACCTATGACA AACTCAACAA 801 GTGGACCAGC AAAGACAAAA TGGCTGAAGA TGAAGTTGAG GTATACATAC 851 CCCAGTTCAA ATTAGAAGAG CATTATGAAC TCAGATCCAT TCTGAGAAGC 901 ATGGGCATGG AGGACGCCTT CAACAAGGGA CGGGCCAATT TCTCAGGGAT 951 GTCGGAGAGG AATGACCTGT TTCTTTCTGA AGTGTTCCAC CAAGCCATGG 1001 TGGATGTGAA TGAGGAGGGC ACTGAAGCAG CCGCTGGCAC AGGAGGTGTT 1051 ATGACAGGGA GAACTGGACA TGGAGGCCCA CAGTTTGTGG CAGATCATCC 1101 TTTTCTTTTT CTTATTATGC ATAAGATAAC CAACTGCATT TTATTTTTCG 1151 GCAGATTTTC CTCACCCTAA AACTAAGCGT GCTGCTTCTG CAAAAGATTT 1201 TTGTAGATGA GCTGTGTGCC TCAGAATTGC TATTTCAAAT TGCCAAAAAT 1251 TTAGAGATGT TTTCTACATA TTTCTGCTCT TCTGAACAAC TTCTGCTACC 1301 CACTAAATAA AAACACAGAA ATAATTAGAC AATTGTCTAT TATAACATGA 1351 CAACCCTATT AATCATTTGG TCTTCTAAAA TGGGATCATG CCCATTTAGA 1401 TTTTCCTTAC TATCAGTTTA TTTTTATAAC ATTAACTTTT ACTTTGTTAT 1451 TTATTATTTT ATATAATGGT GAGTTTTTGG GG // LOCUS A26481 2624 bp mRNA PAT 17-OCT-1995 DEFINITION Human NPY receptor Y1 gene cDNA. ACCESSION A26481 NID g1247452 VERSION A26481.1 GI:1247452 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2624) AUTHORS . TITLE HUMAN NEUROPEPTIDE Y-Y1 RECEPTOR JOURNAL Patent: WO 9309227-A 3 13-MAY-1993; FEATURES Location/Qualifiers source 1. .2624 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 152. .1306 /codon_start=1 /product="neuropeptide Y Y1 receptor" /protein_id="CAA01819.1" /db_xref="PID:e205126" /db_xref="PID:g1247453" /db_xref="GI:1247453" /db_xref="SWISS-PROT:P25929" /translation="MNSTLFSQVENHSVHSNFSEKNAQLLAFENDDCHLPLAMIFTLA LAYGAVIILGVSGNLALIIIILKQKEMRNVTNILIVNLSFSDLLVAIMCLPFTFVYTL MDHWVFGEAMCKLNPFVQCVSITVSIFSLVLIAVERHQLIINPRGWRPNNRHAYVGIA VIWVLAVASSLPFLIYQVMTDEPFQNVTLDAYKDKYVCFDQFPSDSHRLSYTTLLLVL QYFGPLCFIFICYFKIYIRLKRRNNMMDKMRDNKYRSSETKRINIMLLSIVVAFAVCW LPLTIFNTVFDWNHQIIATCNHNLLFLLCHLTAMISTCVNPIFYGFLNKNFQRDLQFF FNFCDFRSRDDDYETIAMSTMHTDVSKTSLKQASPVAFKKINNNDDNEKI" BASE COUNT 791 a 479 c 473 g 878 t 3 others ORIGIN 1 ATTGTTCAGT TCAAGGGAAT GAAGAATTCA GAATAATTTT GGTAAATGGA 51 TTCCAATATC GGGAATAAGA ATAAGCTGAA CAGTTGACCT GCTTTGAAGA 101 AACATACTGT CCATTTGTCT AAAATAATCT ATAACAACCA AACCAATCAA 151 AATGAATTCA ACATTATTTT CCCAGGTTGA AAATCATTCA GTCCACTCTA 201 ATTTCTCAGA GAAGAATGCC CAGCTTCTGG CTTTTGAAAA TGATGATTGT 251 CATCTGCCCT TGGCCATGAT ATTTACCTTA GCTCTTGCTT ATGGAGCTGT 301 GATCATTCTT GGTGTCTCTG GAAACCTGGC CTTGATCATA ATCATCTTGA 351 AACAAAAGGA GATGAGAAAT GTTACCAACA TCCTGATTGT GAACCTTTCC 401 TTCTCAGACT TGCTTGTTGC CATCATGTGT CTCCCCTTTA CATTTGTCTA 451 CACATTAATG GACCACTGGG TCTTTGGTGA GGCGATGTGT AAGTTGAATC 501 CTTTTGTGCA ATGTGTTTCA ATCACTGTGT CCATTTTCTC TCTGGTTCTC 551 ATTGCTGTGG AACGACATCA GCTGATAATC AACCCTCGAG GGTGGAGACC 601 AAATAATAGA CATGCTTATG TAGGTATTGC TGTGATTTGG GTCCTTGCTG 651 TGGCTTCTTC TTTGCCTTTC CTGATCTACC AAGTAATGAC TGATGAGCCG 701 TTCCAAAATG TAACACTTGA TGCGTACAAA GACAAATACG TGTGCTTTGA 751 TCAATTTCCA TCGGACTCTC ATAGGTTGTC TTATACCACT CTCCTCTTGG 801 TGCTGCAGTA TTTTGGTCCA CTTTGTTTTA TATTTATTTG CTACTTCAAG 851 ATATATATAC GCCTAAAAAG GAGAAACAAC ATGATGGACA AGATGAGAGA 901 CAATAAGTAC AGGTCCAGTG AAACCAAAAG AATCAATATC ATGCTGCTCT 951 CCATTGTGGT AGCATTTGCA GTCTGCTGGC TCCCTCTTAC CATCTTTAAC 1001 ACTGTGTTTG ATTGGAATCA TCAGATCATT GCTACCTGCA ACCACAATCT 1051 GTTATTCCTG CTCTGCCACC TCACAGCAAT GATATCCACT TGTGTCAACC 1101 CCATATTTTA TGGGTTCCTG AACAAAAACT TCCAGAGAGA CTTGCAGTTC 1151 TTCTTCAACT TTTGTGATTT CCGGTCTCGG GATGATGATT ATGAAACAAT 1201 AGCCATGTCC ACGATGCACA CAGATGTTTC CAAAACTTCT TTGAAGCAAG 1251 CAAGCCCAGT CGCATTTAAA AAAATCAACA ACAATGATGA TAATGAAAAA 1301 ATCTGAAACT ACTTATAGCC TATGGTCCCG GATGACATCT GTTTAAAAAC 1351 AAGCACAACC TGCAACATAC TTTGATTACC TGTTCTCCCA AGGAATGGGG 1401 TTGAAATCAT TTGAAAATGA CTAAGATTTT CTTGTCTTGC TTTTTTACTG 1451 CTTTTGTTGT AGTGTCATAA TTACATTTGG AACAAAAGGT GTGGGCTTTG 1501 GGGTCTTCTG GAAATAGTTT TGACCAGACA TCTTTGAAGT GCTTTTTGTG 1551 AATTTATGCA TATAATATAA AGACTTTTAT ACTGTACTTA TTGGAATGAA 1601 ATTTCTTTAA AGTATTACGA TNNNCTGACT TCAGAAGTAC CTGCCATCCA 1651 ATACGGTCAT TAGATTGGGT CATCTTGATT AGATTAGATT AGATTAGATT 1701 GTCAACAGAT TGGGCCATCC TTACTTTATG ATAGGCATCA TTTTAGTGTG 1751 TTACAATAGT AACAGTATGC AAAAGCAGCA TTCAGGAGCC GAAAGATAGT 1801 CTTGAAGTCA TTCAGAAGTG GTTTGAGGTT TCTGTTTTTT GGTGGTTTTT 1851 GTTTGTTTTT TTTTTTTTTC ACCTTAAGGG AGGCTTTCAT TTCCTCCCGA 1901 CTGATTGTCA CTTAAATCAA AATTTAAAAA TGAATAAAAA GACATACTTC 1951 TCAGCTGCAA ATATTATGGA GAATTGGGCA CCCACAGGAA TGAAGAGAGA 2001 AAGCAGCTCC CCAACTTCAA AACCATTTTG GTACCTGACA ACAAGAGCAT 2051 TTTAGAGTAA TTAATTTAAT AAAGTAAATT AGTATTGCTG CAAATAGCTA 2101 AATTATATTT ATTTGAATTG ATGGTCAAGA GATTTTCCAT TTTTTTTACA 2151 GACTGTTCAG TGTTTGTCAA GCTTCTGGTC TAATATGTAC TCGAAAGACT 2201 TTCCGCTTAC AATTTGTAGA AACACAAATA TCGTTTTCCA TACAGCAGTG 2251 CCTATATAGT GACTGATTTT AACTTTCAAT GTCCATCTTT CAAAGGAAGT 2301 AACACCAAGG TACAATGTTA AAGGAATATT CACTTTACCT AGCAGGGAAA 2351 AATACACAAA AACTGCAGAT ACTTCATATA GCCCATTTTA ACTTGTATAA 2401 ACTGTGTGAC TTGTGGCGTC TTATAAATAA TGCACTGTAA AGATTACTGA 2451 ATAGTTGTGT CATGTTAATG TGCCTAATTT CATGTATCTT GTAATCATGA 2501 TTGAGCCTCA GAATCATTTG GAGAAACTAT ATTTTAAAGA ACAAGACATA 2551 CTTCAATGTA TTATACAGAT AAAGTATTAC ATGTGTTTGA TTTTAAAAGG 2601 GCGGACATTT TATTAAAATC AAGG // LOCUS A30262 1065 bp mRNA PAT 09-OCT-1995 DEFINITION H.sapiens beta-casein cDNA. ACCESSION A30262 NID g1247534 VERSION A30262.1 GI:1247534 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1065) AUTHORS . TITLE HUMAN BETA-CASEIN, PROCESS FOR PRODUCING IT AND USE THEREOF JOURNAL Patent: WO 9304171-A 16 04-MAR-1993; FEATURES Location/Qualifiers source 1. .1065 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 4. .681 /codon_start=1 /product="beta-casein" /protein_id="CAA02017.1" /db_xref="PID:e204047" /db_xref="PID:g1247535" /db_xref="GI:1247535" /db_xref="SWISS-PROT:P05814" /translation="MKVLILACLVALALARETIESLSSSEESITEYKKVEKVKHEDQQ QGEDEHQDKIYPSFQPQPLIYPFVEPIPYGFLPQNILPLAQPAVVLPVPQPEIMEVPK AKDTVYTKGRVMPVLKSPTIPFFDPQIPKLTDLENLHLPLPLLQPLMQQVPQPIPQTL ALPPQPLWSVPQPKVLPIPQQVVPYPQRAVPVQALLLNQELLLNPTHQIYPVTQPLAP VHNPISV" BASE COUNT 296 a 277 c 165 g 327 t ORIGIN 1 CGGATGAAGG TCCTCATCCT CGCCTGCCTG GTGGCTCTTG CTCTTGCAAG 51 GGAGACCATA GAAAGCCTTT CAAGCAGTGA GGAATCTATT ACAGAATACA 101 AGAAAGTTGA GAAGGTTAAA CATGAGGACC AGCAGCAAGG AGAGGATGAA 151 CACCAGGATA AAATCTACCC CTCTTTCCAG CCACAGCCTC TGATCTATCC 201 ATTCGTTGAA CCTATCCCCT ATGGTTTTCT TCCACAAAAC ATTCTGCCTC 251 TTGCTCAGCC TGCTGTGGTG CTGCCTGTCC CTCAGCCTGA AATAATGGAA 301 GTCCCTAAAG CTAAAGACAC TGTCTACACT AAGGGCAGAG TGATGCCTGT 351 CCTTAAATCT CCAACGATAC CCTTTTTTGA CCCTCAAATC CCAAAACTCA 401 CTGATCTTGA AAATCTGCAT CTTCCTCTGC CTCTGCTCCA GCCCTTGATG 451 CAGCAGGTCC CTCAGCCTAT TCCTCAGACT CTTGCACTTC CCCCTCAGCC 501 CCTGTGGTCT GTTCCTCAGC CCAAAGTCCT GCCTATCCCC CAGCAAGTGG 551 TGCCCTACCC TCAGAGAGCT GTGCCTGTTC AAGCCCTTCT GCTCAACCAA 601 GAACTTCTAC TTAACCCCAC CCACCAGATC TACCCTGTGA CTCAGCCACT 651 TGCCCCAGTT CATAACCCCA TTAGTGTCTA AGAAGATTTC AAAGTTAATT 701 TTCCCTCCTT ATTTTTGAAT TGACTGAGAC TGGAAATATG ATGCCTTTTC 751 CGTCTTTGTA TCACGTTACC CCAAATTAAG TATGTTTGAA TGAGTTTATA 801 TGGAAAAAAT GAACTTTGTC CCTTTATTTA TTTTATATAT TATGTCATTC 851 ATTTAATTTG AAATTTGACT CATGAACTAT TTACATTTTC CAAATCTTAA 901 TTCAACTAGT ACCACAGAAG TTCAATACTC ATTTGGAAAT GCTACAAACA 951 TATCAAACAT ATGTATACAA ATTGTTTCTG GAATTGTGCT TATTTTTATT 1001 TCTTTAAGAA TCTATTTCCT TTCCAGTCAT TTCAATAAAT TATTCTTAAG 1051 CATAAAAAAA AAAAA // LOCUS A35395 2296 bp mRNA PAT 11-DEC-1996 DEFINITION H.sapiens u-PA cDNA sequence. ACCESSION A35395 NID g1926844 VERSION A35395.1 GI:1926844 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2296) AUTHORS Meyhack,B., Heim,J. and Buergi,R. TITLE Process for the production of proteins JOURNAL Patent: EP 0288435-A 2 26-OCT-1988; CIBA-GEIGY AG FEATURES Location/Qualifiers source 1. .2296 /organism="Homo sapiens" /db_xref="taxon:9606" gene 70. .1365 /gene="u-PA" CDS 70. .1365 /gene="u-PA" /codon_start=1 /protein_id="CAA02215.1" /db_xref="PID:e285030" /db_xref="PID:g1926845" /db_xref="GI:1926845" /translation="MRALLARLLLCVLVVSDSKGSNELHQVPSNCDCLNGGTCVSNKY FSNIHWCNCPKKFGGQHCEIDKSKTCYEGNGHFYRGKASTDTMGRPCLPWNSATVLQQ TYHAHRSDALQLGLGKHNYCRNPDNRRRPWCYVQVGLKPLVQECMVHDCADGKKPSSP PEELKFQCGQKTLRPRFKIIGGEFTTIENQPWFAAIYRRHRGGSVTYVCGGSLISPCW VISATHCFIDYPKKEDYIVYLGRSRLNSNTQGEMKFEVENLILHKDYSADTLAHHNDI ALLKIRSKEGRCAQPSRTIQTICLPSMYNDPQFGTSCEITGFGKENSTDYLYPEQLKM TVVKLISHRECQQPHYYGSEVTTKMLCAADPQWKTDSCQGDSGGPLVCSLQGRMTLTG IVSWGRGCALKDKPGVYTRVSHFLPWIRSHTKEENGLAL" BASE COUNT 547 a 594 c 607 g 548 t ORIGIN 1 CCCGGGCTCC GGGCTGCGGT CTCCTGCCGC AGCCACCGAG CCGCCGTCTA 51 GCGCCCCGAC CTCGCCACCA TGAGAGCCCT GCTGGCGCGC CTGCTTCTCT 101 GCGTCCTGGT CGTGAGCGAC TCCAAAGGCA GCAATGAACT TCATCAAGTT 151 CCATCGAACT GTGACTGTCT AAATGGAGGA ACATGTGTGT CCAACAAGTA 201 CTTCTCCAAC ATTCACTGGT GCAACTGCCC AAAGAAATTC GGAGGGCAGC 251 ACTGTGAAAT AGATAAGTCA AAAACCTGCT ATGAGGGGAA TGGTCACTTT 301 TACCGAGGAA AGGCCAGCAC TGACACCATG GGCCGGCCCT GCCTGCCCTG 351 GAACTCTGCC ACTGTCCTTC AGCAAACGTA CCATGCCCAC AGATCTGATG 401 CTCTTCAGCT GGGCCTGGGG AAACATAATT ACTGCAGGAA CCCAGACAAC 451 CGGAGGCGAC CCTGGTGCTA TGTGCAGGTG GGCCTAAAGC CGCTTGTCCA 501 AGAGTGCATG GTGCATGACT GCGCAGATGG AAAAAAGCCC TCCTCTCCTC 551 CAGAAGAATT AAAATTTCAG TGTGGCCAAA AGACTCTGAG GCCCCGCTTT 601 AAGATTATTG GGGGAGAATT CACCACCATC GAGAACCAGC CCTGGTTTGC 651 GGCCATCTAC AGGAGGCACC GGGGGGGCTC TGTCACCTAC GTGTGTGGAG 701 GCAGCCTCAT CAGCCCTTGC TGGGTGATCA GCGCCACACA CTGCTTCATT 751 GATTACCCAA AGAAGGAGGA CTACATCGTC TACCTGGGTC GCTCAAGGCT 801 TAACTCCAAC ACGCAAGGGG AGATGAAGTT TGAGGTGGAA AACCTCATCC 851 TACACAAGGA CTACAGCGCT GACACGCTTG CTCACCACAA CGACATTGCC 901 TTGCTGAAGA TCCGTTCCAA GGAGGGCAGG TGTGCGCAGC CATCCCGGAC 951 TATACAGACC ATCTGCCTGC CCTCGATGTA TAACGATCCC CAGTTTGGCA 1001 CAAGCTGTGA GATCACTGGC TTTGGAAAAG AGAATTCTAC CGACTATCTC 1051 TATCCGGAGC AGCTGAAAAT GACTGTTGTG AAGCTGATTT CCCACCGGGA 1101 GTGTCAGCAG CCCCACTACT ACGGCTCTGA AGTCACCACC AAAATGCTGT 1151 GTGCTGCTGA CCCACAGTGG AAAACAGATT CCTGCCAGGG AGACTCAGGG 1201 GGACCCCTCG TCTGTTCCCT CCAAGGCCGC ATGACTTTGA CTGGAATTGT 1251 GAGCTGGGGC CGTGGATGTG CCCTGAAGGA CAAGCCAGGC GTCTACACGA 1301 GAGTCTCACA CTTCTTACCC TGGATCCGCA GTCACACCAA GGAAGAGAAT 1351 GGCCTGGCCC TCTGAGGGTC CCCAGGGAGG AAACGGGCAC CACCCGCTTT 1401 CTTGCTGGTT GTCATTTTTG CAGTAGAGTC ATCTCCATCA GCTGTAAGAA 1451 GAGACTGGGA AGATAGGCTC TGCACAGATG GATTTGCCTG TGGCACCACC 1501 AGGGTGAACG ACAATAGCTT TACCCTCACG GATAGGCCTG GGTGCTGGCT 1551 GCCCAGACCC TCTGGCCAGG ATGGAGGGGT GGTCCTGACT CAACATGTTA 1601 CTGACCAGCA ACTTGTCTTT TTCTGGACTG AAGCCTGCAG GAGTTAAAAA 1651 GGGCAGGGCA TCTCCTGTGC ATGGGCTCGA AGGGAGAGCC AGCTCCCCCG 1701 ACCGGTGGGC ATTTGTGAGG CCCATGGTTG AGAAATGAAT AATTTCCCAA 1751 TTAGGAAGTG TAAGCAGCTG AGGTCTCTTG AGGGAGCTTA GCCAATGTGG 1801 GAGCAGCGGT TTGGGGAGCA GAGACACTAA CGACTTCAGG GCAGGGCTCT 1851 GATATTCCAT GAATGTATCA GGAAATATAT ATGTGTGTGT ATGTTTGCAC 1901 ACTTGTTGTG TGGGCTGTGA GTGTAAGTGT GAGTAAGAGC TGGTGTCTGA 1951 TTGTTAAGTC TAAATATTTC CTTAAACTGT GTGGACTGTG ATGCCACACA 2001 GAGTGGTCTT TCTGGAGAGG TTATAGGTCA CTCCTGGGGC CTCTTGGGTC 2051 CCCCACGTGA CAGTGCCTGG GAATGTACTT ATTCTGCAGC ATGACCTGTG 2101 ACCAGCACTG TCTCAGTTTC ACTTTCACAT AGATGTCCCT TTCTTGGCCA 2151 GTTATCCCTT CCTTTTAGCC TAGTTCATCC AATCCTCACT GGGTGGGGTG 2201 AGGACCACTC CTTACACTGA ATATTTATAT TTCACTATTT TTATTTATAT 2251 TTTTGTAATT TTAAATAAAA GTGATCAATA AAATGTGATT TTTCTG // LOCUS AB000509 3993 bp mRNA PRI 25-MAR-1998 DEFINITION Homo sapiens mRNA for TRAF5, complete cds. ACCESSION AB000509 NID g2982670 VERSION AB000509.1 GI:2982670 KEYWORDS TRAF5. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3993) AUTHORS Mizushima,S. TITLE Direct Submission JOURNAL Submitted (16-JAN-1997) to the DDBJ/EMBL/GenBank databases. Seiichi Mizushima, Mochida Pharmaceutical Co.,LTD, Biosciences Research Laboratory; 1-1, Kamiya 1-chome, Kita-ku, Tokyo 115, Japan (E-mail:smizushi@mochida.co.jp, Tel:03-3913-6261) REFERENCE 2 (sites) AUTHORS Mizushima,S., Fujita,M., Ishida,T., Azuma,S., Kato,K., Hirai,M., Otsuka,M., Yamamoto,T. and Inoue,J. TITLE Cloning and characterization of a cDNA encoding the human homolog of tumor necrosis factor receptor-associated factor 5 (TRAF5) JOURNAL Gene 207 (2), 135-140 (1998) MEDLINE 98172745 FEATURES Location/Qualifiers source 1. .3993 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 55. .1728 /codon_start=1 /product="TRAF5" /protein_id="BAA25262.1" /db_xref="PID:d1026190" /db_xref="PID:g2982671" /db_xref="GI:2982671" /translation="MAYSEEHKGMPCGFIRQNSGNSISLDFEPSIEYQFVERLEERYK CAFCHSVLHNPHQTGCGHRFCQHCILSLRELNTVPICPVDKEVIKSQEVFKDNCCKRE VLNLYVYCSNAPGCNAKVILGRYQDHLQQCLFQPVQCSNEKCREPVLRKDLKEHLSAS CQFRKEKCLYCKKDVVVINLQNHEENLCPEYPVFCPNNCAKIILKTEVDEHLAVCPEA EQDCPFKHYGCAVTDKRRNLQQHEHSALREHMRLVLEKNVQLEEQISDLHKSLEQKES KIQQLAETIKKLEKEFKQFAQLFGKNGSFLPNIQVFASHIDKSAWLEAQVHQLLQMVN QQQNKFDLRPLMEAVDTVKQKITLLENNDQRLAVLEEETNKHDTHINIHKAQLSKNEE RFKLLEGTCYNGKLIWKVTDYKMKKREAVDGHTVSIFSQSFYTSRCGYRLCARAYLNG DGSGRGSHLSLYFVVMRGEFDSLLQWPFRQRVTLMLLDQSGKKNIMETFKPDPNSSSF KRPDGEMNIASGCPRFVAHSVLENAKNAYIKDDTLFLKVAVDLTDLEDL" BASE COUNT 1198 a 797 c 866 g 1132 t ORIGIN 1 GCAGCAGCCG CGCCTGCAGA CCGGCCTCGC GGAGCCCGCG CGCCGAGCCC 51 CACAATGGCT TATTCAGAAG AGCATAAAGG TATGCCCTGT GGTTTCATCC 101 GCCAGAATTC CGGCAACTCC ATTTCCTTGG ACTTTGAGCC CAGTATAGAG 151 TACCAGTTTG TGGAGCGGTT GGAAGAGCGC TACAAATGTG CCTTCTGCCA 201 CTCGGTGCTT CACAACCCCC ACCAGACAGG ATGTGGGCAC CGCTTCTGCC 251 AGCACTGCAT CCTGTCCCTG AGAGAATTAA ACACAGTGCC AATCTGCCCT 301 GTAGATAAAG AGGTCATCAA ATCTCAGGAG GTTTTTAAAG ACAATTGTTG 351 CAAAAGAGAA GTCCTCAACT TATATGTATA TTGCAGCAAT GCTCCTGGAT 401 GTAATGCCAA GGTTATTCTG GGCCGGTACC AGGATCACCT TCAGCAGTGC 451 TTATTTCAAC CTGTGCAGTG TTCTAATGAG AAGTGCCGGG AGCCAGTCCT 501 ACGGAAAGAC CTGAAAGAGC ATTTGAGTGC ATCCTGTCAG TTTCGAAAGG 551 AAAAATGCCT TTATTGCAAA AAGGATGTGG TAGTCATCAA TCTACAGAAT 601 CATGAGGAAA ACTTGTGTCC TGAATACCCA GTATTTTGTC CCAACAATTG 651 TGCGAAGATT ATTCTAAAAA CTGAGGTAGA TGAACACCTG GCTGTATGTC 701 CTGAAGCTGA GCAAGACTGT CCTTTTAAGC ACTATGGCTG TGCTGTAACG 751 GATAAACGGA GGAACCTGCA GCAACATGAG CATTCAGCCT TACGGGAGCA 801 CATGCGTTTG GTTTTAGAAA AGAATGTCCA ATTAGAAGAA CAGATTTCTG 851 ACTTACACAA GAGCCTAGAA CAGAAAGAAA GTAAAATCCA GCAGCTAGCA 901 GAAACTATAA AGAAACTTGA AAAGGAGTTC AAGCAGTTTG CACAGTTGTT 951 TGGCAAAAAT GGAAGCTTCC TCCCAAACAT CCAGGTTTTT GCCAGTCACA 1001 TTGACAAGTC AGCTTGGCTA GAAGCTCAAG TGCATCAATT ATTACAAATG 1051 GTTAACCAGC AACAAAATAA ATTTGACCTG AGACCTTTGA TGGAAGCAGT 1101 TGATACAGTG AAACAGAAAA TTACCCTGCT AGAAAACAAT GATCAAAGAT 1151 TAGCCGTTTT AGAAGAGGAA ACTAACAAAC ATGATACCCA CATTAATATT 1201 CATAAAGCAC AGCTGAGTAA AAATGAAGAG CGATTTAAAC TGCTGGAGGG 1251 TACTTGCTAT AATGGAAAGC TCATTTGGAA GGTGACAGAT TACAAGATGA 1301 AGAAGAGAGA GGCGGTGGAT GGGCACACAG TGTCCATCTT CAGCCAGTCC 1351 TTCTACACCA GCCGCTGTGG CTACCGGCTC TGTGCTAGAG CATACCTGAA 1401 TGGGGATGGG TCAGGGAGGG GGTCACACCT GTCCCTATAC TTTGTGGTCA 1451 TGCGAGGAGA GTTTGACTCA CTGTTGCAGT GGCCATTCAG GCAGAGGGTG 1501 ACCCTGATGC TTCTGGACCA GAGTGGCAAA AAGAACATTA TGGAGACCTT 1551 CAAACCTGAC CCCAATAGCA GCAGCTTTAA AAGACCTGAT GGGGAGATGA 1601 ACATTGCATC TGGCTGTCCC CGCTTTGTGG CTCATTCTGT TTTGGAGAAT 1651 GCCAAGAACG CCTACATTAA AGATGACACT CTGTTCTTGA AAGTGGCCGT 1701 GGACTTAACT GACCTGGAGG ATCTCTAGTC ACTGTTATGG GGTGATAAGA 1751 GGACTTCTTG GGGCCAGAAC TGTGGAGGAG AGCACATTTG ATTATCATAT 1801 TGACCTGGAT TTAGACTCAA AGCACATTTG TATTTGCCTT TTTCCTTAAC 1851 GTTTGAAGTC AGTTTAAAAC TTCTGAAGTG CTGTCTTTTT ACATTTTACT 1901 CTGTCCCAGT TTGAAACTTA AAACTCTTAG AATATTCTCT TATTATTTAT 1951 ATTTTTATAT TTCTTGAAAG ATGGTAAGTT TCTTGAAGTT TTTGGGGCGT 2001 TTCTCTTTTA CTGGTGCTTA GCGCAGTGTC TCGGGCACTC TAAATATTGA 2051 GTGTTATGGA GGACACAGAG GTAGCAGAAT CCCAGTTGAA AATGTTTTGA 2101 TATTTTATTG TTTGGCCTAT TGATTCTAGA CCTGGCCTTA AGTCTGCAAA 2151 AGCCATCTTT ATAAGGTAGG CTGTTCCAGT TAAGAAGTGG GTGATGTAGT 2201 TACAAAGATA ATATGCTCAG TTTGGACCTT TTTTTCAGTT AAATGCTAAA 2251 TATATGAAAA TTACTATACC TCTAAGTATT TTCATGAAAT TCACCAGCAG 2301 TTTGCAAGCA CAGTTTTGCA AGGCTGCATA AGAACTGGTG AATGGGGTAA 2351 GCATTTTCAT TCTTCCTGCT GAAGTAAAGC AGAAAGTACT GCATAGTATA 2401 TGAGATATAG CCAGCTAGCT AAAGTTCAGA TTTTGTTAGG TTCAACCCTA 2451 TGAAAAAAAC TATTTTCATA GGTCAAAAAT GGTAAAAAAT TAGCAGTTTC 2501 ATAAGATTCA ACCAAATAAA TATATATATA CACACACACA TACATATACA 2551 CCTATATATG TGTGTATACA AACAGTTCGA ATGTATTTTG GTGACAGTAA 2601 TAAATCAATG TGAGGATGGA TAGAATTTAG TATATGATAG AGAAAATGTC 2651 ATAAATGGAT AAAAGGAATT TACAACTTGA GGAGAAAACC TTTACAATTT 2701 CCTATGGGTG TCAGAAGTAC TCTCAGCGAA AACTGATGGC TAAAACAGTA 2751 TCTACTATTC TCTGATAACT TTTTTTTTGA GACAGAGTTT CATTGTCACC 2801 CAGGCTGGAG TACAGTGGCA TGATCTCAGC TCACTGCAAA CTCTGCCTCC 2851 CGAATTCAAG TGATTCTCCT GCCTCAGCCT CCTGAGTAGC TGGGATTACA 2901 GGCGCCCGTC ACCACACCCA GGTAATTTTT GTATTTTTAG TAGAGACGGA 2951 GTTTTGCCAT GTTGGCCAAG CTGATCTCAA ACTCCTGACC TCAAGTGATC 3001 TGCCCGCCTC GGCCTCCCAA AGTGCTGAGA TTACAGGCAT GACCCACCGC 3051 GTCAAGCCTC TGACAACTAT TGAATTTGTA AGCTGCTATG CAAATGGGCA 3101 TTTATATAAA CTTGTGATGT TTCTTGTCAG AATTCTGAGT ACTCTGTGAA 3151 GAACAGAAAT GATCATATTC TTATGCATCT ATCTGTATGG GTCTGAAGGT 3201 GTATATACAA ACTGAGATGA GTCCTTATGA CTCTTGATAA GCCTGAGTTT 3251 AACAACAACA AAAATGCCAA GTTGTCCTGA GCCCTTCTGC GTTGTTATGC 3301 CACTTCCCTA CTGCTCATAT GCACGCTGGC TCCCCTGGGC ACGCAAGGAT 3351 GAGTATGGGC CATGGGCCCC TGTAGAGCTG CTTACCTGGT GATGACCATG 3401 CACCTTACAA TTTCTGAACA GTTAACCCTA TAGAAGCATG CTTTATATGA 3451 GTGTCTTCTG GGAAGAGGAA CCTTCTTAAT CTCTTCTGTG GGATTTTCAA 3501 AATGCTAAAG ACTCACACTG CAGCAATCAT CCCAGATGAT TAAATTCAAA 3551 GAAATAGGTT CACAACAGGA ATATACTGAA GAACTAGAGT GTCACTGCTG 3601 GTGAACTGTG GCACGGTTGC TCAACACATC ACCTCGGACA AATTCAGGAA 3651 GCATTTCTTT AGCCCACAAG TCCAGACCCA GGTGCTCTGT ATGTTTGTTT 3701 TTAATATTCA TCATATCCAA GTTCACTCTG TCTTCCTGAG CAGTGGAAGA 3751 TCATATTGCT GTAACTTCTT TTAAGTAGTT GATGTGGAAA ACATTTTAAA 3801 GTGAATTTGT CAAAATGCTG GTTTTGTGTT TTATCCAACT TTTGTGCATA 3851 TATATAAAGT ATGTCATGGC ATGGTTTGCT TAGGAGTTCA GAGTTCCTTC 3901 ATCATCGAAA TAGTGATTAA GTGATCCCAG AACAAGGAAT ACTAGAGTAA 3951 AAAGCACCTC TTTTTCAGAA AAAAAAAAAA AAAAAAAAAA AAA // LOCUS AB001466 3114 bp mRNA PRI 04-FEB-1998 DEFINITION Homo sapiens mRNA for Efs1, complete cds. ACCESSION AB001466 NID g2829301 VERSION AB001466.1 GI:2829301 KEYWORDS Efs1. SOURCE Homo sapiens 2 years old female hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3114) AUTHORS Ishino,M. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) to the DDBJ/EMBL/GenBank databases. Masaho Ishino, Sapporo Medical University, Department of Biochemistry, Cancer Research Institute; S1, W17, Chuo-ku, Sapporo, Hokkaido 060, Japan (E-mail:ishino@cc.sapmed.ac.jp, Tel:011-611-2111, Fax:011-612-5861) REFERENCE 2 (sites) AUTHORS Ishino,M., Ohba,T., Inazawa,J., Sasaki,H., Ariyama,Y. and Sasaki,T. TITLE Identification of an Efs isoform that lacks the SH3 domain and chromosomal mapping of human Efs JOURNAL Oncogene 15 (14), 1741-1745 (1997) MEDLINE 98007665 FEATURES Location/Qualifiers source 1. .3114 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /sex="female" /tissue_type="hippocampus" CDS 609. .2294 /codon_start=1 /product="Efs1" /protein_id="BAA24588.1" /db_xref="PID:d1025508" /db_xref="PID:g2829302" /db_xref="GI:2829302" /translation="MAIATSTQLARALYDNTAESPQELSFRRGDVLRVLQREGAGGLD GWCLCSLHGQQGIVPANRVKLLPAGPAPKPSLSPASPAQPGSPYPAPDHSNEDQEVYV VPPPARPCPTSGPPAGPCPPSPDLIYKIPRASGTQLAAPRDALEVYDVPPTALRVPSS GPYDCPASFSHPLTRVAPQPPGEDDAPYDVPLTPKPPAELEPDLEWEGGREPGPPIYA APSNLKRASALLNLYEAPEELLADGEGGGTDEGIYDVPLLGPEAPPSPEPPGALASHD QDTLAQLLARSPPPPHRPRLPSAESLSRRPLPALPVPEAPSPSPVPSPAPGRKGSIQD RPLPPPPPRLPGYGGPKVEGDPEGREMEDDPAGHHNEYEGIPMAEEYDYVHLKGMDKA QGSRPPDQACTGDPELPERGMPAPQEALSPGEPLVVSTGDLQLLYFYAGQCQSHYSAL QAAVAALMSSTQANQPPRLFVPHSKRVVVAAHRLVFVGDTLGRLAASAPLRAQVRAAG TALGQALRATVLAVKGAALGYPSSPAIQEMVQCVTELAGQALQFTTLLTSLAP" polyA_site 3114 /note="10 A nucleotides" BASE COUNT 570 a 1055 c 893 g 596 t ORIGIN 1 CTCCAGGCAA CTTGGGGCAA GCGTCTCAGT TCTCGCTCTC CCTTCCTCCC 51 AGCGGGGTCG CCGCAGACCC CAGCCCTGGG AGCACCGCTC TGCAGCGCGG 101 CCGGCGGGTG GAGACGGTTG GCCCCTAAAC TCGCTCGTCC AGCCCAACCG 151 CCCCGGCGGC TTCTCCCAGC CCTCGAGGCT CTCCTGAGCG GCCTGGAGAG 201 GCGTCGAGCG CAGCCCAGCG CCCGCCTGCT CACCCGCCCC GGCCCGGGAA 251 GGGAATTTTC GGATCCTGCG AGCCCGGGGC GCCCCCGCGG CCTAGGGCGG 301 GCAGCTCCCG GGGCCTGGCC GAGCCGGTGG CGCCCGGGAG GCCGCGGGGA 351 CAGCACGCAG CGCGCGCCCT TGGATGCCGT CCCGCAGCGA CGCCCCGGCC 401 CGCCCCGCTC CTCCTCCTGC CTGGCTAGCC TGCCTCTCAT TTGGGAAGTT 451 TTGTGGGTTT TCTTTCTCCT CCTCCAACCT TGGCGGAGGC CACGACTCAG 501 GCGCCACAGC TGGGGGCTAG AGGCCGCGGA CCATGGTGCG GGGCAGCCAC 551 CGCTGAAGTC AGCAAAACCG AGCCTGGCCT GAGGCAGGCT GCGCGGGAGG 601 CCAAAGCCAT GGCCATTGCC ACGTCGACCC AGCTGGCCCG GGCACTGTAT 651 GACAACACCG CTGAGTCCCC CCAGGAGCTG TCCTTCCGCC GAGGGGATGT 701 CCTACGGGTC CTGCAGAGAG AGGGCGCTGG TGGACTGGAC GGCTGGTGCC 751 TCTGCTCCCT ACACGGCCAG CAGGGCATTG TGCCCGCCAA CAGGGTGAAG 801 CTCTTGCCTG CTGGCCCAGC ACCCAAGCCC AGCCTCTCTC CTGCGTCCCC 851 AGCCCAGCCT GGCTCACCAT ATCCAGCCCC AGATCACAGC AATGAGGACC 901 AGGAGGTGTA TGTGGTGCCG CCCCCAGCTC GGCCCTGTCC AACCTCAGGA 951 CCTCCAGCTG GACCTTGCCC ACCCTCTCCT GACCTCATCT ACAAAATCCC 1001 CAGAGCTAGT GGGACCCAGC TGGCTGCTCC CAGAGATGCC TTGGAGGTCT 1051 ACGATGTGCC CCCCACCGCC CTCCGGGTGC CCTCCAGTGG CCCCTATGAC 1101 TGCCCTGCCT CCTTTTCCCA CCCTCTGACC CGGGTTGCCC CGCAGCCCCC 1151 TGGAGAGGAT GATGCTCCCT ATGATGTGCC TCTGACCCCA AAGCCACCTG 1201 CAGAGCTGGA ACCAGATCTG GAGTGGGAAG GAGGCCGGGA GCCGGGGCCC 1251 CCCATCTATG CTGCCCCCTC CAACCTGAAA CGAGCGTCAG CCTTACTCAA 1301 TTTGTATGAA GCACCCGAGG AACTGCTGGC AGACGGGGAG GGCGGGGGCA 1351 CTGATGAGGG GATCTACGAT GTGCCTCTGC TGGGGCCAGA GGCTCCCCCT 1401 TCTCCAGAGC CCCCTGGAGC CTTGGCCTCC CATGACCAGG ACACCCTGGC 1451 CCAGCTTCTG GCCAGAAGCC CCCCACCCCC ACACAGGCCC CGGCTCCCCT 1501 CAGCTGAGAG CCTGTCCCGC CGCCCTCTGC CTGCCCTGCC TGTCCCTGAG 1551 GCCCCCAGCC CCTCCCCAGT GCCCTCTCCT GCCCCAGGCC GGAAGGGCAG 1601 CATCCAGGAC CGGCCTCTGC CCCCACCCCC ACCCCGCCTG CCTGGTTATG 1651 GAGGCCCCAA GGTCGAGGGG GATCCAGAGG GCAGGGAGAT GGAGGATGAC 1701 CCAGCAGGAC ACCACAATGA GTACGAGGGC ATTCCGATGG CCGAGGAGTA 1751 TGACTATGTC CACCTGAAGG GCATGGACAA AGCTCAGGGA TCTAGGCCCC 1801 CGGATCAGGC CTGCACAGGG GATCCTGAAC TGCCCGAGAG GGGGATGCCG 1851 GCGCCGCAGG AGGCCCTGTC CCCAGGGGAG CCACTGGTTG TGTCCACCGG 1901 AGATCTGCAG CTCCTGTACT TCTATGCTGG GCAATGCCAG AGCCACTACT 1951 CAGCCCTGCA GGCAGCCGTG GCAGCCCTGA TGTCCAGTAC CCAGGCTAAT 2001 CAGCCCCCGC GCCTTTTCGT GCCCCACAGC AAGAGGGTGG TGGTGGCTGC 2051 TCATCGCCTG GTGTTTGTTG GGGACACCCT GGGCCGGCTG GCAGCCTCTG 2101 CCCCTCTGAG AGCACAGGTC AGGGCTGCAG GTACAGCACT GGGCCAGGCA 2151 TTGCGGGCCA CTGTGCTGGC TGTCAAGGGA GCTGCCCTGG GCTACCCATC 2201 CAGCCCTGCC ATCCAAGAGA TGGTGCAGTG TGTAACAGAA CTGGCAGGGC 2251 AGGCCCTGCA ATTCACTACC CTGCTCACTA GCCTGGCTCC ATGAAGGTCC 2301 TTTGGCACAG CTCTGCTCCT CCCCTGCCTG CCAAAGCCCC CCTTTAGGCC 2351 TTGGGTGGCT GGAAGGCTTT GTTAAGGGAC TAGGAGAAAT GGGGGTATCT 2401 TTCCCCTTTC CTGCCCTTTC TGCTCATCTC AACCTCTCAC AGAGGTGTCT 2451 TCTCCCCCTA ACCTACAGCT TTTTGTACAA GCCATTTTGT GTAAATTATT 2501 TATATTTAAT ATTATTCCCT GCTTTGTCAG GAGCAGGTAC TAGGCTCTGG 2551 GGCAGTGAGG AACTAGATCC TTCTCTCCTC AGCCTAGGGT GGAGGTCACT 2601 GCACTACCAC CCACCTCTGG AAGACTGGCT GTGAAAAGTC AGGTGGCAGA 2651 AACCTGGGGC CACATAGAGC CTCTCTCTTT TCCTGTTTCT TGGCTCTAGA 2701 AGATCAGCAC TGCACTGTTA GCTGAGAGTG CGGGCAAGAC ATAAACTGTC 2751 CAGAGTTTGA AGGTTCTCGG AAAGACCGGA GGGCTTCTCC CCACAGAAGG 2801 CGGAGAGAGC TGGGGCTCAG ACATGGGTGT GCACCTTAAT AAACCCTGCT 2851 GTCTGCCTCC CTGACTCTGC TTCTTGGGAG CATGGTGAGC AGCCCTGGTG 2901 CTCAGCAGCC ATACCTATGG GACACACACT ACGAAAAGGA TGCCTTTAGG 2951 GTTTGGGGGA GATTTTACTC CTTTCTTCAA CAACTATTCA CTGGACAAGT 3001 TCTCTGCTCC CATGACGCGC CAGGCACAGT TCTGCAAGTA TATTGTGAAT 3051 GTATTGTTCT AGTGGGATAC ACAAATAAGT CAGTTAAAAT ACATAAATAA 3101 AAACATAAAC CTGC // LOCUS AB001467 2578 bp mRNA PRI 13-FEB-1999 DEFINITION Homo sapiens mRNA for Efs2, complete cds. ACCESSION AB001467 NID g2829303 VERSION AB001467.1 GI:2829303 KEYWORDS Efs2. SOURCE Homo sapiens 2 years old female hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2578) AUTHORS Ishino,M. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) to the DDBJ/EMBL/GenBank databases. Masaho Ishino, Sapporo Medical University, Department of Biochemistry, Cancer Research Institute; S1, W17, Chuo-ku, Sapporo, Hokkaido 060, Japan (E-mail:ishino@cc.sapmed.ac.jp, Tel:011-611-2111, Fax:011-612-5861) REFERENCE 2 (sites) AUTHORS Ishino,M., Ohba,T., Inazawa,J., Sasaki,H., Ariyama,Y. and Sasaki,T. TITLE Identification of an Efs isoform that lacks the SH3 domain and chromosomal mapping of human Efs JOURNAL Oncogene 15 (14), 1741-1745 (1997) MEDLINE 98007665 FEATURES Location/Qualifiers source 1. .2578 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /sex="female" /tissue_type="hippocampus" CDS 609. .2015 /codon_start=1 /product="Efs2" /protein_id="BAA24589.1" /db_xref="PID:d1025509" /db_xref="PID:g2829304" /db_xref="GI:2829304" /translation="MAIATSVYVVPPPARPCPTSGPPAGPCPPSPDLIYKIPRASGTQ LAAPRDALEVYDVPPTALRVPSSGPYDCPASFSHPLTRVAPQPPGEDDAPYDVPLTPK PPAELEPDLEWEGGREPGPPIYAAPSNLKRASALLNLYEAPEELLADGEGGGTDEGIY DVPLLGPEAPPSPEPPGALASHDQDTLAQLLARSPPPPHRPRLPSAESLSRRPLPALP VPEAPSPSPVPSPAPGRKGSIQDRPLPPPPPRLPGYGGPKVEGDPEGREMEDDPAGHH NEYEGIPMAEEYDYVHLKGMDKAQGSRPPDQACTGDPELPERGMPAPQEALSPGEPLV VSTGDLQLLYFYAGQCQSHYSALQAAVAALMSSTQANQPPRLFVPHSKRVVVAAHRLV FVGDTLGRLAASAPLRAQVRAAGTALGQALRATVLAVKGAALGYPSSPAIQEMVQCVT ELAGQALQFTTLLTSLAP" BASE COUNT 446 a 893 c 757 g 482 t ORIGIN 1 CTCCAGGCAA CTTGGGGCAA GCGTCTCAGT TCTCGCTCTC CCTTCCTCCC 51 AGCGGGGTCG CCGCAGACCC CAGCCCTGGG AGCACCGCTC TGCAGCGCGG 101 CCGGCGGGTG GAGACGGTTG GCCCCTAAAC TCGCTCGTCC AGCCCAACCG 151 CCCCGGCGGC TTCTCCCAGC CCTCGAGGCT CTCCTGAGCG GCCTGGAGAG 201 GCGTCGAGCG CAGCCCAGCG CCCGCCTGCT CACCCGCCCC GGCCCGGGAA 251 GGGAATTTTC GGATCCTGCG AGCCCGGGGC GCCCCCGCGG CCTAGGGCGG 301 GCAGCTCCCG GGGCCTGGCC GAGCCGGTGG CGCCCGGGAG GCCGCGGGGA 351 CAGCACGCAG CGCGCGCCCT TGGATGCCGT CCCGCAGCGA CGCCCCGGCC 401 CGCCCCGCTC CTCCTCCTGC CTGGCTAGCC TGCCTCTCAT TTGGGAAGTT 451 TTGTGGGTTT TCTTTCTCCT CCTCCAACCT TGGCGGAGGC CACGACTCAG 501 GCGCCACAGC TGGGGGCTAG AGGCCGCGGA CCATGGTGCG GGGCAGCCAC 551 CGCTGAAGTC AGCAAAACCG AGCCTGGCCT GAGGCAGGCT GCGCGGGAGG 601 CCAAAGCCAT GGCCATTGCC ACGTCGGTGT ATGTGGTGCC GCCCCCAGCT 651 CGGCCCTGTC CAACCTCAGG ACCTCCAGCT GGACCTTGCC CACCCTCTCC 701 TGACCTCATC TACAAAATCC CCAGAGCTAG TGGGACCCAG CTGGCTGCTC 751 CCAGAGATGC CTTGGAGGTC TACGATGTGC CCCCCACCGC CCTCCGGGTG 801 CCCTCCAGTG GCCCCTATGA CTGCCCTGCC TCCTTTTCCC ACCCTCTGAC 851 CCGGGTTGCC CCGCAGCCCC CTGGAGAGGA TGATGCTCCC TATGATGTGC 901 CTCTGACCCC AAAGCCACCT GCAGAGCTGG AACCAGATCT GGAGTGGGAA 951 GGAGGCCGGG AGCCGGGGCC CCCCATCTAT GCTGCCCCCT CCAACCTGAA 1001 ACGAGCGTCA GCCTTACTCA ATTTGTATGA AGCACCCGAG GAACTGCTGG 1051 CAGACGGGGA GGGCGGGGGC ACTGATGAGG GGATCTACGA TGTGCCTCTG 1101 CTGGGGCCAG AGGCTCCCCC TTCTCCAGAG CCCCCTGGAG CCTTGGCCTC 1151 CCATGACCAG GACACCCTGG CCCAGCTTCT GGCCAGAAGC CCCCCACCCC 1201 CACACAGGCC CCGGCTCCCC TCAGCTGAGA GCCTGTCCCG CCGCCCTCTG 1251 CCTGCCCTGC CTGTCCCTGA GGCCCCCAGC CCCTCCCCAG TGCCCTCTCC 1301 TGCCCCAGGC CGGAAGGGCA GCATCCAGGA CCGGCCTCTG CCCCCACCCC 1351 CACCCCGCCT GCCTGGTTAT GGAGGCCCCA AGGTCGAGGG GGATCCAGAG 1401 GGCAGGGAGA TGGAGGATGA CCCAGCAGGA CACCACAATG AGTACGAGGG 1451 CATTCCGATG GCCGAGGAGT ATGACTATGT CCACCTGAAG GGCATGGACA 1501 AAGCTCAGGG ATCTAGGCCC CCGGATCAGG CCTGCACAGG GGATCCTGAA 1551 CTGCCCGAGA GGGGGATGCC GGCGCCGCAG GAGGCCCTGT CCCCAGGGGA 1601 GCCACTGGTT GTGTCCACCG GAGATCTGCA GCTCCTGTAC TTCTATGCTG 1651 GGCAATGCCA GAGCCACTAC TCAGCCCTGC AGGCAGCCGT GGCAGCCCTG 1701 ATGTCCAGTA CCCAGGCTAA TCAGCCCCCG CGCCTTTTCG TGCCCCACAG 1751 CAAGAGGGTG GTGGTGGCTG CTCATCGCCT GGTGTTTGTT GGGGACACCC 1801 TGGGCCGGCT GGCAGCCTCT GCCCCTCTGA GAGCACAGGT CAGGGCTGCA 1851 GGTACAGCAC TGGGCCAGGC ATTGCGGGCC ACTGTGCTGG CTGTCAAGGG 1901 AGCTGCCCTG GGCTACCCAT CCAGCCCTGC CATCCAAGAG ATGGTGCAGT 1951 GTGTAACAGA ACTGGCAGGG CAGGCCCTGC AATTCACTAC CCTGCTCACT 2001 AGCCTGGCTC CATGAAGGTC CTTTGGCACA GCTCTGCTCC TCCCCTGCCT 2051 GCCAAAGCCC CCCTTTAGGC CTTGGGTGGC TGGAAGGCTT TGTTAAGGGA 2101 CTAGGAGAAA TGGGGGTATC TTTCCCCTTT CCTGCCCTTT CTGCTCATCT 2151 CAACCTCTCA CAGAGGTGTC TTCTCCCCCT AACCTACAGC TTTTTGTACA 2201 AGCCATTTTG TGTAAATTAT TTATATTTAA TATTATTCCC TGCTTTGTCA 2251 GGAGCAGGTA CTAGGCTCTG GGGCAGTGAG GAACTAGATC CTTCTCTCCT 2301 CAGCCTAGGG TGGAGGTCAC TGCACTACCA CCCACCTCTG GAAGACTGGC 2351 TGTGAAAAGT CAGGTGGCAG AAACCTGGGG CCACATAGAG CCTCTCTCTT 2401 TTCCTGTTTC TTGGCTCTAG AAGATCAGCA CTGCACTGTT AGCTGAGAGT 2451 GCGGGCAAGA CATAAACTGT CCAGAGTTTG AAGGTTCTCG GAAAGACCGG 2501 AGGGCTTCTC CCCACAGAAG GCGGAGAGAG CTGGGGCTCA GACATGGGTG 2551 TGCACCTTAA TAAACCCTGC TGTCTGCC // LOCUS AB002292 8467 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0294 gene, complete cds. ACCESSION AB002292 NID g2224528 VERSION AB002292.1 GI:2224528 KEYWORDS KIAA0294. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HF0223. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8467) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .8467 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HF0223" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 3732. .7097 /gene="KIAA0294" CDS 3732. .7097 /gene="KIAA0294" /codon_start=1 /protein_id="BAA20754.1" /db_xref="PID:d1021590" /db_xref="PID:g2224529" /db_xref="GI:2224529" /translation="MHSDEMIYDDVENGDEGGNSSLEYGWSSSEFESYEEQSDSECKN GIPRSFLRSNHKKQLSHDLTRLKEHYEKKMRDLMASTVGVVEIQQLRQKHELKMQKLV KAAKDGTKDGLERTRAAVKRGRSFIRTKSLIAQDHRSSLEEEQNLFIDVDCKHPEAIL TPMPEGLSQQQVVRRYILGSVVDSEKNYVDALKRILEQYEKPLSEMEPKVLSERKLKT VFYRVKEILQCHSLFQIALASRVSEWDSVEMIGDVFVASFSKSMVLDAYSEYVNNFST AVAVLKKTCATKPAFLEFLKQEQEASPDRTTLYSLMMKPIQRFPQFILLLQDMLKNTS KGHPDRLPLQMALTELETLAEKLNERKRDADQRCEVKQIAKAINERYLNKLLSSGSRY LIRSDDMIETVYNDRGEIVKTKERRVFMLNDVLMCATVSSRPSHDSRVMSSQRYLLKW SVPLGHVDAIEYGSSAGTGEHSRHLAVHPPESLAVVANAKPNKVYMGPGQLYQDLQNL LHDLNVIGQITQLIGNLKGNYQNLNQSVAHDWTSGLQRLILKKEDEIRAADCCRIQLQ LPGKQDKSGRPTFFTAVFNTFTPAIKESWVNSLQMAKLALEEENHMGWFCVEDDGNHI KKEKHPLLVGHMPVMVAKQQEFKIECAAYNPEPYLNNESQPDSFSTAHGFLWIGSCTH QMGQIAIVSFQNSTPKVIECFNVESRILCMLYVPVEEKRREPGAPPDPETPAVRASDV PTICVGTEEGSISIYKSSQGSKKVRLQHFFTPEKSTVMSLACTSQSLYAGLVNGAVAS YARAPDGSWDSEPQKVIKLGVLPVRSLLMMEDTLWAASGGQVFIISVETHAVEGQLEA HQEEGMVISHMAVSGVGIWIAFTSGSTLRLFHTETLKHLQDINIATPVHNMLPGHQRL SVTSLLVCHGLLMVGTSLGVLVALPVPRLQGIPKVTGRGMVSYHAHNSPVKFIVLATA LHEKDKDKSRDSLAPGPEPQDEDQKDALPSGGAGSSLSQGDPDAAIWLGDSLGSMTQK SDLSSSSGSLSLSHGSSSLEHRSEDSTIYDLLKDPVSLRSKARRAKKAKASSALVVCG GQGHRRVHRKARQPHQEELAPTVMVWQIPLLNI" BASE COUNT 2240 a 1923 c 2236 g 2068 t ORIGIN 1 TTCCCCAAAT TGATGGACAT AAACCCATAT GCTTATCTCA GCATGTGTTT 51 AAAAAGCACT TGCTGAGATT CAGTGACCAT CCAACATTAA AAACTGCTGA 101 TAGAAGGAAA CTCACTTAGC TGAATTAAGG ACGTGTTCTT AAAATCTACC 151 GCCAACGTAA TGGGGAGGAG CTCACGGCGT TCGTTAATTT ATTCATTCCG 201 CAAATATGTT TTGGGCAGTT ACATACCGAA TAGTAGATGG AGGTGTGCCT 251 GCTGTCATGG AGATAGGGTG ATTTCATCCT GTTGATCAGG AAAACTCCTA 301 GGTGCTTGCA GGTAAATGTG CCACAAAGAA AGTGAGGACC AAAGGTTAGT 351 TGATGTAAAA ACAAGTTTGA AATGCATTTT GGGGTAATTT ATCCGGTCGC 401 TTCGGGCATT CCTCGCGGAA GGCGTGGTCT GGTGACTCAG AAGCCAACAC 451 ACTGCGGGAG TCCAGCCGTC GGCCCCCTGC CGTGTGGCGA GGCCCAGTGT 501 GTCCCCTTTG TAAGGACAGC ACAAGCAGGA GTTAATGGAC CGGCCATCCA 551 TAGCGGTGGT GGGGCAGGGA GCCAGTTTCC GAAAGAAACT CACGCCGCCG 601 CAGGAGGGCC CTGTGGGATG CTCTGTGCAG AGCTGTTGTG CGGACCGGGA 651 GACGGGAAAG CCTGGTGGCT GCAGGAGGGC ACCGTGCAGA AGTATCCAGT 701 AAACCACCCA CAGCACGGCA GCAGAAAAAC GAGGAAATTA TATGTGTGTA 751 TGTTTATAAG AACTCAGAAG CAATGGTGAG CAAAAAGCAA AAGCAAGAGG 801 AGAAAGTCAC AGTGTGCTGG CATCAAGTTG CATTGAGAGG AGCCCGCGGG 851 GTAGTGCACC CGCATTTCCT CGTTGCGTTG AGAGGCGCCC GCGGGGTAGT 901 GCACCCGCAT TTCCTCGTTT GAGAGGCGCC CGCGGGGTAG TGCACCCGCA 951 TTTCCTAGTT GCCTTGAGAG GTGCCGCGGG GTAGTGCACC CGCATTTCCT 1001 AGTTGCCTTG AGAGGTGCCC GCGGGGTAGT GCACCCGCAT TTCCTAGTTG 1051 CCTTGAGAGG TGCCGCGGGG TAGTGCACCC GCATTTCCTC GTTTCATTGA 1101 GCGGTGCCCG CGGGGTAGTG CACTCGCATT TCCTAGTTGC CTTGAGAGGT 1151 GCCGCGGGGT AGTGCACCCA CATTTCTTCA CTCGTTTAGA GTTCGGGGCT 1201 CTCAGAACAC AGGGAGAATA TGGGAGAATT CCTTACTAGA TGTTACAGAG 1251 GCCACACAGG GCCACTTTTT TCTTTTTTTT TTATTGTGTC AGGTATACGT 1301 AAATATTCCT TTCGGTCAGT TCAGCACGTA GGACTGAGTG GCATTAGGTA 1351 CGCTCACTGT ACAGCCAATG CCTCCATAGG CCACTTTTTA AATACGCGGG 1401 GCTAAGGGCC AACGACAAGA TTGTCATCGA GGACAATAAG TCGATGGCGC 1451 TGCCTGGTCA CTGGCTTGGT CAGAAAACAC ATGCCAGGGT GACTGGATTT 1501 AACGTTCAGT TTTAGAACCA CAAATCTGCC AGCCCAGGCA TTCAAGAGGA 1551 AGTGAGTAAT TCACTGAATT GATGGTTAGC AAGACCCTTC AAAGTCCTTG 1601 GAAGTTCCGT GTTTGCTGGG GGTCACAACA GCAATTGCGT TTCTAAAACA 1651 TTGAAAACCA CCCGTTTTTC ACACATCTGA ATAGCCTGAG TTGTAACAGA 1701 CCTAAGTAAA GGCGTCCAAA CGTGCCTGAT CCTGTGGCTG GGTCCCAGGA 1751 GCCTTAACAA GGCATTGAGA GAGCTGGATT GATTGATTAG ACTCTTGCCA 1801 ACTGCTGTGC GGAATACAGA AAATGCAGCT CCAGCTCTCA AAGGGCTGAA 1851 AATCTAATTG AGGTGAAAAA GGTAACATGT GTGAAAACCA TGTCTGTGTA 1901 GACTTGTTTT GTATGGACCA GAAAAGTTGG AAGTCCAGAG ATGGATGCGA 1951 GGAAGTAGGG GATGGAGTTT CCTTATAACC TCTGGAGGGA CAGTCCAGAT 2001 ACATACCGGG GTGTGTAGCG GGTGGAGGTT CTAAGACAGG CTGGAGGAAC 2051 AGTGCAGACA CACACCAGGG TGTGTAGGGG GTAGAGGTTC TAAGACAGTC 2101 TGGAGGGAGA GTGCAGACAC ACACCGGGGT GTGTAAGCAG TGGAGGTTCT 2151 AAGACAGGCT GGAGGAACAG TGGAGACACA CACCAGGGCG TGTAGGGGGT 2201 AGAGGTTCTA AGACAGGCTG GAGGAACAGT GGAGACACAC ACCACGGCGT 2251 GTAGGGGGTA GAGGTTCTAA GACAGTCTGG TGGGAGAGTG CAGACACACC 2301 CGGGGCCGTG TAGGGGGTAC AGGTTCTAAG ACAGTCTGGT GGGAGAGTGC 2351 AGACACACAC GGGGCTGTGT CGGGGATAGA GGTTCTAAGA CAGTCTGGTG 2401 GGAGAGTGCA GACACACACC AGGGCGTGTA GGGGGTGGAG GTTCTAAGAC 2451 AGGCTGGAGG AACAGTGGAG ACACACACTG GGGAGCATAG GGGTGCTTTT 2501 CTCTGAGTCC CCTAGTACAT GGTAGAGGCT GTAGACCCTC CGCTCTTGGG 2551 CACGTGGGTA GGCTCTCAGG ATGACTCTTG GCTCTTGGGC ATGTGGGTGA 2601 GCTCTCAGGA TGATGCCCAG CCCCAATTTT CAGGCAATTG TGCAAGGACT 2651 TGACCCATTC ATCCGCTGAG CTGAGTTGCT GAGACTGCTG GGTGCCCGGG 2701 TGGTGATCAT TGTCCGTGGC ATACAGAACA CACTGCAGCT TCTGCAAAGT 2751 GAGCTCATTT CACGCATTTT ATGGCTTTGC CAGGCTGCTG TTGACCTGCC 2801 AGAACTTTTA ATCAGACATT TGGAGGACCT GTTTTGTAGT CAGTGGAGAA 2851 ATATTACAAG GATAGGGTAA TTTGAAATAT CTAAGGATTG TAAGTGACAA 2901 GTTCATGTCT AATTTTGCAT TTCCAGTGAA AGCAAGTGTT GGCTTTGAAT 2951 GTTACTTATG TGCTGAGATG TGTATATTCC TCAGTGCTTA ATTACTAAGG 3001 ATTTTTAGGG CCAAGTTTTG TTACAGTGAA TGATTGTGGA TGCATAAAGA 3051 ATAAATTTAA TATTTTTAAG GCATGGAGAT TATTTGTATC TAAGAAACCA 3101 GGTAAAATAA AGAAACATTT ATGCTTGTGT GACTGATAAA AGAGTTAGAG 3151 AGACACTCAT ATTCTGGGAG TTTGAAGAAT GTCATTTTCA TTCTCTAAAA 3201 GTCTTGTTAG TGTCACAGCA TTGAAAATTT AAAAATCCGT GTGTATTTTC 3251 TTGCTAGTGC TGGTACTTGA ATATCTGTAT CATCCACCTA TCCATCCACC 3301 TACCCATATT TCTATAATCC ACCGTCCCTC GACATGCCTA TCATCTGTCC 3351 ACCATTTCTC TCTGTCTAAT TTTCAAAACA TCCTGTAAGT TTATATAAAG 3401 GAAGATTTTT CTTCTTGTGA AGTTCTCTAA GGCTGACAAG TTACCTGGCA 3451 TGACTGTGGC GGATGCCCAT AGCCAGGTGG TCCTCGGGGT ACAGATGGGG 3501 CAGGGGCACT TGTGAGAAAC ACCTGAAGTG CTTTTCCCCA GCCTCCCCGG 3551 CCCTGCCGGG TGGTGGAGGC GCTGCACGGT GCCTTCCATG GAGCAAGCCC 3601 GGGGCTCCGC AGGGTCCTCA GCATGATTCA GATTTCCTTC CACCCCCAGC 3651 TCTAGATGAT TTGGTAAAAC CACAAACAGG CACAAAACAG CCCACATGGA 3701 ATTCTAAAGT TTTAATTTCA TTTTGGAATT TATGCACTCA GATGAAATGA 3751 TTTATGATGA TGTTGAGAAT GGGGATGAAG GTGGAAACAG CTCCTTGGAA 3801 TACGGATGGA GTTCGAGTGA ATTTGAAAGT TACGAAGAGC AGAGTGACTC 3851 GGAGTGCAAG AATGGGATTC CCAGGTCCTT CCTGCGCAGC AACCACAAAA 3901 AGCAACTTTC TCATGACCTA ACCCGTTTAA AGGAGCACTA TGAGAAAAAG 3951 ATGAGAGATT TGATGGCAAG CACGGTGGGC GTGGTGGAGA TTCAGCAGCT 4001 CAGGCAGAAG CATGAACTGA AGATGCAGAA GCTCGTGAAG GCCGCGAAGG 4051 ACGGCACCAA GGACGGGCTG GAGAGGACCA GGGCAGCCGT GAAGAGGGGC 4101 CGCTCCTTCA TCAGGACCAA GTCTCTCATC GCACAGGATC ACAGATCTTC 4151 TCTTGAGGAA GAACAGAATT TGTTCATTGA TGTTGACTGC AAGCACCCGG 4201 AAGCCATCTT GACCCCGATG CCCGAGGGTT TATCTCAGCA GCAGGTTGTA 4251 AGAAGATATA TACTGGGTTC AGTTGTCGAC AGTGAAAAGA ACTACGTAGA 4301 TGCTCTTAAG AGGATTTTGG AGCAATATGA GAAGCCGCTG TCTGAGATGG 4351 AGCCAAAGGT TCTGAGTGAG AGGAAGCTGA AGACGGTGTT CTACCGAGTC 4401 AAAGAGATCC TGCAGTGCCA CTCGCTATTT CAGATCGCGC TGGCCAGCCG 4451 CGTTTCCGAG TGGGACTCCG TGGAAATGAT AGGCGATGTC TTCGTGGCTT 4501 CGTTTTCTAA GTCCATGGTG CTGGATGCAT ACAGTGAATA TGTGAACAAT 4551 TTCAGCACAG CCGTGGCAGT CCTCAAGAAA ACATGTGCCA CAAAGCCCGC 4601 TTTTCTTGAA TTTTTAAAGC AGGAACAGGA GGCCAGCCCC GATCGAACCA 4651 CGCTCTACAG CCTGATGATG AAGCCCATCC AGAGGTTCCC ACAGTTCATC 4701 CTCCTGCTCC AGGACATGCT GAAGAACACC TCCAAAGGCC ACCCCGACAG 4751 GCTGCCTCTT CAGATGGCCC TGACAGAGCT CGAAACACTA GCAGAGAAGT 4801 TAAATGAAAG AAAGAGAGAT GCTGATCAAC GCTGTGAAGT GAAGCAAATA 4851 GCCAAAGCCA TAAACGAAAG ATACCTGAAC AAGCTTCTCA GCAGTGGAAG 4901 CCGATACCTC ATTCGATCAG ATGATATGAT AGAAACAGTT TACAACGACA 4951 GAGGAGAGAT TGTTAAAACC AAAGAACGCC GAGTCTTCAT GTTAAATGAT 5001 GTGTTAATGT GTGCCACCGT CAGCTCACGC CCCTCTCATG ACAGCCGTGT 5051 GATGAGCAGC CAGAGGTACT TGCTGAAGTG GAGCGTTCCA CTGGGACATG 5101 TGGACGCCAT CGAGTATGGC AGCAGCGCAG GCACGGGCGA GCACAGCAGG 5151 CACCTTGCCG TTCACCCGCC GGAGAGCCTG GCCGTGGTTG CTAACGCGAA 5201 ACCAAACAAA GTTTACATGG GGCCAGGACA ACTGTATCAA GATTTACAAA 5251 ACTTGTTGCA TGACTTAAAT GTAATTGGCC AAATCACTCA GCTGATAGGA 5301 AACCTTAAAG GAAACTATCA GAACTTAAAC CAGTCAGTAG CCCATGACTG 5351 GACATCAGGT TTACAAAGGC TTATTTTGAA GAAAGAAGAT GAAATCAGAG 5401 CTGCGGACTG CTGCAGAATT CAGTTACAGC TTCCCGGGAA GCAGGACAAA 5451 TCTGGGCGAC CGACGTTCTT TACAGCTGTG TTCAATACGT TCACCCCTGC 5501 CATCAAGGAG TCCTGGGTCA ACAGCTTACA GATGGCCAAG CTCGCCCTAG 5551 AAGAGGAGAA CCACATGGGC TGGTTCTGTG TGGAAGACGA TGGGAATCAC 5601 ATTAAAAAGG AGAAGCATCC TCTCCTCGTC GGACACATGC CCGTGATGGT 5651 GGCCAAGCAG CAGGAGTTCA AGATTGAATG TGCTGCTTAT AACCCTGAAC 5701 CTTACCTAAA TAATGAAAGC CAGCCAGATT CATTTTCCAC GGCACATGGT 5751 TTCCTGTGGA TCGGAAGTTG CACCCATCAA ATGGGTCAGA TTGCCATCGT 5801 CTCGTTTCAA AATTCCACTC CCAAAGTCAT TGAGTGCTTC AACGTGGAAT 5851 CTCGCATCCT GTGCATGCTG TACGTTCCCG TCGAGGAGAA GCGCAGAGAG 5901 CCTGGGGCAC CCCCGGACCC CGAGACCCCG GCCGTGAGAG CTTCTGATGT 5951 CCCCACGATC TGTGTAGGGA CGGAGGAGGG AAGCATTTCC ATTTATAAAA 6001 GCAGTCAAGG CTCCAAGAAA GTGAGACTTC AGCACTTTTT CACTCCTGAG 6051 AAGTCCACAG TCATGAGCCT GGCTTGCACG TCTCAGAGCC TGTACGCTGG 6101 CCTGGTCAAC GGGGCAGTCG CCAGCTACGC CAGAGCCCCA GATGGATCCT 6151 GGGATTCAGA ACCTCAAAAA GTGATCAAGT TAGGCGTCCT ACCAGTTAGA 6201 AGTCTACTCA TGATGGAAGA CACGTTGTGG GCGGCTTCCG GAGGTCAAGT 6251 CTTCATCATC AGTGTGGAGA CTCATGCTGT AGAGGGTCAG CTGGAGGCCC 6301 ACCAGGAGGA AGGCATGGTG ATCTCCCACA TGGCCGTGTC CGGCGTCGGG 6351 ATCTGGATTG CCTTCACCTC AGGGTCCACG CTCCGCCTTT TTCACACGGA 6401 AACTCTCAAG CACCTGCAGG ACATCAACAT CGCCACCCCT GTTCACAACA 6451 TGCTGCCAGG GCACCAGCGG CTGTCGGTGA CGAGCCTGCT CGTCTGCCAC 6501 GGATTGCTGA TGGTCGGCAC CAGCCTGGGA GTCCTCGTGG CCCTGCCGGT 6551 CCCACGTCTG CAAGGGATTC CCAAAGTGAC CGGAAGAGGC ATGGTCTCCT 6601 ACCATGCACA CAACAGTCCT GTCAAATTCA TCGTCCTGGC CACGGCTCTG 6651 CACGAGAAAG ACAAGGACAA ATCCAGGGAC AGCCTGGCTC CTGGCCCCGA 6701 GCCTCAGGAC GAAGACCAGA AGGACGCACT TCCGAGTGGA GGAGCTGGTT 6751 CATCTCTGAG CCAGGGTGAC CCTGACGCAG CCATCTGGTT GGGAGATTCG 6801 CTGGGATCGA TGACTCAGAA AAGCGACCTG TCCTCCTCAT CTGGGTCCCT 6851 GAGCTTGTCT CACGGCTCCA GCTCTCTAGA GCACAGATCA GAGGACAGCA 6901 CCATCTATGA TCTCCTGAAG GATCCTGTCT CGCTGAGAAG CAAAGCACGC 6951 CGGGCCAAGA AAGCCAAGGC CAGCTCGGCG CTGGTGGTCT GTGGAGGGCA 7001 GGGCCACCGC CGGGTGCACA GGAAGGCCCG GCAGCCCCAC CAGGAAGAGC 7051 TGGCGCCGAC CGTCATGGTC TGGCAGATCC CTCTGCTGAA TATATAAGCA 7101 GGACGGCCGC CTTCTGCTGT CAGAATTTGC AATCAAGGGT GACTTCTCAG 7151 CTAATCCTAC AGCCTGAGTG GTTAAGCTGT GTCTACACTG GTTGGGAATA 7201 AATTAAAAAC AGTATTTGGG GGAGAAACGT GCAATAGCGT AATGGTGGTG 7251 TCCCTGCCAA TTCCTTCCTT CTCTTCTGTA CAGCAGAAGT AATTACAAGC 7301 ACTTCTCACG AAGGCAGAAG ACTGATGCAA TTTTCGAGTA ATTGAGTGCA 7351 GTTCTGGGAA AATACCACAT TCTTTTTGAC TGCTGTAGTC CATATATGAA 7401 TACTAAATGT TAAACTTCAT CAGCGTCAGA CCTATTGTAT CATATTAGAG 7451 AATTTGCAGA CTAAGAATTT ATGAGAAAAT ATATGTATTC AGTAGTGCAG 7501 GCATTTATTA ACAATTCTTA AAAGTTTTAC CTGATTCAGA TTCACGACTT 7551 TTATTTATAT TCTATATTTT TGAATTTCAG AGTAAAATTT GTTAACAATT 7601 TTAAAAGCCA GGTAACACCT ACCAGTCCAG TTAGCATGAT TTGCTTTCAG 7651 AAGTGAGCTG GGTTTTCCAA AGTGGTATAA TGTGTGTACT GTATATTTTA 7701 ACAAAGTAAT ATTTTTGTAT TGCATTTTTC TATTAAAAAA TTAACAGTTA 7751 ATGTTTCAGT CAATGTATTA TCTGTAGCAT TTCACAAATA ATGTTTGCTT 7801 TGAACCAAAA TGCTCAGTGC CTATCAACAT TTGGACTCAA GCATCAACAC 7851 CAAATTATTC CTCCCTTCTC GTATAAATAG AGTGACTATC CACAGGAGAA 7901 AAGTGTGTGC TTTAGTATTA GAGGAGATAG GCAGAGAAGT CTTGCTTAGT 7951 TCCTTCGTGC AGCTTCTTGC CCCTGTTGAC GTGGAATGCT GTGTCTGCTT 8001 TAGCACGCAC GCTCCGAATG ACTCCTGGTG CTAGGCCATG CTGGCTGCTG 8051 TCACTGAGCG GGACTCAGGC CAAGAGGCGT GACCTCGGGC CAGCCTGTCT 8101 GTTGTGCAGA CGCCTCCTCT GCAGAACGCA TCAGTTTCTA TTCTGCAGTT 8151 GCAGAGCCAG CCCCGCGTGA GAACGTGCAT AATGAGTGCA CACCATCATG 8201 TCAAGGTGCA TACTTAGTGA GCGCCATCCT GCTGAACGTG TATTTCAGTG 8251 TTTCACTTAC TGGACGGATA ACAAGAAAAA AATCCTAACA CAGGCAGTCA 8301 CCAGAAATAA ATGTCTCAGC ACTTTACAGA TGACTAAAAA TGTTAATTTT 8351 ATGACTTAGC CAAATATGTT CTAGGTTGCA TATATCCCCC ATGTGAAAGT 8401 GATTTCTTCC CAAGCTTCTC AAACTGTTAG CTGCTGTCTG ACTTCATCAA 8451 TAAAGTATTT TTATTTT // LOCUS AB002303 6632 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0305 gene, complete cds. ACCESSION AB002303 NID g2224550 VERSION AB002303.1 GI:2224550 KEYWORDS KIAA0305. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0042. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6632) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .6632 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0042" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 249. .4868 /gene="KIAA0305" CDS 249. .4868 /gene="KIAA0305" /codon_start=1 /protein_id="BAA20764.1" /db_xref="PID:d1021601" /db_xref="PID:g2224551" /db_xref="GI:2224551" /translation="MDSYFKAAVSDLDKLLDDFEQNPDEQDYLQDVQNAYDSNHCSVS SELASSQRTSLLPKDQECVNSCASSETSYGTNESSLNEKTLKGLTSIQNEKNVTGLDL LSSVDGGTSDEIQPLYMGRCSKPICDLISDMGNLVHATNSEEDIKKLLPDDFKSNADS LIGLDLSSVSDTPCVSSTDHDSDTVREQQNDTSSELQNREIGGIKELGIKVDTTLSDS YNYSGTENLKDKKIFNQLESIVDFNMSSALTRQSSKMFHAKDKLQHKSQPCGLLKDVG LVKEEVDVAVITAAECLKEEGKTSALTCSLPKNEDLCLNDSNSRDENFKLPDFSFQED KTVIKQSAQEDSKSLDLKDNDVIQDSSSALHVSSKDVPSSLSCLPASGSMCGSLIESK ARGDFLPQHEHKDNIQDAVTIHEEIQNSVVLGGEPFKENDLLKQEKCKSILLQSLIEG MEDRKIDPDQTVIRAESLDGGDTSSTVVESQEGLSGTHVPESSDCCEGFINTFSSNDM DGQDLDYFNIDEGAKSGPLISDAELDAFLTEQYLQTTNIKSFEENVNDSKSQMNQIDM KGLDDGNINNIYFNAEAGAIGESHGINIICETVDKQNTIENGLSLGEKSTIPVQQGLP TSKSEITNQLSVSDINSQSVGGARPKQLFSLPSRTRSSKDLNKPDVPDTIESEPSTAD TVVPITCAIDSTADPQVSFNSNYIDIESNSEGGSSFVTANEDSVPENTCKEGLVLGQK QPTWVPDSEAPNCMNCQVKFTFTKRRHHCRACGKVFCGVCCNRKCKLQYLEKEARVCV VCYETISKAQAFERMMSPTGSNLKSNHSDECTTVQPPQENQTSSIPSPATLPVSALKQ PGVEGLCSKEQKRVWFADGILPNGEVADTTKLSSGSKRCSEDFSPLSPDVPMTVNTVD HSHSTTVEKPNNETGDITRNEIIQSPISQVPSVEKLSMNTGNEGLPTSGSFTLDDDVF AETEEPSSPTGVLVNSNLPIASISDYRLLCDINKYVCNKISLLPNDEDSLPPLLVASG EKGSVPVVEEHPSHEQIILLLEGEGFHPVTFVLNANLLVNVKFIFYSSDKYWYFSTNG LHGLGQAEIIILLLCLPNEDTIPKDIFRLFITIYKDALKGKYIENLDNITFTESFLSS KDHGGFLFITPTFQKLDDLSLPSNPFLCGILIQKLEIPWAKVFPMRLMLRLGAEYKAY PAPLTSIRGRKPLFGEIGHTIMNLLVDLRNYQYTLHNIDQLLIHMEMGKSCIKIPRKK YSDVMKVLNSSNEHVISIGASFSTEADSHLVCIQNDGIYETQANSATGHPRKVTGASF VVFNGALKTSSGFLAKSSIVEDGLMVQITPETMNGLRLALREQKDFKITCGKVDAVDL REYVDICWVDAEEKGNKGVISSVDGISLQGFPSEKIKLEADFETDEKIVKCTEVFYFL KDQDLSILSTSYQFAKEIAMACSAALCPHLKTLKSNGMNKIGLRVSIDTDMVEFQAGS EGQLLPQHYLNDLDSALIPVIHGGTSNSSLPLEIELVFFIIEHLF" BASE COUNT 2198 a 1124 c 1252 g 2058 t ORIGIN 1 ACTCCCGGCC GGGGTAGCTC TTCACTCCTC AGCGCGACGT CGTGTCGAGT 51 TCCCAAAAAG CTCCGCAGGG GCTGTAGGGA GGTGATCTCA TCCATTAACA 101 GCTGTGTGTT GCCAGTTCCC AAATCTTTAT CTATCTCAGA CTTCTCTCCT 151 GCATTCCAGA TTCTTATATT CAGCTGCCTT TTGGATATCT CTCCCAGGAT 201 GTTCTCAAGG CATACAAGAA TTAAATTCTG AATAAGTCTG CAGGTAGGAT 251 GGACAGTTAT TTTAAAGCAG CTGTCAGTGA CTTGGACAAA CTCCTTGATG 301 ATTTTGAACA GAACCCAGAT GAACAAGATT ATCTCCAAGA TGTACAAAAT 351 GCATATGATT CTAACCACTG CTCAGTTTCT TCAGAGTTGG CTTCCTCACA 401 GCGAACTTCA TTGCTCCCAA AAGACCAAGA GTGCGTTAAT AGTTGTGCCT 451 CATCAGAAAC AAGCTATGGA ACAAATGAGA GTTCCCTGAA TGAAAAAACA 501 CTCAAGGGAC TTACTTCTAT ACAAAATGAA AAAAATGTAA CAGGACTTGA 551 TCTTCTTTCT TCTGTGGATG GTGGTACTTC AGATGAAATC CAGCCGTTAT 601 ATATGGGACG ATGTAGTAAA CCTATCTGTG ATCTGATAAG TGACATGGGT 651 AACTTAGTTC ATGCAACCAA TAGTGAAGAA GATATTAAAA AATTATTGCC 701 AGATGATTTT AAGTCTAATG CAGATTCCTT GATTGGATTG GATTTATCTT 751 CAGTGTCAGA TACTCCCTGT GTTTCTTCAA CAGACCATGA TAGTGATACT 801 GTCAGAGAAC AACAGAATGA TACCAGTTCT GAATTACAAA ATAGAGAAAT 851 CGGAGGAATC AAAGAATTGG GTATAAAAGT AGATACAACA CTTTCAGATT 901 CCTATAATTA CAGTGGAACA GAAAATTTAA AAGATAAAAA GATCTTTAAT 951 CAGTTAGAAT CAATTGTTGA TTTTAACATG TCATCTGCTT TGACTCGACA 1001 AAGTTCCAAA ATGTTTCATG CCAAAGACAA GCTACAACAC AAGAGCCAGC 1051 CATGTGGATT ACTAAAAGAT GTTGGCTTAG TAAAAGAGGA AGTAGATGTG 1101 GCAGTCATAA CTGCCGCAGA ATGTTTAAAA GAAGAGGGCA AGACAAGTGC 1151 TTTGACCTGC AGCCTTCCGA AAAATGAAGA TTTATGCTTA AATGATTCAA 1201 ATTCAAGAGA TGAAAATTTC AAATTACCTG ACTTTTCCTT TCAGGAAGAT 1251 AAGACTGTTA TAAAACAATC TGCACAAGAA GACTCAAAAA GTTTAGACCT 1301 TAAGGATAAT GATGTAATCC AAGATTCCTC TTCAGCTTTA CATGTTTCCA 1351 GTAAAGATGT GCCGTCCTCA TTGTCCTGTC TTCCTGCGTC TGGGTCTATG 1401 TGTGGATCAT TAATTGAAAG TAAAGCACGG GGTGATTTTT TACCTCAGCA 1451 TGAACATAAA GATAATATAC AAGATGCAGT GACTATACAT GAAGAAATAC 1501 AGAACAGTGT TGTTCTAGGT GGGGAACCAT TCAAAGAGAA TGATCTTTTG 1551 AAACAGGAAA AATGTAAAAG CATACTCCTT CAGTCATTAA TTGAAGGGAT 1601 GGAAGACAGA AAGATAGATC CTGACCAGAC AGTAATCAGA GCTGAGTCTT 1651 TGGATGGTGG TGACACCAGT TCTACAGTTG TAGAATCTCA AGAGGGGCTT 1701 TCTGGCACTC ATGTCCCAGA GTCTTCTGAT TGTTGTGAAG GTTTTATTAA 1751 TACTTTTTCA AGCAATGATA TGGATGGGCA AGACTTAGAT TACTTTAATA 1801 TTGATGAAGG CGCAAAAAGT GGCCCACTAA TTAGTGATGC TGAACTTGAT 1851 GCCTTTCTGA CAGAACAGTA TCTTCAGACC ACTAACATAA AGTCTTTTGA 1901 AGAAAATGTA AATGACTCTA AATCGCAAAT GAATCAGATA GATATGAAAG 1951 GCTTAGATGA TGGAAACATC AATAATATAT ATTTCAATGC AGAAGCAGGA 2001 GCTATTGGGG AAAGTCATGG TATTAATATA ATTTGTGAAA CAGTTGATAA 2051 ACAAAATACA ATAGAAAATG GCCTTTCTTT AGGAGAAAAA AGCACTATTC 2101 CAGTTCAACA AGGGTTACCT ACCAGTAAGT CTGAGATTAC AAATCAATTA 2151 TCAGTCTCTG ATATTAACAG TCAATCTGTT GGAGGGGCCA GACCTAAGCA 2201 ATTGTTTAGC CTTCCATCAA GAACAAGGAG TTCAAAGGAC CTGAATAAGC 2251 CAGATGTTCC AGATACAATA GAAAGTGAAC CCAGCACAGC AGATACCGTT 2301 GTTCCAATCA CTTGTGCTAT AGATTCTACA GCTGATCCAC AGGTTAGCTT 2351 CAACTCTAAT TACATTGATA TAGAAAGTAA TTCTGAAGGT GGATCTAGTT 2401 TCGTAACTGC AAATGAAGAT TCTGTACCTG AAAACACTTG CAAAGAAGGC 2451 TTGGTTTTGG GCCAGAAACA GCCTACTTGG GTTCCTGATT CAGAAGCTCC 2501 AAACTGTATG AACTGCCAAG TCAAATTTAC TTTTACCAAA CGGCGACACC 2551 ATTGCCGAGC ATGTGGGAAA GTATTTTGTG GTGTCTGTTG TAATAGGAAG 2601 TGTAAACTGC AATATCTAGA AAAGGAAGCA AGAGTATGTG TAGTCTGCTA 2651 TGAAACTATT AGTAAAGCTC AGGCATTTGA AAGGATGATG AGTCCAACTG 2701 GTTCTAATCT TAAGTCTAAT CATTCTGATG AATGTACTAC TGTCCAGCCT 2751 CCTCAGGAGA ACCAAACATC CAGTATACCT TCACCAGCAA CTTTGCCAGT 2801 CTCAGCACTT AAACAACCAG GTGTTGAAGG ACTATGTTCC AAAGAACAGA 2851 AGAGAGTATG GTTTGCAGAT GGTATATTGC CCAATGGTGA AGTTGCAGAT 2901 ACAACAAAAT TATCATCTGG AAGTAAAAGA TGTTCTGAAG ACTTTAGTCC 2951 TCTCTCACCT GATGTGCCTA TGACAGTAAA CACAGTGGAT CATTCCCATT 3001 CTACTACAGT GGAAAAGCCA AACAATGAGA CAGGAGATAT TACAAGAAAT 3051 GAGATAATTC AGAGTCCTAT TTCTCAGGTT CCATCAGTGG AAAAATTGTC 3101 TATGAACACA GGAAATGAGG GGTTACCTAC TTCTGGTTCA TTTACACTAG 3151 ATGATGATGT TTTTGCAGAA ACTGAAGAAC CATCTAGTCC TACTGGTGTC 3201 TTAGTTAACA GCAATTTACC TATTGCTAGT ATTTCAGATT ATAGGTTACT 3251 GTGTGATATT AACAAGTATG TCTGCAATAA GATTAGTCTT CTACCTAATG 3301 ATGAGGACAG TTTGCCCCCA CTTCTGGTTG CATCTGGAGA AAAGGGATCA 3351 GTGCCTGTAG TAGAAGAACA TCCATCTCAT GAGCAGATCA TTTTGCTTCT 3401 TGAAGGTGAA GGCTTTCATC CTGTTACATT TGTCCTAAAT GCTAATCTAC 3451 TCGTGAATGT CAAATTCATA TTTTATTCCT CAGACAAATA TTGGTACTTT 3501 TCAACCAATG GATTGCATGG CTTGGGACAG GCAGAAATTA TTATTCTATT 3551 GTTATGTTTG CCAAATGAAG ATACTATTCC TAAGGACATC TTCAGACTAT 3601 TTATCACCAT ATATAAGGAT GCTCTAAAAG GAAAATACAT AGAAAACTTG 3651 GACAATATTA CCTTTACTGA GAGTTTTCTC AGTAGCAAGG ATCACGGAGG 3701 ATTCCTGTTT ATTACACCTA CTTTTCAGAA ACTTGATGAT CTCTCATTAC 3751 CAAGTAATCC TTTTCTTTGT GGAATTCTTA TCCAGAAGCT TGAGATTCCC 3801 TGGGCAAAGG TTTTTCCTAT GCGTTTAATG TTGAGATTGG GTGCAGAATA 3851 TAAAGCATAT CCTGCTCCTC TAACAAGCAT CAGAGGCCGA AAACCTCTTT 3901 TTGGAGAAAT AGGACACACT ATTATGAACT TACTTGTTGA CCTTCGAAAT 3951 TACCAGTATA CCTTGCATAA TATAGATCAA CTGTTGATTC ATATGGAAAT 4001 GGGAAAAAGC TGCATAAAAA TACCACGGAA AAAGTACAGT GATGTAATGA 4051 AAGTACTAAA TTCTTCCAAT GAGCATGTCA TTAGCATTGG AGCAAGTTTC 4101 AGTACAGAAG CAGATTCTCA TCTAGTCTGT ATACAGAATG ATGGAATTTA 4151 TGAAACACAG GCCAACAGTG CCACTGGCCA TCCTAGAAAA GTGACAGGTG 4201 CAAGTTTTGT GGTATTCAAT GGAGCTCTAA AAACATCTTC AGGATTTCTT 4251 GCTAAGTCCA GCATAGTTGA AGATGGCTTA ATGGTACAAA TAACTCCAGA 4301 GACCATGAAT GGCTTGCGGC TAGCTTTACG AGAACAGAAA GACTTTAAAA 4351 TTACATGTGG GAAAGTTGAT GCAGTAGACC TGAGAGAATA CGTGGATATC 4401 TGCTGGGTAG ATGCTGAAGA AAAAGGAAAC AAAGGAGTTA TCAGTTCAGT 4451 GGATGGAATA TCATTACAAG GATTTCCAAG TGAAAAAATA AAACTGGAAG 4501 CAGATTTTGA AACCGATGAG AAGATTGTAA AATGTACCGA GGTGTTCTAC 4551 TTTCTAAAGG ACCAGGATTT ATCTATTTTA TCAACTTCTT ATCAGTTTGC 4601 AAAAGAAATA GCCATGGCTT GTAGTGCTGC GCTGTGCCCT CACCTGAAAA 4651 CTCTAAAAAG TAATGGGATG AATAAAATTG GACTCAGAGT TTCCATTGAC 4701 ACTGATATGG TTGAATTTCA GGCAGGATCT GAAGGCCAAC TTCTGCCTCA 4751 GCATTATCTA AATGATCTTG ATAGTGCTCT GATACCTGTG ATCCATGGTG 4801 GGACCTCCAA CTCTAGTTTA CCATTAGAAA TAGAATTAGT GTTTTTCATT 4851 ATAGAACATC TTTTTTAGTG AAAGAATGTG CCATATTACA TATTGCAACC 4901 TAATTTGTTA AAACTAACTC CAGCACTAAA GCTGAAATGC CACAAACACT 4951 AAAAGTATAA ATATGTCTGA TTTTTGAAAC ACATAAGCTT TGCTCTTTAG 5001 GCAGGAATGA TCTTTTCAAA TCATTAGCAC AATATTTAAA TATCTAAAAA 5051 TTTAAGAGAT CCATACTTTC TGTAGCTTTA CAATTAATTT AAGTACTAAA 5101 AAGACAAGGA TTTCTTTTAA GAAATTTATA GCATTTACTG TGTTATTTAA 5151 ATGCTAAGCC AAAGTATCTG CACTTAGGTA TACCTCTTTA TGCCAATAAT 5201 GATTTTAATG AAGGCTCTTT TCAGATGTAA CCTTATGAAG GAAATATCTG 5251 CTTTGTGTAT ATGCCAGTTA GAATACTGGT TTCTAAAGTC TGTCAAATTG 5301 TATTTCAGTG GCACAAAAAC CAGTTTTGAG GTCTTAGACT TATAATTCTT 5351 TGAATAAAAC TGATAACTTA TTTGTATAAT TGGAGTGGAG ACCTACCTCC 5401 ATAATTAGAT AAACTCTTTT TGGATTATAA TCAGAATTTT GCCTTTTTTC 5451 TTCTCAAATT ATTACATATG TATGTATTAT ATATCCACAT ATATAGTTTT 5501 CCCTGATTAA ATGGATATTA AAATAATTGC GGGTGCTTCA GGACTTTTTG 5551 CTTCTATATT TAAGTATATT GTTTTTATAG CAAGAACATA TTCTGAATGT 5601 TTTATAAATC TTTAATAATT TATATGTAGG TAATATTTTT GTATCACAAT 5651 GCATTATTTT TTTTCCTCCT TTCCTTCCAA ACTATACCAC TGTATTTACC 5701 ACTTCTAAGA GTGACTGACG ACGGGCCAGA TGACCCTTGA AGTAGTCATT 5751 ATGTAGCAAT AAATGAAGCC TGAAACAGGT TTTTTTACTT CCACTTTAAT 5801 CCTTAGAAAT TTCTTGGCAA CTTCGCATAT TTTCATTGAC ACTGGTGTAT 5851 AAGTATAAAT TTAAATGAAC TAATTACTTT TGCATATTTT AAATTCTTTA 5901 TATGGTAGTT ATTTTTTATA ACAGGATATT AACATAAGTT AAATCCTATG 5951 TATTTGAAAT TGTTACAGAG CTTTCCTCTT TACTTCAAAC AGCAAAAAAG 6001 TGGGGGGCAT ATTGTAGTCC TGTCATTTAA GTTATGTAAA AAATTTAATC 6051 ATTATTTTGA TGCTTTAAAC ATTCTCATGT GTAATATATG TTTTTGTATC 6101 AAAAACACTC ATATATTTCA AGAAAAAGAA ATTATGTTAA ATAGCCCTGT 6151 TTTAAGAAAA ATATTTATGA AGCATCTCAA CTTGAAGATC AAGTCAAAGT 6201 TATAACTCAG GATCTGAGGT CTCAAGCTAG GAGAGACTGA GAATTTTAAT 6251 CAGTTTGGGC ATATAGTTTG GACTGAATCA CATCTGTAGT ACTTAGCCAA 6301 AGACAATTTG GAGGAGAATA TCAGCCTTCT GGAAGTAGCT ACTTCCTGAA 6351 CAATGTAAAG TGTCGCAGAT ATTCAATAAA ATGGCAACCT GTTATAATTT 6401 GTGAAATTTA TTGAAATGGT GTAAGATGAA AACAATTGCA TATCAAACCC 6451 AATTTATGTT TTCTAAATAT AGTGTATGTA TTCTGCCATG TAAGTAATTG 6501 AACAGTCTTA AAATAACCAA ATGGTAGAGG GCTGTTCCAT GATGGGACAG 6551 CTTTGGATTT GTTTTCATAA AATCTCTACA TTCAATAAAA ATTGGAATTA 6601 TGTGCCTGAA GTTTGGAGGC ACATTTTGAA GT // LOCUS AB002371 5967 bp mRNA PRI 13-FEB-1999 DEFINITION Human mRNA for KIAA0373 gene, complete cds. ACCESSION AB002371 NID g2224686 VERSION AB002371.1 GI:2224686 KEYWORDS KIAA0373. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0281. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5967) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1. .5967 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0281" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1181. .5800 /gene="KIAA0373" CDS 1181. .5800 /gene="KIAA0373" /codon_start=1 /protein_id="BAA20828.1" /db_xref="PID:d1021669" /db_xref="PID:g2224687" /db_xref="GI:2224687" /translation="MAIFKIAALQKVVDNSVSLSELELANKQYNELTAKYRDILQKDN MLVQRTSNLEHLECENISLKEQVESINKELEITKEKLHTIEQAWEQETKLGNESSMDK AKKSITNSDIVSISKKITMLEMKELNERQRAEHCQKMYEHLRTSLKQMEERNFELETK FAELTKINLDAQKVEQMLRDELADSVSKAVSDADRQRILELEKNEMELKVEVSKLREI SDIARRQVEILNAQQQSRDKEVESLRMQLLDYQAQSDEKSLIAKLHQHNVSLQLSEAT ALGKLESITSKLQKMEAYNLRLEQKLDEKEQALYYARLEGRNRAKHLRQTIQSLRRQF SGALPLAQQEKFSKTMIQLQNDKLKIMQEMKNSQQEHRNMENKTLEMELKLKGLEELI STLKDTKGAQKVINWHMKIEELRLQELKLNRELVKDKEEIKYLNNIISEYERTISSLE EEIVQQNKFHEERQMAWDQREVDLERQLDIFDRQQNEILNAAQKFEEATGSIPDPSLP LPNQLEIALRKIKENIRIILETRATCKSLEEKLKEKESALRLAEQNILSRDKVINELR LRLPATAEREKLIAELGRKEMEPKSHHTLKIAHQTIANMQARLNQKEEVLKKYQRLLE KAREEQREIVKKHEEDLHILHHRLELQADSSLNKFKQTAWDLMKQSPTPVPTNKHFIR LAEMEQTVAEQDDSLSSLLVKLKKVSQDLERQREITELKVKEFENIKLQLQENHEDEV KKVKAEVEDLKYLLDQSQKESQCLKSELQAQKEANSRAPTTTMRNLVERLKSQLALKE KQQKALSRALLELRAEMTAAAEERIISATSQKEAHLNVQQIVDRHTRELKTQVEDLNE NLLKLKEALKTSKNRENSLTDNLNDLNNELQKKQKAYNKILREKEEIDQENDELKRQI KRLTSGLQGKPLTDNKQSLIEELQRKVKKLENQLEGKVEEVDLKPMKEKNAKEELIRW EEGKKWQAKIEGIRNKLKEKEGEVFTLTKQLNTLKDLFAKADKEKLTLQRKLKTTGMT VDQVLGIRALESEKELEELKKRNLDLENDILYMRAHQALPRDSVVEDLHLQNRYLQEK LHALEKQFSKDTYSKPSISGIESDDHCQREQELQKENLKLSSENIELKFQLEQANKDL PRLKNQVRDLKEMCEFLKKEKAEVQRKLGHVRGSGRSGKTIPELEKTIGLMKKVVEKV QRENEQLKKASGILTSEKMANIEQENEKLKAELEKLKAHLGHQLSMHYESKTKGTEKI IAENERLRKELKKETDAAEKLRIAKNNLEILNEKMTVQLEETGKRLQFAESRGPQLEG ADSKSWKSIVVTRMYETKLKELETDIAKKNQSITDLKQLVKEATEREQKVNKYNEDLE QQIKILKHVPEGAETEQGLKRELQVLRLANHQLDKEKAELIHQIEANKDQSGAESTIP DADQLKEKIKDLETQLKMSDLEKQHLKEEIKKLKKELENFDPSFFEEIEDLKYNYKEE VKKNILLEEKVKKLSEQLGVELTSPVAASEEFEDEEESPVNFPIY" BASE COUNT 2370 a 903 c 1148 g 1546 t ORIGIN 1 AAGCTTAATA CTGAGCATCA AGAAATTCTT TAATAAATAT AAGTGATATT 51 TATTAAGACG TGTAATAAGG AAATGTTCAT GTCTTATTTT TGTGTTAGAT 101 TTTTTTAGAA TCTACTTTTG TTAGAGTTTT ATAAATACAG TTAGTGTTTG 151 AGATAGAAAG AGAAAAGAAT TAGTTTTCTT CCTCTTCTAC CTGCTCATGA 201 ACTTGATTTT TTTCTCCCAA CAATTGAAGA GCCAAGAAAA AGGGAGATTC 251 TTAAGAGATG GGAAATAGAA TCTCATCTAC CCCTGTTTCC CTCAGAACAG 301 TGAAACTGAA TCTTAAGGGT AAGATAGAAT AGTGTGTACT TAACTTAGAT 351 GGAGAAGAAA GGCTGCCAAA ATGAGATCTG AAGCGCTATT ACAAATATTT 401 CCATCATTAC TGTACTTCAG AATGAATTAC AACCGTAAGT TTTTTTACTT 451 CCTCATTCAT AAATTTGATT ATTCCTTATA CCACTTCTCA GCTTTCATCA 501 TTCTTTATTG TACTTTTCTA TGTAATGTTT GCCTATTATA CAGCAACTTA 551 AGAGAACTGT AAGTTTGGAC ATTTCATTTT GGTGTTGATA ATAGAATATC 601 TTTGAATAGT TCTATAGTTG ATGAGTAGAA CCATGAACCA AGTAACTTAA 651 AGTCCTTGAT GTTATTTATT ACAGAGAACT ATAATAGAAG CTCTCCCGCT 701 AATGTTTCCA TCATGTGTAC AAAAAGTTTT CTTGTTATTA AAGCCAGTCC 751 GTTTAACTTA CAATAAGCAT AAATAGCTAA GCTGTGAAAG TTACCTGTGA 801 TAATGCTAAT TTTCCCATTT ATTAAAAGGC AAGTTGTTTT CCGATCATAA 851 GAAATTTAGA AAAGCCATCC AAAGATAAAT TCCGAGTGAT ATATTCCTGC 901 TGTTTGTTAT GTTTTCTCAA ATTAATTGAG TTTTATTTTA CAATGACAGG 951 AGTTATTAAA GTATTTTATT TTTATTATGA TTAAGATTTT CAAAGTAACA 1001 TTTCTTATAT GAAAGAAATT ATGTTAATGC ATGTTTTTCT TACATGGGAA 1051 ATCATATATT TTAAAAATGA TTTTAAAATT CGTTTTACTT TAAGTTGTAT 1101 TATCTTTCTC AAAAGTGGCT AGTGCTTCAC CAGAAAAAAA GACACCAGCA 1151 TAACTCAGTG TATCTTTATT TACATAGGAA ATGGCCATTT TCAAGATTGC 1201 AGCTCTCCAA AAAGTTGTAG ATAATAGTGT TTCTTTGTCT GAACTAGAAC 1251 TGGCTAATAA ACAGTACAAT GAACTGACTG CTAAGTACAG GGACATCTTG 1301 CAAAAAGATA ATATGCTTGT TCAAAGAACA AGTAACTTGG AACACCTGGA 1351 GTGTGAAAAC ATCTCCTTAA AAGAACAAGT GGAGTCTATA AATAAAGAAC 1401 TGGAGATTAC CAAGGAAAAA CTTCACACTA TTGAACAAGC CTGGGAACAG 1451 GAAACTAAAT TAGGTAATGA ATCTAGCATG GATAAGGCAA AGAAATCAAT 1501 AACCAACAGT GACATTGTTT CCATTTCAAA AAAAATAACT ATGCTGGAAA 1551 TGAAGGAATT AAATGAAAGG CAGCGGGCTG AACATTGTCA AAAAATGTAT 1601 GAACACTTAC GGACTTCGTT AAAGCAAATG GAGGAACGTA ATTTTGAATT 1651 GGAAACCAAA TTTGCTGAGC TTACCAAAAT CAATTTGGAT GCACAGAAGG 1701 TGGAACAGAT GTTAAGAGAT GAATTAGCTG ATAGTGTGAG CAAGGCAGTA 1751 AGTGATGCTG ATAGGCAACG GATTCTAGAA TTAGAGAAGA ATGAAATGGA 1801 ACTAAAAGTT GAAGTGTCAA AACTGAGAGA GATTTCTGAT ATTGCCAGAA 1851 GACAAGTTGA AATTTTGAAT GCACAACAAC AATCTAGGGA CAAGGAAGTA 1901 GAGTCCCTCA GAATGCAACT GCTAGACTAT CAGGCACAGT CTGATGAAAA 1951 GTCGCTCATT GCCAAGTTGC ACCAACATAA TGTCTCTCTT CAACTGAGTG 2001 AGGCTACTGC TCTTGGTAAG TTGGAGTCAA TTACATCTAA ACTGCAGAAG 2051 ATGGAGGCCT ACAACTTGCG CTTAGAGCAG AAACTTGATG AAAAAGAACA 2101 GGCTCTCTAT TATGCTCGTT TGGAGGGAAG AAACAGAGCA AAACATCTGC 2151 GCCAAACAAT TCAGTCTCTA CGACGACAGT TTAGTGGAGC TTTACCCTTG 2201 GCACAACAGG AAAAGTTCTC CAAAACAATG ATTCAACTAC AAAATGACAA 2251 ACTTAAGATA ATGCAAGAAA TGAAAAATTC TCAACAAGAA CATAGAAATA 2301 TGGAGAACAA AACATTGGAG ATGGAATTAA AATTAAAGGG CCTGGAAGAG 2351 TTAATAAGCA CTTTAAAGGA TACCAAAGGA GCCCAAAAGG TAATCAACTG 2401 GCATATGAAA ATAGAAGAAC TTCGTCTTCA AGAACTTAAA CTAAATCGGG 2451 AATTAGTCAA GGATAAAGAA GAAATAAAAT ATTTGAATAA CATAATTTCT 2501 GAATATGAAC GTACAATCAG CAGTCTTGAA GAAGAAATTG TGCAACAGAA 2551 CAAGTTTCAT GAAGAAAGAC AAATGGCCTG GGATCAAAGA GAAGTTGACC 2601 TGGAACGCCA ACTAGACATT TTTGACCGTC AGCAAAATGA AATACTAAAT 2651 GCGGCACAAA AGTTTGAAGA AGCTACAGGA TCAATCCCTG ACCCTAGTTT 2701 GCCCCTTCCA AATCAACTTG AGATCGCTCT AAGGAAAATT AAGGAGAACA 2751 TTCGAATAAT TCTAGAAACA CGGGCAACTT GCAAATCACT AGAAGAGAAA 2801 CTAAAAGAGA AAGAATCTGC TTTAAGGTTA GCAGAACAAA ATATACTGTC 2851 AAGAGACAAA GTAATCAATG AACTGAGGCT TCGATTGCCT GCCACTGCAG 2901 AAAGAGAAAA GCTCATAGCT GAGCTAGGCA GAAAAGAGAT GGAACCAAAA 2951 TCTCACCACA CATTGAAAAT TGCTCATCAA ACCATTGCAA ACATGCAAGC 3001 AAGGTTAAAT CAAAAAGAAG AAGTATTAAA GAAGTATCAA CGTCTTCTAG 3051 AAAAAGCCAG AGAGGAGCAA AGAGAAATTG TGAAGAAACA TGAGGAAGAC 3101 CTTCATATTC TTCATCACAG ATTAGAACTA CAGGCTGATA GTTCACTAAA 3151 TAAATTCAAA CAAACGGCTT GGGATTTAAT GAAACAGTCT CCCACTCCAG 3201 TTCCTACCAA CAAGCATTTT ATTCGTCTGG CTGAGATGGA ACAGACAGTA 3251 GCAGAACAAG ATGACTCTCT TTCCTCACTC TTGGTCAAAC TAAAGAAAGT 3301 ATCACAAGAT TTGGAGAGAC AAAGAGAAAT CACTGAATTA AAAGTAAAAG 3351 AATTTGAAAA TATCAAATTA CAGCTTCAAG AAAACCATGA AGATGAAGTG 3401 AAAAAAGTAA AAGCGGAAGT AGAGGATTTA AAGTATCTTC TGGACCAGTC 3451 ACAAAAGGAG TCACAGTGTT TAAAATCTGA ACTTCAGGCT CAAAAAGAAG 3501 CAAATTCAAG AGCTCCAACA ACTACAATGA GAAATCTAGT AGAACGGCTA 3551 AAGAGCCAAT TAGCCTTGAA GGAGAAACAA CAGAAAGCAC TTAGTCGGGC 3601 ACTTTTAGAA CTCCGGGCAG AAATGACAGC AGCTGCTGAA GAACGTATTA 3651 TTTCTGCAAC TTCTCAAAAA GAGGCCCATC TCAATGTTCA ACAAATCGTT 3701 GATCGACATA CTAGAGAGCT AAAGACACAA GTTGAAGATT TAAATGAAAA 3751 TCTTTTAAAA TTGAAAGAAG CACTTAAAAC AAGTAAAAAC AGAGAAAACT 3801 CACTAACTGA TAATTTGAAT GACTTAAATA ATGAACTGCA AAAGAAACAA 3851 AAAGCCTATA ATAAAATACT TAGAGAGAAA GAGGAAATTG ATCAAGAGAA 3901 TGATGAACTG AAAAGGCAAA TTAAAAGACT AACCAGTGGA TTACAGGGCA 3951 AACCCCTGAC AGATAATAAA CAAAGTCTAA TTGAAGAACT CCAAAGGAAA 4001 GTTAAAAAAC TAGAGAACCA ATTAGAGGGA AAGGTGGAGG AAGTAGACCT 4051 AAAACCTATG AAAGAAAAGA ATGCTAAAGA AGAATTAATT AGGTGGGAAG 4101 AAGGTAAAAA GTGGCAAGCC AAAATAGAAG GAATTCGAAA CAAGTTAAAA 4151 GAGAAAGAGG GGGAAGTCTT TACTTTAACA AAGCAGTTGA ATACTTTGAA 4201 GGATCTTTTT GCCAAAGCCG ATAAAGAGAA ACTTACTTTG CAGAGGAAAC 4251 TAAAAACAAC TGGCATGACT GTTGATCAGG TTTTGGGAAT ACGAGCTTTG 4301 GAGTCAGAAA AAGAATTGGA AGAATTAAAA AAGAGAAATC TTGACTTAGA 4351 AAATGATATA TTGTATATGA GGGCCCACCA AGCTCTTCCT CGAGATTCTG 4401 TTGTAGAAGA TTTACATTTA CAAAATAGAT ACCTCCAAGA AAAACTTCAT 4451 GCTTTAGAAA AACAGTTTTC AAAGGATACA TATTCTAAGC CTTCAATTTC 4501 AGGAATAGAG TCAGATGATC ATTGTCAGAG AGAACAGGAG CTTCAGAAGG 4551 AAAACTTGAA GTTGTCATCT GAAAATATTG AACTGAAATT TCAGCTTGAA 4601 CAAGCAAATA AAGATTTGCC AAGATTAAAG AATCAAGTCA GAGATTTGAA 4651 GGAAATGTGT GAATTTCTTA AGAAAGAAAA AGCAGAAGTT CAGCGGAAAC 4701 TTGGCCATGT TAGAGGGTCT GGTAGAAGTG GAAAGACAAT CCCAGAACTG 4751 GAAAAAACCA TTGGTTTAAT GAAAAAAGTA GTTGAAAAAG TCCAGAGAGA 4801 AAATGAACAG TTGAAAAAAG CATCAGGAAT ATTGACTAGT GAAAAAATGG 4851 CTAATATTGA GCAGGAAAAT GAAAAATTGA AGGCTGAATT AGAAAAACTT 4901 AAAGCTCATC TTGGGCATCA GTTGAGCATG CACTATGAAT CCAAGACCAA 4951 AGGCACAGAA AAAATTATTG CTGAAAATGA AAGGCTTCGT AAAGAACTTA 5001 AAAAAGAAAC TGATGCTGCA GAGAAATTAC GGATAGCAAA GAATAATTTA 5051 GAGATATTAA ATGAGAAGAT GACAGTTCAA CTAGAAGAGA CTGGTAAGAG 5101 ATTGCAGTTT GCAGAAAGCA GAGGTCCACA GCTTGAAGGT GCTGACAGTA 5151 AGAGCTGGAA ATCCATTGTG GTTACAAGAA TGTATGAAAC CAAGTTAAAA 5201 GAATTGGAAA CTGATATTGC CAAAAAAAAT CAAAGCATTA CTGACCTTAA 5251 ACAGCTTGTA AAAGAAGCAA CAGAGAGAGA ACAAAAAGTT AACAAATACA 5301 ATGAAGACCT TGAACAACAG ATTAAGATTC TTAAACATGT TCCTGAAGGT 5351 GCTGAGACAG AGCAAGGCCT TAAACGGGAG CTTCAAGTTC TTAGATTAGC 5401 TAATCATCAG CTGGATAAAG AGAAAGCAGA ATTAATCCAT CAGATAGAAG 5451 CTAACAAGGA CCAAAGTGGA GCTGAAAGCA CCATACCTGA TGCTGATCAA 5501 CTAAAGGAAA AAATAAAAGA TCTAGAGACA CAGCTCAAAA TGTCAGATCT 5551 AGAAAAGCAG CATTTGAAGG AGGAAATAAA GAAGCTGAAA AAAGAACTGG 5601 AAAATTTTGA TCCTTCATTT TTTGAAGAAA TTGAAGATCT TAAGTATAAT 5651 TACAAGGAAG AAGTGAAGAA GAATATTCTC TTAGAAGAGA AGGTAAAAAA 5701 ACTTTCAGAA CAATTGGGAG TTGAATTAAC TAGCCCTGTT GCTGCTTCTG 5751 AAGAGTTTGA AGATGAAGAA GAAAGTCCTG TTAATTTCCC CATTTACTAA 5801 AGGTCACCTA TAAACTTTGT TTCATTTAAC TATTTATTAA CTTTATAAGT 5851 TAAATATACT TGGAAATAAG CAGTTCTCCG AACTGTAGTA TTTCCTTCTC 5901 ACTACCTTGT ACCTTTATAC TTAGATTGGA ATTCTTAATA AATAAAATTA 5951 TATGAAATTT TCAACTT // LOCUS AB002375 5544 bp mRNA PRI 24-JUL-1997 DEFINITION Human mRNA for KIAA0377 gene, complete cds. ACCESSION AB002375 NID g2280486 VERSION AB002375.1 GI:2280486 KEYWORDS KIAA0377. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0412. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5544) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 COMMENT On Jul 25, 1997 this sequence version replaced gi:2224694. Sequence updated (22-Jul-1997). FEATURES Location/Qualifiers source 1. .5544 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0412" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 127. .4347 /gene="KIAA0377" CDS 127. .4347 /gene="KIAA0377" /codon_start=1 /protein_id="BAA20831.1" /db_xref="PID:d1021673" /db_xref="PID:g2224695" /db_xref="GI:2224695" /translation="MWSLTASEGESTTAHFFLGAGDEGLGTRGIGMRPEESDSELLED EEDEVPPEPQIIVGICAMTKKSKSKPMTQILERLCRFDYLTVVILGEDVILNEPVENW PSCHCLISFHSKGFPLDKAVAYSKLRNPFLINDLAMQYYIQDRREVYRILQEEGIDLP RYAVLNRDPARPEECNLIEGEDQVEVNGAVFPKPFVEKPVSAEDHNVYIYYPSSAGGG SQRLFRKIGSRSSVYSPESSVRKTGSYIYEEFMPTDGTDVKVYTVGPDYAHAEARKSP ALDGKVERDSEGKEIRYPVMLTAMEKLVARKVCVAFKQTVCGFDLLRANGHSFVCDVN GFSFVKNSMKYYDDCAKILGNTIMRELAPQFQIPWSIPTEAEDIPIVPTTSGTMMELR CVIAIIRHGDRTPKQKMKMEVKHPRFFALFEKHGGYKTGKLKLKRPEQLQEVLDITRL LLAELEKEPGGEIEEKTGKLEQLKSVLEMYGHFSGINRKVQLTYYPHGVKASNEGQDP QRETLAPSLLLVLKWGGELTPAGRVQAEELGRAFRCMYPGGQGDYAGFPGCGLLRLHS TFRHDLKIYASDEGRVQMTAAAFAKGLLALEGELTPILVQMVKSANMNGLLDSDGDSL SSCQHRVKARLHHILQQDAPFGPEDYDQLAPTRSTSLLNSMTIIQNPVKVCDQVFALI ENLTHQIRERMQDPRSVDLQLYHSETLELMLQRWSKLERDFRQKSGRYDISKIPDIYD CVKYDVQHNGSLGLQGTAELLRLSKALADVVIPQEYGISREEKLEIAVGFCLPLLRKI LLDLQRTHEDESVNKLHPLYSRGVLSPGRHVRTRLYFTSESHVHSLLSVFRYGGLLDE TQDAQWQRALDYLSAISELNYMTQIVIMLYEDNTQDPLSEERFHVELHFSPGVKGVEE EGSAPAGCGFRPASSENEEMKTNQGSMENLCPGKASDEPDRALQTSPQPPEGPGLPRR SPLIRNRKAGSMEVLSETSSSRPGGYRLFSSSRPPTEMKQSGLGFEGCSMVPTIYPLE TLHNALSLRQVSEFLSRVCQRHTDAQAQASAALFDSMHSSQASDNPFSPPRTLHSPPL QLQQRSEKPPWLETRFCHVGQAGLELLTSSDLPASASQSAGITGVSHRTQPDSSGPSS TVSSAGPSSPTTVDGNSQFGFSDQPSLNSHVAEEHQGLGLLQETPGSGAQELSIEGEQ ELFEPNQSPQVPPMETSQPYEEVSQPCQEVPDISQPCQDISEALSQPCQKVPDISQQC QENHDNGNHTCQEVPHISQPCQKSSQLCQKVSEEVCQLCLENSEEVSQPCQGVSVEVG KLVHKFHVGVGSLVQETLVEVGSPAEEIPEEVIQPYQEFSVEVGRLAQETSAINLLSQ GIPEIDKPSQEFPEEIDLQAQEVPEEIN" BASE COUNT 1373 a 1383 c 1462 g 1326 t ORIGIN 1 GCGGGACTCA AGAGTAGCCT TCCTCGAGGA CCTGCCTTTC CCATTTGCTG 51 CCTGAAGTTA ATGTTTCTTG CTGGCCAAAT CAGGGACATG CCGGCATTAG 101 CGGGATGAGT GGGTGTTCCG GCAGGGATGT GGTCATTGAC GGCCAGTGAG 151 GGCGAGAGTA CCACGGCCCA CTTCTTCCTT GGAGCTGGAG ATGAGGGGCT 201 GGGCACCCGT GGAATAGGCA TGAGGCCAGA AGAGAGTGAC AGCGAGCTCC 251 TTGAGGATGA GGAGGATGAA GTGCCTCCTG AACCTCAGAT CATTGTTGGC 301 ATCTGTGCCA TGACCAAGAA ATCCAAGTCC AAGCCAATGA CTCAAATCCT 351 AGAGCGACTC TGCAGATTTG ACTACCTGAC TGTTGTCATT CTGGGAGAAG 401 ATGTAATCCT TAATGAACCT GTGGAAAACT GGCCATCCTG CCACTGCCTC 451 ATCTCTTTCC ACTCCAAAGG CTTTCCTCTG GACAAAGCTG TTGCTTACTC 501 CAAGCTTCGA AACCCCTTTC TTATCAATGA TCTGGCCATG CAGTATTACA 551 TCCAAGATAG GAGGGAGGTG TACCGGATCC TGCAGGAAGA GGGTATTGAT 601 CTGCCTCGAT ATGCTGTGCT CAACCGTGAT CCTGCCCGGC CTGAGGAATG 651 CAACCTGATA GAAGGTGAAG ACCAAGTAGA GGTCAATGGA GCTGTCTTTC 701 CCAAGCCCTT TGTGGAGAAG CCAGTGAGTG CAGAAGACCA CAATGTTTAC 751 ATCTACTACC CCAGCTCAGC TGGAGGAGGA AGCCAGCGTC TCTTTCGTAA 801 GATTGGCAGC CGAAGCAGTG TTTACTCTCC TGAGAGCAGC GTCCGAAAGA 851 CGGGGTCGTA CATCTATGAG GAGTTTATGC CAACAGATGG CACAGATGTC 901 AAGGTGTATA CAGTGGGGCC AGATTATGCC CATGCTGAAG CTAGAAAATC 951 TCCAGCTTTG GATGGGAAGG TTGAACGAGA CAGTGAGGGG AAAGAGATTC 1001 GATATCCAGT CATGCTGACT GCCATGGAAA AGCTGGTGGC CAGGAAAGTC 1051 TGCGTAGCTT TCAAGCAAAC AGTTTGTGGA TTTGACCTTC TTCGTGCCAA 1101 TGGTCATTCC TTTGTGTGTG ATGTCAATGG CTTTAGTTTT GTCAAGAACT 1151 CGATGAAATA CTACGATGAC TGTGCCAAGA TTCTGGGGAA CACCATAATG 1201 CGGGAGCTTG CCCCACAGTT CCAGATTCCA TGGTCCATCC CCACGGAGGC 1251 TGAGGACATT CCCATTGTTC CCACCACATC TGGCACTATG ATGGAACTTC 1301 GTTGTGTCAT TGCAATTATT CGTCATGGGG ATCGTACTCC CAAGCAGAAG 1351 ATGAAGATGG AAGTGAAACA CCCAAGGTTT TTTGCTCTGT TTGAAAAACA 1401 TGGTGGCTAC AAGACAGGGA AATTAAAACT CAAGCGACCT GAGCAGCTCC 1451 AGGAGGTGCT GGATATCACA AGGCTGTTGT TGGCTGAACT GGAGAAAGAA 1501 CCAGGTGGTG AGATCGAGGA GAAGACTGGA AAACTAGAGC AGCTGAAGTC 1551 TGTACTGGAG ATGTATGGTC ACTTCTCAGG TATAAACCGG AAGGTACAAT 1601 TGACTTACTA CCCTCATGGA GTAAAAGCTT CTAATGAGGG GCAAGATCCA 1651 CAGAGGGAAA CTCTGGCCCC ATCTCTGTTG CTGGTACTGA AGTGGGGTGG 1701 AGAACTGACT CCTGCTGGCC GTGTTCAGGC TGAGGAGCTG GGGCGAGCTT 1751 TTCGCTGCAT GTACCCTGGA GGACAGGGTG ACTATGCTGG CTTCCCTGGT 1801 TGTGGGCTGC TTCGTCTCCA TAGCACTTTC CGCCACGATC TCAAGATCTA 1851 TGCCTCTGAT GAGGGTCGTG TTCAGATGAC TGCTGCTGCC TTCGCCAAGG 1901 GCCTTCTGGC TCTAGAAGGG GAGCTGACAC CCATTTTGGT GCAAATGGTG 1951 AAGAGTGCCA ACATGAATGG GCTACTGGAC AGCGATGGGG ATTCCTTGAG 2001 CAGCTGCCAG CACCGGGTGA AGGCTCGGCT GCACCATATT CTACAGCAGG 2051 ATGCACCCTT TGGCCCTGAG GACTACGATC AGCTGGCTCC CACCAGAAGT 2101 ACTTCCCTGC TCAACTCCAT GACTATCATC CAGAATCCTG TGAAGGTCTG 2151 TGATCAGGTA TTTGCCCTGA TCGAAAACCT CACCCACCAG ATCCGGGAAC 2201 GAATGCAGGA CCCCAGGTCT GTAGACCTGC AGCTCTACCA CAGTGAGACA 2251 CTAGAGCTAA TGCTACAGCG TTGGAGCAAG CTGGAGCGTG ACTTTCGACA 2301 GAAGAGTGGG CGCTATGATA TCAGTAAGAT CCCTGACATC TATGACTGTG 2351 TCAAGTATGA TGTGCAGCAC AATGGGAGTC TGGGACTTCA AGGCACAGCA 2401 GAGTTGCTCC GTCTCTCTAA GGCACTGGCT GATGTGGTCA TTCCCCAGGA 2451 GTACGGGATC AGTCGGGAGG AGAAACTGGA AATTGCTGTG GGCTTCTGTC 2501 TTCCACTGTT GCGGAAGATA CTACTTGACC TGCAGAGAAC CCACGAGGAT 2551 GAGTCTGTCA ACAAGCTGCA TCCCCTGTAC TCCCGAGGCG TGCTCTCCCC 2601 AGGTCGCCAC GTTCGAACGC GTCTCTATTT CACCAGTGAG AGCCATGTCC 2651 ACTCCCTGCT CAGTGTCTTC CGTTATGGAG GACTTCTTGA TGAGACCCAG 2701 GATGCACAAT GGCAGCGAGC TTTGGATTAT CTTAGTGCCA TCTCAGAGCT 2751 TAACTACATG ACCCAGATTG TCATCATGCT TTATGAGGAC AACACACAGG 2801 ATCCCTTATC AGAGGAACGG TTCCATGTGG AGCTACACTT CAGCCCCGGA 2851 GTGAAAGGTG TTGAGGAAGA AGGCAGTGCC CCGGCTGGCT GTGGATTCCG 2901 TCCAGCCTCT TCTGAGAATG AGGAGATGAA AACCAACCAA GGCAGTATGG 2951 AGAACCTGTG TCCAGGAAAG GCATCAGATG AACCAGACCG GGCATTGCAG 3001 ACTTCACCCC AGCCTCCTGA GGGCCCTGGC CTTCCGAGGA GATCACCCCT 3051 CATTCGTAAC CGAAAAGCTG GTTCCATGGA GGTACTTTCT GAGACTTCAT 3101 CCTCGAGGCC TGGTGGCTAC CGGCTCTTTT CATCTTCACG GCCACCCACA 3151 GAAATGAAGC AGAGTGGCCT AGGGTTTGAA GGGTGTTCCA TGGTGCCTAC 3201 CATCTACCCT CTGGAAACAC TGCATAATGC CCTTTCCCTA CGTCAAGTGA 3251 GTGAATTCTT GAGTAGAGTC TGCCAGCGCC ACACTGATGC CCAGGCACAG 3301 GCATCTGCAG CCCTCTTTGA TTCCATGCAC AGCAGCCAGG CCTCAGATAA 3351 CCCATTTTCT CCACCACGTA CTCTTCATTC ACCTCCCCTG CAACTCCAGC 3401 AGCGCTCTGA GAAGCCCCCT TGGTTAGAGA CAAGGTTTTG CCATGTTGGC 3451 CAGGCTGGTT TAGAGCTCCT GACCTCAAGT GATCTGCCTG CCTCGGCCTC 3501 CCAAAGTGCT GGGATTACAG GCGTGAGCCA CCGCACCCAG CCAGACAGCA 3551 GTGGCCCTTC TAGCACTGTG TCCAGTGCTG GTCCTTCTTC CCCTACTACA 3601 GTAGATGGTA ACTCCCAATT TGGCTTCAGT GATCAACCCT CCCTAAATTC 3651 ACACGTGGCT GAAGAACATC AAGGCCTTGG GCTGCTCCAG GAGACCCCTG 3701 GGAGTGGAGC ACAAGAGCTC TCCATAGAAG GGGAGCAAGA GCTTTTTGAA 3751 CCAAATCAGT CCCCACAGGT GCCACCTATG GAAACCAGCC AGCCATACGA 3801 GGAGGTCAGC CAGCCATGTC AGGAGGTCCC TGACATCAGC CAGCCATGCC 3851 AGGACATTTC TGAGGCGCTC AGCCAGCCAT GTCAGAAGGT CCCTGACATC 3901 AGCCAGCAAT GCCAGGAGAA CCATGACAAT GGTAACCACA CATGCCAGGA 3951 GGTCCCTCAC ATCAGCCAGC CATGCCAGAA GTCCAGCCAA CTGTGCCAGA 4001 AAGTCTCTGA GGAAGTTTGC CAGCTATGTC TGGAGAACTC CGAGGAGGTC 4051 AGCCAGCCAT GCCAGGGGGT CTCTGTGGAG GTTGGCAAGC TGGTCCATAA 4101 GTTCCATGTA GGGGTTGGTA GCTTGGTCCA GGAAACCCTT GTAGAAGTTG 4151 GCAGCCCAGC TGAAGAGATC CCTGAGGAGG TCATCCAGCC ATACCAGGAG 4201 TTCTCTGTGG AGGTTGGCAG GCTGGCCCAG GAGACTTCTG CGATCAATCT 4251 GTTATCTCAG GGCATCCCTG AGATTGATAA ACCATCCCAG GAGTTCCCTG 4301 AGGAGATTGA TCTGCAGGCC CAGGAGGTCC CTGAGGAGAT AAATTAGAAG 4351 TCCTGGGTGG TCCCTGAAGT GATTGATCAG CTGCCTGGAG AGGGTATTCC 4401 TCAAGCCCAG CATCCATCTG GTGATCCAAA CCCTCAGAGC CAGTCTCTAG 4451 CCCATGACCA GCACTCACCC CTTCCACCAG CAACATGTGA TTAATTTTCT 4501 CATTAGTGGT ATCACACTAT ACCAGCCATT TGAGCCAGCA ACCTTTTCTG 4551 TTGGCTAACT CACTGGCCAG CTCTCACCAG CGGTGTCTGG GGAAGTAGTT 4601 CTCTTTGTAT GAAGCATACC TGTGCCAGAG CTGTGGTTGA GGAGGAGCCA 4651 GTTTTAGGTT CGAAGAAGCC ATTGGCTCCT CACTTAGCCA TTAGACTTGA 4701 ATAGGATTTC CTTGGGGTGG GTTGGTCTGT CATTACCCAG CCTTCTCTGA 4751 GCATCCTAGG AAATCACAGA TTGTTAAAGG AAATGCCGCT TCACTGCTGA 4801 AGACACCATC TGGCGACAGC AAATGCAAAA GAGGGGACTC TAGGGTCTTC 4851 ACTTTTCTGG GGAAATGTTC ACGACTTCTC AAGGTACGCT TAGACCATAT 4901 GTGCATTCAG GGGACTCTTG TCTTTGCCAG CACTCCTCAG GTGAGCCCGG 4951 GATGGCTATC TGAAATGTCT GGAAAATAGA GTCCCTTCCC CAAACATCTG 5001 ACATGGCCAC TAAATCTTTA GACTTGCCTT ATAGATCCTT CCATATCATC 5051 CCCAAGTGCC CCTTCTGCCT CAGTCCTGTT CCTCTGGGCA ATAGCTCTAG 5101 GGAAAAGTAG GTATTAGACT GCTGTGCAAA ATTCAGAGCA ACTACTTAAA 5151 ATGCTCTAGA GGATTCTTAG CCAGTCTGTG AAAGTGGGCC TCCTCTGTGG 5201 TGGGGCTGTT TGGCTTAGGA GATGCTGTAG TAGTAGAGAT AATCAGGTTC 5251 TATTTTAAGC AATCAGCAGA GATCAATATT GTACAGATAC AAGCAAAGGT 5301 TTAATAAATA TATATATATA TATTTATTTC TGTAGTTGTG CCAGGGACAG 5351 ACTGGCCAAA GACCAAACAC TGCAGGGTCC CCAAGAAATT TGTCCTTATA 5401 TCATTGTGTC TGAGGGCCAG ATATGATTAT GAAGCTTTTT CCAAAGATCC 5451 AGGGATGGGA ATGGGAGTGG GTAGGAGGGG TAATGCGGTC ATTGGAGTCG 5501 GGGGCTGGAA CATTATGAGT GCTCAATAAA TATAAACTAA TGAG // LOCUS AB003698 3187 bp mRNA PRI 05-FEB-1999 DEFINITION Homo sapiens mRNA for Cdc7-related kinase, complete cds. ACCESSION AB003698 NID g2102636 VERSION AB003698.1 GI:2102636 KEYWORDS Cdc7-related kinase. SOURCE Homo sapiens adult, fetal testis, liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3187) AUTHORS Sato,N. TITLE Direct Submission JOURNAL Submitted (08-MAY-1997) to the DDBJ/EMBL/GenBank databases. Noriko Sato, Institute of Medical Science, University of Tokyo, Molecular and Developmental Biology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:nrksato@hgc.ims.u-tokyo.ac.jp, Tel:81-3-5449-5661, Fax:81-3-5449-5424) REFERENCE 2 (sites) AUTHORS Sato,N., Arai,K. and Masai,H. TITLE Human and Xenopus cDNAs encoding budding yeast Cdc7-related kinases: in vitro phosphorylation of MCM subunits by a putative human homologue of Cdc7 JOURNAL EMBO J. 16 (14), 4340-4351 (1997) MEDLINE 97392464 FEATURES Location/Qualifiers source 1. .3187 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1p22" /dev_stage="adult, fetal" /tissue_type="testis, liver" CDS 133. .1857 /codon_start=1 /product="Cdc7-related kinase" /protein_id="BAA19962.1" /db_xref="PID:d1020752" /db_xref="PID:g2102637" /db_xref="GI:2102637" /translation="MEASLGIQMDEPMAFSPQRDRFQAEGSLKKNEQNFKLAGVKKDI EKLYEAVPQLSNVFKIEDKIGEGTFSSVYLATAQLQVGPEEKIALKHLIPTSHPIRIA AELQCLTVAGGQDNVMGVKYCFRKNDHVVIAMPYLEHESFLDILNSLSFQEVREYMLN LFKALKRIHQFGIVHRDVKPSNFLYNRRLKKYALVDFGLAQGTHDTKIELLKFVQSEA QQERCSQNKSHIITGNKIPLSGPVPKELDQQSTTKASVKRPYTNAQIQIKQGKDGKEG SVGLSVQRSVFGERNFNIHSSISHESPAVKLMKQSKTVDVLSRKLATKKKAISTKVMN SAVMRKTASSCPASLTCDCYATDKVCSICLSRRQQVAPRAGTPGFRAPEVLTKCPNQT TAIDMWSAGVIFLSLLSGRYPFYKASDDLTALAQIMTIRGSRETIQAAKTFGKSILCS KEVPAQDLRKLCERLRGMDSSTPKLTSDIQGHASHQPAISEKTDHKASCLVQTPPGQY SGNSFKKGDSNSCEHCFDEYNTNLEGWNEVPDEAYDLLDKLLDLNPASRITAEEALLH PFFKDMSL" polyA_site 3187 /note="20 A nucleotides" BASE COUNT 1003 a 549 c 664 g 971 t ORIGIN 1 GAATTCGGCA CGAGTTGGAG ACGGCGACCC AGGCATCTGG GGAGCACAGA 51 AGTCGTACTC CCTTAAACCC TGCTTTGCTC CCCCTGTGGA TGTAACCCCT 101 TAGCTGGCAT TTTGCATCTC AATTGGCTTG TGATGGAGGC GTCTTTGGGG 151 ATTCAGATGG ATGAGCCAAT GGCTTTTTCT CCCCAGCGTG ACCGGTTTCA 201 GGCTGAAGGC TCTTTAAAAA AAAACGAGCA GAATTTTAAA CTTGCAGGTG 251 TTAAAAAAGA TATTGAGAAG CTTTATGAAG CTGTACCACA GCTTAGTAAT 301 GTGTTTAAGA TTGAGGACAA AATTGGAGAA GGCACTTTCA GCTCTGTTTA 351 TTTGGCCACA GCACAGTTAC AAGTAGGACC TGAAGAGAAA ATTGCTCTAA 401 AACACTTGAT TCCAACAAGT CATCCTATAA GAATTGCAGC TGAACTTCAG 451 TGCCTAACAG TGGCTGGGGG GCAAGATAAT GTCATGGGAG TTAAATACTG 501 CTTTAGGAAG AATGATCATG TAGTTATTGC TATGCCATAT CTGGAGCATG 551 AGTCGTTTTT GGACATTCTG AATTCTCTTT CCTTTCAAGA AGTACGGGAA 601 TATATGCTTA ATCTGTTCAA AGCTTTGAAA CGCATTCATC AGTTTGGTAT 651 TGTTCACCGT GATGTTAAGC CCAGCAATTT TTTATATAAT AGGCGCCTGA 701 AAAAGTATGC CTTGGTAGAC TTTGGTTTGG CCCAAGGAAC CCATGATACG 751 AAAATAGAGC TTCTTAAATT TGTCCAGTCT GAAGCTCAGC AGGAAAGGTG 801 TTCACAAAAC AAATCCCACA TAATCACAGG AAACAAGATT CCACTGAGTG 851 GCCCAGTACC TAAGGAGCTG GATCAGCAGT CCACCACAAA AGCTTCTGTT 901 AAAAGACCCT ACACAAATGC ACAAATTCAG ATTAAACAAG GAAAAGACGG 951 AAAGGAGGGA TCTGTAGGCC TTTCTGTCCA GCGCTCTGTT TTTGGAGAAA 1001 GAAATTTCAA TATACACAGC TCCATTTCAC ATGAGAGCCC TGCAGTGAAA 1051 CTCATGAAGC AGTCAAAGAC TGTGGATGTA CTGTCTAGAA AGTTAGCAAC 1101 AAAAAAGAAG GCTATTTCTA CGAAAGTTAT GAATAGTGCT GTGATGAGGA 1151 AAACTGCCAG TTCTTGCCCA GCTAGCCTGA CCTGTGACTG CTATGCAACA 1201 GATAAAGTTT GTAGTATTTG CCTTTCAAGG CGTCAGCAGG TTGCCCCTAG 1251 GGCAGGTACA CCAGGATTCA GAGCACCAGA GGTCTTGACA AAGTGCCCCA 1301 ATCAAACTAC AGCAATTGAC ATGTGGTCTG CAGGTGTCAT ATTTCTTTCT 1351 TTGCTTAGTG GACGATATCC ATTTTATAAA GCAAGTGATG ATTTAACTGC 1401 TTTGGCCCAA ATTATGACAA TTAGGGGATC CAGAGAAACT ATCCAAGCTG 1451 CTAAAACTTT TGGGAAATCA ATATTATGTA GCAAAGAAGT TCCAGCACAA 1501 GACTTGAGAA AACTCTGTGA GAGACTCAGG GGTATGGATT CTAGCACTCC 1551 CAAGTTAACA AGTGATATAC AAGGGCATGC TTCTCATCAA CCAGCTATTT 1601 CAGAGAAGAC TGACCATAAA GCTTCTTGCC TCGTTCAAAC ACCTCCAGGA 1651 CAATACTCAG GGAATTCATT TAAAAAGGGG GATAGTAATA GCTGTGAGCA 1701 TTGTTTTGAT GAGTATAATA CCAATTTAGA AGGCTGGAAT GAGGTACCTG 1751 ATGAAGCTTA TGACCTGCTT GATAAACTTC TAGATCTAAA TCCAGCTTCA 1801 AGAATAACAG CAGAAGAAGC TTTGTTGCAT CCATTTTTTA AAGATATGAG 1851 CTTGTGATAA TGGATCTTCA TTTAATGTTT ACTGTTATGA GGTAGAATAA 1901 AAAAGAATAC TTTGTAATAG CCACAAGTTC TTGTTTAGAG ACCAGAGCAG 1951 GATTAATAAT TTATTTTAAC ATTTTAGTGT TTGGTGGCAC ATTCTAAAAT 2001 ATAGATTAAG AATACTTAAA ATGCCTGGGA TAGTTCTTGG GACTAACAAC 2051 ATGATCTTCT TTGAGTTAAA CCTACCTAAG TAGATTTTAG GTGGGTTCCT 2101 ATTAGGTCAG ATTTTTAGCT TCCCTAATTA CCTTTCACTG ACATATACAG 2151 AAAAAGGAGC AGTTTTAGTT TTAATTAATT AAAATTAACA GATGTGATGA 2201 GGATTAAATG AATCAAAAGA CTTAATTTGT AGATTCTTTT AGAGTTATGA 2251 GCTAGGTATA GTTTGGGGAA ACTCAACCTG GTGCTGGTGC TCTTAACAAT 2301 TTTGTAAATA AAGAAGATAA TTTCCTTTTC TAGAGGTACA TATTAGGCCT 2351 TTTATGAACA CTAAAACAAT GAGGAAATGT TGGTCATGGG GCAAAGTATC 2401 ACTTAAAATT GAATTCATCC ATTTTTAAAA AACACTTCAT GAAAGCATTC 2451 TGGTGTGAAT TGCCATTTTT TTCTTACTGG CTTCTCAATT TTCTTCCTTC 2501 TCTGCCCCTA CCTAAAACAT TCTCCTCGGA AATTACATGG TGCTGACCAC 2551 AAAGTTTCTG GATGTTTTAT TAAATATTGT ACGTCTTTAC AGTTGGGAAT 2601 TTAAAATAAT ACATACACTG GTTGATAAAG GGAAGCTGCA GGACCAAGGT 2651 GAAGATTGAT AGTCCAAATG CTTTTCTTTT TTGAGTTGTA TATTTTTGGA 2701 CACCATCTTA GATATAATTA GGTAGCTGCT GAAAGGAAAA GTGAATACAG 2751 AATTGACGGT ATTATTGGAG ATTTTTCCTC TGCGTAGAGC CATCCAGATC 2801 TCTGTATCCT GTTTTGACTA AGTCTTAGGT GGGTTGGGAA GACAGATAAT 2851 GAAGTGTAGG CAAAGAGAAA AGGACCCAAG ATAGAGGTTT ATATTCAGAA 2901 ATGGTATATA TCAATGACAG CATATCAAAC TTCCTATGGG AAAAAGTCTG 2951 GTGGGTGGTC AGCTGACAGA TTTCCCATTT AGTAGTCATA GAATACAGAA 3001 ATAGTTTAGG GACATGTATT CATTTTGTTA TTTTGAGCAT TGATAGGTCA 3051 GTATATCTAC CTAATCTGTT TGGTAAGTAT AGGATATATA AACCATTACC 3101 ATTGATCTGT CTTATGCCAT AATCTTAAAA AAAAATTGAA TGCTCTTGAA 3151 TTTGTATATT CAATAAAGTT ATCCTTTTAT ATTTTTT // LOCUS AB006626 8459 bp mRNA PRI 05-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0288 gene, complete cds. ACCESSION AB006626 NID g2564323 VERSION AB006626.1 GI:2564323 KEYWORDS KIAA0288. SOURCE Homo sapiens Brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6116. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8459) AUTHORS Ohara,O., Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N. and Nomura,N. TITLE Direct Submission JOURNAL Submitted (20-AUG-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene StructureI; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (bases 1 to 8459) AUTHORS Ohara,O., Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes JOURNAL Published Only in DataBase (1997) In press FEATURES Location/Qualifiers source 1. .8459 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6116" /clone_lib="pSPORT 1" /tissue_type="Brain" 5'UTR 1. .1143 /gene="KIAA0288" gene 1. .8459 /gene="KIAA0288" CDS 1144. .4047 /gene="KIAA0288" /codon_start=1 /protein_id="BAA22957.1" /db_xref="PID:d1023833" /db_xref="PID:g2564324" /db_xref="GI:2564324" /translation="MLAMKHQQELLEHQRKLERHRQEQELEKQHREQKLQQLKNKEKG KESAVASTEVKMKLQEFVLNKKKALAHRNLNHCISSDPRYWYGKTQHSSLDQSSPPQS GVSTSYNHPVLGMYDAKDDFPLRKTASEPNLKLRSRLKQKVAERRSSPLLRRKDGPVV TALKKRPLDVTDSACSSAPGSGPSSPNNSSGSVSAENGIAPAVPSIPAETSLAHRLVA REGSAAPLPLYTSPSLPNITLGLPATGPSAGTAGQQDTERLTLPALQQRLSLFPGTHL TPYLSTSPLERDGGAAHSPLLQHMVLLEQPPAQAPLVTGLGALPLHAQSLVGADRVSP SIHKLRQHRPLGRTQSAPLPQNAQALQHLVIQQQHQQFLEKHKQQFQQQQLQMNKIIP KPSEPARQPESHPEETEEELREHQALLDEPYLDRLPGQKEAHAQAGVQVKQEPIESDE EEAEPPREVEPGQRQPSEQELLFRQQALLLEQQRIHQLRNYQASMEAAGIPVSFGGHR PLSRAQSSPASATFPVSVQEPPTKPRFTTGLVYDTLMLKHQCTCGSSSSHPEHAGRIQ SIWSRLQETGLRGKCECIRGRKATLEELQTVHSEAHTLLYGTNPLNRQKLDSKKLLGS LASVFVRLPCGGVGVDSDTIWNEVHSAGAARLAVGCVVELVFKVATGELKNGFAVVRP PGHHAEESTPMGFCYFNSVAVAAKLLQQRLSVSKILIVDWDVHHGNGTQQAFYSDPSV LYMSLHRYDDGNFFPGSGAPDEVGTGPGVGFNVNMAFTGGLDPPMGDAEYLAAFRTVV MPIASEFAPDVVLVSSGFDAVEGHPTPLGGYNLSARCFGYLTKQLMGLAGGRIVLALE GGHDLTAICDASEACVSALLGNELDPLPEKVLQQRPNANAVRSMEKVMEIHSKYWRCL QRTTSTAGRSLIEAQTCENEEAETVTAMASLSVGVKPAEKRPDEEPMEEEPPL" 3'UTR 4048. .8459 /gene="KIAA0288" BASE COUNT 1852 a 2374 c 2338 g 1895 t ORIGIN 1 GGAGGTTGTG GGGCCGCCGC CGCGGAGCAC CGTCCCCGCC GCCGCCCGAG 51 CCCGAGCCCG AGCCCGCGCA CCCGCCCGCG CCGCCGCCGC CGCCGCCCGA 101 ACAGCCTCCC AGCCTGGGCC CCCGGCGGCG CCGTGGCCGC GTCCCGGCTG 151 TCGCCGCCCG AGCCCGAGCC CGCGCGCCGG CGGGTGGCGG CGCAGGCTGA 201 GGAGATGCGG CGCGGAGCGC CGGAGCAGGG CTAGAGCCGG CCGCCGCCGC 251 CCGCCGCGGT AAGCGCAGCC CCGGCCCGGC GCCCGCGGGC CATTGTCCGC 301 CGCCCGCCCC GCGCCCCGCG CAGCCTGCAG GCCTTGGAGC CCGCGGCAGG 351 TGGACGCCGC CGGTCCACAC CCGCCCCGCG CGCGGCCGTG GGAGGCGGGG 401 GCCAGCGCTG GCCGCGCGCC GTGGGACCCG CCGGTCCCCA GGGCCGCCCG 451 GCCCCTTCTG GACCTTTCCA CCCGCGCCGC GAGGCGGCTT CGCCCGCCGG 501 GGCGGGGGCG CGGGGGTGGG CACGGCAGGC AGCGGCGCCG TCTCCCGGTG 551 CGGGGCCCGC GCCCCCCGAG CAGGTTCATC TGCAGAAGCC AGCGGACGCC 601 TCTGTTCAAC TTGTGGGTTA CCTGGCTCAT GAGACCTTGC CGGCGAGGCT 651 CGGCGCTTGA ACGTCTGTGA CCCAGCCCTC ACCGTCCCGG TACTTGTATG 701 TGTTGGTGGG AGTTTGGAGC TCGTTGGAGC TATCGTTTCC GTGGAAATTT 751 TGAGCCATTT CGAATCACTT AAAGGAGTGG ACATTGCTAG CAATGAGCTC 801 CCAAAGCCAT CCAGATGGAC TTTCTGGCCG AGACCAGCCA GTGGAGCTGC 851 TGAATCCTGC CCGCGTGAAC CACATGCCCA GCACGGTGGA TGTGGCCACG 901 GCGCTGCCTC TGCAAGTGGC CCCCTCGGCA GTGCCCATGG ACCTGCGCCT 951 GGACCACCAG TTCTCACTGC CTGTGGCAGA GCCGGCCCTG CGGGAGCAGC 1001 AGCTGCAGCA GGAGCTCCTG GCGCTCAAGC AGAAGCAGCA GATCCAGAGG 1051 CAGATCCTCA TCGCTGAGTT CCAGAGGCAG CACGAGCAGC TCTCCCGGCA 1101 GCACGAGGCG CAGCTCCACG AGCACATCAA GCAATAACAG GAGATGCTGG 1151 CCATGAAGCA CCAGCAGGAG CTGCTGGAAC ACCAGCGGAA GCTGGAGAGG 1201 CACCGCCAGG AGCAGGAGCT GGAGAAGCAG CACCGGGAGC AGAAGCTGCA 1251 GCAGCTCAAG AACAAGGAGA AGGGCAAAGA GAGTGCCGTG GCCAGCACAG 1301 AAGTGAAGAT GAAGTTACAA GAATTTGTCC TCAATAAAAA GAAGGCGCTG 1351 GCCCACCGGA ATCTGAACCA CTGCATTTCC AGCGACCCTC GCTACTGGTA 1401 CGGGAAAACG CAGCACAGTT CCCTTGACCA GAGTTCTCCA CCCCAGAGCG 1451 GAGTGTCGAC CTCCTATAAC CACCCGGTCC TGGGAATGTA CGACGCCAAA 1501 GATGACTTCC CTCTTAGGAA AACAGCTTCT GAACCGAATC TGAAATTACG 1551 GTCCAGGCTA AAGCAGAAAG TGGCCGAAAG ACGGAGCAGC CCCCTGTTAC 1601 GCAGGAAAGA CGGGCCAGTG GTCACTGCTC TAAAAAAGCG TCCGTTGGAT 1651 GTCACAGACT CCGCGTGCAG CAGCGCCCCA GGCTCCGGAC CCAGCTCACC 1701 CAACAACAGC TCCGGGAGCG TCAGCGCGGA GAACGGTATC GCGCCCGCCG 1751 TCCCCAGCAT CCCGGCGGAG ACGAGTTTGG CGCACAGACT TGTGGCACGA 1801 GAAGGCTCGG CCGCTCCACT TCCCCTCTAC ACATCGCCAT CCTTGCCCAA 1851 CATCACGCTG GGCCTGCCTG CCACCGGCCC CTCTGCGGGC ACGGCGGGCC 1901 AGCAGGACAC CGAGAGACTC ACCCTTCCCG CCCTCCAGCA GAGGCTCTCC 1951 CTTTTCCCCG GCACCCACCT CACTCCCTAC CTGAGCACCT CGCCCTTGGA 2001 GCGGGACGGA GGGGCAGCGC ACAGCCCTCT TCTGCAGCAC ATGGTCTTAC 2051 TGGAGCAGCC ACCGGCACAA GCACCCCTCG TCACAGGCCT GGGAGCACTG 2101 CCCCTCCACG CACAGTCCTT GGTTGGTGCA GACCGGGTGT CCCCCTCCAT 2151 CCACAAGCTG CGGCAGCACC GCCCACTGGG GCGGACCCAG TCGGCCCCGC 2201 TGCCCCAGAA CGCCCAGGCT CTGCAGCACC TGGTCATCCA GCAGCAGCAT 2251 CAGCAGTTTC TGGAGAAACA CAAGCAGCAG TTCCAGCAGC AGCAACTGCA 2301 GATGAACAAG ATCATCCCCA AGCCAAGCGA GCCAGCCCGG CAGCCGGAGA 2351 GCCACCCGGA GGAGACGGAG GAGGAGCTCC GTGAGCACCA GGCTCTGCTG 2401 GACGAGCCCT ACCTGGACCG GCTGCCGGGG CAGAAGGAGG CGCACGCACA 2451 GGCCGGCGTG CAGGTGAAGC AGGAGCCCAT TGAGAGCGAT GAGGAAGAGG 2501 CAGAGCCCCC ACGGGAGGTG GAGCCGGGCC AGCGCCAGCC CAGTGAGCAG 2551 GAGCTGCTCT TCAGACAGCA AGCCCTCCTG CTGGAGCAGC AGCGGATCCA 2601 CCAGCTGAGG AACTACCAGG CGTCCATGGA GGCCGCCGGC ATCCCCGTGT 2651 CCTTCGGCGG CCACAGGCCT CTGTCCCGGG CGCAGTCCTC ACCCGCGTCT 2701 GCCACCTTCC CCGTGTCTGT GCAGGAGCCC CCCACCAAGC CGAGGTTCAC 2751 GACAGGCCTC GTGTATGACA CGCTGATGCT GAAGCACCAG TGCACCTGCG 2801 GGAGTAGCAG CAGCCACCCC GAGCACGCCG GGAGGATCCA GAGCATCTGG 2851 TCCCGCCTGC AGGAGACGGG CCTCCGGGGC AAATGCGAGT GCATCCGCGG 2901 ACGCAAGGCC ACCCTGGAGG AGCTACAGAC GGTGCACTCG GAAGCCCACA 2951 CCCTCCTGTA TGGCACGAAC CCCCTCAACC GGCAGAAACT GGACAGTAAG 3001 AAACTTCTAG GCTCGCTCGC CTCCGTGTTC GTCCGGCTCC CTTGCGGTGG 3051 TGTTGGGGTG GACAGTGACA CCATATGGAA CGAGGTGCAC TCGGCGGGGG 3101 CAGCCCGCCT GGCTGTGGGC TGCGTGGTAG AGCTGGTCTT CAAGGTGGCC 3151 ACAGGGGAGC TGAAGAATGG CTTTGCTGTG GTCCGCCCCC CTGGACACCA 3201 TGCGGAGGAG AGCACGCCCA TGGGCTTTTG CTACTTCAAC TCCGTGGCCG 3251 TGGCAGCCAA GCTTCTGCAG CAGAGGTTGA GCGTGAGCAA GATCCTCATC 3301 GTGGACTGGG ACGTGCACCA TGGAAACGGG ACCCAGCAGG CTTTCTACAG 3351 CGACCCTAGC GTCCTGTACA TGTCCCTCCA CCGCTACGAC GATGGGAACT 3401 TCTTCCCAGG CAGCGGGGCT CCTGATGAGG TGGGCACAGG GCCCGGCGTG 3451 GGTTTCAACG TCAACATGGC TTTCACCGGC GGCCTGGACC CCCCCATGGG 3501 AGACGCTGAG TACTTGGCGG CCTTCAGAAC GGTGGTCATG CCGATCGCCA 3551 GCGAGTTTGC CCCGGATGTG GTGCTGGTGT CATCAGGCTT CGATGCCGTG 3601 GAGGGCCACC CCACCCCTCT TGGGGGCTAC AACCTCTCCG CCAGATGCTT 3651 CGGGTACCTG ACGAAGCAGC TGATGGGCCT GGCTGGCGGC CGGATTGTCC 3701 TGGCCCTCGA GGGAGGCCAC GACCTGACCG CCATTTGCGA CGCCTCGGAA 3751 GCATGTGTTT CTGCCTTGCT GGGAAACGAG CTTGATCCTC TCCCAGAAAA 3801 GGTTTTACAG CAAAGACCCA ATGCAAACGC TGTCCGTTCC ATGGAGAAAG 3851 TCATGGAGAT CCACAGCAAG TACTGGCGCT GCCTGCAGCG CACAACCTCC 3901 ACAGCGGGGC GTTCTCTGAT CGAGGCTCAG ACTTGCGAGA ACGAAGAAGC 3951 CGAGACGGTC ACCGCCATGG CCTCGCTGTC CGTGGGCGTG AAGCCCGCCG 4001 AAAAGAGACC AGATGAGGAG CCCATGGAAG AGGAGCCGCC CCTGTAGCAC 4051 TCCCTCGAAG CTGCTGTTCT CTTGTCTGTC TGTCTCTGTC TTGAAGCTCA 4101 GCCAAGAAAC TTTCCCGTGT CACGCCTGCG TCCCACCGTG GGGCTCTCTT 4151 GGAGCACCCA GGGACACCCA GCGTGCAACA GCCACGGGAA GCCTTTCTGC 4201 CGCCCAGGCC CACAGGTCTC GAGACGCACA TGCACGCCTG GGCGTGGCAG 4251 CCTCACAGGG AACACGGGAC AGACGCCGGC GACGCGCAGA CACACGGACA 4301 CGCGGAAGCC AAGCACACTC TGGCGGGTCC CGCAAGGGAC GCCGTGGAAG 4351 AAAGGAGCCT GTGGCAACAG GCGGCCGAGC TGCCGAATTC AGTTGACACG 4401 AGGCACAGAA AACAAATATC AAAGATCTAA TAATACAAAA CAAACTTGAT 4451 TAAAACTGGT GCTTAAAGTT TATTACCCAC AACTCCACAG TCTCTGTGTA 4501 AACCACTCGA CTCATCTTGT AGCTTATTTT TTTTTTAAAG AGGACGTTTT 4551 CTACGGCTGT GGCCCGCCTC TGTGAACCAT AGCGGTGTGC GGCGGGGGGT 4601 CTGCACCCGG GTGGGGGACA GAGGGACCTT TAAAGAAAAC AAAACTGGAC 4651 AGAAACAGGA ATGTGAGCTG GGGGAGCTGG CTTGAGTTTC TCAAAAGCCA 4701 TCGGAAGATG CGAGTTTGTG CCTTTTTTTT TATTGCTCTG GTGGATTTTT 4751 GTGGCTGGGT TTTCTGAAGT CTGAGGAACA ATGCCTTAAG AAAAAACAAA 4801 CAGCAGGAAT CGGTGGGACA GTTTCCTGTG GCCAGCCGAG CCTGGCAGTG 4851 CTGGCACCGC GAGCTGGCCT GACGCCTCAA GCACGGGCAC CAGCCGTCAT 4901 CTCCGGGGCC AGGGGCTGCA GCCCGGCGGT CCCTGTTTTG CTTTATTGCT 4951 GTTTAAGAAA AATGGAGGTA GTTCCAAAAA AGTGGCAAAT CCCGTTGGAG 5001 GTTTTGAAGT CCAACAAATT TTAAACGAAT CCAAAGTGTT CTCACACGTC 5051 ACATACGATT GAGCATCTCC ATCTGGTCGT GAAGCATGTG GTAGGCACAC 5101 TTGCAGTGTT ACGATCGGAA TGCTTTTTAT TAAAAGCAAG TAGCATGAAG 5151 TATTGCTTAA ATTTTAGGTA TAAATAAATA TATATATGTA TAATATATAT 5201 TCCAATGTAT TCCAAGCTAA GAAACTTACT TGATTCTTAT GAAATCTTGA 5251 TAAAATATTT ATAATGCATT TATAGAAAAA GTATATATAT ATATATAAAA 5301 TGAATGCAGA TTGCGAAGGT CCCTGCAAAT GGATGGCTTG TGAATTTGCT 5351 CTCAAGGTGC TTATGGAAAG GGATCCTGAT TGATTGAAAT TCATGTTTTC 5401 TCAAGCTCCA GATTGGCTAG ATTTCAGATC GCCAACACAT TCGCCACTGG 5451 GCAACTACCC TACAAGTTTG TACTTTCATT TTAATTATTT TCTAACAGAA 5501 CCGCTCCCGT CTCCAAGCCT TCATGCACAT ATGTACCTAA TGAGTTTTTA 5551 TAGCAAAGAA TATAAATTTG CTGTTGATTT TTGTATGAAT TTTTTCACAA 5601 AAAGATCCTG AATAAGCATT GTTTTATGAA TTTTACATTT TTCCTCACCA 5651 TTTAGCAATT TTCTGAATGG TAATAATGTC TAAATCTTTT TCCTTTCTGA 5701 ATTCTTGCTT GTACATTTTT TTTTACCTTT CAAAGGTTTT TAATTATTTT 5751 TGTTTTTATT TTTGTACGAT GAGTTTTCTG CAGCGTACAG AATTGTTGCT 5801 GTCAGATTCT ATTTTCAGAA AGTGAGAGGA GGGACCGTAG GTCTTTTCGG 5851 AGTGACACCA ACGATTGTGT CTTTCCTGGT CTGTCCTAGG AGCTGTATAA 5901 AGAAGCCCAG GGGCTCTTTT TAACTTTCAA CACTAGTAGT ATTACGAGGG 5951 GTGGTGTGTT TTTCCCCTCC GTGGCAAGGG CAGGGAGGGT TGCTTAGGAT 6001 GCCCGGCCAC CCTGGGAGGC TTGCCAGATG CCGGGGGCAG TCAGCATTAA 6051 TGAAACTCAT GTTTAAACTT CTCTGACCAC ATCGTCAGGA TAGAATTCTA 6101 ACTTGAGTTT TCCAAAGACC TTTTGAGCAT GTCAGCAATG CATGGGGCAC 6151 ACGTGGGGCT CTTTACCCAC TTGGGTTTTT CCACTGCAGC CACGTGGCCA 6201 GCCCTGGATT TTGGAGCCTG TGGCTGCAAG GAACCCAGGG ACCCTTGTTG 6251 CCTGGTGAAC CTGCAGGGAG GGTATGATTG CCTGACCAGG ACAGCCAGTC 6301 TTTACTCTTT TTCTCTTCAA CAGTAACTGA CAGTCACGTT TTACTGGTAA 6351 CTTATTTTCC AGCACATGAA GCCACCAGTT TCATTCCAAA GTGTATATTG 6401 GGTTCAGACT TGGGGGCAGA AGTTCAGACA CACCGTGCTC AGGAGGGACC 6451 CAGAGCCGAG TTTCGGAGTT TGGTAAAGTT TACAGGGTAG CTTCTGAAAT 6501 TAACTCAAAC TTTTGACCAA ATGAGTGCAG ATTCTTGGAT TCACTTGGTC 6551 ACTGGGCTGC TGATGGTCAG CTCTGAGACA GTGGTTTGAG AGCAGGCAGA 6601 ACGGTCTTGG GACTTGTTTG ACTTTCCCCT CCCTGGTGGC CACTCTTTGC 6651 TCTGAAGCCC AGATTGGCAA GAGGAGCTGG TCCATTCCCC ATTCATGGCA 6701 CAGAGCAGTG GCAGGGCCCA GCTAGCAGGC TCTTCTGGCC TCCTTGGCCT 6751 CATTCTCTGC ATAGCCCTCT GGGGATCCTG CCACCTGCCC TCTTACCCCG 6801 CCGTGGCTTA TGGGGAGGAA TGCATCATCT CACTTTTTTT TTTTAAGCAG 6851 ATGATGGGAT AACATGGACT GCTCAGTGGC CAGGTTATCA GTGGGGGGAC 6901 TTAATTCTAA TCTCATTCAA ATGGAGACGC CCTCTGCAAA GGCCTGGCAG 6951 GGGGAGGCAC GTTTCATCTG TCAGCTCACT CCAGCTTCAC AAATGTGCTG 7001 AGAGCATTAC TGTGTAGCCT TTTCTTTGAA GACACACTCG GCTCTTCTCC 7051 ACAGCAAGCG TCCAGGGCAG ATGGCAGAGG ATCTGCCTCG GCGTCTGCAG 7101 GCGGGACCAC GTCAGGGAGG GTTCCTTCAT GTGTTCTCCC TGTGGGTCCT 7151 TGGACCTTTA GCCTTTTTCT TCCTTTGCAA AGGCCTTGGG GGCACTGGCT 7201 GGGAGTCAGC AAGCGAGCAC TTTATATCCC TTTGAGGGAA ACCCTGATGA 7251 CGCCACTGGG CCTCTTGGCG TCTGCCCTGC CCTCGCGGCT TCCCGCCGTG 7301 CCGCAGCGTG CCCACGTGCC CACGCCCCAC CAGCAGGCGG CTGTCCCGGA 7351 GGCCGTGGCC CGCTGGGACT GGCCGCCCCT CCCCAGCGTC CCAGGGCTCT 7401 GGTTCTGGAG GGCCACTTTG TCAAGGTGTT TCAGTTTTTC TTTACTTCTT 7451 TTGAAAATCT GTTTGCAAGG GGAAGGACCA TTTCGTAATG GTCTGACACA 7501 AAAGCAAGTT TGATTTTTGC AGCACTAGCA ATGGACTTTG TTGTTTTTCT 7551 TTTTGATCAG AACATTCCTT CTTTACTGGT CACAGCCACG TGCTCATTCC 7601 ATTCTTCTTT TTGTAGACTT TGGGCCCACG TGTTTTATGG GCATTGATAC 7651 ATATATAAAT ATATAGATAT AAATATATAT GAATATATTT TTTTAAGTTT 7701 CCTACACCTG GAGGTTGCAT GGACTGTACG ACCGGCATGA CTTTATATTG 7751 TATACAGATT TTGCACGCCA AACTCGGCAG CTTTGGGGAA GAAGAAAAAT 7801 GCCTTTCTGT TCCCCTCTCA TGACATTTGC AGATACAAAA GATGGAAATT 7851 TTTCTGTAAA ACAAAACCTT GAAGGAGAGG AGGGCGGGGA AGTTTGCGTC 7901 TTATTGAACT TATTCTTAAG AAATTGTACT TTTTATTGTA AGAAAAATAA 7951 AAAGGACTAC TTAAACATTT GTCATATTAA GAAAAAAAGT TTATCTAGCA 8001 CTTGTGACAT ACCAATAATA GAGTTTATTG TATTTATGTG GAAACAGTGT 8051 TTTAGGGAAA CTACTCAGAA TTCACAGTGA ACTGCCTGTC TCTCTCGAGT 8101 TGATTTGGAG GAATTTTGTT TTGTTTTGTT TTGTTTGTTT CCTTTTATCT 8151 CCTTCCACGG GCCAGGCGAG CGCCGCCCGC CCTCACTGGC CTTGTGACGG 8201 TTTATTCTGA TTGAGAACTG GGCGGACTCG AAAGAGTCCC CTTTTCCGCA 8251 CAGCTGTGTT GACTTTTTAA TTACTTTTAG GTGATGTATG GCTAAGATTT 8301 CACTTTAAGC AGTCGTGAAC TGTGCGAGCA CTGTGGTTTA CAATTATACT 8351 TTGCATCGAA AGGAAACCAT TTCTTCATTG TAACGAAGCT GAGCGTGTTC 8401 TTAGCTCGGC CTCACTTTGT CTCTGGCATT GATTAAAAGT CTGCTATTGA 8451 AAGAAAAAG // LOCUS AB007454 1503 bp mRNA PRI 13-FEB-1999 DEFINITION Homo sapiens mRNA for chemokine LEC precursor, complete cds. ACCESSION AB007454 NID g2723285 VERSION AB007454.1 GI:2723285 KEYWORDS chemokine LEC precursor. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1503) AUTHORS Nomiyama,H. TITLE Direct Submission JOURNAL Submitted (19-SEP-1997) to the DDBJ/EMBL/GenBank databases. Hisayuki Nomiyama, Kumamoto University Medical School, Department of Biochemistry; Honjo 2-2-1, Kumamoto, Kumamoto 860-0811, Japan (E-mail:nomiyama@gpo.kumamoto-u.ac.jp, Tel:81-96-373-5063, Fax:81-96-372-6140) REFERENCE 2 (sites) AUTHORS Shoudai,K., Hieshima,K., Fukuda,S., Iio,M., Miura,R., Imai,T., Yoshie,O. and Nomiyama,H. TITLE Isolation of cDNA encoding a novel human CC chemokine NCC-4/LEC JOURNAL Biochim. Biophys. Acta 1396 (3), 273-277 (1998) MEDLINE 98207719 FEATURES Location/Qualifiers source 1. .1503 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" sig_peptide 77. .145 CDS 77. .439 /codon_start=1 /product="chemokine LEC precursor" /protein_id="BAA24057.1" /db_xref="PID:d1024963" /db_xref="PID:g2723286" /db_xref="GI:2723286" /translation="MKVSEAALSLLVLILIITSASRSQPKVPEWVNTPSTCCLKYYEK VLPRRLVVGYRKALNCHLPAIIFVTKRNREVCTNPNDDWVQEYIKDPNLPLLPTRNLS TVKIITAKNGQPQLLNSQ" mat_peptide 146. .436 polyA_signal 560. .565 polyA_signal 1485. .1490 BASE COUNT 417 a 374 c 312 g 400 t ORIGIN 1 GTTGGCAAGC GGACCACCAG CAACAGACAA CATCTTCATT CGGCTCTCCC 51 TGAAGCTGTA CTGCCTCGCT GAGAGGATGA AGGTCTCCGA GGCTGCCCTG 101 TCTCTCCTTG TCCTCATCCT TATCATTACT TCGGCTTCTC GCAGCCAGCC 151 AAAAGTTCCT GAGTGGGTGA ACACCCCATC CACCTGCTGC CTGAAGTATT 201 ATGAGAAAGT GTTGCCAAGG AGACTAGTGG TGGGATACAG AAAGGCCCTC 251 AACTGTCACC TGCCAGCAAT CATCTTCGTC ACCAAGAGGA ACCGAGAAGT 301 CTGCACCAAC CCCAATGACG ACTGGGTCCA AGAGTACATC AAGGATCCCA 351 ACCTACCTTT GCTGCCTACC AGGAACTTGT CCACGGTTAA AATTATTACA 401 GCAAAGAATG GTCAACCCCA GCTCCTCAAC TCCCAGTGAT GACCAGGCTT 451 TAGTGGAAGC CCTTGTTTAC AGAAGAGAGG GGTAAACCTA TGAAAACAGG 501 GGAAGCCTTA TTAGGCTGAA ACTAGCCAGT CACATTGAGA GAAGCAGAAC 551 AATGATCAAA ATAAAGGAGA AGTATTTCGA ATATTTTCTC AATCTTAGGA 601 GGAAATACCA AAGTTAAGGG ACGTGGGCAG AGGTACGCTC TTTTATTTTT 651 ATATTTATAT TTTTATTTTT TTGAGATAGG GTCTTACTCT GTCACCCAGG 701 CTGGAGTGCA GTGGTGTGAT CTTGGCTCAC TTGATCTTGG CTCACTGTAA 751 CCTCCACCTC CCAGGCTCAA GTGATCCTCC CACCCCAGCC TCCCGAGTAG 801 CTGGGACTAC AGGCTTGCGC CACCACACCT GGCTAATTTT TGTATTTTTG 851 GTAGAGACGG GATTCTACCA TGTTGCCCAG GCTGGTCTCA AACTCGTGTG 901 CCCAAGCAAT CCACCTGCCT CAGCCTTCCA AAAGTGCTGG GATTACAGGC 951 GTGAGCCACC ACATCCGGCC AGTGCACTCT TAATACACAG AAAAAATATA 1001 TTCACATCCT TCTCCTGCTC TCTTTCAATT CCTCACTTCA CACCAGTACA 1051 CAAGCCATTC TAAATACTTA GCCAGTTTCC AGCCTTCCAG ATGATCTTTG 1101 CCCTCTGGGT CTTGACCCAT TAAGAGCCCC ATAGAACTCT TGATTTTTCC 1151 TGTCCATCTT TATGGATTTT TCTGGATCTA TATTTTCTTC AATTATTCTT 1201 TCATTTTATA ATGCAACTTT TTCATAGGAA GTCCGGATGG GAATATTCAC 1251 ATTAATCATT TTTGCAGAGA CTTTGCTAGA TCCTCTCATA TTTTGTCTTC 1301 CTCAGGGTGG CAGGGGTACA GAGATGTCCT GATTGGAAAA AAAAAAAAAA 1351 GAGAGAGAGA GAGAAGAAGA AGAAGAAGAG ACACAAATCT CTACCTCCCA 1401 TGTTAAGCTT TGCAGGACAG GGAAAGAAAG GGTATGAGAC ACGGCTAGGG 1451 GTAAACTCTT AGTCCAAAAC CCAAGCATGC AATAAATAAA ACTCCCTTAT 1501 TTG // LOCUS AB007860 5711 bp mRNA PRI 13-FEB-1999 DEFINITION Homo sapiens KIAA0400 mRNA, complete cds. ACCESSION AB007860 NID g2662080 VERSION AB007860.1 GI:2662080 KEYWORDS KIAA0400. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1091. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5711) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. 78 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 4 (5), 307-313 (1997) MEDLINE 98116655 FEATURES Location/Qualifiers source 1. .5711 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1091" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 341. .3361 /gene="KIAA0400" CDS 341. .3361 /gene="KIAA0400" /codon_start=1 /protein_id="BAA23696.1" /db_xref="PID:d1024577" /db_xref="PID:g2662081" /db_xref="GI:2662081" /translation="MPDQISVSEFVAETHEDYKAPTASSFTTRTAQCRNTVAAIEEAL DVDRMVLYKMKKSVKAINSSGLAHVENEEQYTQALEKFGGNCVCRDDPDLGSAFLKFS VFTKELTALFKNLIQNMNNIISFPLDSLLKGDLKGVKGDLKKPFDKAWKDYETKITKI EKEKKEHAKLHGMIRTEISGAEIAEEMEKERRFFQLQMCEYLLKVNEIKIKKGVDLLQ NLIKYFHAQCNFFQDGLKAVESLKPSIETLSTDLHTIKQAQDEERRQLIQLRDILKSA LQVEQKEDSQIRQSTAYSLHQPQGNKEHGTERNGSLYKKSDGIRKVWQKRKCSVKNGF LTISHGTANRPPAKLNLLTCQVKTNPEEKKCFDLISHDRTYHFQAEDEQECQIWMSVL QNSKEEALNNAFKGDDNTGENNIVQELTKEIISEVQRMTGNDVCCDCGAPDPTWLSTN LGILTCIECSGIHRELGVHYSRMQSLTLDVLGTSELLLAKNIGNAGFNEIMECCLPAE DSVKPNPGSDMNARKDYITAKYIERRYARKKHADNAAKLHSLCEAVKTRDIFGLLQAY ADGVDLTEKIPLANGHEPDETALHLAVRSVDRTSLHIVDFLVQNSGNLDKQTGKGSTA LHYCCLTDNAECLKLLLRGKASIEIANESGETPLDIAKRLKHEHCEELLTQALSGRFN SHVHVEYEWRLLHEDLDESDDDMDEKLQPSPNRREDRPISFYQLGSNQLQSNAVSLAR DAANLAKEKQRAFMPSILQNETYGALLSGSPPPAQPAAPSTTSAPPLPPRNVGKVQTA SSANTLWKTNSVSVDGGSRQRSSSDPPAVHPPLPPLRVTSTNPLTPTPPPPVAKTPSV MEALSQPSKPAPPGISQIRPPPLPPQPPSRLPQKKPAPGADKSTPLTNKGQPRGPVDL SATEALGPLSNAMVLQPPAPMPRKSQATKLKPKRVKALYNCVADNPDELTFSEGDVII VDGEEDQEWWIGHIDGDPGRKGAFPVSFVHFIAD" BASE COUNT 1569 a 1337 c 1332 g 1473 t ORIGIN 1 CGCCCCTCTC CGCGGGAGGC GTCGGGGCCC GCGGCTGGGT GGCGGGTAGT 51 TCCCGCCGGC TGTGCGCGCC CGCCTCGGGG CTCGCTCTGA GAGGACGCGG 101 CGGCAGCGGA CTCGGAGCCC TCGGCGCGCA GGCGGGCGGA CCGGCCGAGC 151 TGCGCGGGGC TGCGCGCCGC CCCTGCTCCG CCGCCAGGCC CCGCGCGGCT 201 CCCGCGCCCG GCGCTCCCCT TTGTCCGCGG GCCGGAGCGG CGGCGGCAGC 251 GGCGGTGTCC GAGCGGCGGT CGGAGCCTGC TGCGGCAGTT GAGGCGGCGG 301 CGCCCCTGCG GCTGTGCGCC AGCGCCCTCG CGCCGAGGCG ATGCCGGACC 351 AGATCTCCGT GTCGGAATTC GTGGCCGAGA CCCATGAGGA CTACAAGGCG 401 CCCACGGCCT CCAGCTTCAC CACCCGCACG GCGCAGTGCC GGAACACTGT 451 GGCGGCCATC GAGGAGGCTT TGGACGTGGA CCGGATGGTT CTTTACAAAA 501 TGAAGAAATC CGTGAAAGCA ATCAACAGCT CTGGGCTGGC TCACGTGGAA 551 AATGAAGAGC AGTACACCCA GGCTCTGGAG AAGTTTGGCG GCAACTGTGT 601 ATGCAGAGAT GACCCAGATT TAGGAAGTGC GTTCCTGAAG TTCTCAGTGT 651 TTACAAAGGA GTTGACAGCA CTTTTCAAAA ACCTGATTCA GAATATGAAC 701 AACATAATCT CCTTCCCTTT GGACAGTTTG CTGAAGGGGG ACCTGAAAGG 751 AGTGAAAGGG GATCTGAAAA AGCCTTTTGA TAAAGCTTGG AAGGACTATG 801 AAACAAAAAT AACCAAGATA GAAAAGGAGA AAAAGGAACA CGCCAAGCTC 851 CATGGGATGA TTCGGACTGA AATAAGCGGA GCGGAAATTG CCGAAGAGAT 901 GGAAAAGGAG AGGCGCTTCT TCCAGCTACA GATGTGCGAG TATCTGCTGA 951 AGGTCAACGA AATCAAGATT AAAAAGGGAG TAGATTTACT TCAGAATCTG 1001 ATCAAATACT TTCATGCCCA ATGCAATTTT TTTCAGGATG GACTCAAAGC 1051 CGTGGAAAGC CTCAAACCTT CCATTGAAAC GCTGTCTACG GATCTTCACA 1101 CGATCAAACA GGCCCAGGAT GAAGAAAGAA GGCAGTTGAT ACAGCTTCGA 1151 GATATTTTGA AATCCGCATT GCAGGTTGAA CAGAAAGAGG ACTCCCAAAT 1201 TCGTCAGAGC ACAGCTTATA GCTTACATCA GCCTCAGGGA AACAAGGAAC 1251 ATGGGACCGA GCGGAACGGC AGCCTCTACA AGAAGAGTGA CGGGATCCGA 1301 AAAGTGTGGC AGAAAAGGAA ATGTTCAGTT AAAAATGGTT TTCTGACCAT 1351 ATCCCATGGT ACCGCTAACC GGCCTCCTGC AAAGCTCAAC CTGCTAACCT 1401 GCCAGGTGAA GACCAACCCT GAGGAGAAGA AGTGCTTTGA CCTTATTTCA 1451 CATGACAGAA CTTACCACTT TCAAGCTGAA GATGAACAGG AATGTCAAAT 1501 ATGGATGTCT GTGCTGCAAA ATAGCAAAGA AGAAGCTTTA AACAATGCAT 1551 TTAAGGGGGA TGACAATACT GGAGAAAATA ACATCGTCCA AGAACTGACA 1601 AAGGAGATCA TCTCAGAAGT GCAGAGGATG ACGGGCAATG ACGTCTGCTG 1651 TGACTGTGGG GCGCCAGATC CTACATGGCT TTCCACCAAC CTGGGCATCC 1701 TGACCTGCAT CGAGTGTTCC GGAATCCACC GAGAGCTGGG GGTTCATTAT 1751 TCCAGGATGC AGTCCCTGAC CTTAGATGTA CTGGGAACAT CTGAGCTGCT 1801 GCTCGCCAAG AATATTGGGA ATGCAGGCTT TAATGAGATC ATGGAATGTT 1851 GCCTACCAGC TGAGGACTCA GTCAAACCCA ACCCAGGCAG CGACATGAAT 1901 GCAAGAAAGG ACTACATCAC AGCCAAGTAC ATCGAGAGGA GATACGCAAG 1951 GAAGAAGCAC GCGGATAACG CGGCGAAGCT TCACAGTCTT TGCGAGGCCG 2001 TCAAAACGAG AGATATTTTT GGATTGCTCC AAGCTTATGC TGATGGTGTG 2051 GATCTTACGG AAAAAATCCC ACTGGCCAAC GGACATGAGC CGGATGAAAC 2101 GGCCCTCCAC CTTGCAGTCA GATCCGTGGA TCGAACCTCT CTTCACATTG 2151 TAGACTTTTT AGTTCAGAAC AGTGGGAACC TGGATAAACA GACAGGGAAA 2201 GGCAGCACAG CCCTGCACTA CTGCTGCCTG ACCGACAATG CCGAGTGCCT 2251 CAAGTTGCTC CTGCGGGGGA AGGCCTCCAT CGAGATAGCA AACGAGTCAG 2301 GAGAGACTCC GCTGGACATT GCCAAGCGCC TCAAGCACGA GCACTGTGAG 2351 GAGCTGCTGA CCCAAGCCTT ATCTGGAAGA TTTAATTCTC ACGTTCACGT 2401 TGAATATGAA TGGCGACTAC TCCACGAAGA CCTGGATGAA AGTGATGACG 2451 ACATGGATGA GAAATTGCAG CCCAGTCCCA ACCGGCGGGA AGACCGGCCC 2501 ATCAGCTTCT ACCAGCTGGG CTCCAACCAG CTTCAGTCTA ACGCTGTATC 2551 TTTGGCCAGA GATGCTGCAA ACCTTGCCAA GGAGAAGCAG AGGGCTTTCA 2601 TGCCCAGCAT CTTGCAGAAT GAGACTTACG GAGCCCTCCT GAGTGGCAGC 2651 CCACCTCCCG CCCAGCCTGC AGCCCCCAGC ACCACCAGCG CCCCCCCGCT 2701 TCCTCCACGG AATGTTGGCA AAGTTCAGAC AGCCTCCTCT GCTAACACCC 2751 TGTGGAAGAC AAACTCTGTA AGTGTGGACG GTGGAAGCCG GCAGCGATCT 2801 TCGTCAGATC CGCCAGCTGT CCATCCACCG CTGCCCCCTC TTCGCGTGAC 2851 ATCTACCAAT CCCCTGACCC CCACGCCGCC CCCACCCGTT GCCAAGACGC 2901 CCAGCGTAAT GGAAGCCTTG AGCCAGCCGA GCAAGCCTGC CCCGCCTGGG 2951 ATCTCACAGA TCAGGCCCCC ACCTCTGCCC CCACAGCCGC CCAGCCGCCT 3001 CCCGCAGAAG AAGCCTGCGC CGGGGGCTGA CAAGTCCACC CCACTGACCA 3051 ACAAAGGCCA ACCGAGAGGA CCTGTGGATC TCTCTGCAAC GGAAGCTCTG 3101 GGTCCTCTGT CCAATGCTAT GGTCCTGCAG CCCCCTGCAC CCATGCCTAG 3151 GAAGTCGCAG GCAACCAAGT TGAAGCCTAA GCGGGTGAAA GCGCTCTATA 3201 ACTGTGTGGC TGACAACCCC GATGAGCTCA CCTTCTCCGA GGGGGATGTG 3251 ATCATCGTGG ACGGGGAGGA GGACCAGGAG TGGTGGATTG GCCACATTGA 3301 TGGAGATCCT GGTCGCAAAG GCGCATTCCC GGTGTCATTT GTGCACTTTA 3351 TCGCTGACTG AATTGCTACT GAACAAAAGC ATTAACAGTT ATGTTCCTGT 3401 TTCGTTATTG GTACCAAAAC TCTTGCCAGA TAACCAGTTT CATGAACTGT 3451 TTGTATGGCA GCCCATGTTC TCTAATGCCA CTGCTCTGTT TTAAAAACTC 3501 AGAGGCAATT TTTACATATC AGTAATTGTT TTTATAATTT GTGGTTTTCA 3551 TGAAACATTG CTATGCATTT ATTAGGAAAA ACTGAATTTC CCAACAGGTG 3601 AACTGAAAAG TTATTTTAAC TATTATACAT AATCAAGATC CTGCCTCTAC 3651 GGAATTAGCT AAACCTAAAA ATGTTTGCAT TAATGAATAA ATTCTTCCTG 3701 CATTCCTTGG CCCAGTTCTG GAGTTGGTGA CCTTTATCAC AATTATTATT 3751 TTAGGCGGCC AGTGAACTGC TGCTTCAGAA GTCCATAGCC CAGCTCTGAA 3801 CTTTCTCGAT AAAATGCCAT CAGTTCACCT TTAAAGACAC ACATTCCTTT 3851 GAAATCCACC CAGTGTTTAA AAAGCAACTT GGAAATTTAC ACATTAGCAT 3901 TGTACTTTCT AGCCCTAATT TGTGAGGTTG CAGCTATCAT TATATTCTGC 3951 ATGTATGTAT AACCTGTTGT GAACAATCAT ACTTAACAAA ACTACTGATG 4001 GTTTATGACA ACGTAGGGTA ACTACAGTTC ATTCTGTTCC AGGTTATATA 4051 AAACTGCATT TCCTGAATTT GGTTAAAAAC TAAGGATGAT GGATTGCAAA 4101 ACAGTTCTTT TAAATTAGTT TATATGCTTT AGGTGTTTTG GAATTTGCCT 4151 TCTTGAACTT CCTGAGTCAC ACAGAAAGCA ACTGTACACA GTAGAATTCT 4201 GTGGCGCAGA CCATGCTGTA TTAACACATC ACTTGCTGTT TCCTACTGAG 4251 TGTACCACTG CCTTCCCTTC TAGCCCAGGA GAATGTTTAC TCAGTTTAGT 4301 GTCTTGTATT TCTATAATAC TCCAACAGGA ATGGTAGTCA CACTGTCTTG 4351 AAATTGAATC TGTCCATCTG TTTATAATCA AGAACATATC AGAAATATAT 4401 AGGTCCCAGG TAATACTCCC AAACATCCCA CTTTTTACTG TTTCAGGCCA 4451 TCATATCATT CTTAAGCTAC TTGGGGTGGT AGTAGAGGAT TAGGTTGTCT 4501 ATTATAAAAC CAAAACTCAT TCGTTTAATG AACTTGACTG TCATACCTCT 4551 ATTTAGTAAT TGCGAGGGTA AGATTCATAG TAGGAATATT GGAAATTTTG 4601 GCACTCTGAG AATAAATAGG CATATGATAC CCACTTGGAC TTTTAACAAA 4651 AGTAAAGGAA TAAATTTGCA TATAGGCTTG GAAAGTGAGG CAGCAATGCT 4701 GTTAACTGCA TTTGTTGTGA TGGTGCATTT GATTGAAGCA GCTTGTCTTT 4751 ATTATGCAAG ACTGTGTAGA GTTTTTTTTT TTTTTGGCAT TGTACTTTTT 4801 GTTTTTGTTA TAAAGGAAGA CAGAACAAAC TGGAATGTTT TATGATGTTG 4851 TATAGCAATC GCTTTTTACC TTTCAAAGTT CCGGGTAAAA ATGTGTTATA 4901 TCTGTAGTTT TTTGTTTTTG TTTTTTTTTA AAGCACTACA TCTGTTTTCA 4951 CTAATTGTTA ATTTCTGTTT GAACCCTTCA TTTAATTTTC TCATAGATTT 5001 AAGTAAACAG ATGTATTTTG CACAGTGCAC TTATGTCTAT TTTAACAATC 5051 CTCCTGCATC TGTATTTTAT AGTCAGCCTT TTGACCACCT GGTGCCAGCT 5101 ATATAAGGAA TAAAGTTGAT TCATATCAAC ATTAGAACTC CAGTCCCAAA 5151 CTAATCTGTC AGGTTCACTG GTACATAAAT ACCTAGGAAA TATTTTTCCA 5201 GTCTACAATT TGGTGCTATT GTGCAGTAAC TAATAGTACT CTTACCAGAG 5251 GAGAAATTAT ATTAACGACC CTGCTAATAT CCTTTCTTAG TTATTTGCTC 5301 CTTGCAAATT AAAAAAGCAA CTAAGAGAAA GAAAAACATT GTAGATATCT 5351 ATTTATATTT AAAGTTTATG TTTCATGAAC TGCAGCTGCA GGATTCTGGC 5401 ATTTTGCATG CCATTCTCCA TCAGATCTGG GATGATGGCT CAGAACATGT 5451 ACACAGACTA AGAGTAACTG TGTGATCTGT TAAGGGGTGG ATAACATAAT 5501 ATGCAGCTTA GGATGCTATT TTGAGATGTA TGATATTCAG TTCATTCACC 5551 TGATTACTTT GGTTGCAGCA CAACTGTATA TATTGTATAA CCGAAATTGA 5601 TTATTTTCAT TGTCCTTAAT GCAGTGATTT ATAATTAGAG CATGTTTAAT 5651 AAGTTTACTC TTCTTGTTAA CTAGTCATTT GACTGGAAAA AAATAAAATA 5701 CTTTTAAATG G // LOCUS AB007866 7323 bp mRNA PRI 13-FEB-1999 DEFINITION Homo sapiens KIAA0406 mRNA, complete cds. ACCESSION AB007866 NID g2662092 VERSION AB007866.1 GI:2662092 KEYWORDS KIAA0406. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1335. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7323) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. 78 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 4 (5), 307-313 (1997) MEDLINE 98116655 FEATURES Location/Qualifiers source 1. .7323 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1335" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 196. .2595 /gene="KIAA0406" CDS 196. .2595 /gene="KIAA0406" /codon_start=1 /protein_id="BAA23702.1" /db_xref="PID:d1024583" /db_xref="PID:g2662093" /db_xref="GI:2662093" /translation="MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSA LQELQQYILFPLRFTLKTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSA CLYSPSSQKPAAVSEELKLAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLG LAEQEKSKQIKIAALKCLQVLLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALT RLITGDFKQGHSIVVSSLKIFYKTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYRE ADWVKKTGDKLTILIKKIIECVSVHPHWKVRLELVELVEDLLLKCSQSLVECAGPLLK ALVGLVNDESPEIQAQCNKVLRHFADQKVVVGNKALADILSESLHSLATSLPRLMNSQ DDQGKFSTLSLLLGYLKLLGPKINFVLNSVAHLQRLSKALIQVLELDVADIKIVEERR WNSDDLNASPKTSATQPWNRIQRRYFRFFTDERIFMLLRQVCQLLGYYGNLYLLVDHF MELYHQSVVYRKQAAMILNELVTGAAGLEVEDLHEKHIKTNPEELREIVTSILEEYTS QENWYLVTCLETEEMGEELMMEHPGLQAITSGEHTCQVTSFLAFSKPSPTICSMNSNI WQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKAGDQTLLISQVATSTMMDVCRAC GYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLEVMLRNSDANLLPLVADVVQD VLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGAPPRAKFRRRGKSFEPKTSS S" BASE COUNT 1931 a 1700 c 1576 g 2116 t ORIGIN 1 GGCTGGAAGA CGAGACCTGC TCACTCTGTC ACCGAGGCTA GAGTACAGTG 51 GCACAATCAC AGCTCATTGC AGCCTCAACC TCCCAGGCTC AATCGATCCT 101 TCCAACTGAG CCTCCCTAGC AGCTGGGACT ATTAGTGCAC ACCACCACAC 151 CTAGCCTGCA GGATGTTTCC TCAATGAGGG GAAGGCTGCT GCACAATGGC 201 AGTTTTTGAT ACTCCTGAGG AGGCCTTTGG TGTCTTACGT CCAGTCTGTG 251 TTCAGCTCAC AAAGACCCAG ACAGTGGAGA ATGTGGAGCA TCTGCAGACA 301 CGACTACAAG CTGTGAGTGA CAGTGCCCTT CAGGAACTTC AGCAGTACAT 351 CCTCTTCCCT CTGCGATTTA CCCTGAAGAC CCCAGGTCCC AAAAGAGAGC 401 GTTTGATCCA AAGTGTGGTG GAATGCCTCA CATTTGTCCT TTCTTCAACA 451 TGTGTGAAAG AACAGGAGCT TCTCCAGGAA CTCTTTTCAG AACTCTCTGC 501 TTGTCTGTAT TCACCCAGCT CCCAAAAACC TGCGGCTGTG TCCGAGGAGT 551 TGAAATTGGC TGTGATCCAG GGACTTAGCA CATTAATGCA CTCAGCTTAT 601 GGGGACATCA TTCTGACTTT TTATGAGCCC TCCATTCTGC CACGTTTAGG 651 ATTTGCTGTA TCTTTACTGT TAGGCCTTGC AGAACAGGAG AAATCAAAGC 701 AAATTAAAAT TGCTGCCTTA AAATGTTTAC AGGTTCTACT CTTGCAGTGT 751 GATTGTCAGG ACCATCCAAG GTCATTGGAT GAACTTGAAC AAAAGCAGCT 801 GGGGGATTTG TTTGCCTCTT TTTTACCTGG AATCTCAACT GCACTGACCA 851 GGCTTATCAC AGGAGACTTT AAACAAGGTC ACAGCATTGT CGTATCTTCC 901 CTAAAGATCT TTTACAAGAC AGTGAGCTTC ATTATGGCTG ATGAACAGCT 951 CAAAAGAATC TCAAAGGTCC AAGCAAAACC TGCAGTTGAG CACAGAGTAG 1001 CAGAGCTGAT GGTTTACAGG GAAGCAGATT GGGTAAAAAA GACTGGCGAC 1051 AAGTTGACTA TCCTTATTAA AAAGATAATT GAGTGTGTTT CTGTTCACCC 1101 ACACTGGAAG GTGAGACTGG AACTGGTAGA ACTTGTGGAG GACCTTCTTT 1151 TGAAGTGCAG TCAATCATTG GTCGAATGTG CTGGTCCCCT TCTGAAGGCC 1201 TTAGTGGGAC TAGTAAATGA TGAGAGTCCT GAAATCCAAG CCCAGTGCAA 1251 TAAAGTTCTG AGACATTTTG CAGATCAAAA AGTAGTGGTG GGCAACAAAG 1301 CCCTCGCTGA CATCTTGTCA GAAAGCCTGC ATTCCCTTGC CACATCTCTT 1351 CCTCGCCTAA TGAACTCCCA AGATGACCAG GGCAAATTCT CTACTCTTTC 1401 CTTGTTACTT GGTTATCTGA AACTCTTGGG CCCAAAAATA AACTTTGTCC 1451 TCAACTCTGT GGCCCATCTC CAGCGGCTTT CCAAAGCACT CATCCAAGTT 1501 CTAGAGCTAG ACGTGGCTGA CATCAAGATT GTTGAGGAAC GGCGTTGGAA 1551 CTCTGATGAT CTGAATGCTT CTCCAAAGAC CTCAGCCACA CAGCCTTGGA 1601 ACCGCATCCA GAGGAGATAT TTCCGCTTCT TCACTGATGA GAGAATCTTC 1651 ATGCTCTTGA GGCAGGTTTG TCAGCTACTT GGTTATTATG GGAATCTTTA 1701 TTTGCTTGTG GATCACTTTA TGGAACTTTA CCATCAATCT GTGGTTTACC 1751 GGAAGCAAGC TGCCATGATC CTTAATGAAC TGGTTACAGG GGCTGCTGGG 1801 CTGGAGGTTG AGGATCTTCA CGAAAAACAT ATTAAAACAA ACCCAGAAGA 1851 ACTGAGAGAG ATTGTGACAT CTATACTTGA AGAATACACA AGTCAAGAAA 1901 ATTGGTATTT GGTTACCTGT CTTGAAACTG AGGAAATGGG AGAGGAGCTG 1951 ATGATGGAGC ACCCAGGCCT CCAAGCCATC ACGTCTGGTG AACACACCTG 2001 CCAAGTTACA TCTTTTCTAG CCTTCTCAAA GCCAAGTCCC ACTATTTGCT 2051 CCATGAACAG TAACATCTGG CAAATATGCA TTCAGTTGGA AGGAATTGGC 2101 CAGTTTGCAT ATGCACTAGG AAAAGACTTC TGTTTGCTCT TGATGTCAGC 2151 CCTTTATCCA GTACTGGAGA AGGCTGGAGA CCAAACCCTA CTCATTAGTC 2201 AGGTGGCTAC CAGCACCATG ATGGACGTTT GCCGTGCTTG TGGCTACGAC 2251 TCCCTGCAGC ACCTGATCAA TCAAAATTCA GACTATTTAG TGAATGGGAT 2301 CTCTTTAAAT CTGCGTCATC TGGCTCTGCA TCCTCATACC CCAAAGGTCC 2351 TGGAAGTCAT GCTGCGGAAC TCAGATGCTA ACCTGCTTCC TTTGGTGGCA 2401 GATGTGGTTC AAGATGTCTT GGCCACCCTG GACCAATTTT ACGATAAGAG 2451 AGCTGCTTCC TTTGTCAGCG TTCTGCATGC TCTGATGGCA GCATTAGCCC 2501 AGTGGTTCCC AGACACAGGT AATCTTGGGG CACCTCCAAG AGCAAAGTTT 2551 AGGAGAAGAG GGAAGTCATT TGAACCAAAG ACCAGCAGCT CTTGAGAAGA 2601 GCACCACCAC AGCTGAAGAC ATCGAACAGT TTTTGCTGAA CTACCTCAAA 2651 GAGAAGGATG TGGCAGATGG AAATGTCTCG GATTTTGATA ATGAAGAAGG 2701 TAACTTGTTT ATTTTGGCTT AGCTGGTTTT TTCTTTCTGA AACAGATTAT 2751 TGTAAAAATG GTGAAACATC ATTACAGAAA GGGTAGAAAT AAGAAAAGTT 2801 ACTCATAACC TTACCAGGCT AACATAGCTA TTCTCCTGTT TTGTGTAATC 2851 TTTTTCCATT CTTTTCTGTT ACAAAACTGT TTTTCCGTGG TTGCAGCTGT 2901 GTACCCTTTT TGTGTTTAGT TATAAGCAAC TCTGCTTATT GCCGTTGAAT 2951 TTTGATCATT AACGTTTTAA TAGCTGTTAC TAAGTGTCCA TCCCCTGTAG 3001 GACATTTAGG TTGCTTCTGG CTTAATAATA ACTCTAATAA ACAGTGCTGC 3051 GGTGGATGAT TCTGTGCATG TGACCTTTTT TTTTTTTATT TTTTTTATTT 3101 TTTTATTTTT TATTTTTGCA TATTTCTTTA AATTCCCAGA AGTAGGATTT 3151 CTGGGTCAAG GATATGAACA TAATTTAATG CTTGCCAAAT TGCCTTTCAA 3201 AAAGGTTGTG TCAATTTATA CTTTTCCTTC GGCAGTGCAG GATGAATACT 3251 GGTTTCACCA CAGCCTTACC AACATTGGCT ATTTCCAGTT TTCTTCCTAA 3301 ATTAATAGGT GAAAAATGGG TCTTGTTATC TACCTTGCAT TTCTTTGATT 3351 ACCAGTGAGG TTGAATGTCT TTATAAGCTT CTTTCCTAAC AGGTTTTTTT 3401 TCCTTATTCC CATTGTCTAT TTATATGCTT TGTCCATTTG TTTGTTGGTG 3451 GGAGGAGATT GCAGTCTTTT TCTTACCAAT TTATATGATA AAGAAGAAGG 3501 GAGTTCAGGC TAGTTGAAGC TCTGGCCTGT TGTTTTATTC ACAAGCTCAA 3551 TCTGGAGCTT CAGGTCACGG AAGGATTAAT AAATTATTAG GATCTCCTCT 3601 GCAAATATAA AATGCCAAGT CATAATGAGC TTGGTGGTCT CAGAACCATC 3651 CTAAATCGAA CCGAGTCCCA AATTAGTTTG TGAGGTTGGA ATAAAACGTT 3701 TCTTTTTCTT TTTCTTTTCT TTTCCTTTTT TTGTTTTTTT CCTCCTTTTG 3751 GTGATCTCCA CTGTGAGATT CTGGTGAACT GAAGCCAGTA CTTCCAGCAG 3801 TGTAACAGGA AATAGTAGCT TGATGCCACT CACTACAACA AATTCCTTCT 3851 AAATAGCAGA AAAGGCATCA CAGGGCCCAA ATAATGATTT ATGCAGAATT 3901 GAGTCATTGC TCTCCCCGAG GACAGAGTTT TCTGATCAGA AATCTATCAG 3951 GCTTTTTCTT CTCAGATTTG TTTCTCGAGC CAATCCAGTC CTTTTTGTGA 4001 ACTGCACCCT TCACCCAAAC CTGGAAATGC TGAAGCAGGG GAGGCATTCT 4051 GATCCCTCAT AATCCAGATT TGCCATTCTT GTTTAAAATT TGAGCACTGT 4101 CAGTGAATCC ATTCCTTCAT GATTAGGATC TTCTGGTGTT AGTTGATGTT 4151 CACGTAGAGC AGACTGAAAG AGTCAAACCC TTCTTCCAAT AACAGGAAAA 4201 TCCACATCCC TCAAATAAGA TTCTGCAATA GGTGAATTTC AAACAACAAA 4251 TTCCCCTCTG GGGAAAAAAA GCACAGGCCT ATCATGCTTA TCATTCATCA 4301 AGTACACACC ACACACTTGG CATTCTAAGG AATGTTTGCC TCAGTGCTGG 4351 CTTGACATGA GTTTTTTGTT TTAAGCACAT AAAAGCACCT TGTATATCTG 4401 AGGTCCCTTT CTCCAAGAAA TCCAAGCATT CTACAAATGC ATTTTAATTT 4451 ATCTTTTCAG CATCTGTTAG AGACAGGTGC AGCCTCTACT GTAAGAGTCT 4501 GTCTTTCCAG CAGAGGGAAC TGAAATGGGA AAAAGTTAAG GGAGCAGTGC 4551 TCTGCTCAGT GGAAATGAGC ACAGCTGGTA ATCAGGGTTG GCATGAGGCT 4601 GGGGGCTGTA AGTAGATCAG GAAAATTTTT CAGGGTAAGA ATTTTGACCT 4651 GGGTTTCATG CAATTCTCAA ATCTTGTGGT GATGTTTCCG TTTACAAACT 4701 TACATCTATC TCATGCAACG CAGCACTCTA GACAGTGAGA ATGATAATGA 4751 CTGGTAATAT CTCATTTAAT GCTCTCAGTA GCATGTGAGG TATACATGCT 4801 CTATCCCTTT TTTTCCAGAT GAGGTAATGT AGGATGAAAG AAGTTGCTAA 4851 TAGGTTTTAG AGGCAAGACT TGGTCCAACC TCCTCTAACC AAACTCCATG 4901 CCCTATGAAC TACACCAAGC TGCCCTTGTT ACCTCACCTA CAAAGATAAG 4951 TAAGGTTTGA TTCTGGCCAT CAGAAGTTCT TACAGGCTGG TGGAGAAGAC 5001 AGATGTGCAC CCCCCTTTTA ATCAAAGGAT GAATCAAGAG GAGACAAAAA 5051 AGTGAAGGAC AGAGCAAAGG CCAGCCATGG AACTTCCCAG CTTCAAGGCT 5101 CCTCCACACG AGAACCCTGC TATCTCTCCC TTTTTTTTTT TTTTTTGAGA 5151 CAGAGTTTTG CTGTTGTTGC CCAGGCTGGA GTGCAGTGGC ACGATCTCGG 5201 CTCATTGCAA CCTCTGCCTC CTGGGTTCAA GCGATTCTCC CGCCTCACCC 5251 TCCCAAGTAG TTAAGATTAC AGGCGCCCGC CACTATGCCC GGCTAATTTT 5301 TGTAATTTAG TAGAGATGGG GTTTTGCCAT GTTGGTCAGG CTGGTCTCAA 5351 ACTGCTGACC TCAGGTGATC CACCTGCCTC AACCTCCCAA ATATCTTGTT 5401 TTTATCTTAA ATATTCATTA ATGGAAATTT AGAAAAGACC AAAGGCATAG 5451 GGAAAAAAAC TGAAGTCATC TGTTATCTCA TTACCCTGTC ACTTTTAATA 5501 TTTTGCTGTA CTTCCTCTCA TGCAAAAACG TATGTAGTAG TGTTCATGCT 5551 GCATATGCAA TTTTGTGTTG TGCTTTTTGT CTTTTTAAGA CAATATTATT 5601 TTATAAGCAT TTCTCTTGTC ATTAAAACCC ATGCTTAGCC TGGTGTAGTG 5651 GCTCATACCT GTAATCCCAG CACTATGAGA GGCCACGGCG GGGGGATTGC 5701 TTGAGCCCAG GAATTGGAGA CCAGCCTGGG CAACATAGTG AGACCCCCAT 5751 TTCTACAAAA AAATTTAAAA ATTAGTTGGG CATGGTGACA TGCACCTGTA 5801 GTCCTGGCCA CTCAGGAGGC TGAGGTGGGA AGATCACTCG ACCCCATAAG 5851 TTTGAGATTG CAGTGAGCCG TGTTCACACC ACTGTACTCC AGCTTGGACA 5901 GAGCAAGGCC CTGTGCCTAA AAAAAATAGG GACCTCATAA AATGCCATTA 5951 TATGGCTATA TCATGGTTTC TGTACCTATT CCCCCTTGAG TGGAGGTGTC 6001 TCAGTCCATT TGTGCTGCTG TAACCAAATA CCTAAGACTG AGTAATTTAT 6051 AAAGAACAGA AATTTAGGTA TCTCTGTTTT ATCACTCATA CAACACTGCC 6101 CCCAGTTGTG CTCTTTTTCC AGAGGAACAG TCAGTCCCTC CCAAAGTGGA 6151 TGAGAATGAC ACCCGTCCAG ATGTGGAGCC ACCACTGCCA TTGCAGATCC 6201 AAATAGCCAT GGACGTGATG GAACGCTGCA TCCACTTGTT GTCAGATAAA 6251 AATCTGCAAA TCCGCCTGAA GGTCTTGGAT GTGCTGGATC TGTGTGTGGT 6301 TGTTCTTCAG TCCCACAAAA ACCAGCTGCT TCCCTTGGCT CATCAGGCCT 6351 GGCCCTCGCT CGTTCACCGA CTCACACGGG ACGCCCCCCT GGCAGTGCTT 6401 AGAGCCTTCA AGGTTTTACG TACCCTGGGA AGCAAGTGTG GTGACTTTCT 6451 TCGCAGCCGG TTCTGCAAAG ATGTCCTGCC AAAGCTGGCT GGCTCCCTAG 6501 TCACCCAGGC CCCCATCAGT GCCAGGGCTG GACCAGTTTA CTCGCACACG 6551 CTGGCCTTCA AGTTGCAGCT GGCTGTCTTA CAGGGCCTGG GCCCCCTCTG 6601 TGAGAGACTG GACCTAGGTG AGGGTGACCT GAATAAAGTG GCTGATGCCT 6651 GCTTGATTTA CCTCAGTGTC AAACAGCCCG TGAAATTACA AGAGGCTGCC 6701 AGGAGCGTCT TCCTCCACTT GATGAAGGTG GACCCAGACT CCACCTGGTT 6751 CCTCCTGAAC GAGCTTTACT GCCCCGTGCA GTTCACACCT CCCCACCCCA 6801 GCCTCCACCC TGTGCAGCTG CACGGGGCCA GCGGGCAGCA GAACCCCTAC 6851 ACGACCAACG TGCTCCAGCT GCTCAAGGAG CTGCAGTGAC CCTGCTCCCC 6901 CACCACAGAG GCCACCGATC CCTCCCCTAC TGCCAGCCAG AAGCTGGGCT 6951 GACCCCACCC CGGCCATAGG CGGTGGCAGC GGCAGCAGAG AAGGTGAATT 7001 AGTTAGCCAA TCGATTTATA AATTGATCGA TCACACAACT GCTTAGAAAT 7051 GGATTGAAGG AAAGTAGCTG ACTATTATTT ATATTTCATA CCTTGTGTTT 7101 TCAAGTGACA TTGTCTGGTG GCTCTAAGGG TTTAACCCCT TAGCCTACCA 7151 TCTCTATAGC CCCAGCTCCC TCACAGGCCA CACACACACA CACACAAGAG 7201 GTCAGTTCCC CTCCATCTGC ATACACCTCC CTGTCTTCAA ATAATGAGAT 7251 GGAACTAATT TGTTTTACCT AACCTGATCT TTGGGAAACA AACGGAAATA 7301 AAGACACTTC TTGGATGAAA AGT // LOCUS AB007874 5725 bp mRNA PRI 13-FEB-1999 DEFINITION Homo sapiens KIAA0414 mRNA, partial cds. ACCESSION AB007874 NID g2887448 VERSION AB007874.1 GI:2887448 KEYWORDS KIAA0414. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0161. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5725) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. 78 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 4 (5), 307-313 (1997) MEDLINE 98116655 COMMENT On Feb 14, 1998 this sequence version replaced gi:2662108. Sequence updated (05-Jan-1998). FEATURES Location/Qualifiers source 1. .5725 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0161" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1133. .2536 /gene="KIAA0414" CDS 1133. .2536 /gene="KIAA0414" /codon_start=1 /protein_id="BAA24844.1" /db_xref="PID:d1025766" /db_xref="PID:g2887449" /db_xref="GI:2887449" /translation="MEPGTNSFRVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIF RAHKAVLAASSPYFCDQVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVS YLTAASFLQMWHVVDKCTEVLEGNPTVLCQKLNHGSDHQSPSSSSYNGLVESFELGSG GHTDFPKAQELRDGENEEESTKDELSSQLTEHEYLPSNSSTEHDRLSTEMASQDGEEG ASDSAEFHYTRPMYSKPSIMAHKRWIHVKPERLEQACEGMDVHATYDEHQVTESINTV QTEHTVQPSGVEEDFHIGEKKVEAEFDEQADESNYDEQVDFYGSSMEEFSGERSDGNL IGHRQEAALAAGYSENIEMVTGIKEEASHLGFSATDKLYPCQCGKSFTHKSQRDRHMS MHLGLRPYGCGVCGKKFKMKHHLVGHMKIHTGIKPYECNICAKRFMWRDSFHRHVTSC TKSYEAAKAEQNTTEAN" BASE COUNT 1648 a 1155 c 1361 g 1561 t ORIGIN 1 GAGAAGGTAA ACGAGCAAGA CTAATACTAA TGAGAGGATC ACCTGAGCCC 51 AGGAGTTTGA GACTACAGTA GCTATGATGG TGCCACTACA CTCCAGCCTG 101 GGTAAAAGAG TGAGACCTCA TCTCTTTAAA AAAAAATTCT GCCAGGAGCG 151 GTGGTTCATG CCTATAATCC CAACACTTTA GGAGGCTGAG GCTGGTGGAT 201 CACTTGAAGC CAGGAATTCA AGACGAACCT GGGCAAAAAG TGAGACCCCT 251 GTCTCTACAA AAAAAATTAA TTTTTTAAAA AAACTAGCTG GGCAGCTGGC 301 GTGGTGGCTC ACACCTGTAA TCCCAGCACT TTGGGAGGCT GAGGTGGGTG 351 GATCATGCGG TCAGGAGTTC GAGACCAGCC TGGCCAACAT AGTGAAACCC 401 CATCTCTACT AAAAATACAA AAAAAAAAAA TTAACCAGGT GTAGTGGCGG 451 GCGCCTATAA TCCCAGCTAC TTGGGAGGCC AAGGCAGGAG AATCGCTCGA 501 AACCAGAAGG TGGAGGTGGC AGTGAGCCGA GATCACACCA TTGCACTCCA 551 GCCTGGGCAA CAAGAGCAAA ACTCTATCTC AAAAAAAAAA AAAAAAAACT 601 AGCTGGACAT GGTGGCATGT ACATACAGTC CCAGCTACTC AGGAGGCTGA 651 AGCAGGATGA TAGCTTGATT ACCCAAGAGT TCAAGGCTGC AGTGAACTAT 701 GATTGTGCCA CTGCACTCCA GCCTGGGCAA CAGAGCAAGA CCCCATCTCG 751 TTTAAAAAAA AATTATATTA TTTGAATGGG TATCTTGTTT GTTTTAACTG 801 TAAATATTTA CAGTACTTGG TTTTACTTGG GAGATTTGTA TAGCTTACAA 851 ATCTTGTCCC TATGGATTCA GTTCTCTTAC CCACAGACAC AGAACCAAAA 901 TCAGGAGTTC CACAAGGAGT GGTGGTGAGT CCTGCCTTTG GCAAACACGC 951 ACCTGCAAGC TTTGCTCTAG GGATTGAATC CCTTACCCAG TGGGGCCCAT 1001 AGGCTAGAGG ACTTTGTGGA ACTTTGGAAC TTGTCTTAAC AATGGAGGAG 1051 GGGATAAAAT AAGTTTATCT TTGGGCTGGA ATTTGTACTA ATTTTTCTTT 1101 CTCTTGTAGC ACCGAACAAG ATTTGTGATG AAATGGAGCC TGGAACAAAC 1151 TCTTTTCGGG TAGAATTTCC TGATTTTTCC AGCACCATTC TACAGAAACT 1201 GAACCAGCAG CGCCAGCAAG GACAATTATG TGACGTCTCC ATTGTTGTCC 1251 AAGGCCACAT TTTCCGGGCA CACAAAGCCG TTCTTGCTGC CAGTTCACCC 1301 TACTTTTGTG ACCAGGTACT CCTGAAAAAC AGCAGGAGAA TTGTTTTGCC 1351 TGATGTGATG AACCCAAGAG TGTTTGAGAA CATTCTCCTA TCTAGTTATA 1401 CAGGACGTCT AGTAATGCCC GCTCCAGAAA TTGTTAGTTA CTTGACAGCG 1451 GCAAGCTTCC TCCAGATGTG GCATGTGGTA GACAAATGCA CTGAAGTTTT 1501 AGAGGGAAAC CCTACAGTCC TTTGTCAGAA GCTAAATCAT GGCAGTGACC 1551 ACCAGTCACC AAGCAGCAGT AGTTATAATG GCCTGGTAGA GAGCTTTGAG 1601 CTGGGCTCTG GGGGTCATAC TGATTTTCCC AAAGCCCAAG AACTGAGAGA 1651 TGGTGAAAAT GAAGAGGAGA GCACCAAAGA CGAGCTGTCA TCCCAGCTCA 1701 CCGAGCACGA ATACCTGCCC AGCAACTCGT CCACAGAGCA TGACCGCCTG 1751 AGCACGGAAA TGGCAAGCCA GGATGGGGAG GAGGGCGCCA GCGACAGCGC 1801 CGAGTTCCAC TACACCCGGC CCATGTACAG CAAGCCCAGC ATCATGGCTC 1851 ACAAACGCTG GATCCACGTG AAGCCCGAGC GCTTAGAACA GGCTTGCGAG 1901 GGCATGGATG TGCACGCGAC CTACGACGAG CACCAGGTCA CAGAGTCCAT 1951 CAACACCGTG CAGACAGAGC ACACGGTGCA GCCTTCGGGA GTGGAGGAGG 2001 ACTTCCACAT CGGGGAGAAG AAAGTGGAAG CTGAGTTTGA TGAACAGGCT 2051 GATGAAAGCA ATTATGATGA GCAGGTGGAT TTCTATGGCT CTTCCATGGA 2101 AGAGTTTTCC GGAGAGAGGT CAGATGGGAA TCTAATTGGG CACAGACAGG 2151 AGGCTGCCCT CGCAGCAGGT TACAGTGAGA ATATTGAAAT GGTAACAGGG 2201 ATTAAAGAAG AAGCTTCCCA CTTAGGATTC TCAGCCACTG ACAAGCTGTA 2251 TCCTTGTCAG TGTGGGAAAA GTTTCACTCA CAAGAGTCAG AGAGATCGGC 2301 ACATGAGCAT GCACCTCGGT CTTCGGCCTT ACGGCTGTGG GGTCTGCGGT 2351 AAGAAATTCA AAATGAAGCA CCATCTCGTG GGCCACATGA AAATTCACAC 2401 AGGCATAAAG CCGTATGAGT GTAATATCTG TGCAAAGAGG TTTATGTGGA 2451 GGGACAGTTT CCACCGGCAT GTGACTTCTT GTACTAAGTC CTACGAAGCT 2501 GCAAAGGCTG AGCAGAATAC AACTGAGGCT AACTAAAAAT AGGATCTGGC 2551 CCTTGAGTGG CATGCACAAA AATAAACTAT GGTAATTAAT GCAAATCTGG 2601 GCACAGATGA TGCGTGCTAC TTGCTATTAT GAGAGAAGCT TAAAAAAAAA 2651 AAGGAAGATA TTTCTGAAAG ACCAGCTCTA AGTAGGCCAA TTAAAAAAAT 2701 CTAATTCCTC AAATTTGTGT GTTCCAGTCC TGGCCTGGAA TGGGTAATGG 2751 GGTGAGTTAA CCCACCGCCC AGCTGGCAAG GGAAACCTTC TGACTGGTTG 2801 TGATCGAAAC AGGTGGACAG AGACACCTGC ACTTGGAACT GGACTCCACC 2851 CACCAGTTCC ATTTTGGGTG GCAGCAGCTT TGGATCACTC ATTATTACAA 2901 GGTCATGCTG AAATTTTATT TTGCTCTTGC TATAGATACT TAGGTAATGT 2951 GGATTTGTTT TGGTAGCTAT TTCACTGAAG GAAGTGCTAC TTATATAAAA 3001 GCTAGAAATA ATGTGATTCC TAGGATGAGA AATTGGTTAA CAGAGCTCTG 3051 TTGTCTGGTT TTAGTCTTTC TAAAGGATAT TTTAACTAAA ACTATGGAGA 3101 TGCTAAGAGA GTGACTTTCT AAATATGAAA CAGATAATTT ACGGTACAAG 3151 GCTGACATAG TGCCCTTGTC AGTTTCTGTA AGATGCCACT ACTGTCACAA 3201 GGTGTTTCAG ACTCTTGATA AGGCAGTGTT TTGTATTTTA GTTCTAACAT 3251 TGAGTTTGGA CAATTTTATC TAATTGTAAT TCTCTAGGGT GCCAGAGATA 3301 GGTATTTCTC ATTGGTTTGC TTTCCCAAAT CCTGTTTGGT TAATTGTAGC 3351 CTCCATACAG TGGGGTCTTC TCTGTGGCTG GTAGACACCA AGCTGCTGTG 3401 ACTGACCACG GTACCACGGG CTGCCACAGC CCCTGCTCTG TCTTAGATTA 3451 TGGTGCTTTA CAAAGAGAGT GGTCCATGAC CACACTTAGT GAGAAGGAGC 3501 CACAAGTTGT GGCTGAGAGT TCCCTGTGAA CTTGAGTAGT TCATAGAGTG 3551 CTAAGGTGAC ACTCCACCAA CCAGAGTGAG AGGGCAGATA GGCAGGATTC 3601 TATGAGTGGT TATACTTAAG GGGGACAAAA CTGCCCAAGA AGAATCTTGA 3651 GAATACACTC TTTCAAGGTG GGGGAGATAC TCTTTAGAGG GTACACTGAG 3701 CTAATACTAC CAGTTCTTTA TGAGCACTGG AATGTGTTTG TAAAAGGAGT 3751 CCTAAGTTTA GCAAGGTAGT CTACAGAACC ATGCTCCCAC ATTATAGATA 3801 AAGCTGCTTA AACTTAAAAG TCCACAAAGC TACGCCGACC AGAAAAAAAA 3851 AATTAAAAAA ACAAACAAGA AAAGCAACTA TTCTGAGACC TTTTCTGCCC 3901 ATCAGTTAGA TGATTTAGGT TAAAAAGAAA GGTAATATTG CACATGCTTT 3951 TAAGCTGTGT AACATACCTG AGGTTATCAC CAGGGTAGGA CAGGGTGCTA 4001 CTACCATGTC ATCTTTTCCA CAATCGTACT GGGTTATTTA CTTCTAAATA 4051 GAAACTTTTT TTCTTAAAAT AAAAATAATT TTTCTTGGAT TTGGGGTGAA 4101 ATTTTATTTG AAAAGTTTGG CTTTGCTGTA ATGTAATAGA CATTGCTGGC 4151 AATGGCCTCT GATTCTCAAG CTCCTAACAC CAGGGTGTTT ACTTGTTGAA 4201 CATTGTCTGG AAAGAGGAAA GAAAATACTT ATTTACCAGT TAACTCTTGT 4251 AAGCAAGATT ACAAACAGGG ATTTATTCAC AACACTGTAT CATTCTCGAT 4301 ATATAAAAAG CACTTTGTAT TTAAAACTTT ATTATAAATA TATATATATA 4351 TTGTTTTTTT TTAAACCTAG AAACTAGATA TTACCTCTTG GTTGTTTGCC 4401 ACATTAATAG CTTCTCTTAG TATTGAAACG TTACTGGTTA GCAGTCTTTT 4451 CTCTGTGTAC CTGACACACG TATACTGAGG GGATTGTACA ATCAAGCCTA 4501 TTGTCTCCTT TTCTTTCACT CATGGTAGAG GCCAGTGGGT TTTAGGTATG 4551 ATCCTAGCCA TTATATTTGA GGAGAAATTG TTCTATTACT CCTACTAATT 4601 TCAGTACTAA GGTGGTGATG CCATTTTGTT CTGCCAAAAA CTGATCACCC 4651 TCTCCCATGG TATTAGCAGA GCATTTTCTG CCTGTTTGGA AGGTTTGATG 4701 TCCTGTTTCT CATTGAAGAC TATTTACATG ATCATTAGGA CATTGCAGGA 4751 GAAGTCTGAG AGGTAAAAAT ACAGATATTC TGGGAGAGTC GTGGTCCTTC 4801 AGTTCTGCTG AAATCAGCAT AGTGCCCTTG TCATGAGGAA GAGTTTCTGT 4851 TCAGCCAAGA GTGGTGGCAC GCTTGGGTGG TAGTTTTGGA AGCAGTCAGT 4901 TGTGCTAGGA CTTATTTAAT ATGTTGTGAA GAGAAGAGTG TCTTCTTGAA 4951 AGCCTTATGT GTCCATCAGC TACTAAATGT AGAACTTAAA TAAGTTGCTC 5001 ACATCTGTTC TTTTAGTGTT TTGTGGTATT TGAGGTTTTG GCAAAAATTG 5051 GGATTTTTTT ATCAGGCAGC CAGAGCCTGG GAGGTGGTAG GGTGTCTGAA 5101 ATGCTGGCCA TGTTCAGAGA GGCAGGAGAA GGGGGTTGCT TTCATTTGAA 5151 TATTAAAAGT GAATTTTTGT AAACTCTGGT TTTTACCTTT TTTTCATCCC 5201 CACGTTGAGT TGGAGGAATA GTCTCTTCTC TTCACCTCAA TATAGCTTTA 5251 GAAAATCTTT ATCCTTCCTA ATAAGTTTGG ATGGTTGTGG GTAACATTGT 5301 TCAAACAATC TTTCAGGGAT CACGTCAATG GCCTACAACC AAGCTATTTG 5351 TCCCCTACTT TGAGTCTTAA CTGTGGTTTT TCTTCAATCC CCATGGGAAA 5401 GGGCTTCAAG GCACCACCAG TGGTATTTAA TATTCATACT TGGGGCCAGG 5451 CATGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC CGAGATGGGT 5501 GGATCACCTG AGGTCAGGAG TTTGTGACCA GCCTGACCAA CATAGTGAAA 5551 CCCCATCTCT ACTAAAAATA CAAAAATTAG CCAGGCGTGG TGGCGCACAC 5601 CTGTAATCCC AGCTACTCGG GAGGCGGAGG CAGGAGAATC ACTTGAACCT 5651 GGGAGGCGGA GGTTGCAGTG AGCCGCGATT GCACCACCAC ACTCCAGCCT 5701 GCGCGACAGA TCGAGACTCT GTCTC // LOCUS AB007945 5525 bp mRNA PRI 13-AUG-1998 DEFINITION Homo sapiens mRNA for KIAA0476 protein, complete cds. ACCESSION AB007945 NID g3413913 VERSION AB007945.1 GI:3413913 KEYWORDS KIAA0476 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0487. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5525) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (08-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Seki,N., Ohira,M., Nagase,T., Ishikawa,K., Miyajima,N., Nakajima,D., Nomura,N. and Ohara,O. TITLE Characterization of cDNA clones in size-fractionated cDNA libraries from human brain JOURNAL DNA Res. 4 (5), 345-349 (1997) MEDLINE 98116662 FEATURES Location/Qualifiers source 1. .5525 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="HH0487" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 569. .4729 /gene="KIAA0476" CDS 569. .4729 /gene="KIAA0476" /codon_start=1 /product="KIAA0476 protein" /protein_id="BAA32321.1" /db_xref="PID:d1033283" /db_xref="PID:g3413914" /db_xref="GI:3413914" /translation="MWQSSLGHWARKCPRATHASRLLLGATPWNSVLDFWVELNPSSA TAGAVTSPPSLSWGCCMRGRNVPSLASKCLTRHPTATQQTWPLQAPGTPAPTSLTGGQ QRGQGCMPWASLTSAWCCPVRARALLILTAGCPATSTLACGAQQCTCAIRWAWRRPTR WCTRQFYEAFPRARLSERQARALGLLSAVERGRALGGRAVRSRRAIAVLSRWPAFPAF RAFLTFLYRYSVSGPHRLPLEAHISHFIHNVPFPSPQRPRILVQMSPYDNLLLCQPVS SPLPLSGASFLQLLQSLGPELAITLLLAVLTEHKLLVHSLRPDLLTSVCEALVSMIFP LHWQCPYIPLCPLVLADVLSAPVPFIVGIHSSYFDLHDPPADVICVDLDTNTLFQTEE KKLLSPRTLPRRPYKVLLATLTNLYQQLDQTYTGPEEEASLEFLLTDYEAVCGRRARL EREVQGAFLRFMACLLKGYRVFLRPLTQAPSEGARDVDNLFFLQGFLKSRERSSHKLY SQLLHTQMFSQFIEECSFGSARHAALEFFDSCVEKVHPEQEKPEPTPLVELEELSGSE LTVFITPPEEPALPEGSESTPQYCYDGFPELRAELFESLQEQPGALPVPGPSRSAPSS PAPRRTKQEMKVAQRMAQKSAAVPELWARCLLGHCYGLWFLCLPAYVRSAPSRVQALH TAYHVLRQMESGKVVLPDEVCYRVLMQLCSHYGQPVLSVRVMLEMRQAGIVPNTITYG YYNKAVLESKWPSGTPGGRLRWAKLRNVVLGAAQFRQPLRERQQQQQQQQQQQQQQQQ QQEQVSAHQEAGSSQAEPYLERPSPTRPLQRQTTWAGRSLRDPASPPGRLVKSGSLGS ARGAQPTVEAGVAHMIEALGVLEPRGSPVPWHDGSLSDLSLTGEEPLPGGSPGGSGSA LSAQSTEALEGLSGRGPKAGGRQDEAGTPRRGLGARLQQLLTPSRHSPASRIPPPELP PDLPPPARRSPMDSLLHPRERPGSTASESSASLGSEWDLSESSLSNLSLRRSSERLSD TPGSFQSPSLEILLSSCSLCRACDSLVYDEEIMAGWAPDDSNLNTTCPFCACPFVPLL SVQTLDSRPSVPSPKSAGASGSKDAPVPGGPGPVLSDRRLCLALDEPQLCNGHMGGAS RRVESGAWAYLSPLVLRKELESLVENEGSEVLALPELPSAHPIIFWNLLWYFQRLRLP SILPGLVLASCDGPSHSQAPSPWLTPDPASVQVRLLWDVLTPDPNSCPPLYVLWRVHS QIPQRVVWPGPVPASLSLALLESVLRHVGLNEVHKAVGLLLETLGPPPTGLHLQRGIY REILFLTMAALGKDHVDIVAFDKKYKSAFNKLASSMGKEELRHRRAQMPTPKAIDCRK CFGAPPEC" BASE COUNT 1004 a 1768 c 1623 g 1130 t ORIGIN 1 CGCCGGCCGA GGAGGAGGGG TGGGGGTAGC GGCGGCGCCC GCGGCCCGGA 51 GCGGGGGTTG GGGGAGTAGA GAAAGCGGGG CGCGCGGAGG AACGCTGGGT 101 CCCCGGCGCC GCGGGAGCTG GGAGGACCGA GCCGGCCGAG CGAGCAGCGC 151 GGCAGCACAG TCCCCGCGTG GCGCAGCGCG GCGGGGACGC GGGGACCGCC 201 CGGATCTCCT TCCACTGCGC CCCGCGCTCT GCGGTCCTCG GCCGCCTCTT 251 CTTCCTTCAC TCACTGCCCG GGCGGGAGCG GCGCCCAAGT CGGGTGCGCC 301 ATGTCTGGGG CCGGGTAGCC CCGCCGCCCG CCGGCCCGCC AGCTCGCCCT 351 CCGAGCCACC CGCCAGCGGG CCGGCCGGCC GGGAAGCGCG GGACAGGCAG 401 ATGCAGTGAG TGAGGGGGGG GCCATGGCGG AGGAGCGGCC CCCCCGGCTG 451 GTGGATTACT TCGTGGTAGC TGGGCTTGCA GGGAACGGAG CACCCATCCC 501 TGAGGAAACG TGGGTTCCTG AACCCAGTGG GCCCCTGCGC CCTCCCCGGC 551 CAGCTGAGCC CATCACAGAT GTGGCAGTCA TCGCTAGGGC ACTGGGCGAG 601 GAAGTGCCCC AGGGCTACAC ATGCATCCAG GCTTCTGCTG GGGGCCACCC 651 CTTGGAACTC AGTGCTGGAC TTCTGGGTGG AACTCAACCC GTCATCTGCT 701 ACCGCAGGGG CCGTGACAAG CCCCCCCTCG TTGAGCTGGG GGTGTTGTAT 751 GAGGGGAAGG AACGTCCCAA GCCTGGCTTC CAAGTGCTTG ACACGACACC 801 CTACAGCCAC TCAGCAAACC TGGCCCCTCC AGGCCCCGGG CACCCCCGCA 851 CCTACCTCAC TTACCGGCGG GCAGCAGAGG GGGCAGGGCT GCATGCCCTG 901 GGCATCACTG ACCTCTGCCT GGTGCTGCCC AGTAAGGGCG AGGGCACTCC 951 TCATACTTAC TGCCGGCTGC CCCGCAACCT CAACCCTGGC ATGTGGGGCC 1001 CAGCAGTGTA CCTGTGCTAT AAGGTGGGCC TGGCGAAGGC CAACACGCTG 1051 GTGTACGAGG CAGTTCTACG AGGCGTTCCC AAGGGCCAGG CTATCAGAGC 1101 GACAGGCACG GGCACTGGGC CTGCTGAGCG CCGTGGAGCG GGGTCGGGCA 1151 CTGGGGGGCA GAGCTGTGCG CAGCCGGCGT GCCATCGCTG TGCTGTCCCG 1201 CTGGCCTGCC TTCCCTGCCT TCCGCGCCTT CCTCACCTTC CTTTACCGCT 1251 ACTCCGTCTC AGGCCCCCAC CGCCTACCCT TGGAAGCGCA CATCTCCCAC 1301 TTCATTCACA ACGTTCCCTT CCCTTCCCCA CAGAGACCCC GCATCCTAGT 1351 GCAGATGTCT CCCTATGACA ACTTGCTCCT CTGTCAGCCT GTATCCTCAC 1401 CCCTGCCCCT CAGTGGTGCC AGCTTCCTGC AGCTGCTGCA GAGCCTGGGC 1451 CCTGAGCTGG CTATCACACT GCTGCTGGCT GTGCTCACAG AGCACAAGCT 1501 GCTAGTCCAC TCGCTGCGGC CAGACCTGCT CACCAGCGTC TGTGAGGCCC 1551 TCGTCTCGAT GATCTTCCCA CTGCACTGGC AGTGCCCCTA CATTCCTCTG 1601 TGCCCGCTGG TGCTGGCAGA TGTGCTGAGT GCCCCAGTGC CCTTCATTGT 1651 GGGTATCCAC TCCAGCTACT TTGATCTGCA TGACCCGCCT GCTGATGTCA 1701 TCTGTGTAGA CCTTGATACC AACACGCTCT TCCAGACTGA GGAAAAGAAG 1751 CTCCTCTCCC CTCGGACCCT GCCCCGCAGA CCCTACAAGG TTCTGCTGGC 1801 CACACTGACA AACCTGTACC AGCAGCTGGA CCAGACATAC ACTGGACCTG 1851 AGGAGGAAGC ATCCCTGGAG TTCCTACTGA CAGACTACGA GGCAGTGTGT 1901 GGCCGCAGGG CCCGGCTGGA GCGCGAAGTC CAAGGAGCCT TCCTCCGCTT 1951 CATGGCCTGT CTGCTCAAGG GCTACCGGGT CTTCCTGCGC CCACTCACCC 2001 AGGCCCCCTC CGAGGGAGCT CGTGATGTTG ACAACCTTTT CTTCCTGCAG 2051 GGCTTCCTCA AATCCCGGGA ACGCTCCAGC CACAAACTTT ACTCTCAGCT 2101 GCTGCACACA CAGATGTTCT CACAGTTCAT TGAGGAGTGC TCTTTTGGCT 2151 CTGCTCGCCA TGCTGCCCTT GAATTCTTTG ACTCTTGTGT TGAAAAGGTC 2201 CACCCAGAGC AGGAGAAGCC TGAGCCGACA CCCTTAGTGG AGCTAGAGGA 2251 GCTGTCAGGA AGTGAGCTCA CTGTCTTTAT CACACCTCCC GAGGAGCCTG 2301 CCTTACCAGA GGGCAGTGAA TCCACTCCCC AGTACTGCTA TGATGGATTC 2351 CCAGAGCTAC GGGCTGAGTT GTTTGAGTCT CTTCAAGAGC AACCTGGGGC 2401 CCTGCCTGTG CCAGGCCCTT CCCGTAGCGC CCCCAGCAGT CCTGCTCCTC 2451 GCCGTACCAA ACAGGAGATG AAAGTTGCAC AGCGGATGGC ACAGAAGTCA 2501 GCAGCTGTGC CTGAGCTGTG GGCCCGGTGC CTGCTGGGGC ACTGCTATGG 2551 GCTGTGGTTC CTGTGTCTGC CTGCCTATGT GCGGTCGGCA CCCTCCCGAG 2601 TGCAGGCACT GCACACAGCC TACCATGTGC TGCGCCAGAT GGAGAGCGGC 2651 AAGGTGGTGC TCCCTGATGA GGTGTGTTAC CGGGTACTGA TGCAGCTCTG 2701 CTCACACTAT GGGCAGCCTG TGCTGTCTGT GCGGGTCATG CTGGAGATGC 2751 GTCAGGCAGG CATTGTGCCC AACACCATCA CCTATGGCTA CTACAATAAG 2801 GCTGTGTTGG AAAGCAAGTG GCCGTCTGGC ACACCAGGTG GGCGTCTGCG 2851 CTGGGCCAAG CTCCGGAATG TTGTCCTGGG GGCTGCTCAG TTCCGCCAGC 2901 CCTTGAGAGA ACGGCAACAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 2951 CAGCAGCAGC AGCAGCAGGA GCAGGTGTCA GCACATCAAG AGGCAGGCAG 3001 CTCCCAGGCA GAGCCCTATT TGGAGCGCCC TTCCCCTACT CGCCCTCTTC 3051 AGCGCCAGAC TACTTGGGCT GGGCGAAGTC TGAGAGACCC AGCCTCACCC 3101 CCTGGACGCC TGGTGAAGAG TGGTAGCCTG GGCAGTGCCC GAGGGGCACA 3151 GCCCACTGTG GAGGCCGGTG TGGCCCACAT GATAGAGGCC TTGGGGGTCC 3201 TGGAACCCCG GGGATCACCT GTGCCCTGGC ACGATGGAAG TCTCTCAGAC 3251 CTGAGCCTGA CAGGGGAGGA GCCGCTCCCT GGAGGCAGCC CAGGGGGCTC 3301 AGGCTCAGCC CTGAGTGCCC AGTCCACTGA GGCCCTGGAA GGGCTAAGTG 3351 GGCGGGGACC CAAGGCTGGT GGGCGACAGG ATGAGGCAGG CACCCCCCGA 3401 CGAGGGCTGG GTGCCCGCCT CCAACAGCTG CTCACTCCTT CCCGCCACTC 3451 CCCTGCCTCC CGCATTCCCC CACCTGAGCT GCCTCCTGAC CTGCCACCCC 3501 CAGCCCGCCG CAGCCCCATG GACAGTCTTC TGCACCCCCG GGAGCGCCCT 3551 GGATCCACTG CCTCCGAGAG CTCAGCCTCT CTGGGCAGTG AGTGGGACCT 3601 CTCAGAATCT TCTCTCAGCA ACCTGAGTCT TCGCCGTTCC TCAGAGCGCC 3651 TCAGTGACAC CCCTGGATCC TTCCAGTCAC CTTCCCTGGA AATTCTGCTG 3701 TCCAGCTGCT CCCTGTGCCG TGCCTGTGAT TCGCTGGTGT ATGATGAGGA 3751 AATCATGGCT GGCTGGGCAC CTGATGACTC TAACCTCAAC ACAACCTGCC 3801 CCTTCTGCGC CTGCCCCTTT GTGCCCCTGC TCAGTGTCCA GACCCTTGAT 3851 TCCCGGCCCA GTGTCCCCAG CCCCAAATCT GCTGGTGCCA GTGGCAGCAA 3901 AGATGCTCCT GTCCCTGGTG GTCCTGGCCC TGTGCTCAGT GACCGAAGGC 3951 TCTGCCTTGC TCTGGATGAG CCCCAGCTCT GCAACGGGCA CATGGGGGGA 4001 GCCTCCCGGC GGGTTGAGAG TGGGGCATGG GCATACCTGA GCCCCCTGGT 4051 GCTGCGTAAG GAGCTGGAGT CGCTGGTAGA GAACGAGGGC AGTGAGGTGC 4101 TGGCGTTGCC TGAACTGCCC TCTGCCCACC CCATCATCTT CTGGAACCTT 4151 TTGTGGTATT TCCAACGGCT ACGCCTGCCC AGTATTCTAC CAGGCCTGGT 4201 GCTGGCCTCC TGTGATGGGC CTTCGCACTC CCAGGCCCCA TCTCCTTGGC 4251 TAACCCCTGA TCCAGCCTCT GTTCAGGTAC GGCTGCTGTG GGATGTACTG 4301 ACCCCTGACC CCAATAGCTG CCCACCTCTC TATGTGCTCT GGAGGGTCCA 4351 CAGCCAGATC CCCCAGCGGG TGGTATGGCC AGGCCCTGTA CCTGCATCCC 4401 TTAGTTTGGC ACTGTTGGAG TCAGTGCTGC GCCATGTTGG ACTCAATGAA 4451 GTGCACAAGG CTGTGGGGCT CCTGCTGGAA ACTCTAGGGC CCCCACCCAC 4501 TGGCCTGCAC CTGCAGAGGG GAATCTACCG TGAGATATTA TTCCTGACAA 4551 TGGCTGCTCT GGGCAAGGAC CACGTGGACA TAGTGGCCTT CGATAAGAAG 4601 TACAAGTCTG CCTTTAACAA GCTGGCCAGC AGCATGGGCA AGGAGGAGCT 4651 GAGGCACCGG CGGGCGCAGA TGCCCACTCC CAAGGCCATT GACTGCCGAA 4701 AATGTTTTGG AGCACCTCCA GAATGCTAGA GACCTTAAGC TTCCCTCTCC 4751 AGCCTAGGGT GGGGAAGTGA GGAAGAAGGG ATTCTAGAGT TAAACTGCTT 4801 CCCTGTTGCC TTCATGGAGT TGGGAACAGG CTGGGAAGGA TGCCCAGTCA 4851 AAGGCTCCAA GCGAGGACAA CAGGAAGAGG GATCCACTGT TACCAAAAGT 4901 CCTGATTCCC CCATCACCAA CCTACCCAGT TTGTTCGTGC TGATGTTGGG 4951 GGAGATCTGG GGGGAGTTGG TACAGCTCTG TTCTTCCCTT GTCCTATACC 5001 GGGAACTCCC CTCCAGGGTA CCCACAGATC TGCATTGCCC TGGTCATTTT 5051 AGAAGTTTTT GTTTTAAAAA ACAACTGGAA AGATGCAGAG CTACTGAGCC 5101 TTTGCCCTGA ATGGGAGGTA GGGATGTCAT TCTCCACCAA TAATGGTCCC 5151 TCTTCCCTGA CGTTGCTGAA GGAGCCCAAG GCTCTCCATG CCTTTCTACC 5201 TAAGTGTTTG TATTTTATTT TAAATTATTT ATTCTGGAGC CACAGCCCCC 5251 TTGCTTATGA GGTTCTTATG GAGAGTGAGA AAGGGAAGGG AAATAGGGCA 5301 CCATGGTCCG GTGGTTTGTA GTTCCTTCAA AGTCAGGCAC TGGGAGCTAG 5351 AGGAGTCTCA AGCTCCCCTT AGGAAGAACT GGTGCCCCCT CCAGTCCTAA 5401 TTTTTCTTGC CTGCCCCGCC TTGGGGAATG CCTCACCCAC CCAGGTCCTG 5451 ACCTGTGCAA TAAGGATTGT TCCCTGCGAA GTTTTGTTGG ATGTAAATAT 5501 AGTAAAAGCT GCTTCTGTCT TTTTC // LOCUS AB008226 7349 bp mRNA PRI 13-FEB-1999 DEFINITION Homo sapiens FCMD mRNA for fukutin, complete cds. ACCESSION AB008226 NID g3370992 VERSION AB008226.1 GI:3370992 KEYWORDS FCMD; fukutin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7349) AUTHORS Kobayashi,K. and Toda,T. TITLE Direct Submission JOURNAL Submitted (20-OCT-1997) to the DDBJ/EMBL/GenBank databases. Kazuhiro Kobayashi, University of Tokyo, Institute of Medical Science, Human Genome Center, Laboratory of Genome Medicine; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan (E-mail:kazukob@ims.u-tokyo.ac.jp, Tel:+81-3-5449-5238, Fax:+81-3-5449-5406) REFERENCE 2 (sites) AUTHORS Kobayashi,K., Nakahori,Y., Miyake,M., Matsumura,K., Kondo-Iida,E., Nomura,Y., Segawa,M., Yoshioka,M., Saito,K., Osawa,M., Hamano,K., Sakakihara,Y., Nonaka,I., Nakagome,Y., Kanazawa,I., Nakamura,Y., Tokunaga,K. and Toda,T. TITLE An ancient retrotransposal insertion causes Fukuyama-type congenital muscular dystrophy JOURNAL Nature 394 (6691), 388-392 (1998) MEDLINE 98352786 FEATURES Location/Qualifiers source 1. .7349 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q31" gene 112. .1497 /gene="FCMD" CDS 112. .1497 /gene="FCMD" /standard_name="Fukuyama-type congenital muscular dystrophy" /codon_start=1 /product="fukutin" /protein_id="BAA32000.1" /db_xref="PID:d1032962" /db_xref="PID:g3370993" /db_xref="GI:3370993" /translation="MSRINKNVVLALLTLTSSAFLLFQLYYYKHYLSTKNGAGLSKSK GSRIGFDSTQWRAVKKFIMLTSNQNVPVFLIDPLILELINKNFEQVKNTSHGSTSQCK FFCVPRDFTAFALQYHLWKNEEGWFRIAENMGFQCLKIESKDPRLDGIDSLSGTEIPL HYICKLATHAIHLVVFHERSGNYLWHGHLRLKEHIDRKFVPFQKLQFGRYPGAFDRPE LQQVTVDGLEVLIPKDPMHFVEEVPHSRFIECRYKEARAFFQQYLDDNTVEAVAFRKS AKELLQLAAKTLNKLGVPFWLSSGTCLGWYRQCNIIPYSKDVDLGIFIQDYKSDIILA FQDAGLPLKHKFGKVEDSLELSFQGKDDVKLDVFFFYEETDHMWNGGTQAKTGKKFKY LFPKFTLCWTEFVDMKVHVPCETLEYIEANYGKTWKIPVKTWDWKRSPPNVQPNGIWP ISEWDEVIQLY" polyA_site 7349 /note="40 a nucleotides" BASE COUNT 2259 a 1437 c 1346 g 2307 t ORIGIN 1 GCTGCAGCCT GCTGTTGAGT GAGAAAACAA AATTATCTTC CTTTCCAAAT 51 CCAAAAAGAT GAAAACGACT GAGATACTTT CAAAAGACAA CCAAGTGAGC 101 AGCACAGACT AATGAGTAGA ATCAATAAGA ACGTGGTTTT GGCCCTTTTA 151 ACGCTGACAA GTTCTGCATT TCTGCTGTTT CAGTTGTACT ACTACAAGCA 201 CTATTTATCA ACAAAGAATG GAGCTGGTTT GTCAAAATCC AAAGGAAGCC 251 GAATTGGATT TGATAGCACA CAGTGGCGTG CAGTTAAAAA ATTTATTATG 301 TTAACATCCA ACCAAAATGT ACCAGTGTTT CTTATTGATC CTTTGATACT 351 GGAATTGATT AATAAGAACT TTGAACAAGT CAAAAATACT TCTCATGGCT 401 CTACTTCACA ATGCAAGTTT TTCTGTGTTC CAAGAGACTT TACTGCATTT 451 GCACTGCAGT ATCACCTATG GAAGAATGAG GAAGGCTGGT TTCGGATAGC 501 TGAGAATATG GGATTTCAGT GCCTAAAGAT TGAGAGTAAA GATCCCCGGC 551 TAGACGGGAT AGACTCACTC TCTGGAACTG AAATCCCCCT GCACTATATC 601 TGCAAACTGG CCACTCATGC GATCCACTTG GTAGTCTTTC ATGAGAGGAG 651 TGGCAACTAC CTCTGGCACG GCCACTTGAG ACTTAAAGAA CACATTGACA 701 GGAAATTTGT TCCCTTCCAA AAGTTACAGT TTGGTCGTTA TCCAGGAGCT 751 TTTGACAGGC CAGAGTTACA GCAAGTTACT GTTGATGGAC TGGAAGTTCT 801 CATTCCAAAG GATCCAATGC ACTTTGTAGA AGAAGTACCA CACTCTAGGT 851 TTATTGAGTG TAGGTATAAA GAAGCTCGAG CATTCTTTCA GCAGTACCTT 901 GATGATAACA CTGTGGAAGC TGTGGCCTTT CGGAAGAGTG CAAAGGAATT 951 ACTGCAACTA GCAGCGAAAA CATTAAACAA ATTGGGAGTA CCATTCTGGC 1001 TGAGCAGTGG AACTTGTCTA GGATGGTATC GACAATGCAA CATTATTCCT 1051 TATAGCAAAG ATGTTGACCT AGGAATTTTT ATACAAGATT ACAAATCTGA 1101 TATTATTTTA GCATTTCAGG ATGCAGGACT TCCGCTCAAA CACAAATTTG 1151 GGAAGGTAGA AGACAGCTTG GAACTATCCT TCCAGGGAAA AGATGATGTA 1201 AAACTTGATG TTTTTTTCTT CTATGAAGAA ACTGATCACA TGTGGAATGG 1251 AGGCACTCAG GCCAAAACAG GAAAAAAATT CAAATACCTG TTTCCGAAGT 1301 TTACACTGTG CTGGACTGAG TTTGTAGACA TGAAGGTCCA TGTACCCTGT 1351 GAAACCCTCG AATACATTGA AGCCAACTAT GGTAAGACCT GGAAGATTCC 1401 TGTAAAGACG TGGGACTGGA AGCGCTCTCC TCCCAATGTG CAACCCAATG 1451 GAATCTGGCC TATTTCTGAG TGGGATGAGG TTATCCAGTT ATATTGAGAT 1501 AGTAGGTTGA AATGGGAGAA TTTCTCTTTT GGAAAAAAAG GTAGATAACT 1551 GTTTAAAAAA TACATGTCTA TTTGTCAAAC ATAAGTGGGA ACCAAAGAAA 1601 AAATGTGACA AGTTTGAAGA CACAGAAAGA GTCATCTGAT GTAATTCTCT 1651 CACTTAGTAC TGAGGAATTT TCATGTGCCA CATACAATGC TAGGTTACAG 1701 TGGAGAAGCC AAGATGAATG AGACAAATAC CTACTTCTTT TATTCCTCCT 1751 TTTGGTAAAC AACTCAATTT TCCTTTGAGG GAACCCTCCC CCACCCTTTG 1801 AAGAGTTCAA GTTCTGTACA GGTTTTTAAA ACGTGAAGTA ATGTTTGAAC 1851 TGGAAGATGA GCTCACCAGG CAAAGCTAAG GAAGGATATA CTAGTTGAAA 1901 AGAATAACCC ACTCCTCTTC TGGGCATTTA AGCTGGTATG TTAGTGCTAC 1951 TTTTAAGATT GTGGAGTCTG AGTTAATATT CAAGTGATCA GACTTTGAGT 2001 GACATCAAGA AAAGATGATA TCAGGTTCAT TTTTCAACTA ATCTTATGTG 2051 GAATTGAATT AGAGAACAAG GCATTATTCT TTTAGGGAAG GTGAGAGCTT 2101 ATTTGTATCA GAGCTTATTA CTTGTCAGGA TAAGTAAATT TCTGTACATG 2151 TACTGTTTTC ATATGTGAAG TGAGAAGAAA CTTTATGCTT GTTTAATGCT 2201 TAAATTTCCA TCCATTGTGA GAATATTTTC ACTGACCTCT GATGGCACTT 2251 GTTGACAAAT CATTCAAGTG AGACCATGTT ACTAGACATG ATCTTGAAAG 2301 AGGCCATGAT TTCACAAAAC TCATTTTTAT TTTATTCTCA GACAGTCTGT 2351 TAGGTAAAAA TATGAGAAAT TCATGTACAT TTTATATTTT CTGAATTTAT 2401 AATCTGTGCA CTCCCAATTT TAATGACACT AAAATATTAA TAGAATTTTT 2451 TAAATATACT CTAATTTTTA AAACATACTC TTTATTATCT TCATTTATCA 2501 ATTACAGCTT CGTATCTCTA ATTTATGGTC TATATACCAA TTTAAATGGC 2551 ATGTAAACCT GCCTGTTTCT TCTCCTCTTC TAATATATCA GATCTCCAAA 2601 ATGGAAGCTA AATGGTGGAC TTGACAACTA TTCACCCTAC CTCAGATCAT 2651 AGAGTTTGTA AAGTTTGTGT CCCTCCCCAG TTTTTCAGTC TGTTGAATGT 2701 CGATGGGGCA AGAACTCAGA CTTCTACTTT ACCAAGTACC ACACACTCTG 2751 GAATGCTTTA GTTTCCTTTT CCCCTCAAGA TGTCTGAGTC AGCTAGGATG 2801 CTGTTTACCC CATCTCTCTC TTATATCACT TGAATGATAT ATTGTAAGTG 2851 AGAGGTAAAG GAAAATGTAG GCACAATAAG CACTGCTATT TTTCTCTTTG 2901 TCTAGGAAAG GAAGCTGAAT CTTATATCTT ATCTATGCTA TTTAGGACTA 2951 CTTTCTGGAG CTTGGCAGAT TTTCCTCTGA CACCAGAGAA CAATAGTAGT 3001 CTCAAGAATG GAAACCTGAA TGTCTGAGGG AATGGGCTGG TAGACTTTTT 3051 CGAAAACAAA TTAGAGAAAG TAACTTACAA CCACCCATTC CGGATTTGTA 3101 AAGCAACATG AAAACCTTTG ATAAATGATA ACCAACAGTC TTCTGTCCTA 3151 ATTTGCATTC TCAATGCAGA ATTATTGGGT CTTTCATAAA CAACATGAGT 3201 GGTTTCTGGA GACATTTAAG ATTGTCAGCA AACTTGTCCC AAAATGGCAC 3251 ATCATCATAA TCCATTTTCT CTTTGCTAGA AAATCAGCCC ATAGAGTGCA 3301 TTCCCAAATT CTAAATAGCT GACCCTAAAT TCATTTTTCA TGCTTAAAAA 3351 TAATAGAACA AATAATAATA CTATTTTTGG GCAAAATCCA CCCTACTTGA 3401 TGAGGACCAT TTGCTTGTGC TCAGTATTTA GAAATGCATA CACTCCACAT 3451 TTCTCCCCAT TTCCAATTGG GACTCCCATT CTTTGCCAGG AGATCTTACC 3501 CATCCCTTTT CAGATTAATC TTTTATCCTT TCCTGAAGTA CAAAGTCTTA 3551 AGAGTGTCTG AAATCCAAGA CATCTTGGCC CAATTGACAC AGGTTCTATA 3601 TTCTTCCCTA CATAGACCAC TGGCTGCTGA AATACTTTTC TTGTCTTCTG 3651 GTTTCACAGG CTTCAGCTCA TGGGTTCCAC CCTAACTTGG GGAAACAAAA 3701 GCCTTCTCTA GAATCTGAAG GCCAGGCTTA CCTCTGATTC TGATTCATCC 3751 ATACTTCATC TTATGTATTT TAACAGTTGG GTTCTGTGGA GTGTCCAGAG 3801 ACCTTGGGTT AGGTATATCC ATCTTCAGTA CCTCATTGGA TCACTTTTCT 3851 TTCATCACTT GAGTATTCTA GCAGTCATTC TCCTAATCTG ACCACTTTCA 3901 GGTTTAGTTT ACCTGGCTTA CCAGGGGAAG TTGACAACTT GTTGGTAGTT 3951 AGGCACCCAT GAATGTCTCC AGAGACATCC TGAGAGGCAA GATTCCTCTT 4001 AATTGATAAC CAGGACAACC AGGTAGTCAC CCAGTCCTCT CTAAGCAGGG 4051 AGCACTTGTC CTTCTCTCCT CTGCTGCAGC TACTGATATC TGGCCCCTGG 4101 AATAAAACCA TAGTTCCTAA AATTGAGCAT CCCTAAGAGT AGCTGCTTGG 4151 TGGGACAGCT GATTTCTTTG ATTCCCTAGC TGCAAAACCT GAAATTGACC 4201 ACTAGGTGAC AGAATGTTTG CCAGTATCCC CACAAAACAA GTTAAAACTT 4251 AAGTGAAAAT CATTCTGCTT TTGAATCTTA AAAGCTAGAA AAACCATAAT 4301 GTAATATTCT TTTTAAACCC TCTTATTTTA TAACTAAATA AATAATCACA 4351 GAAAAGAATG ATCTGAACAA GAACAGCTGA GGCTAAAACC CAAGACTGCC 4401 GTGACTCCTA GTCCAATGTC TGTTTCGCAT CACAACACAG TATCTTTAAC 4451 AGTGCTTGAG ATGCTTAACA GAGAAGGCAT CCAGTGCTTC ATTGAGGCTA 4501 AGTCTCAGGG TGTTTCTGCC GCTTAGTATC TTTTTGTTCA GATTGAGAAC 4551 CTAAGGCGAC AGAGAGGTCA AGGGGAAGTA TTTTCTAGTT CAGAATTCAG 4601 AATTCTTAGG CAACCTTCAT ACCTTTATCT GGAAAATGCC TTCTCTCCAG 4651 ACAATAAGAC AGTTCACATC TGGAGACATC AGCCTTTGGT ACCTAATTCT 4701 CTAAGCTCAA GATGCATCCC ATTGCCACTT TACCATTCTA TTTCCCAAGG 4751 GAACCATCAG CACCAACCTG CTAAATGCCT GTATTTGAAG TCTCTCCCCT 4801 TCCTGAAGTG ACCTTTTATG AGGCTCTTTC CTTCCTAAGG GACCTTTCCC 4851 TGGTAGTAGT GGTCTCATTC ACAAAAAAGA ATAATAGATG TAACTGGAGT 4901 CACTTTGCAG TTTTCAAGAT TTTTTTTTTA TCTGCTTGGC TGGAAAGGAG 4951 ACTAGAGTAT CTAGTTTTTA GTATTTAAAC TACTTTTAAT TATTTATATT 5001 GATACTCTCT GATAAATGCA GGTAGTTAAG AATAGATTGT ACTCATTCCT 5051 TTTTGAGTTT GTGACCTCAA ACCATTGTTT TTATTCTTAC ACGACTACAG 5101 TACTTCAAAT GGCACATAGA TAGGCAGGAT ATTACATAGG TAAGCAGGAA 5151 TTTATACATT GGGTACCAGA GCTAAATCTG GAAAACACAT TTGGAATGAG 5201 CTTCCACACT TCAGTCTAAA ATCTCACCCA AACTTATGGC TACATATTTT 5251 TCTAAGTTTC CCCCCTAGTT TCACTTTCTG GAAAGTGAAC AGTTTTTTAA 5301 ATCCTAAATG TTATAAATCC TTAGGAGAAA TATTCCCTCA AAACCTAGTC 5351 AAGAAAGTGC CACATTCCCA CCTTCTAACA GGTAATTATT AGTTGTAATA 5401 GCTTTAATGC ATGGAAAAAC TTCAGTTACT GACCAGTCAA GAGAATTTCT 5451 CAAGAAAAAT CCCAGCAACG TTCCCTCTTT CCTCCTTGTC TTCCACTATC 5501 ATTAAGACCT GGGCTAGATC ACCTCTAACA TCTCACTCAG GGAAGGCAAT 5551 GTGTTGCCAT AAAAAGACCA CAAGCTTTTG AGTTAGGCCT GAATTTGAAT 5601 TTCAGCTTAA TCATGTGGGA TTTATGGTGG GGGCCCTTTC AGTCTCAATT 5651 TCCACATTAG TGAACTGGGG AAATGATACA TACTTCTAAG GGTTTTTAGG 5701 GGGATTAAAT GAAGTATACA GTTCTTAGCC TCAGGCCTAG GATTAAGTAA 5751 GTACTCAAGA AATGTGCAAT TTTCTAGTTC CATGTTTCTT TTCTAATCTT 5801 CAAATGGAAT TATAGAAACC ATGGGTCAGA ACATATTCCT TTACTCAAAA 5851 GATTGCATGA CTGAATTTGC TTAAGAAAAA AAAAATTGTA TCAAGTCATT 5901 AATACAATTA TACATTAATT ATATTACATT AATACAATAT ATGGTTTGTG 5951 AATTCAGAGA CATTACCAGT TTGCCTCCTT CTCTCAATAG AACTTGTATT 6001 TTCATTTTCT TGGTTAAGCA GTTGTCTCCT AATATTATCC CATATGCTAC 6051 CTAGTTTGCT GGTCCCAAGC AGTTTACTGT ACTTCACTAG ATTTGGTACC 6101 TGCTCTCCCC TGGACTTCTT TTTCAATATT CTAGCCTTTC CTAGATGTAA 6151 ATCTTTACCT CCTTGTTAGT GAAATTAGAT ATAAGCCATG ATTTGGAGAG 6201 GGAAGAAATC TGGAATACTT AATTTCATTT AATTATCTAT GCTGATGAAT 6251 GCCTGTATCA TTGTTAATAA AGGAGAATTG AAAATACTCA TTTCTACTTT 6301 CTGCCCTCAA ATTTCTGTTT CTATCTCAAC TAGGCAAGAA TCAGCAGGGT 6351 GCATGATGCC ATTTTAAGCT GCTTCACATC AGACTGAAAT CCTAATTACA 6401 GTTCATAAGT GAAACAGACT AATTCAATGG CAATACCTTT TGTATAGGTC 6451 CTGTGCTTAA AGGAGGCAAG TATAAATTTT CTAATAAGAA ATCCCTGCTT 6501 CTTTTGGGTG CAGTGGCTCA CACCTGTAAT ACCAGCAGTT TGAGAAGCTA 6551 AGGCAGGTGG ATCACTTGAG GCCAGGAGTT CGAGACCAGC CTGGCCAACA 6601 TAGTGAAACC CCGTCTCTTC TAAAAATACA AAAAAATTAG CTGGGCATGG 6651 TGGTGCACGC CTATAGTCCC AGCTACACAG GAGGCTGAGG CAGGAGAATC 6701 ACTTGAACCC GGGAAGCAGA GTCTGCAGTG AGCCAAGATC GCGGCATTGC 6751 ACTTCAGCCT GGGCGATAGA GCAAGACTCT GTCTCAGAAA AAAAAAAAAG 6801 GAAAGAAAGA AATCTCTGCT TCCGTACTAT CAAAACTTCT ACCCTAGAAT 6851 ACCCCCTGGC ACCTTCTAAC CCAACCTAAT ACAGAGATTT TGTAGGGACC 6901 CATTAACAAG CTCCTATATA ATTATAAGCA GCTCTCACAG TAGGGTTGAA 6951 AGAGAATATA AGGAAAATTA GAAGTACTGT ATTTTTGCTA TTGGAAATGA 7001 AATATTGTTT CATGCACCTT TGAAAAAAAT AACAGGATTT TACAGCCTTC 7051 TGATGATTCA TTCAAAGCAT GGGGAATAGT TCATATGTTT GTTAAATGAA 7101 AATCTTATGT AGCAATTTGT GTGTCCCTCC CACTTCATAA CTACAAAAAC 7151 ATATATATGA ATTTCTAAAC AAAGTGATAT TTTAAAGATG AATTGATTCA 7201 ATGTGTACTT ACCAGTTTAC TGTGTGGTTT TATCTTCAGG TACAGATAGT 7251 TTGTGGTACT ACTTGAAAAT ACCTTTAATA TTATTTTCAT CAGAATTTGT 7301 AATATATAAT CCAGTTTGGG ATGATGAATA AAATTTTTCT AATCTCTTG // LOCUS AB011122 5749 bp mRNA PRI 10-APR-1998 DEFINITION Homo sapiens mRNA for KIAA0550 protein, complete cds. ACCESSION AB011122 NID g3043623 VERSION AB011122.1 GI:3043623 KEYWORDS KIAA0550 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0851. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5749) AUTHORS Ohara,O., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (13-FEB-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. IX. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (1), 31-39 (1998) MEDLINE 98290545 FEATURES Location/Qualifiers source 1. .5749 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0851" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 730. .3684 /gene="KIAA0550" CDS 730. .3684 /gene="KIAA0550" /codon_start=1 /product="KIAA0550 protein" /protein_id="BAA25476.1" /db_xref="PID:d1026406" /db_xref="PID:g3043624" /db_xref="GI:3043624" /translation="MKAVRNLLIYIFSTYLLVMFGFNAAQDFWCSTLVKGVIYGSYSV SEMFPKNFTNCTWTLENPDPTKYSIYLKFSKKDLSCSNFSLLAYQFDHFSHEKIKDLL RKNHSIMQLCNSKNAFVFLQYDKNFIQIRRVFPTNFPGLQKKGEEDQKSFFEFLVLNK VSPSQFGCHVLCTWLESCLKSENGRTESCGIMYTKCTCPQHLGEWGIDDQSLILLNNV VLPLNEQTEGCLTQELQTTQVCNLTREAKRPPKEEFGMMGDHTIKSQRPRSVHEKRVP QEQADAAKFMAQTGESGVEEWSQWSTCSVTCGQGSQVRTRTCVSPYGTHCSGPLRESR VCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCTPPQYGGRPCEGPETHHKPC NIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCTAAAHGGSECRGPWAESRECYNP ECTANGQWNQWGHWSGCSKSCDGGWERRIRTCQGAVITGQQCEGTGEEVRRCSEQRCP APYEICPEDYLMSMVWKRTPAGDLAFNQCPLNATGTTSRRCSLSLHGVAFWEQPSFAR CISNEYRHLQHSIKEHLAKGQRMLAGDGMSQVTKTLLDLTQRKNFYAGDLLMSVEILR NVTDTFKRASYIPASDGVQIYPGSIELMQVIEDFIHIVGMGMMDFQNSYLMTGNVVAS IQKLPAASVLTDINFPMKGRKGMVDWARNSEDRVVIPKSIFTPVSSKELDESSVFVLG AVLYKNLDLILPTLRNYTVINSKIIVVTIRPEPKTTDSFLEIELAHLANGTLNPYCVL WDDSKTNESLGTWSTQGCKTVLTDASHTKCLCDRLSTFAILAQQPREIIMESSGTPSV TLIVGSGLSCLALITLAVVYAALWRYIRSERSIILINFCLSIISSNILILVGQTQTHN KSICTTTTAFLHFFFLASFCWVLTEAWQSYMAVTGKIRTRLIRKRFLCLGWGKHIDIP FHALLKMTLNTH" BASE COUNT 1778 a 1107 c 1304 g 1560 t ORIGIN 1 GGAAAGCGGA AAGAGGAAAA AGCATAAGCT TGAGCCTTCC GATCCGACCA 51 CGAATACTCC TGTAATAAAC CCACCGCCCC AACAAATCTG CCATAGCAGC 101 CGCCGCCGCC GCCGGTCACT TCTCGTCTCA GCGCTTTCTT TGCTTCTTGG 151 TTTGTTGGGG GTAGCTTTTA TGAAACAAAT CTTTGCTATT AAGCCACTTA 201 CATTTTGGGG GGTTCCTTAG AGTCTCCCTT GGGGGGGCTT CTCCCTCCCT 251 TTAGCCCCCC TCGGTTTGGA GGTTGGATTC AGTTGGATAC GGCGCAAGGT 301 TCTGGGCTCC TGCTGGCTTT TTTTTCCTCT CTCTCATCGA CCCCCCTTTG 351 GTTCCCACCC CCCACCTTTT GCTTTTCGTA TGTATGCATT TTTAAAAATA 401 AATCCTGATT TTGGAAGCTG AGCCGGGGAA AATGGGCAAC GGTGATTGGG 451 ACCGAAGGGG AGTCTCTCCG TCACTGTTGC TGGGACGCGT GCCTGTGCTG 501 GTGTCTTAGA GCAAGAGCCT CCCTGAGCTT TCGGAGTGGA AGAACAGTGG 551 AAGAGACTGC AGCCTAAAGA CTTTTAAAAT TAACTTGGCA TCACTTTTAT 601 CAGCTCAAAG GCTAAACAAA CAAACAAAAG CAGTGTCATT TATTCTAAGA 651 AATAACTTCT TAAAGGTTAA AGCTGAAAAA TATTCAAGTT ATTTTTGGAT 701 AACAACTTAC AGAGGCCAAA TGACATAGGA TGAAGGCTGT TCGTAACCTG 751 CTGATTTATA TATTTTCCAC CTATCTCCTG GTTATGTTTG GATTTAATGC 801 TGCCCAAGAC TTCTGGTGTT CAACTTTGGT GAAGGGAGTC ATTTATGGAT 851 CGTATTCTGT AAGTGAAATG TTTCCTAAAA ACTTTACAAA CTGCACTTGG 901 ACGCTGGAAA ATCCAGATCC AACCAAATAT AGCATTTACC TGAAATTTTC 951 CAAAAAGGAC CTTAGCTGCT CTAACTTTTC ACTCCTGGCT TATCAGTTTG 1001 ATCATTTTTC CCATGAAAAA ATAAAGGATC TTTTAAGAAA GAATCATTCT 1051 ATAATGCAAC TCTGCAATTC CAAGAATGCT TTCGTTTTTC TACAGTATGA 1101 TAAAAATTTT ATTCAAATAC GTCGAGTATT TCCAACTAAT TTCCCAGGAT 1151 TACAGAAAAA AGGGGAAGAA GATCAGAAAT CTTTTTTTGA GTTTTTGGTA 1201 TTGAACAAGG TCAGCCCAAG CCAGTTTGGT TGCCATGTAT TATGTACTTG 1251 GTTGGAGAGC TGCTTAAAAT CAGAAAATGG GAGAACAGAA TCATGTGGGA 1301 TCATGTATAC AAAATGCACC TGCCCTCAGC ATTTGGGAGA GTGGGGGATC 1351 GACGACCAGT CGCTGATTTT GTTAAATAAC GTGGTGTTAC CCCTGAATGA 1401 GCAGACAGAG GGCTGCCTGA CCCAGGAGCT GCAAACCACC CAAGTCTGCA 1451 ATCTTACCAG GGAGGCCAAG CGACCACCCA AAGAAGAATT TGGAATGATG 1501 GGAGATCATA CAATTAAAAG TCAGCGACCT CGATCTGTTC ATGAAAAAAG 1551 GGTCCCTCAG GAACAAGCTG ATGCTGCTAA ATTTATGGCA CAAACTGGTG 1601 AATCTGGTGT GGAAGAGTGG TCCCAGTGGA GCACATGTTC GGTTACTTGT 1651 GGTCAAGGGT CGCAGGTGCG AACCAGAACT TGTGTATCAC CTTACGGGAC 1701 ACACTGCAGC GGCCCATTAA GAGAATCAAG GGTTTGCAAT AACACTGCCC 1751 TCTGTCCAGT ACACGGAGTA TGGGAGGAAT GGTCACCATG GAGTTTATGT 1801 TCATTTACAT GTGGTCGAGG CCAAAGAACA AGAACAAGGT CATGCACACC 1851 TCCTCAGTAT GGAGGAAGGC CGTGTGAAGG ACCTGAAACA CATCATAAGC 1901 CTTGTAATAT TGCTCTTTGC CCAGTTGATG GACAGTGGCA AGAGTGGAGT 1951 TCGTGGAGCC AGTGCTCAGT AACGTGCTCG AATGGGACTC AGCAGAGAAG 2001 CCGGCAGTGC ACTGCAGCTG CCCATGGAGG CTCCGAATGC AGAGGGCCAT 2051 GGGCAGAAAG CAGAGAGTGC TATAACCCTG AATGTACAGC CAATGGTCAA 2101 TGGAATCAGT GGGGTCATTG GAGTGGTTGT TCCAAGTCCT GTGATGGCGG 2151 CTGGGAAAGG CGAATAAGGA CCTGTCAGGG TGCAGTGATA ACAGGGCAGC 2201 AATGTGAAGG AACGGGCGAA GAAGTGAGAA GATGCAGTGA GCAGCGATGC 2251 CCTGCACCTT ATGAAATATG CCCTGAGGAT TATCTGATGT CGATGGTGTG 2301 GAAAAGAACT CCAGCAGGCG ACTTGGCATT CAATCAATGT CCCCTGAATG 2351 CCACAGGCAC CACTAGCAGA CGCTGCTCTC TCAGTCTTCA TGGAGTGGCC 2401 TTCTGGGAAC AGCCGAGCTT TGCAAGATGC ATATCAAATG AGTACAGACA 2451 CTTGCAGCAT TCAATTAAAG AGCACCTTGC TAAGGGGCAG CGAATGCTGG 2501 CAGGTGATGG AATGTCCCAG GTGACCAAGA CACTGTTGGA TTTAACTCAG 2551 AGAAAAAATT TCTATGCAGG CGATCTTCTG ATGTCTGTGG AGATCCTGAG 2601 AAATGTGACA GACACATTTA AAAGGGCAAG TTACATCCCT GCATCTGATG 2651 GTGTCCAGAT TTATCCAGGG TCAATAGAGT TAATGCAGGT GATTGAAGAT 2701 TTTATACACA TTGTTGGAAT GGGGATGATG GACTTTCAGA ATTCATACTT 2751 AATGACTGGA AATGTAGTGG CTAGTATTCA GAAGCTTCCT GCAGCCTCTG 2801 TTCTAACAGA CATCAACTTT CCAATGAAAG GACGGAAGGG AATGGTTGAC 2851 TGGGCAAGAA ACTCAGAAGA TAGGGTAGTA ATTCCAAAAA GCATTTTCAC 2901 TCCGGTGTCA TCAAAAGAAT TAGATGAATC ATCTGTATTT GTTCTTGGCG 2951 CAGTCCTATA CAAAAACTTA GATCTAATTT TGCCCACTTT GAGAAATTAT 3001 ACTGTCATTA ATTCCAAAAT CATCGTGGTC ACAATAAGGC CTGAACCCAA 3051 AACAACCGAT TCGTTTCTGG AGATAGAACT AGCTCATTTG GCTAATGGTA 3101 CTTTGAATCC CTATTGTGTA TTGTGGGATG ACTCCAAAAC GAACGAGTCT 3151 TTGGGAACGT GGTCCACCCA GGGATGTAAA ACTGTGCTTA CCGATGCATC 3201 CCATACGAAA TGCTTATGTG ATCGTCTCTC TACCTTCGCC ATTTTGGCTC 3251 AGCAACCTAG AGAAATAATC ATGGAATCCT CTGGCACACC TTCAGTTACC 3301 CTAATAGTAG GCAGTGGTCT TTCTTGCTTG GCCTTGATTA CCCTAGCAGT 3351 TGTCTATGCA GCATTATGGA GGTACATACG CTCTGAGAGA TCCATAATAC 3401 TAATTAACTT CTGCCTGTCT ATCATCTCAT CCAATATCCT CATACTGGTT 3451 GGACAGACTC AGACACATAA TAAGAGTATC TGCACAACCA CCACTGCATT 3501 TTTGCACTTT TTCTTCCTGG CTTCATTCTG TTGGGTTTTG ACTGAGGCGT 3551 GGCAATCATA TATGGCTGTA ACTGGAAAAA TTAGGACACG GCTTATAAGA 3601 AAACGCTTTT TGTGCCTTGG ATGGGGTAAG CATATTGATA TACCGTTTCA 3651 TGCTCTTCTC AAAATGACGT TGAACACACA TTAGAAAGCA GTCATGAGTG 3701 ATTAGACACA GGCTACTTTG TGTCTAATTT AATCTATGGA AGTGAAAATA 3751 CATGAGCTGG TCAGTTTTGA ACATTCATTG GTCATTTGGA ACTTTAAAAG 3801 GAAGTAAGTA TTGAATGCTC ATTTAGCTAG TCAGTTAACA TTCAACAGTG 3851 TCTAGATAGT ATGAAATGAG ACCCCGAGAT GCCTACACAC AGAAAAACAG 3901 TGCTCTCTGT TAATATTTTC TGAAAGTGCA AAATACCTTA AAATTTTCAA 3951 GGCCTAATGT GTGATGGTTC ACTAGGCATG TACTCCCACC AAGAAAACTT 4001 AGAAGATTTC ATTTCAAGAA ATCTCAAAGC AATTAAAGAA TAAAAGCGAT 4051 TCATTTCATA GGGAGAACAC CATCTAGAGA ATTAATGAAA CCTCACAGCT 4101 TGTTGACCTG GTCCTCAAAA GCAGAAACAG AATTGCTGAC AGACTGAGAA 4151 CTAATTCTTT ACTTGTGTTT ATTAAGAAGT TTCTCTCAAA TTGCCTCATG 4201 ACATGGACAT CTCAAAGATC TATATTATAG GGCCAATTCT AATGATAGCC 4251 TAGTTAATTT AAGAAGCTAC TTTTAGAAAA AGCCCAAATA TACAATAATA 4301 TCTACTGTAT TAGAAGACTG GCATATGGGA TGCTAGGAGG AACCTGGGAA 4351 ATTACAAATA AGTGTGCTTA TAACAATTCC AGAATTATTT AGGCTGGAAA 4401 AATATGATCA AGAACACGTA AATATTATTC ATTAGGTTTC AGCAAGGTCT 4451 ATTATGTCTA GCTAATAAAT TAGGACTTTA TCCACAGACA AATGGAAAAG 4501 CAATTAATAA GAAGTTGAAG AGTAGGCCAG ACATGGTGGC TCACGCCTGT 4551 AATCCTAGCA CTTTGGGAAA CCAAGGCGGG TGGACTACCT GAGCGTGGGT 4601 GGACTACCTG AGCGCGGGTG GACTACCTGA GCACGGGAGT TCGAGATTAG 4651 CCTGAGCAAT ATGGCAAAAC CCCATCTCTA CCAAAAATGC AAAAAATTAG 4701 CCGGGAGTGA TGGCACATAC CTGTAGTCCC AGCTACTCGG GAGGCTAAGG 4751 TGGGAGGATT GCATGAGCCT GGGAAGTGGA GGTTGCAGTG AGCCAAGATC 4801 ACACCACGAC ACTCCAACTT GTGTCACAGA GTAAGACCCT ATCTCACACC 4851 AACACAAAAG TTGAAGACTT TGTTCTACTT AGAATTTCAT CAAATTTTTG 4901 TCTAAATTTC CTGACAAAGG CCTTCTAAAG TTGAGATAGT ATTTAAATCA 4951 AGGGACACTT TTGCCATGAA TTAGTACCAT TCTAAGAAAT ACAGAATACA 5001 GGTAAAAGAA CACATTTTTT GATGAAGAAC AAAACATGGT GATTTTCAAG 5051 ATTAGTGACT ACCTTGTTTA AAATTATTAC TAAAGATTTT GAGGAGAGGG 5101 TTCACAGACA GTCTCCGTAT TTACAGCTAA TATTAAACTA CTCTAGGTAG 5151 CAAAAACCTG AACTGATGGT GCTAAAGTAT CAGAAAGTTT ATGGGTTGGC 5201 AGAATAGTGG TGTGTGTGTT TCATTATGAA CAAGTACAAT AAAATGAATC 5251 TAGAAAAAAT TTAATCTAAA TTGTATGAAA TAAATACTAT TAATTCTTCA 5301 GTTATAACCC ATGAGGAATT TTTTTTTCCT AATGAACTTG GTCCAGTCAA 5351 TCAAAAAAAA TCAACAAATG ACATGTGTGG AGGAAGAGGA GAAGGAGGGA 5401 CAAGAAGAGG AGAAACAGAA GGAGGAAAGG GAAGAGGAGG AAAGGGAGGA 5451 TAAGGATGAG GAGGGGACTA TATATTTATA ATTTTATATA CATATATGTA 5501 TAGTCACCAG TGTTTGTTTA ACACTATGGT GTGTCCTTCT GAGATGTTTT 5551 CCATAGTTCT TGTCATTAAA TCTCATGAAG GAATGTGATG CCACTAGAGA 5601 AGGCTCACAG AAGAGAATAG CAGGAGGTAT GTGAAATGAT AGTAAGAAAG 5651 AAGACAGACA GAGGACATAC AATAAAATGA TTAGATGACG GATTTTTCAA 5701 CCGGAAAAGG CAAACATGAT CTTCCTGAAA AGAAGGCATA ACCAAAACT // LOCUS AB011143 6052 bp mRNA PRI 10-APR-1998 DEFINITION Homo sapiens mRNA for KIAA0571 protein, complete cds. ACCESSION AB011143 NID g3043665 VERSION AB011143.1 GI:3043665 KEYWORDS KIAA0571 protein. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH2388. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6052) AUTHORS Ohara,O., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (13-FEB-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. IX. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (1), 31-39 (1998) MEDLINE 98290545 FEATURES Location/Qualifiers source 1. .6052 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH2388" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 161. .2077 /gene="KIAA0571" CDS 161. .2077 /gene="KIAA0571" /codon_start=1 /product="KIAA0571 protein" /protein_id="BAA25497.1" /db_xref="PID:d1026427" /db_xref="PID:g3043666" /db_xref="GI:3043666" /translation="MSGDPDVLEYYKNDHSKKPLRIINLNFCEQVDAGLTFNKKELQD SFVFDIKTSERTFYLVAETEEDMNKWVQSICQICGFNQAEESTDSLRNVSSAGHGPRS SPAELSSSSQHLLRERKSSAPSHSSQPTLFTFEPPVSNHMQPTLSTSAPQEYLYLHQC ISRRAENARSASFSQGTRASFLMRSDTAVQKLAQGNGHCVNGISGQVHGFYSLPKPSR HNTEFRDSTYDLPRSLASHGHTKGSLTGSETDNEDVYTFKTPSNTLCREFGDLLVDNM DVPATPLSAYQIPRTFTLDKNHNAMTVATPGDSAIAPPPRPPKPSQAETPRWGSPQQR PPISENSRSVAATIPRRNTLPAMDNSRLHRASSCETYEYPQRGGESAGRSAESMSDGV GSFLPGKMIVGRSDSTNSEDNYVPMNPGSSTLLAMERAGDNSQSVYIPMSPGAHHFDS LGYPSTTLPVHRGPSRGSEIQPPPVNRNLKPDRKAKPTPLDLRNNTVIDELPFKSPIT KSWSRANHTFNSSSSQYCRPISTQSITSTDSGDSEENYVPMQNPVSASPVPSGTNSPA PKKSTGSVDYLALDFQPSSPSPHRKPSTSSVTSDEKVDYVQVDKEKTQALQNTMQEWT DVRQSSEPSKGAKL" BASE COUNT 1491 a 1629 c 1624 g 1308 t ORIGIN 1 GAAATTTTGA GAAGTGGTGG GATGAAGAAG ATCCCTGAAG AGAGGAGTAA 51 CTGAGACAAA GGCGAGGCTA GCAATAGAGA GACAAGTGAG GGAATTAGAC 101 CTTTGGTATC ATCATGAAAA GGCCTGGAAG AAACGCTGGT TTATCCTGCG 151 GAGTGGCCGG ATGAGCGGTG ACCCAGATGT TCTGGAATAC TACAAGAACG 201 ATCACTCCAA GAAGCCTCTG CGGATCATCA ACCTGAACTT CTGTGAGCAG 251 GTAGATGCAG GCCTGACCTT TAACAAGAAG GAGCTGCAGG ATAGTTTTGT 301 GTTTGACATC AAGACCAGTG AACGCACCTT TTACCTGGTG GCTGAGACAG 351 AAGAGGACAT GAATAAGTGG GTCCAGAGCA TCTGCCAGAT CTGTGGCTTC 401 AATCAGGCTG AGGAGAGCAC AGACTCCCTG AGAAATGTTT CCTCAGCCGG 451 TCATGGCCCC CGCTCTTCTC CAGCTGAGCT CAGCAGCTCT AGCCAGCACC 501 TTCTCCGAGA GCGCAAGTCC TCAGCCCCAT CACACTCCAG CCAGCCAACT 551 CTGTTCACGT TTGAACCCCC TGTGTCAAAC CACATGCAGC CCACCTTGTC 601 CACCAGCGCA CCTCAGGAGT ATCTCTACTT GCACCAGTGC ATAAGCCGAA 651 GAGCAGAAAA TGCAAGGAGT GCCAGCTTCT CTCAGGGCAC CAGAGCCTCT 701 TTTCTCATGA GGAGTGACAC AGCTGTACAA AAACTTGCCC AGGGCAATGG 751 ACACTGTGTC AACGGGATCA GTGGTCAAGT CCATGGCTTC TATAGCCTTC 801 CCAAGCCGAG CCGGCACAAT ACAGAATTCA GAGACAGTAC CTACGACCTC 851 CCCCGCAGCC TGGCCTCCCA TGGCCACACC AAGGGCAGCC TCACAGGCTC 901 CGAGACAGAT AATGAGGATG TGTACACCTT CAAGACGCCC AGCAACACCC 951 TGTGCAGGGA GTTCGGGGAC CTCCTGGTAG ACAATATGGA TGTTCCGGCC 1001 ACCCCACTCT CAGCCTACCA GATCCCTAGG ACATTCACTC TGGACAAAAA 1051 CCACAATGCC ATGACAGTGG CCACTCCTGG GGACTCAGCC ATAGCTCCCC 1101 CACCCCGCCC CCCCAAGCCA AGTCAGGCAG AAACACCTCG ATGGGGCAGT 1151 CCTCAGCAGA GACCGCCAAT CAGTGAAAAT AGCAGATCTG TCGCTGCCAC 1201 CATCCCCAGA CGCAACACCC TCCCTGCAAT GGACAACAGC CGACTTCACC 1251 GAGCTTCTTC CTGTGAGACC TACGAGTACC CACAGCGTGG TGGAGAGAGT 1301 GCAGGCCGGT CTGCTGAATC CATGAGTGAT GGAGTTGGCT CTTTCCTGCC 1351 AGGGAAAATG ATTGTGGGCC GATCGGACAG CACCAATTCT GAAGACAACT 1401 ATGTGCCCAT GAATCCAGGT TCTTCCACCC TGTTGGCCAT GGAACGAGCA 1451 GGTGATAATT CCCAGAGCGT CTACATCCCA ATGAGCCCAG GGGCCCATCA 1501 CTTTGACTCA CTTGGCTACC CATCAACAAC CCTTCCTGTG CACCGAGGCC 1551 CCAGCAGAGG AAGTGAGATT CAGCCACCCC CTGTCAACCG CAACCTCAAA 1601 CCTGATCGGA AAGCAAAGCC AACACCACTT GACCTGAGGA ACAACACCGT 1651 CATCGATGAA CTCCCCTTCA AGTCACCTAT CACCAAGTCT TGGTCTAGGG 1701 CCAACCACAC CTTCAACTCC AGCTCCTCCC AGTACTGCCG CCCCATCTCC 1751 ACCCAGAGCA TCACCAGCAC AGACTCAGGA GACAGCGAAG AGAACTATGT 1801 CCCTATGCAA AACCCAGTGT CTGCATCTCC CGTTCCCAGT GGCACGAACA 1851 GTCCTGCCCC TAAGAAGAGC ACCGGCAGCG TTGATTATCT GGCCCTGGAC 1901 TTCCAGCCGA GCTCCCCAAG CCCCCACCGC AAGCCATCTA CTTCATCCGT 1951 CACCTCTGAT GAGAAGGTGG ACTACGTTCA GGTGGACAAG GAGAAGACCC 2001 AGGCCCTGCA GAACACCATG CAGGAGTGGA CAGACGTGCG GCAGTCCTCA 2051 GAGCCTTCCA AGGGTGCCAA GCTGTGATGA GAGGGCCACC GCAGAGCCCA 2101 GGAGGCAGCA TCTCCAGAGC TGGCCCTTCC CATCTCCCCT CTCCCCTCTC 2151 CCGTTCTTCC TCCCATCCAC CTCCTCTCTA CTCTGCCAGT CTCAGCCTTC 2201 AAAGCACTTG ACATCAGGGA CCCTGAACCC TTCCCCTGGG AGGTGAGGGC 2251 CTGATCAAGG CACCTCCTCT GCCCACTCGG GGCCCAGCTG TGATTTTTAT 2301 CAGTAATGGC CATGCCTCCA CCCACCTTAG TTAGGAGCTA CTTCCAAAAA 2351 GCATCCTTCA GCCTCTTCCT GTCCTTTAGA CCTGACTCTC TACCAGATGT 2401 TTGGAGGGAA GGGCTGGGGC TCTGAGCCAG ATTCCACACC TCACGTTCAG 2451 TCACAGCCCT CAGCTATCTT CCCTCCGGCC ACTGGGCTAC CTCTCCTTCA 2501 GTCCCAGAAG ACAAGTCTCA CCAACCCAGG GAGTCAAGGA CCAGCAAACC 2551 AAAGTGGATA ATGGACTTTT TCATTCCTGT TTTTCTTGGC AGGAGAGAAG 2601 CAAGGCCACT AAAAGAGGAG ATGGTGGAGA CGGAGGCTCA GCAGTGGTCT 2651 TGAGGGGTAA AGGACTTAGA TGCCCAGATG AAGAGGGAAA GCTGACATCT 2701 GCAGGGAACC CACTTTGAGG CTGAGGCCAT GGCAGGACAG CTGCTGTGGG 2751 GTGCAGAGGC AGAAGATGAA GGACAAAAGG AAGGGAAAAC TGATGGCCAA 2801 CCTAGAGCAG CAAGGAGCAG GGCTTGGAGC TCGGGTGGTG GAGATGACAA 2851 GGACACTGTG GGGTCTGGGT CCCCAGAACT CTGGAGCTAC AGGCCACTCT 2901 AGGCCCAAGG GCTAGTCCTC TTCCCCAGTC CCCTCAGAGG CCCCCGCCAG 2951 CCCCACCTTG AAAGCAGCAT ACAGGGGAAG GCTTGGACCA AGCTGGGCGA 3001 CCAAGCACAT GGGGCAGGAA CACATGGTAA AGGGGTGGGG AATATGGGAG 3051 GGAGTGTGGT GTGGATGGGG GTGATGCAGG GACTGAGGGG AACCCTGGGA 3101 CAGGCACAGG CTGGGCAGAG GCACAGGGCA GTGCAGGGGA CTCTGCAGTG 3151 GGGTCGGGAA GTGAGTTTCT TTGCAGTGAG CAGTGCAGTG GAAGTCGGGC 3201 ACAGAGGTAG CAGACAGATG TGAAGCAGTG GTGAAGGCCA TGTAGCAAGT 3251 GGGGAAATAC ATCCAAAGGG CCTGGGAGTT GGGGGGTGCC CAACGCAATC 3301 CTTGGGGGTG CAGGGTGGAG CAGAAAGTGA AGGAGGGACA CGTGCAAGAG 3351 TGGTGTGCAT GGTGGTGTGA CATGAGGACC GTTCCTAGGA TGGGACAGTG 3401 GGTCAGGCAG GACAAGGAGA AAGCAGGGCA GAATGATGCC TAGAGGACCA 3451 CATCAGGCAT GGCTGACAGC TTGTGCCCAT GGGCTGTGGC GTATGTCAGA 3501 TCGCAGGGTA GGAACGAGTC TGGCCTGGTG CCGGCCCAGT GTTTCCTCAG 3551 CTCATCCGCC CTCTGTTGCT CCCTAGCATT CCAGGAGCCA TCTTGGACTC 3601 TCCTCCCCAG GTTTGAAAGG CCATCAGATT AGCAGGGACG GGGTGTAGGG 3651 CATCACCCAA GGTTCCTTCT CTTAAACTAA GGGTGGGGGA TCTGAATGTT 3701 TTTATGTTGA CTGTTCTTGA CTAAATTTTC AAGAGTTTCA GAAGCAACAG 3751 GACAGACCAG ACGTTTCATT CTACCCTGGG GCGAACAGAA CTTCTTCCTC 3801 CCAAACAATG ACTTCCTGCC ATGTTTGATG GGGACAGCTA CCACTGTCCT 3851 CTGCCCCCAT TCCCCTTTCA GCTCCCATGA GCATGCATAG TTCACCAGAC 3901 CAATGGCCTA GCCATTCTCT AAGTCCCATC CTGGAAGAAG TTATTTCTTC 3951 AAGAGCTGCA CCTCTCCTCC TAGCATTAGT TTAGATCAAC TCAAGGAGTA 4001 TTTATTAATG GCTGCTGTCT CCAGTTTCTG GGGTTAAGCA CTAAGGACAC 4051 AAGAATCAAT CAGACCTTCT CCCTGAACTT AAGATAGCCA CAATCAGAAA 4101 AAGGACAAGG ACATGAGACA GTGGTGATGG CCATCAGACA GAGACTTCAA 4151 ATGCTGATGG AGGGCAGAGG AAGTACTTAG GGAGGTTGGT GTCAGAGGCA 4201 GGAGTGGGGG ATCAGGGAAG GTGGATTCTA GGAAAAGGGA GTGCCTGAGG 4251 TAGGCCTTAG AAGGGGATGA GTCAGATTTT TACAGAGGAG GAGGGCAGGG 4301 CTTGGGTCCA GTGGAGGAAG AAGGAAGGAG AGGCTTGGAA AGCCTTGTGT 4351 CTTGGGAAAA AAAGGCCTTT GAGCATATGG GTCCAGCCAC TCAGAAGTGC 4401 AGGGGCCATG CCTTGGTGTT CCAATAAGTG AATGGAAGCA GTGGTGGTAG 4451 CTACACTGGG CAGAGTTGGC AGGGTGCTGG TTCACTCTGC CCAGCCCTGA 4501 ATGTGTGCCT TAAAGGCCCC CTACAAGGGG CCCCATACGA CAGAGCTTTT 4551 AACTGGTGCC TTCCCTGTAC CCGCAGCAGC CACAAGTGGG CCCAGACTAT 4601 TGCAGCCTCC CATAAACATG TGAGCATGTT CTGAGTGTGC CATGATGTGA 4651 GTGGACCTGG CTGGAATCTT CGGAGAGCGA CTGAGGTGTT CAAATCGAAT 4701 CTCCCAGGAG GCTTCCTTCC AGCCCCCTAT TCTGGTAACT ACCAGGAGGC 4751 TTCCTTCCAG CCCCCTATTC TGGTAACTAC CAAAATCCCT CGGGTGCAAG 4801 TGTAGGGGTA GAGATGGAAG GATGAGAGGT GAAATTGACC TTTTGAAAGC 4851 AAAGCTCTGG CTCACAGGCC CCAAACTACC AGCCGTATCT AGCATATCCC 4901 CACCCTCCAC CCACTACCTC CTCCAACAAA GGAGTCAACT CAGTTGAAAA 4951 AACTGGTCCT TTGGCCTATC CATGGGTCAA AGTCCACCTC TCCTGGGGGC 5001 CTGGAGAGGA CTGAGCCTAC GGAAAGGGGA TACCTTCCCA CTCAGCACTG 5051 CTTCACACAG GCCCCCTGCC TGGGGCTCTC CAAGGAGCCT TCTTCACCCA 5101 CTTCCAGCTC CACTTCTGCA AGGTTAAGTC AAGTGAGAAC GATGAGAAAT 5151 AGGGAGATGG TGTCTCCTTA AGTCCTTGAT CTGCCTGTCT GTGGAATGGG 5201 AGGTTGGATT AGCTGCGCTG AGGTCCCATC CACAGCTGGT GCTCAGCTGC 5251 TTGAAGGGGA GACTCCCTCC TCTGTAACTT CTTTCTGGGG GATTGGGGTG 5301 GGCAGTACCT ATCCCCAGTC CCCTCCTAGC TTGACTTTAG TGGTTTCCAA 5351 TGTAGAAGTT AACAAAGTAT GCCCCATTCC TGTGACAAAA GCACAACCAT 5401 TCTGAAGTTA CTGGAGCATG GGCTCAGCTC ATCCTCCCTC TGGCCCCTTC 5451 TCCCATGGGG ACATCTCGGC CCAGCACCCC TATCCCATTT CCAGAGTTCT 5501 TCCTTCCCCA TCTGGGCCTT CATAAAATGC AGGGGAAGCC AGACTGGTCT 5551 CAGGAGCGCT AAAGCCCTTC CGTGGGGGGT CGTCTTTCTG GGACTAGCCC 5601 TGCTGTTTAG GACCTGGGAC CACAATGGGG TACCTGCCGA GGGGGTCCCC 5651 AAGAGATCCA GGCTGTCATG TGATTTATGG TGGCATGTGT TGTGTATTTG 5701 TTGGCTACTT GTGTCTTGAA ATCTAGAATT ATTTCACGCA GAATTGTCAC 5751 TGTTTGTCAG GAAGAGAAAA TGGGCTAGTG GAAGCCCAGT CTTGAGTTCT 5801 TGTCTTGTTA CCATTTAAAA TTGACATTTA ATTTTCAAAT CACTGTTGGT 5851 GCCTAATCAC TTAAGTTATT AATTTATTCT GTTGTATTCT TTTTTTTAAA 5901 TTGTAACATA TTTATCCGGT GGGTGGGACA GGAGTGTGTT CAAGTGGGTC 5951 ATGTTTTTGC TGTGGTGACA CATGGTACAG GCTTGGAGCT TGCAGGTCCC 6001 TTTCTACTGT GGTGTTGGAG CAGGACAATA AAGTCCACTA GAAATGCACC 6051 CC // LOCUS AB011420 2641 bp mRNA PRI 05-NOV-1998 DEFINITION Homo sapiens mRNA for DRAK1, complete cds. ACCESSION AB011420 NID g3834353 VERSION AB011420.1 GI:3834353 KEYWORDS DRAK1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2641) AUTHORS Akira,S., Sanjo,H. and Kawai,T. TITLE Direct Submission JOURNAL Submitted (23-FEB-1998) to the DDBJ/EMBL/GenBank databases. Shizuo Akira, Hyogo College of Medicine, Department of Biochemistry; Mukogawa-cho 1-1, Nishinomiya, Hyogo 663-8501, Japan (E-mail:akira@hyo-med.ac.jp, Tel:+81-798-45-6357, Fax:+81-798-46-3164) REFERENCE 2 (sites) AUTHORS Sanjo,H., Kawai,T. and Akira,S. TITLE DRAKs, novel serine/threonine kinases related to death-associated protein kinase that trigger apoptosis JOURNAL J. Biol. Chem. 273 (44), 29066-29071 (1998) MEDLINE 99003259 FEATURES Location/Qualifiers source 1. .2641 /organism="Homo sapiens" /db_xref="taxon:9606" gene 118. .1362 /gene="DRAK1" CDS 118. .1362 /gene="DRAK1" /codon_start=1 /product="DRAK1" /protein_id="BAA34126.1" /db_xref="PID:d1035103" /db_xref="PID:g3834354" /db_xref="GI:3834354" /translation="MIPLEKPGSGGSSPGATSGSGRAGRGLSGPCRPPPPPQARGLLT EIRAVVRTEPFQDGYSLCPGRELGRGKFAVVRKCIKKDSGKEFAAKFMRKRRKGQDCR MEIIHEIAVLELAQDNPWVINLHEVYETASEMILVLEYAAGGEIFDQCVADREEAFKE KDVQRLMRQILEGVHFLHTRDVVHLDLKPQNILLTSESPLGDIKIVDFGLSRILKNSE ELREIMGTPEYVAPEILSYDPISMATDMWSIGVLTYVMLTGISPFLGNDKQETFLNIS QMNLSYSEEEFDVLSESAVDFIRTLLVKKPEDRATAEECLKHPWLTQSSIQEPSFRME KALEEANALQEGHSVPEINSDTDKSETEESIVTEELIVVTSYTLGQCRQSEKEKMEQK AISKRFKFEEPLLQEIPGEFIY" polyA_site 2641 /note="19 a nucleotides" BASE COUNT 791 a 509 c 567 g 774 t ORIGIN 1 GGGGAGAGCG GGTGTTTGAA GGCTCCGCGG ACCGGCACTA GGAGCCGGGG 51 GCGGGTCCGT GACCCTCCGG CTGCTCGGAG TGAACAGGCG GCCAGGAAAG 101 AAGCGGGCCT GAACACCATG ATCCCTTTGG AGAAGCCAGG CAGCGGCGGC 151 TCCTCCCCAG GCGCCACCTC AGGCTCGGGC CGGGCAGGCC GGGGTCTGAG 201 CGGGCCGTGC CGGCCGCCGC CGCCGCCCCA GGCCCGCGGG CTGCTGACAG 251 AGATACGCGC CGTGGTGCGC ACCGAGCCCT TCCAGGACGG CTACAGCCTG 301 TGCCCGGGCC GGGAGCTGGG CAGGGGGAAA TTTGCAGTGG TGAGAAAATG 351 TATAAAGAAA GATTCTGGGA AAGAATTTGC TGCAAAGTTC ATGAGAAAAA 401 GAAGAAAAGG CCAAGATTGT CGGATGGAAA TAATTCATGA GATTGCTGTA 451 CTTGAACTAG CACAAGACAA TCCTTGGGTC ATTAATTTAC ATGAAGTTTA 501 TGAGACTGCA TCAGAAATGA TCTTAGTTCT GGAATATGCT GCTGGGGGTG 551 AAATCTTTGA CCAGTGTGTT GCAGACAGAG AAGAAGCCTT TAAAGAAAAA 601 GATGTTCAAA GACTTATGCG ACAGATTTTA GAAGGTGTTC ACTTTTTACA 651 CACTCGTGAT GTAGTTCATC TTGATTTGAA GCCTCAGAAT ATTCTGTTGA 701 CAAGTGAATC TCCATTGGGT GACATTAAGA TTGTTGATTT TGGCCTTTCA 751 AGAATATTGA AGAACAGTGA AGAGCTCCGA GAAATTATGG GTACCCCTGA 801 ATATGTGGCT CCTGAAATTC TTAGTTATGA TCCTATAAGC ATGGCAACAG 851 ATATGTGGAG CATTGGAGTG TTAACATATG TCATGCTTAC AGGAATATCA 901 CCTTTCTTAG GCAATGATAA ACAAGAAACA TTCTTAAACA TCTCACAGAT 951 GAATTTAAGT TATTCTGAGG AAGAATTTGA TGTTTTGTCT GAGTCGGCTG 1001 TTGATTTCAT CAGGACACTT TTAGTTAAGA AACCTGAAGA TCGAGCCACT 1051 GCTGAAGAAT GTCTAAAGCA CCCCTGGTTG ACACAGAGCA GTATTCAAGA 1101 GCCTTCTTTC AGGATGGAAA AGGCACTAGA AGAAGCAAAT GCCCTCCAAG 1151 AAGGTCATTC TGTGCCTGAA ATTAATTCGG ATACCGACAA ATCAGAAACC 1201 GAGGAATCCA TTGTAACCGA AGAGTTAATT GTAGTTACTT CATATACTCT 1251 AGGACAATGC AGACAGTCTG AAAAAGAGAA AATGGAGCAA AAGGCCATTT 1301 CCAAACGATT TAAATTTGAG GAACCTTTGC TACAAGAAAT TCCAGGAGAA 1351 TTTATCTACT GAGCAATATT TCCCTTTAGA ACTTCAAGAT TTCTACACTG 1401 AAAATGTTAA TATTATTTAT GGACCTCTGG CCAAATGGTA CATGTACTGG 1451 AAGTGGATAA CCAGTATCAC TTACACAAAC AAAAATAACT TTGTCAAATT 1501 TGTGGAGTTA GGTGGAAGCC AGATTTTAAA AGTTGCCAAC CAGGATATTT 1551 AACAGGTACA GTTACCCGTT TCAATGTTAT TTTTAAGAAG GGAGATGTTG 1601 GCACCTTTGA ATTCTACATC CTGTTTCTCC AGAATGAGAA TTTGTGTACA 1651 AAGATATTTG TATTCACTTT CTTTAAAAAA TCCAAGTAAA AGTGCCAAAA 1701 CTACATTTCT GTAAATCTCT TGCATTATTC ATATGTGTAT CTATATCTGC 1751 ATAATGTTTG TTAATGTCTA CTAAAATTGC TACTTTTTCA CTTTGGATTT 1801 GTTTTTGGCA AAATTTTAGT CTAAACAGAC ATCTAAATTT TCGAGACTTA 1851 GAATAACATT CACTAAAGTT TGATAATGTC TTTTTTAAAT ATTTTCTTAC 1901 TGCTTTATAG TGACTTGATT TGGTTTCTGT TGTGTTTTTG CCTAAATACT 1951 AGTAACATAT CAGTGAAAAA CCCTAATTTT TTTTCTCTTC AATGTCACTA 2001 GTAAGAGAGG TGGTAATTTT TCACCTCTGA AATTATTCTG GGTTTAGGTG 2051 TCAGCTTTTG AGCTGCTTCC CTCATCACAC CACACTCACA TGCATCTGTT 2101 CTCTCTCACA CTCTATACCC CTGCCACTTC CTGGCACCTC CCCCCATGCT 2151 TGTCATTTAA TTTTGGCCAC TTGTAGGTAT CAGTGTGATC TGATCAACAC 2201 TCTGGTTACC TTGGTCAGTG AAAAGATGCT ACTATATTGC TTTTGTCCCA 2251 AAGTGAGTAA AATCCCCTAG AGCAGAGAGA GAGAGAGAGA GAGAGTGTCT 2301 TCATGCCAAC CACAGCCTGT TTCTGCTGAG TTTCTTATCA GCCCTCAAGC 2351 TATGCTAGGA AAAGTTATAA AATGCCAAAA TATTTATAAA CATTTACTTT 2401 GTCCATAAAA AATTTACATT GGATACTCTG TAAGTAGGAG GTACTTTGTC 2451 CCAAAAAGAT GTATAAGAAT GTACTAATAG TTTTATTTGA TTAGGATTGA 2501 ACAGTTCAGT TGTATCTATG CCCCACAGTG ACCAGTAAAG TCCAATTAAA 2551 ATATGGAAAT GTAAAAGTGT ATGCCGAATA CCTTAAAGTA ACTAATTATC 2601 CTTACACACA AAAGGCTCAG TGCATTAAAT ATTTGCCTAT C // LOCUS AB014552 4040 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0652 protein, complete cds. ACCESSION AB014552 NID g3327117 VERSION AB014552.1 GI:3327117 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HK01711. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4040) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .4040 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HK01711" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 309. .1862 /gene="KIAA0652" CDS 309. .1862 /gene="KIAA0652" /codon_start=1 /product="KIAA0652 protein" /protein_id="BAA31627.1" /db_xref="PID:d1032588" /db_xref="PID:g3327118" /db_xref="GI:3327118" /translation="METDLNSQDRKDLDKFIKFFALKTVQVIVQARLGEKICTRSSSS PTGSDWFNLAIKDIPEVTHEAKKALAGQLPAVGRSMCVEISLKTSEGDSMELEIWCLE MNEKCDKEIKVSYTVYNRLSLLLKSLLAITRVTPAYRLSRKQGHEYVILYRIYFGEVQ LSGLGEGFQTVRVGTVGTPVGTITLSCAYRINLAFMSTRQFERTPPIMGIIIDHFVDR PYPSSSPMHPCNYRTAGEDTGVIYPSVEDSQEVCTTSFSTSPPSQLSSSRLSYQPAAL GVGSADLAYPVVFAAGLNATHPHQLMVPGKEGGVPLAPNQPVHGTQADQERLATCTPS DRTHCAATPSSSEDTETVSNSSEGRASPHDVLETIFVRKVGAFVNKPINQVTLTSLDI PFAMFAPKNLELEDTDPMVNPPDSPETESPLQGSLHSDGSSGGSSGNTHDDFVMIDFK PAFSKDDILPMDLGTFYREFQNPPQLSSLSIDIGAQSMAEDLDSLPEKLAVHEKNVRE FDAFVETLQ" BASE COUNT 898 a 1107 c 1027 g 1008 t ORIGIN 1 CGCCGGGAAG CGACCGGCTG CTGGGCTTAA GGCGGGAGTG ACCGCTTAAC 51 CAGTGAGGGA AGCACTGAAG AGCGCCAGTC GACGTGGGTG CGACAACTCG 101 CGGAGTCTTA GGAGCAAAAC GTCTGGGGCC TGCGAGCCAG GACCCTTCTG 151 AAGCCTTAGG TGTCTATCGG CGACGTGTAC GGTCACTGCA GCTCCGGAGC 201 GCGGAACCCT CAGCCAGGAG GCGCGGCTGG TCGGTCCCAG GTCCCGGCCT 251 CCGTAATGAG AGCCCGGAAC CACTCTTTGT GCCGCAGCTT CGCAGATTCC 301 TATAGGCAAT GGAAACTGAT CTCAATTCCC AGGACAGAAA GGACCTGGAC 351 AAGTTTATTA AATTTTTTGC CCTCAAGACT GTCCAAGTGA TTGTCCAGGC 401 TCGGCTTGGT GAAAAGATTT GCACTCGTTC ATCATCTTCT CCAACGGGTT 451 CAGATTGGTT CAACTTAGCA ATCAAAGACA TCCCAGAGGT TACACATGAA 501 GCAAAGAAGG CACTGGCAGG ACAGCTGCCT GCAGTCGGGA GGTCCATGTG 551 TGTGGAGATT TCACTTAAGA CTTCTGAGGG AGATTCCATG GAGCTGGAAA 601 TATGGTGTCT TGAAATGAAT GAAAAGTGTG ATAAAGAAAT CAAAGTTTCC 651 TACACGGTGT ACAACAGACT GTCATTGCTG CTGAAGTCCC TTCTTGCTAT 701 AACTAGGGTG ACACCAGCCT ATAGGCTCTC CAGGAAACAA GGGCATGAAT 751 ATGTCATATT ATACAGGATA TATTTTGGAG AAGTTCAGCT GAGTGGCTTA 801 GGAGAAGGCT TCCAGACAGT TCGTGTTGGG ACAGTGGGCA CCCCTGTGGG 851 CACCATCACT CTTTCTTGTG CTTACAGAAT TAACTTGGCA TTCATGTCTA 901 CCAGGCAATT TGAGAGGACC CCACCTATCA TGGGGATTAT TATTGATCAC 951 TTTGTGGACC GTCCCTATCC CAGCTCCTCT CCCATGCACC CCTGCAATTA 1001 CAGAACTGCT GGTGAGGACA CTGGAGTAAT ATACCCGTCT GTAGAAGACT 1051 CTCAAGAAGT GTGTACCACC TCTTTTTCCA CCTCCCCACC ATCCCAGCTC 1101 TCAAGCTCTC GCCTTTCCTA TCAGCCTGCT GCCCTGGGCG TTGGATCAGC 1151 TGACCTGGCT TATCCAGTAG TGTTTGCTGC TGGCTTAAAT GCTACACACC 1201 CTCACCAGCT GATGGTTCCT GGGAAGGAAG GTGGGGTACC CCTTGCTCCC 1251 AACCAGCCTG TCCATGGTAC CCAGGCTGAC CAGGAGAGAC TGGCAACCTG 1301 CACCCCTTCT GACAGAACCC ACTGTGCTGC CACACCCTCC AGTAGTGAGG 1351 ATACTGAAAC CGTATCAAAC AGCAGTGAGG GACGGGCCTC CCCTCACGAT 1401 GTCTTGGAGA CCATCTTTGT CCGAAAAGTG GGGGCTTTTG TCAACAAACC 1451 CATTAACCAG GTGACCCTGA CGAGTTTGGA TATACCCTTT GCCATGTTTG 1501 CTCCCAAGAA TTTGGAGCTG GAGGATACCG ATCCAATGGT GAATCCTCCA 1551 GATTCCCCAG AGACTGAATC TCCTCTCCAG GGCAGCCTGC ACTCAGATGG 1601 CTCCAGCGGG GGCAGCAGTG GCAATACCCA TGATGACTTT GTTATGATAG 1651 ACTTTAAACC AGCTTTTTCT AAAGATGACA TTCTTCCGAT GGACCTGGGG 1701 ACCTTCTATC GGGAGTTTCA GAACCCACCT CAGCTGAGCA GCCTCTCCAT 1751 AGATATTGGA GCACAGTCCA TGGCTGAAGA CTTGGACTCA TTACCAGAGA 1801 AGCTGGCTGT GCATGAGAAG AATGTCCGCG AGTTTGATGC CTTTGTGGAA 1851 ACCCTGCAGT AAAAGTATCC TTGAGTCCCA GCAGCACCCC CTTTTTGTGG 1901 CCCCAGGGCA TAAGCAGCCT CCCATGCATC AGCTGCTCCC ACCCCTCATC 1951 CTGCTCTGAG CCAGGTGGAA GGGAGGCTGG CTTCTCCCAT GGGGACCCAG 2001 AAGTCCCTAC TCTTGGACCT CCTGGAGACT CCGTGGCGGC AGTCAAGCCC 2051 AGTGCCCAGT TGGAGAAGAC TCACGTGCTG GCCTTGGAGA TGGGAAGAAC 2101 CTTCGTACGA AAAAGCCCTC AGCAGGGCCA TCTGTGTGCC CTGCCCATCA 2151 CCAACTGCTT CCCAAGGGTG TCATCCTGTT CCTCCTGCTG CCGGCCTCCT 2201 GCCTGGGCCT GCCTTGCAGC TGGCCCCTTC CCTGCCTGCT GTCACCATCC 2251 ACTGTTTGAC ATTCCAGCTG GTGGCCAAGA GATTGGTGTG GAGGCAGAAA 2301 GAGGAAGGAG ACAGTGCCAG GAGGAAGAAG GAAGGAGTCC CTTAGCTCTC 2351 TTCATTGTCC CCTTTACTTC CTGCTATCTT CTTCTCCTCT TCTTCTCTCT 2401 CTTGCCTCTA TGCCTGTATT TCTGGCAATA TGACAGGCCT GCCTACCCAA 2451 GATCAGAACT CCAAAACCAC TCCCACCCCT GAAGGTCGGG AGGGTCTGAG 2501 CAGCCCTGGT GGCTGCCTGT GCTCAGGTCC TCAGCTCCAT GGGAAATAAA 2551 AATGGCACCC TGAATCTCTA GGATTTTGTC ACTTGGAGTC ACAGCAAAGT 2601 TCTCTTCCTC TTGTCCCCCC GTTGCTGCTC CTTGGTTATA GAACATGGTA 2651 AATATTTATT ACTTTCAGAG AAACCAGATA TTTTATAGAG GAAATATGTT 2701 TGAGGTGAGT TGTTTTTCAC TTGGAGAAGG CGGAGGGCTC TTCCTGGGAC 2751 GGAGACCTCC TCCTCCGGAG GTTATTGAGA ATCCGGGCTG CTGCTTTGAG 2801 GATCTTCCCA CCATACAGAC AGCGAGATCC AAGAAGAGGG CTGGCCGGGG 2851 GCAAAGTCAC CTCCCAGTGT GGCTGCACTG GAACTGACTA AAGGCTTTAC 2901 CTTGGATAGT TGCGTATTCC TGGTGAGAGC CTTACATCTC CCACAGTTTC 2951 TGCAGAGTGA CTGACTCCAT TCTGGCAGCC CAGGAAGTCC TGGGTGCTAA 3001 ATGTGATGGC CACATGTAGT GGTTAGGGGA TGTTGTGTGT GTCCCCCAAC 3051 TGCCTGGGTA CTTGTTCCTG ATCCCTGGGG CTGTCCTGTG GAGCTTTTCC 3101 TCCTGCTTGG GCCTAGCTAC CATCTCCCTC TAATCCCAGG TTCTCTACAC 3151 TGCCCTGGGG TTTACCAGCT GGATTGGCTT CTGGTTGAGA AATCAAAGCT 3201 GGGCGTATGA TTGACTTAAC CCTTCAGGTA TTGTTACTTG AATAAGTCAA 3251 GTGCCTAGCC TCACCCACCT ATGATCTGTC CTTTCCCAGC CTCGCTGGTA 3301 GTCCTGGTCA AGGAGATCTA GGTCTACTCC ATTCCTCCTG GCCCACCTGG 3351 GGCATTCACT GGCAGCAGCT GTGCTTCAGT GGAGCAGGTG GTTCTCAGCT 3401 GCTTGTTAGT ATACTGCATG TGACACTGTT CCCACATACA AGGCTGACTT 3451 CTGAGGATTG GAGCAGGCTC TGGCGGGGAC CAGAGCTCTG CGTGCTGCTG 3501 CTGCCACCAA GAAGTGTTAG CAGAAGCAGT AGCAGCCAAC TGGCCCTCCT 3551 GACTTTGGCC CAGAGCACAT GCGTGGCTTG CTGAACCCAG GCTCAGGTTT 3601 ATCCCCAAGG CCCCAGCTTT GAGAAGGGGG AAGGCCCCTG GTAAGTTATT 3651 GATGCCCCCA TATTTCAGCT ACTGCTCTCT TTCCAAGGCC TTGCATGGAA 3701 AGGCCTAGCC ATTGTCTGAG GCAGCAATCT TTGGCATCTA CAGGTGGCAG 3751 CAGCCTTTCA CCAGGGCTCC ATCTGTGAAG AGTCTCAGCC ATGACTTTGA 3801 GCTGAGCTTG GGAGAAGTAA AGCAACTGTT AAGGCCAGCC CTTGCCCCTC 3851 AGACCTGCCA TGAAAGGAAT GAGCCCTAGA CTGACTCCTG CAGCACCCCC 3901 GGGACAGGCT GGGACCAGCT GTTTGTCTCC AGGTGTCAGA GTCCCTCCTC 3951 CTCCTCCAAC CTCTCCAACC TACTTTGTTT GGAAATACCG AGCTACACTT 4001 CAAAATGTAT TCAAGGGATT TCCAATAAAT TTTTTTCTGT // LOCUS AB014560 4210 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0660 protein, complete cds. ACCESSION AB014560 NID g3327133 VERSION AB014560.1 GI:3327133 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HK01902. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4210) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .4210 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HK01902" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 121. .1569 /gene="KIAA0660" CDS 121. .1569 /gene="KIAA0660" /codon_start=1 /product="KIAA0660 protein" /protein_id="BAA31635.1" /db_xref="PID:d1032596" /db_xref="PID:g3327134" /db_xref="GI:3327134" /translation="MVMEKPSPLLVGREFVRQYYTLLNKAPEYLHRFYGRNSSYVHGG VDASGKPQEAVYGQNDIHHKVLSLNFSECHTKIRHVDAHATLSDGVVVQVMGLLSNSG QPERKFMQTFVLAPEGSVPNKFYVHNDMFRYEDEVFGDSEPELDEESEDEVEEEQEER QPSPEPVQENANSGYYEAHPVTNGIEEPLEESSHEPEPEPESETKTEELKPQVEEKNL EELEEKSTTPPPAEPVSLPQEPPKAFSWASVTSKNLPPSGTVSSSGIPPHVKAPVSQP RVEAKPEVQSQPPRVREQRPRERPGFPPRGPRPGRGDMEQNDSDNRRIIRYPDSHQLF VGNLPHDIDENELKEFFMSFGNVVELRINTKGVGGKLPNFGFVVFDDSEPVQRILIAK PIMFRGEVRLNVEEKKTRAARERETRGGGDDRRDIRRNDRGPGGPRGIVGGGMMRDRD GRGPPPRGGMAQKLGSGRGTGQMEGRFTGQRR" BASE COUNT 1234 a 751 c 902 g 1323 t ORIGIN 1 GAGCCCGGGA GCCGGAGGTG TAGCGGCAGA GACATTGTTC TTGCCGGCTC 51 CCTACGGTGC CGTGTGTGCG TGAGAGAAGA CCAGTCTTTC CTCTAGCATT 101 TGACATTGTG CAGCAAAGAA ATGGTTATGG AGAAGCCCAG TCCGCTGCTT 151 GTAGGGCGGG AGTTTGTGAG GCAATATTAT ACTTTGCTGA ATAAAGCTCC 201 GGAATATTTA CACAGGTTTT ATGGCAGGAA TTCTTCCTAT GTTCATGGTG 251 GAGTAGATGC TAGTGGAAAG CCCCAGGAAG CTGTTTATGG CCAAAATGAT 301 ATACACCACA AAGTATTATC TCTGAACTTC AGTGAATGTC ATACTAAAAT 351 TCGTCATGTG GATGCTCATG CAACCTTGAG TGATGGAGTA GTTGTCCAGG 401 TCATGGGTTT GCTGTCTAAC AGTGGACAAC CAGAAAGAAA GTTTATGCAA 451 ACCTTTGTTC TGGCTCCTGA AGGATCTGTT CCAAATAAAT TTTATGTTCA 501 CAATGATATG TTTCGTTATG AAGATGAAGT GTTTGGTGAT TCTGAGCCTG 551 AACTTGATGA AGAATCAGAA GATGAAGTAG AAGAGGAACA AGAAGAAAGA 601 CAACCATCTC CTGAACCTGT GCAAGAAAAT GCTAACAGTG GTTACTATGA 651 AGCTCACCCT GTGACTAATG GCATAGAGGA GCCTTTGGAA GAATCCTCTC 701 ATGAACCTGA ACCTGAGCCA GAATCTGAAA CAAAGACTGA AGAGCTGAAA 751 CCACAAGTGG AGGAGAAGAA CTTAGAAGAA CTAGAGGAGA AATCTACTAC 801 TCCTCCTCCG GCAGAACCTG TTTCTCTGCC ACAAGAACCA CCAAAGGCTT 851 TCTCCTGGGC TTCAGTGACC AGTAAAAACC TGCCTCCTAG TGGTACTGTT 901 TCTTCCTCTG GAATTCCACC CCATGTTAAA GCACCAGTCT CACAGCCAAG 951 AGTCGAAGCT AAACCAGAAG TTCAATCTCA GCCACCTCGT GTGCGTGAAC 1001 AACGACCTAG AGAACGACCT GGTTTTCCTC CTAGAGGACC AAGACCAGGC 1051 AGAGGAGATA TGGAACAGAA TGACTCTGAC AACCGTAGAA TAATTCGCTA 1101 TCCAGATAGT CATCAACTTT TTGTTGGTAA CTTGCCACAT GATATTGATG 1151 AAAATGAGCT AAAGGAATTC TTCATGAGTT TTGGAAACGT TGTGGAACTT 1201 CGCATCAATA CCAAGGGTGT TGGGGGAAAG CTTCCAAATT TTGGTTTTGT 1251 GGTTTTTGAT GACTCTGAAC CAGTTCAGAG AATCTTAATT GCAAAACCGA 1301 TTATGTTTCG AGGGGAAGTA CGTTTAAATG TGGAAGAGAA AAAAACAAGA 1351 GCTGCAAGAG AGCGAGAAAC CAGAGGTGGT GGTGATGATC GCAGGGATAT 1401 TAGGCGCAAT GATCGAGGTC CCGGTGGTCC ACGTGGAATT GTGGGTGGTG 1451 GAATGATGCG TGATCGTGAT GGAAGAGGAC CTCCTCCAAG GGGTGGCATG 1501 GCACAGAAAC TTGGCTCTGG AAGAGGAACC GGGCAAATGG AGGGCCGCTT 1551 CACAGGACAG CGTCGCTGAA GCTCCACTGT TGGCAAAGTC TTGGCAGTGG 1601 TACATTATTC ATCGTGTTTG CATTCTTGTT AATTTTTTTT TTGGCTTTGG 1651 AATGTGACAC AGCCTTTTTG ATCATTTCTT TGATGTGAAA AGCATCTTTG 1701 GTTATCAGTT AAATTGAGGT GGACATTATT TCCCCAATTT CACAACAGGA 1751 TTCACATTGT TAATTTATAA ATCTAGACTT GGAGAATTAA GGACTGAGAA 1801 ATGACCATAT CTTAAACTAT CTACGACAAA GTGAACTTAA AAGGACATGC 1851 CCACTGAATT CAGGTCCTTT GAGTAAAAAA AAAATCTTCT GCTGCACATT 1901 TTGTTTAAGT GTTACTGTTT CTGCCTGTTA ATGCTGGGAA CACAAATAGT 1951 GCAATTTGTG CAATTGGAGA ATCTTGCCTT TTTTCTTGGC TCCCCCCAAA 2001 AATACAAACC AACAGAAACT TGTTATGCAC TCATCAAAAT GTACTAATGG 2051 GTACTCTGAA CTCATTAACA TTGACATCTG CAACAGGAGG CAACAGGGAA 2101 AAAATCTCAT CTTCTTTTCC AGTAGAAAAT AGTTTGTGAA ATGATGAGGG 2151 CATTTTATCT GCTTGCTGTG ACCAGCGTGT GTACACATAA ACCTTAACAA 2201 GACTACAAGT ATATTCCAGA AGGAAATCAT TTTAGTTATG AACTAAATAA 2251 TAAAAATTAG AACTTCAAAT GCGATGGTCT TGACTATTAG ACCAGATTTA 2301 GTAGCTCCAT ATCTAAGATT TTTCTACCTG CCCCTCTTCA GTACAGGGAT 2351 GGCTGGCTGC TCAACACACT CCTCCTCCCC TTTTTTCCTT TCTTTAAGCT 2401 GTGTACAGTG AAAATTGTCT TTACTGTATT TTTGTTCTCT GGTAATGTAA 2451 TAAGCATGAT GGTGCCTTCT ATTAATACAT CATTCCAGTC TTGCTGGTAA 2501 TTTTGTACAG TATAGTGTAT GAATTGCTGT GCTGCAAAGC CAAACAGCTG 2551 CAAAATGTTG AAAAATCATC GAAATGTATA AAAATTGCAG TATCTTTAAA 2601 ATCAGTAAAA TGGACTAGCA TATTATTTAT CTTGTTCTTC AGTTAACAAC 2651 TTTGTGTTCT CTGTGGGAGG GAGGGAGTCC TGTGTGTTTG TGGGGAGAGG 2701 GAAGGAGGAA GTCAGTTATT TGAGTAAGCC TCTAGTTGAC TTTTCTCTTA 2751 GCCTGAATGT GGACGTTGAA ACATATCACT TCAGGGCTTG GAAAAGTCAG 2801 TCAACTTGAC GTACATTTTT AGTGACATTT TAAAAGCAGT CAGATTCTAT 2851 AAATGGCAAG TAAGCCTGAA GTGAGGATAC TGCAATTTTC GGAGAAAAGA 2901 ACAGCAGCTC TTTAAGTGTT TGCATTTTCT ATTTGGGGGG CAGGGAACTG 2951 TCATTCATTT TGCACAATTC TTGAACTGAT GTCAGCACCC GAGTGGCTCC 3001 TGAATTTAAG TCTGGGACGA CATCTTTTAT TTTTACATGA ATCTTTAAAC 3051 AATTCTGTGA GCAAAGTTTG TAGCTGCTGG ATTATTGTCT GTCTTTATAG 3101 CAAGTTCCAG TAAACCACAA GTATGGCAAA GCTTATCCAA TTTTATGCTT 3151 GGAGCAGTCA GTACATACCA GTTTCTGATG TTTCAGGCAG GAGTGGGGTA 3201 AATAAGTGTG ACCACTTAAA GCTGCTCGTT AGCATGGAAG ACTTCTCCAT 3251 TCTATCTTTG TAAAACAGAC AAGATATGCA CTTGACATAG TAGCAAATTG 3301 GTTCTGAATT ATGCAACTGT TTGCTATTTA GTAAACTAGC AAATGATGCA 3351 TGTATTTTGT TTTTCATGTA CTGGGCAATA TGAGTAAAAT CTGTCCCTTT 3401 TTCCCCCTTT GAATGAGGTC TTCCATGTTT GAGGGAAAGT CTTGCACTAT 3451 TGCATATATT TTGGGGACAC AGATTTTCAT AGTTTCCATT TTTGGGGGGC 3501 TTAAGGATTT TTTTTTTTTC TGTTTGAAAC AGTTTTATAC TTTCTGATAT 3551 AGTACTTGAA ATTCTTACCA GAAAATTACT TTGGAGTTTT GAAGCCTTTA 3601 TTAATACTAC TTTTAAAGAA GCAGTTGTTT TATTGTCAAT GTTTTTTTTC 3651 CCCCAAGCAT ATTTTCTTGT ATTTCTGTTT CCATATATAT ATATATATAT 3701 AATTTCCAAT TCAGGATATT GCCCTGCCAT CCATGAAAAC TGTTCTGGCA 3751 CCAAAAGTAA TGACAAATGT TAAGTGTAAT AATAGAAAAG TAGAGCAAAG 3801 AGCCATTCAG CTTCAGTCTT TACATACCAT GAATAAAACA TTAAAACATC 3851 ATATGGAGAA GTTTACATGG TGATTGTTCA CCTGCAGTAC TGTGGAGTTT 3901 TAACATTTTG TCCTCTTTTC AGTGAAACAG AGTAAAAATA TTCATCTACC 3951 ATTACTGTTA TTTGCTGATT TTGTTTTATT TTTTGATGGT AATATTCTAT 4001 CCTTATGACA CTATTGCAAC CAAATTGGCT TTACCATCTT GGCTTTAGTA 4051 GGTATAGAAG ACAATGGATT ACCATCTTTA TTGCTGTAAT GTGTTAAGCA 4101 TTATATGCTA GTAGAATCTA GTTTAATTGT TTCAGGTGGA AAGTATTCTT 4151 TGAGTTTCCA TATTGAATGT GTTTGGACTA AACAAACAAT AAACTACTGA 4201 TGTCTGCAGC // LOCUS AB014569 4550 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0669 protein, complete cds. ACCESSION AB014569 NID g3327151 VERSION AB014569.1 GI:3327151 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HK02346. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4550) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .4550 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HK02346" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 1017. .3359 /gene="KIAA0669" CDS 1017. .3359 /gene="KIAA0669" /codon_start=1 /product="KIAA0669 protein" /protein_id="BAA31644.1" /db_xref="PID:d1032605" /db_xref="PID:g3327152" /db_xref="GI:3327152" /translation="MSKMPAKKKSCFQITSVTTAQVATSITEDTESLDDPDESRTEDV SSEIFDVSRATDYGPEEVCERSSSEETLNNVGDAETPGTVSPNLLLDGQLAAAAAAPA NGGGVVSARSVSGALASTLAAAATSAPAPGAPGGPQLAGSSAGPVTAAPSQPPTTCSS RFRVIKLDHGSGEPYRRGRWTCMEYYERDSDSSVLTRSGDCIRHSSTFDQTAERDSGL GATGGSVVVVVASMQGAHGPESGTDSSLTAVSQLPPSEKMSQPTPAQPQSFSVGQPQP PPPPVGGAVAQSSAPLPPFPGAATGPQPMMAAAQPSQPQGAGPGGQTLPPTNVTLAQP AMSLPPQPGPAVGAPAAQQPQQFAYPQPQIPPGHLLPVQPSGQSEYLQQHVAGLQPPS PAQPSSTGAAASPATAATLPVGTGQNASSVGAQLMGASSQPSEAMAPRTGPAQGGQVA PCQPTGVPPATVGGVVQPCLGPAGAGQPQSVPPPQMGGSGPLSAVPGGPHAVVPGVPN VPAAVPAPSVPSVSTTSVTMPNVPAPLAQSQQLSSHTPVSRSSSIIQHVGLPLAPGTH SAPTSLPQSDLSQFQTQTQPLVGQVDDTRRKSEPLPQPPLSLIAENKPVVKPPVADSL ANPLQLTPMNSLATSVFSIAIPVDGDEDRNPSTAFYQAFHLNTLKESKSLWDSASGGG VVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVERNSLLERENALLKSLSSND QLSQLPTQQANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA" BASE COUNT 995 a 1347 c 1207 g 1001 t ORIGIN 1 CCCAGCATCC CTGGCGCGGC CATTTCAGCC CCATCTTGGC CCAGCGGAGG 51 GAGCTGCAGC CGCCGCTTCA GCAGTTTGAA TTTCAGTTTC TTCGGGGTTT 101 AGGAATTGGG CGCACCGAGG AGGAGCCGGA GAAGCCACCA CCGCTACCTC 151 ACACAGCCGG GGCGCCTCCG GTCCCGTGCC AGCGGTCTGC CGCCGCCGCC 201 CCTCTCTGAG AGAAGCCGCG AGGGGAGGCC GAGACCGCCG TCGCCGCCCG 251 CCCGCAGGGG TGGTTCCCCC TGTAAGGGGA GGACCGAGCC GGCTTTCCCC 301 TCCCCCAGAG CGGGTTTCCG CCTGAGGCAG TCGACATGTC CCTGGGGCTG 351 AGCTCCGGCT AGAGCCGGGG AGGGAGGGCC GCCTGCCCGC GCCACAGCCC 401 CACCGCCCGC CCGCGGACCT TCAGCCAGCT CCGGCCCGCG GCTCCCGGCT 451 GAATTAGGCA TCTCCGACTC CGGACTCGCC GCTCCTTCCC TCAGTCAGAG 501 AGCCCAGCGC TGACGCCGGC ACCGGCCTGA GGAGCCCGAG CGGGGCAGCA 551 CCGCCCCTCA CGCCGCCGCC ACCGCCTCCT CCTCCTCCTC TTCGCCCTCC 601 CACTCCCACC CCCAGCCAAG GCCTGCTGAC CGCGCCGAGC GCCGAGCTGG 651 ACTGACCACG GCTGCCCGGA GACGAGAGAG GAAGCAGCCG GCCCGCCCCT 701 GGGTTCGCGC TCTCGCCGCC TCTGAGGGAA TTGAATTGAG GCGCCGCGGC 751 TGCGAGAGCT AAAAAGGAAG GAGGAGCCGC CGCGGGACTG AGACGGGGGC 801 AGAGCCGAAG AGACCGACAC AGAGAAGGAA ACGAGGAGGA GGATGTCTCA 851 CCGGGCGGCC AGCGCCTGGA TCAGCCCGTG ACTCTTAACA GCGGCGGGCC 901 TCAGACCCCA GCGCAGACTC GGACTTTGTC TTTGGGGGCC CGTGCTCTGC 951 CCTCCCCGGT TTCCGACAGG ACCCAGAGGA GCCGGCGTGC CTCTCTGCCC 1001 TCCAGCCTTC TTCACCATGT CCAAGATGCC GGCCAAGAAG AAGAGCTGCT 1051 TCCAGATCAC CAGTGTCACC ACGGCCCAGG TGGCCACTAG CATCACCGAG 1101 GACACCGAGA GCTTGGACGA CCCGGACGAG TCACGCACAG AGGACGTCTC 1151 CTCCGAGATT TTCGACGTCT CTCGGGCCAC GGATTATGGC CCTGAGGAGG 1201 TCTGCGAGCG CAGCTCTTCC GAAGAGACGC TTAACAATGT TGGGGATGCG 1251 GAGACTCCCG GGACCGTCTC CCCAAACCTC CTCCTAGATG GGCAGCTGGC 1301 AGCGGCGGCT GCTGCTCCCG CCAACGGAGG AGGAGTCGTT TCGGCCCGGA 1351 GCGTGTCTGG GGCGCTCGCC AGTACCCTGG CGGCGGCTGC CACTTCGGCC 1401 CCCGCCCCCG GAGCACCCGG CGGCCCCCAG CTCGCGGGCT CATCCGCCGG 1451 GCCAGTGACT GCAGCCCCAT CTCAGCCTCC CACCACATGT AGTTCCCGTT 1501 TTCGCGTGAT CAAGCTGGAC CACGGGAGCG GAGAGCCCTA TAGACGCGGC 1551 CGATGGACGT GTATGGAATA CTATGAGAGG GATTCAGACA GCAGCGTCCT 1601 GACTAGATCC GGGGATTGCA TTAGACACAG CAGTACTTTT GACCAGACTG 1651 CGGAGCGGGA CAGCGGCCTG GGCGCCACCG GAGGGTCGGT GGTGGTAGTA 1701 GTGGCCTCCA TGCAGGGGGC GCACGGGCCC GAGTCGGGAA CTGACAGCTC 1751 CTTGACTGCT GTGTCACAGC TACCCCCGTC GGAGAAAATG AGCCAGCCCA 1801 CTCCGGCCCA GCCGCAGAGT TTTAGCGTTG GGCAGCCACA GCCGCCGCCG 1851 CCACCCGTAG GTGGGGCTGT GGCTCAAAGC TCGGCTCCGC TGCCGCCGTT 1901 CCCGGGAGCC GCGACCGGGC CGCAGCCAAT GATGGCAGCC GCGCAGCCCA 1951 GCCAGCCCCA GGGAGCGGGG CCCGGGGGAC AGACTCTGCC GCCGACGAAT 2001 GTAACCCTGG CGCAGCCGGC TATGTCCCTG CCTCCGCAGC CGGGCCCTGC 2051 AGTGGGCGCC CCCGCGGCGC AGCAGCCCCA GCAGTTCGCG TATCCTCAGC 2101 CTCAGATACC GCCCGGACAT TTGCTGCCCG TCCAGCCCTC CGGCCAGAGT 2151 GAGTACCTGC AGCAGCACGT GGCCGGCCTG CAGCCGCCAA GCCCCGCGCA 2201 GCCCTCGTCC ACCGGCGCCG CAGCGAGCCC CGCCACGGCG GCCACCCTTC 2251 CCGTGGGCAC CGGCCAGAAT GCTTCCTCGG TGGGCGCGCA GCTCATGGGC 2301 GCGTCTTCCC AGCCCAGCGA AGCCATGGCC CCCCGGACGG GACCAGCGCA 2351 AGGCGGGCAG GTCGCGCCTT GTCAGCCGAC TGGAGTGCCC CCGGCTACTG 2401 TGGGAGGCGT GGTGCAGCCG TGCCTCGGTC CTGCCGGGGC TGGGCAGCCC 2451 CAGTCCGTGC CTCCGCCGCA GATGGGTGGC AGTGGTCCGC TGTCAGCCGT 2501 ACCTGGTGGC CCTCACGCCG TGGTGCCCGG AGTTCCAAAC GTGCCTGCAG 2551 CCGTGCCCGC TCCAAGCGTG CCTAGTGTGT CTACCACTTC TGTTACTATG 2601 CCAAATGTAC CCGCGCCTCT GGCCCAGTCG CAACAGCTGA GCAGCCATAC 2651 GCCAGTCAGC AGGAGCAGCA GCATAATCCA GCATGTTGGG CTGCCCTTAG 2701 CGCCAGGCAC ACACAGCGCA CCAACAAGTC TACCACAGTC TGACCTAAGC 2751 CAGTTTCAAA CTCAGACCCA GCCTTTAGTC GGGCAAGTCG ACGATACTAG 2801 AAGAAAATCA GAACCCCTAC CTCAACCACC ACTTTCTCTC ATTGCTGAAA 2851 ATAAGCCTGT TGTGAAGCCG CCTGTTGCAG ATTCCCTGGC AAACCCCCTT 2901 CAGTTAACAC CTATGAACAG TCTGGCCACC TCTGTATTCA GCATAGCTAT 2951 TCCTGTTGAT GGTGATGAAG ACAGGAATCC TTCAACTGCT TTCTACCAAG 3001 CGTTCCATTT GAACACGTTA AAGGAATCAA AGAGCCTCTG GGATAGTGCA 3051 TCTGGGGGAG GTGTTGTAGC CATTGACAAC AAAATAGAAC AAGCAATGGA 3101 TCTGGTGAAA AGCCATTTGA TGTATGCAGT AAGAGAAGAA GTGGAAGTTT 3151 TAAAGGAACA AATAAAAGAA TTAGTTGAAA GAAACTCTTT ACTTGAACGA 3201 GAAAATGCAC TGTTAAAATC TCTTTCAAGC AATGATCAAT TATCCCAACT 3251 CCCAACCCAA CAGGCCAATC CTGGTAGCAC TTCTCAACAG CAAGCAGTGA 3301 TAGCACAGCC TCCGCAGCCA ACGCAACCTC CACAGCAGCC GAATGTCTCC 3351 TCAGCATAAA GCTTTCTTAA GCCTCATTAA GAAAAAAACT GAAAGCAATC 3401 TATCCTTGTG TGCCACTGGT GTTCTTTCCA CTTTATACGA AAGCAAGTAG 3451 CCATGCTTTG GTTGTGTGTT TGGCCTTTTC AGTATTAGAC AATCATTCTA 3501 CAAGAGCTTT TCCTCTCTCT GAGATGTCAT GCAGCGCTGT TGATGTCCAG 3551 TTCTATGTCA TCAGTACACA AGGAGAATAA TAGATGGGGT TTATTAAAGC 3601 GAGCAAAGTC TGCATTTTAC CTGGTGCGCA TGAGTGGGGT CTTTAAGAGT 3651 TTTGGTGGCT CTCCCATGTT TCCTATTACC CATGGATTTA CCCTGAGCCT 3701 TCCTATCACA TTATAAATAA CAGTTCATCT AAAGAGCCAC TTTTCTTTCT 3751 GATTCAGTAA CATTTGCCTA CATAAGTTTT CATTTATTTG TGTTTTATTT 3801 ATTACAGGGC TGCTATTTTC ATAATGTACA TGAACAATGT CACAGAACTT 3851 TTTTAATTTT TTTGAATAAT TATAAGTATC AGTAAAGGAA GTGAAAGACA 3901 GGATTGCATT TAATAGATAA AACGTTTAGG CAATAATTGA ACAAAAGAAT 3951 CCTGGCATAT TTCTAACACT AATGGCAATT TACTTATGGT ATTTATTTTC 4001 AGTAGTAAAG ACCCAGCTTG AATGTAAATT TTGTATAGTG TAAGTATGAA 4051 GAACATAGTG CAACTGTACA GGTAGTCACC AGTTATTGTG ATATGATAAA 4101 TAATTGGGCT ATTTTGATGA AGAAAACTTT GTTCATTTGT TTCTACTTTC 4151 TAAGAGAAAT TGCCACGATT CCTCTGCTTT TCAACATTTC GTATGACTTT 4201 TTTTTCGGGT GGGAATAAAA AGCTGTGAAA TTGTTCAACC TACTTTGTAA 4251 CCAAAGAAGC AAAGCTGTGT AATGGAGTTT GGTTTTTTTT TTGTTGTTTT 4301 TTTTTTTTTT GTCTTTTTTT TTTTTATAAT GCACATTCTT TATGTATTTT 4351 TATTTAGTGT TTTCTCAGTC ACAATTTTCT TTACTGTCTA GCATGATCTG 4401 CATGACCTAT AATCTTTGAA CCACTTTCGT ACCTCATGTT TTTATCCAGC 4451 ACTCTTATTG TAATATGTAC TAGTCTGTGA ACAATGTCAA ATAAAAGAGA 4501 ACGAACAGGT AGTTTGGTGG AGCTGAGCTA GTGTACAATA CACTAGTTGT // LOCUS AB014585 3981 bp mRNA PRI 06-FEB-1999 DEFINITION Homo sapiens mRNA for KIAA0685 protein, complete cds. ACCESSION AB014585 NID g3327183 VERSION AB014585.1 GI:3327183 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HK02959. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3981) AUTHORS Ohara,O., Suyama,M., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (26-MAY-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) REFERENCE 2 (sites) AUTHORS Ishikawa,K., Nagase,T., Suyama,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. X. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 5 (3), 169-176 (1998) MEDLINE 98403880 FEATURES Location/Qualifiers source 1. .3981 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HK02959" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 354. .3137 /gene="KIAA0685" CDS 354. .3137 /gene="KIAA0685" /codon_start=1 /product="KIAA0685 protein" /protein_id="BAA31660.1" /db_xref="PID:d1032621" /db_xref="PID:g3327184" /db_xref="GI:3327184" /translation="MFWKFDLNTTSHVDKLLDKEHVTLQELMDEDDILQECKAQNQKL LDFLCRQQCMEELVSLITQDPPLDMEEKVRFKYPNTACELLTCDVPQISDRLGGDESL LSLLYDFLDHEPPLNPLLASFFSKTIGNLIARKTEQVITFLKKKDKFISLVLKHIGTS ALMDLLLRLVSCVEPAGLRQDVLHWLNEEKVIQRLVELIHPSQDEDRQSNASQTLCDI VRLGRDQGSQLQEALEPDPLLTALESQDCVEQLLKNMFDGDRTESCLVSGTQVLLTLL ETRRVGTEGLVDSFSQGLERSYAVSSSVLHGIEPRLKDFHQLLLNPPKKKAILTTIGV LEEPLGNARLHGARLMAALLHTNTPSINQELCRLNTMDLLLDLFFKYTWNNFLHFQVE LCIAAILSHAAREERTEASGSESRVEPPHENGNRSLETPQPAASLPDNTMVTHLFQKC CLVQRILEAWEANDHTQAAGGMRRGNMGHLTRIANAVVQNLERGPVQTHISEVIRGLP ADCRGRWESFVEETLTETNRRNTVDLAFSDYQIQQMTANFVDQFGFNDEEFADQDDNI NAPFDRIAEINFNIDADEDSPSAALFEACCSDRIQPFDDDEDEDIWEDSDTRCAARVM ARPRFGAPHASESCSKNGPERGGQDGKASLEAHRDAPGAGAPPAPGKKEAPPVEGDSE AGAMWTAVFDEPANSTPTAPGVVRDVGSSVWAAGTSAPEEKGWAKFTDFQPFCCSESG PRCSSPVDTECSHAEGSRSQGPEKAFSPASPCAWNVCVTRKAPLLASDSSSSGGSHSE DGDQKAASAMDAVSRGPGREAPPLPTVARTEEAVGRVGCADSRLLSPACPAPKEVTAA PAVAVPPEATVAITTALSKAGPAIPTPAVSSALAVAVPLGPIMAVTAAPAMVATLGTV TKDGQMPRQKELP" BASE COUNT 877 a 1140 c 1205 g 759 t ORIGIN 1 TCGGAGTGCC GCCCGCGGCC CCGAGTCGGT CTCGAGCCGC CGGCCGGCCG 51 TGCCGGTGTC CGTAGGCGCT GCGCCCTCGG CCGGGCCCAT GTGTGTGCGG 101 CCCGCCCGAG GCCGCCCGGG CTTTGCCTCC ACCAGCGCCC TGGCCTCCGC 151 TCGGGCCTCC ACACGGGCCT CCGAAGAGCT GCCGCGACGC CCGGCCCGCA 201 GGGCAGGTAA AGAGATTATA AATCTTCCAC TGAATGAAAA AAATTTTCTT 251 AAAGCTGCAT ATACTCCAAG AAAAAAACCA CAAATGTTTT TCTGTTTTGC 301 CTGAATACAT GATTTAAACA AGAGATTTCC ACAGAAGCTC TGCGGCCGTC 351 ACGATGTTCT GGAAGTTTGA CTTGAACACC ACGTCCCATG TTGACAAGCT 401 GCTGGACAAG GAGCATGTGA CGCTGCAGGA GTTAATGGAT GAAGATGACA 451 TCTTGCAGGA GTGTAAGGCT CAGAACCAGA AGCTGCTGGA CTTCCTGTGC 501 AGGCAGCAGT GCATGGAGGA GCTGGTGAGC CTCATCACAC AGGATCCGCC 551 CCTGGACATG GAGGAGAAGG TCCGCTTCAA ATATCCAAAC ACAGCCTGTG 601 AGCTTCTGAC TTGTGATGTG CCGCAGATCA GCGACCGCCT CGGTGGGGAC 651 GAGAGCCTGC TGAGCCTCCT GTACGACTTC TTGGACCATG AGCCGCCTCT 701 CAATCCTCTG CTCGCCAGTT TTTTCAGCAA GACCATTGGC AATCTCATTG 751 CAAGAAAAAC CGAACAGGTG ATTACGTTTT TGAAGAAGAA GGACAAGTTC 801 ATCAGCCTGG TGTTGAAGCA CATCGGCACC TCAGCGCTTA TGGACCTGCT 851 GCTGCGCCTG GTCAGCTGTG TGGAGCCAGC CGGGCTCCGG CAGGACGTCC 901 TGCACTGGCT GAATGAAGAG AAGGTCATCC AGAGGCTTGT GGAGTTGATC 951 CACCCGAGCC AGGATGAAGA TAGGCAGTCA AATGCTTCTC AGACTCTCTG 1001 TGACATAGTT AGGCTGGGCA GAGACCAGGG CAGTCAGCTG CAAGAGGCTC 1051 TGGAGCCAGA CCCGCTCCTC ACAGCGCTGG AGTCGCAGGA CTGTGTGGAG 1101 CAGCTTCTGA AGAACATGTT TGATGGAGAC CGGACGGAGA GCTGCCTCGT 1151 CAGTGGGACT CAGGTGTTAC TCACCTTGCT GGAAACCAGG CGGGTTGGGA 1201 CAGAGGGCTT GGTGGACTCC TTTTCTCAGG GACTGGAAAG GTCATACGCT 1251 GTCAGCAGCA GCGTACTACA CGGCATCGAG CCTCGGCTGA AGGACTTCCA 1301 CCAGCTCCTG CTCAACCCGC CCAAGAAGAA AGCGATCCTG ACCACCATTG 1351 GTGTGCTGGA GGAGCCCCTG GGGAATGCCC GTCTGCATGG CGCCCGCCTC 1401 ATGGCAGCAC TGCTGCACAC AAACACACCC AGCATCAACC AGGAGCTCTG 1451 CCGGCTCAAC ACGATGGACT TACTGCTGGA CTTGTTCTTT AAGTACACCT 1501 GGAATAACTT TTTGCACTTC CAAGTGGAAC TATGCATAGC CGCTATTCTC 1551 TCCCACGCTG CCCGTGAGGA GAGGACAGAA GCCAGCGGAT CCGAGAGCAG 1601 GGTGGAGCCT CCGCATGAGA ACGGGAACCG GAGCCTGGAG ACTCCCCAGC 1651 CGGCCGCCAG CCTCCCTGAC AACACAATGG TGACCCACCT GTTCCAGAAG 1701 TGCTGCCTGG TGCAGAGGAT CCTGGAGGCC TGGGAAGCCA ACGACCACAC 1751 GCAGGCAGCG GGTGGCATGA GACGTGGGAA CATGGGCCAC CTCACACGGA 1801 TCGCCAACGC GGTGGTGCAG AACCTGGAGC GGGGCCCTGT GCAGACGCAC 1851 ATCAGCGAGG TCATCCGAGG GCTCCCTGCG GACTGCCGTG GCCGCTGGGA 1901 GAGCTTCGTG GAGGAGACGC TGACGGAGAC GAACCGCAGG AACACTGTGG 1951 ACCTGGCCTT CTCTGACTAC CAGATCCAGC AGATGACAGC CAACTTCGTG 2001 GATCAGTTTG GCTTCAATGA TGAGGAGTTT GCCGACCAGG ACGACAACAT 2051 CAATGCCCCG TTTGACAGGA TCGCAGAGAT CAACTTCAAC ATCGACGCTG 2101 ACGAGGACAG TCCCAGCGCA GCTCTGTTTG AGGCCTGCTG CAGTGACCGC 2151 ATCCAGCCCT TTGATGATGA TGAGGACGAG GACATCTGGG AGGACAGTGA 2201 CACTCGCTGT GCTGCCCGGG TGATGGCCAG ACCCAGGTTT GGAGCCCCCC 2251 ATGCTTCAGA GAGTTGCTCA AAGAATGGCC CAGAGCGTGG AGGCCAGGAT 2301 GGGAAGGCGA GCTTGGAAGC ACACAGAGAT GCACCTGGGG CAGGTGCCCC 2351 ACCGGCCCCC GGGAAGAAGG AAGCCCCCCC TGTGGAGGGT GACTCAGAAG 2401 CAGGCGCCAT GTGGACGGCA GTGTTTGATG AGCCAGCGAA CTCAACGCCC 2451 ACAGCCCCAG GAGTGGTGAG GGACGTGGGT TCCAGTGTGT GGGCAGCTGG 2501 CACCTCAGCT CCAGAGGAGA AAGGCTGGGC CAAGTTCACT GACTTCCAAC 2551 CTTTCTGCTG CTCCGAGTCA GGGCCCAGGT GCAGCTCTCC GGTGGACACA 2601 GAATGCAGCC ATGCTGAGGG CAGCCGGAGC CAAGGCCCTG AGAAAGCCTT 2651 CAGCCCGGCT TCTCCATGTG CCTGGAACGT GTGTGTCACC AGGAAGGCCC 2701 CCCTGCTGGC CTCTGACAGT AGCTCCTCTG GGGGCTCCCA CAGCGAGGAT 2751 GGCGACCAGA AGGCAGCGAG TGCCATGGAT GCGGTGAGCA GGGGTCCCGG 2801 CCGGGAGGCC CCCCCGCTGC CCACAGTGGC CAGGACAGAG GAGGCGGTCG 2851 GCAGGGTCGG GTGTGCTGAC AGCCGGCTGT TAAGCCCTGC CTGCCCCGCG 2901 CCAAAGGAAG TGACTGCTGC CCCAGCCGTG GCTGTGCCCC CCGAGGCTAC 2951 TGTGGCCATC ACCACAGCAC TGAGCAAGGC TGGCCCCGCC ATACCCACCC 3001 CAGCAGTCTC TTCTGCACTG GCCGTGGCGG TCCCCCTAGG GCCCATCATG 3051 GCAGTCACAG CAGCCCCAGC CATGGTGGCC ACCCTGGGGA CAGTGACAAA 3101 GGACGGACAG ATGCCCCGCC AGAAGGAGCT GCCTTAAATG GCCCAGTGTG 3151 ATGCTGCTGC CGCCCGGCCA CGGCCCACCC TGGTCAGGCT GCCTCCTTAA 3201 TCGAGAAAAC TACCTGGTGA TGCAATCTTT TTTTTTTTAA TTTAATTTAA 3251 TTTTAAAATA AATGCTGCAT TGGTAAAGCT GGCAGTTGAA ACCAGTTGGA 3301 CGGCCCAGCT TGCGTCTCTT CTGCCTGAGT GGGCCTCTCA GGTCACTCGT 3351 GCCCTGCTGG AGGACAGAGG GGCACCTCAG CCGCCCCCAA GCCCAGAGCA 3401 CAGCAATAAG GTCGGCCTGC AGGAGCCGGG GTGGGGGTGG GGGGGGCAGG 3451 ACCCTGAGAT GCCACCAGGA CCTGATGGGC CAGGAAGGGC GTGGACATGG 3501 AGGCTGTTTT TACAGTTTTT TTTTGTTGTT GTTTTGTTTT TAAAGAATAC 3551 AGAAGGAGCC AAGCTTTTTT GCACTTTGTA TCCAGCTGCA AGCTCAGGGC 3601 AGAGTCAAGG GCCTGGGTTG GAAAAACCTG ACTCACAGGA ATGCATAATT 3651 GACCCTTGCA GCTACCCAAT AGCCCTTGGA GCTGGCACTG AACCAGGCTG 3701 CAAGATTTGA CTGCCTTAAA AACACAAGGC CCTCTAGGCC TGGCAGGGAT 3751 GTCCCTGTGC CCAGCACAGG GTGCCTGGCA GGGGGAGACC ACAGGTATGC 3801 AGGTGGGGGG ACATGGTGTG GCACTGGGGG CTCGAAGACT GGTTTCTAGC 3851 ACTACCGGTC ACGGCCATGT CGTCCTAGAA GGGTCCAGAA GATTATTTTA 3901 CGTTGAGTCC ATTTTTAATG TTCTGATCAC CTGACAGGGC ACCCCAAACC 3951 CCCAACTCCC AATAAAAGCC GTGACGTTCG G // LOCUS AB016247 2125 bp mRNA PRI 09-OCT-1998 DEFINITION Homo sapiens mRNA for sterol-C5-desaturase, complete cds. ACCESSION AB016247 NID g3721881 VERSION AB016247.1 GI:3721881 KEYWORDS sterol-C5-desaturase; C5D. SOURCE Homo sapiens (strain:caucasian) 9-year old female liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2125) AUTHORS Nishino,H. TITLE Direct Submission JOURNAL Submitted (09-JUL-1998) to the DDBJ/EMBL/GenBank databases. Hideaki Nishino, Hokkaido University School of Medicine, Department of Biochemistry; N15W7 Kita-ku, Sapporo, Hokkaido 060-8638, Japan (E-mail:hideakin@med.hokudai.ac.jp, Tel:+81-11-706-5047, Fax:+81-11-706-5169) REFERENCE 2 (sites) AUTHORS Nishi,S., Nishino,H. and Ishibashi,T. TITLE Molecular cloning and expression of the human and mouse lathosterol 5-desaturase JOURNAL Unpublished (1998) REFERENCE 3 (sites) AUTHORS Matsushima,M., Inazawa,J., Takahashi,E., Suzumori,K. and Nakamura,Y. TITLE Molecular cloning and mapping of a human cDNA (SC5DL) encoding a protein homologous to fungal sterol-C5-desaturase JOURNAL Cytogenet. Cell Genet. 74 (4), 252-254 (1996) MEDLINE 97130614 FEATURES Location/Qualifiers source 1. .2125 /organism="Homo sapiens" /strain="caucasian" /db_xref="taxon:9606" /dev_stage="9-year old" /sex="female" /tissue_type="liver" gene 82. .981 /gene="C5D" CDS 82. .981 /gene="C5D" /EC_number="1.3.3.2" /codon_start=1 /product="sterol-C5-desaturase" /protein_id="BAA33729.1" /db_xref="PID:d1034698" /db_xref="PID:g3721882" /db_xref="GI:3721882" /translation="MDLVLRVADYYFFTPYVYPATWPEDDIFRQAISLLIVTNVGAYI LYFFCATLSYYFVFDHALMKHPQFLKNQVRREIKFTVQALPWISILTVALFLLEIRGY SKLHDDLGEFPYGLFELVVSIISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIP TPFASHAFHPIDGFLQSLPYHIYPFIFPLHKVVYLSLYILVNIWTISIHDGDFRVPQI LQPFINGSAHHTDHHMFFDYNYGQYFTLWDRIGGSFKNPSSFEGKGPLSYVKEMTEGK RSSPSGNGCKNEKLFNGEFTKTE" conflict 725. .936 /gene="C5D" /note="deleted in D8518 (Matsushima, M. et al. Cytogenet. Cell Genet. 74(4), 252-254 (1996))" /citation=[3] BASE COUNT 670 a 388 c 379 g 688 t ORIGIN 1 GATCCCCCGG GCTGCAGGAA TTCCCGGGTC GACCCACGCG TCCGGTGCGG 51 ACGGGCGCGG ACCACCTCCA GGGGCTAAGT GATGGATCTT GTACTCCGTG 101 TTGCAGATTA CTATTTTTTT ACACCATACG TGTATCCAGC CACATGGCCA 151 GAAGATGACA TCTTCCGACA AGCTATTAGT CTTCTGATTG TAACAAATGT 201 TGGTGCTTAC ATCCTTTATT TCTTCTGTGC AACACTGAGC TATTATTTTG 251 TCTTCGATCA TGCATTAATG AAACATCCAC AATTTTTAAA GAATCAAGTC 301 CGTCGAGAGA TTAAGTTTAC TGTCCAGGCA TTGCCATGGA TAAGTATTCT 351 TACTGTTGCA CTGTTCTTGC TGGAGATAAG AGGTTACAGC AAATTACATG 401 ATGACCTAGG AGAGTTTCCA TATGGATTGT TTGAACTTGT CGTTAGTATA 451 ATATCTTTCC TCTTTTTCAC TGACATGTTC ATCTACTGGA TTCACAGAGG 501 CCTTCATCAT AGACTGGTAT ATAAGCGCCT ACATAAACCT CACCATATTT 551 GGAAGATTCC TACTCCATTT GCAAGTCATG CTTTTCACCC TATTGATGGC 601 TTTCTTCAGA GTCTACCTTA CCATATATAC CCTTTTATCT TTCCATTACA 651 CAAGGTGGTT TATTTAAGTC TGTACATCTT GGTTAATATC TGGACAATTT 701 CCATTCATGA CGGTGATTTT CGTGTCCCCC AAATCTTACA GCCATTTATT 751 AATGGCTCAG CTCATCATAC AGACCACCAT ATGTTCTTTG ACTATAATTA 801 TGGACAATAT TTCACTTTGT GGGATAGGAT TGGCGGCTCA TTCAAAAATC 851 CTTCATCCTT TGAGGGGAAG GGACCGCTCA GTTATGTGAA GGAGATGACA 901 GAGGGAAAGC GCAGCAGCCC TTCAGGAAAT GGCTGTAAGA ATGAAAAATT 951 ATTCAATGGA GAGTTTACAA AGACTGAATA GATTATTGCC CAGTTATTCT 1001 TAAGTAAGGA CAAAGAAGGA AATATCATCG TATTTCTTTT TTTTAATAAG 1051 GAAAAAATAA TCTCCATACA GTCAAGATAC ATAGTAAATG GTATCATTTG 1101 GAAATCAGCA TCGTGGGCAC TGCTGAGGAA TGATCCTAGT GGTAGGTCAG 1151 AAGAAGATGC TGTGAACACC AGGACTTTAA TCTTATGCTT AAAATGCCAG 1201 ATGTTGTTCG GGCCCCAACT TGTATTTCTA GCAGCAGATC TGTAGTTTGT 1251 ATAGCCTCAA CAACAATTTT AAATAAGATG GAGAATAAAT TATTGAGGGG 1301 ACTAGGCTAT ATGCATTTGC CTTCATCCAC CCATGTTTAT TAAGAATCAT 1351 TGTGCTTAAT AATACCAAGA CTAAGCACCA TAACCAAGAA ATACTAATGT 1401 AAAGATTGTT TCTTGTTTCA GGAATGGTTA ATTCTTCAAC GTTGGTATGA 1451 TAATGATAAC TTGTTTTGAC TTGAATAAAG TACTACATCA GTGTGGAAAA 1501 AAATTCTGAT ACATTAGCAG CTATGTAAAT GACCTAATTG ATAGCAGGTG 1551 TAATAAGACT ATCGTCTTCC TACACATAGG AGGCTCATTC TCTGGACACA 1601 CTATCACCTA TTACATTTTA CTGATTAACA AATAAATTGG AATTTAAAAA 1651 TATCGATATC ACCATGATTT AATCCAGATC TGGGATTATG TAGCTAAACA 1701 TTGTGATGAT TATTATTTAA AACCATTATT TAATAAGAGT AAAAATATGT 1751 GAATCTGGAT ATATTTAAAA AAAGAAATTT GATTGCCCAG ATAATATATT 1801 AGGCACTACT GATTTTTTAG TTAAATTGAT GCACTACACT TTTGATGTTT 1851 GAAGTTACAA CCTGTAATTT TTTTGTAAAG GAAATAATTG CCAAATACCT 1901 AGGCCCATTG CTGACGATTA GTTCTAAAAT CTTATTCCTC CTCTTCTCCC 1951 CTCACTTTTC CCTACTTCCT CTGCAAAAAG ATTTAACAAA TACATTCATA 2001 AGGAAATGTG TGTTGTAACA AATATATTGC AAAAACATAG TTTGTAAAGG 2051 CATTCTATAA GCTATTTATG TAAAATCAAT AAAAGTTGAT CATAATTAAA 2101 CTGTAAAAAA AAAAAAAAAG GCGGC // LOCUS AB016899 2800 bp mRNA PRI 08-JUN-1999 DEFINITION Homo sapiens HGC6.1.1 mRNA, complete cds. ACCESSION AB016899 NID g5006254 VERSION AB016899.1 GI:5006254 KEYWORDS . SOURCE Homo sapiens tissue_lib:skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Minaguchi,T., Matsushima,M., Saito,S., Kanamori,Y., Shirahama,S., Okamoto,S., Minami,M., Taketani,Y. and Nakamura,Y. TITLE Complete DNA sequence and characterization of a 330-kb VNTR-rich region on chromosome 6q27 that is commonly deleted in ovarian cancer JOURNAL DNA Res. 6 (2), 131-136 (1999) MEDLINE 99310344 REFERENCE 2 (bases 1 to 2800) AUTHORS Nakamura,Y., Matsushima,M. and Minaguchi,T. TITLE Direct Submission JOURNAL Submitted (12-AUG-1998) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science, The University of Tokyo, Laboratory of Molecular Medicine, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:81-3-5449-5372, Fax:81-3-5449-5433) FEATURES Location/Qualifiers source 1. .2800 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q27" /tissue_lib="skeletal muscle" gene 619. .1164 /gene="HGC6.1.1" CDS 619. .1164 /gene="HGC6.1.1" /note="partial homology to human s-laminin" /codon_start=1 /protein_id="BAA78634.1" /db_xref="PID:d1042406" /db_xref="PID:g5006255" /db_xref="GI:5006255" /translation="MGDIFKNNGVLQGRLRAVACAPHRFGPRLRCLHHDQGLTELAWG TWPHSHPVRHQPQMPSARECCSIVCMAAKEVSAPKAPGSPWMVPGDVAMSGHRVGALD ERGHPNPQTGHCRGGSVSVTWSSVSCCRGRLAAVRVMIARDPSTCHLAKGCSPAWGFL PQARGPAGTRTPQRRCSSHEA" polyA_site 2785 /note="15 a nucleotides" BASE COUNT 652 a 834 c 697 g 617 t ORIGIN 1 CGCCACCAAT GTGGCCCAGG CCACCAGCCC CCGTGACACT GCAGCCATCC 51 CCTTTGCTCC CCGTCGTGGC CTCCCGACCG AGCAGCCAGA GTGACCCCGA 101 AAGCCTAAAT CAGCCTGCGG CGCCCTCACC CGTCCTCCAG CTGCCATCGC 151 CTCCTGGGTG AAGAGGACGC AGGTCGCAGC CCCCGCAGCT CTGCCCCTCT 201 GCCCTCCGGC CAGGCCGCCC CGCTGTCCCC CTTGCTGTCC ACCTCTCTGT 251 CCACCCCTGC TGTCCCCCCC CGCTGTCCCC CCCGCTGTCA CACCCCCGCT 301 GTCCCCCCCC CGCTGTCCCC CCTGTTGTCC CCCCGCTGTC CCTCCCGCTG 351 TCCCTCCGGC GAGGAGGCTT CTGCACGTTC TGCGTGGGCC TCTCCCCGGG 401 GCCGCCCCAG GTCTCCCCTG ACGCCCACAC CCACCACCAG GCTGTTGTGC 451 GGGACCTTCC TTCTCTGCCT CTCCAGCTGG TGACGCGCAC TGACGTCCTG 501 CAAGGCCCGC TCCAGGTGCG CCACCTTTGA CTGGCATGCT GAAAGGACAA 551 AGGGACGGAG GCATTTTACC GGAGAGGACA TCCATGGGGC CCAGCAGGTC 601 AAATTCCTTT CTCTAAGTAT GGGAGACATT TTTAAAAATA ATGGAGTCCT 651 CCAGGGCCGG CTGCGTGCTG TGGCCTGTGC TCCTCACCGC TTTGGACCCA 701 GGCTGCGCTG TCTCCACCAC GACCAAGGAC TCACAGAGCT CGCCTGGGGG 751 ACCTGGCCCC ACAGTCACCC CGTCCGTCAT CAGCCTCAAA TGCCATCAGC 801 CCGTGAATGC TGCTCCATCG TCTGCATGGC AGCCAAGGAG GTCTCTGCTC 851 CTAAAGCACC TGGGAGCCCT TGGATGGTGC CAGGTGATGT GGCGATGAGT 901 GGGCATCGCG TGGGGGCCTT GGACGAGCGT GGACACCCCA ATCCCCAGAC 951 TGGCCATTGC CGTGGAGGCA GCGTGAGTGT GACATGGAGC TCGGTGTCGT 1001 GCTGCAGAGG ACGCCTCGCG GCTGTACGCG TCATGATAGC CAGGGACCCC 1051 TCCACATGTC ACTTAGCCAA AGGATGCAGC CCGGCCTGGG GCTTTCTACC 1101 CCAGGCCCGA GGACCTGCAG GGACAAGGAC CCCTCAGCGC AGATGCAGCT 1151 CTCATGAGGC CTGAAACCAA GCTGGACTTG CAGAACTGCG CACGCTCCCT 1201 TTCTGAAAGA AACACCCGTT ACCGGGTCCA GACTGAATGT GGGCGTAAGA 1251 AAGCGGGAAG GGTCCCCCAC AGCAAGAGAG AGAGCCTCCC AGTCACTTGG 1301 TCATCTGAGC CTTGAGTGTG TCCGGCCCCA GCCACCAGCC TGTCCCCACT 1351 ATCCCTCTTG TCAGCACGGC TGAAAGGAGC TGCCTGGGCA CCTGAGGTGG 1401 CTGAGGCTCA TCTTTGAGGA GCATGGAACT CGAGGGGAGA AGTCGCGCCC 1451 CCAGGGAAGC CAGGAAACTC TGACACAGAA CGCCCAACGT GGACACAGGT 1501 GCCAGTTCAG CAGTGACCGT CAAGACCGGG GAAGTGCCCT GTGGTGACCA 1551 GGAGAACAAG GAGCTCCTCC AGCAAGCAGG GCTGCAGTGG GAGCTGGGAG 1601 GGAGCAGTGC ACCTGCCAGC GTGGCCGAGT CACACAGTGC CTGCCCTCAC 1651 CTGTGGAATT TGGTGTCAGC TAAAAGGAAG AAGAAATGGA AACTTAAATT 1701 CTAGCCGGTC ATTTTCTTAA AACTGCAAAC ATTTCTCTCG AGTGTGGACT 1751 GGCCCGGCTT CGGTGAAAGG CGCCCTCCTA CCTAGCAGGG TCTCGGCACT 1801 CTGCTGTGTT CTTTGCACCG CGGTCACTAT TTCAGAAACA CGGTCCTGAT 1851 ACTCATCCTG ATAATCTCTT AAGAGCACCC GAGAAGAAGA ATGAAACGCC 1901 TGTATATCCT GGTTAAGTTT CTATAAAATA AGAACATACA CGATTCATAA 1951 AATTATATGA TTAATGATCA CAGTATAATT ACATAAGAAT TTGTTATATA 2001 TGTTTCTATA GAATAATAAC ATACCCACAC AATTAATATA AGGGATATGT 2051 TGTTAACTGG AAGGGTGTAA AGCAGTTCTT CGGCCCGGGC CTGTCGATGC 2101 CCGCCAGGAC TGTTTTCATG CTGCCTGTGC CTCGGCTCAC TTACAGAAGT 2151 TCTGACTCAA TACATCTGGG ATTTCTACTT TCAAAGCAGT ACATGTGATT 2201 TTATGCCCTT AAAGTATTTA AGAAACAGAA AGTCAAGTGT TTCACCAATT 2251 TCCGTAGGAA GGGCCGCTCA CTGCACGGTG GGGCGATGCT GGGCGCGAGC 2301 CGTGTTTCCG TCTATTGTTG ACTTCCTCAC GATGCTGGCT CAGGCTCAGA 2351 GCCTCTCATC ACCGCCTGTG AAATGGGGAC CACCCTCCTA ATCTACTATC 2401 CAGACCTACC TTCTGGAAGC TCAATTATCA CGAGCACATG GTGGAAAACG 2451 TGGGCCGGTG ACAACTTAAC ATGTAAATGT ACATAATAAA ATTCCAGGCA 2501 CAATACATAT GAACGCTTTT AGAGAACTTT GGGAGCAAAG CAACTGATGT 2551 CCTGCGTGGA AAGTCCTCTT CTCCCCATGA AATTATCTCA AGGATCTTCC 2601 ATGGGAAGAT GTGACAGAAC CAGGACACTA CTGCTTAATA TTTAATATCA 2651 GATACTCTTA AATCACAAGA GCATAACTAA AGTATTCTTC GTTTCTTTAG 2701 TCAAATCTCC TTTTCTATAA AATGGAGACG GCTTTCATAT GTTTGGGAAA 2751 CATTTTAATA ACGAATTAAT AAACATTTTA ATATCAAAAA AAAAAAAAAA // LOCUS AB018254 6706 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0711 protein, complete cds. ACCESSION AB018254 NID g3882142 VERSION AB018254.1 GI:3882142 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hg00358. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XI. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (5), 277-286 (1998) MEDLINE 99087487 REFERENCE 2 (bases 1 to 6706) AUTHORS Ohara,O., Suyama,M., Nagase,T., Ishikawa,K. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (08-OCT-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .6706 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hg00358" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 964. .2835 /gene="KIAA0711" CDS 964. .2835 /gene="KIAA0711" /codon_start=1 /product="KIAA0711 protein" /protein_id="BAA34431.1" /db_xref="PID:d1035412" /db_xref="PID:g3882143" /db_xref="GI:3882143" /translation="MEHAVAPCVLYPGTEPGAAGESESEGAASPAQTPCSLGASLCFS SGEESPPQSLASAAEGAATSPPSSGGPRVVERQWEAGSAGAASPEELASPEERACPEE PAAPSPEPRVWLEDPASPEEPGEPAPVPPGFGAVYGEPDLVLEVSGRRLRAHKAVLAA RSDYFRARASRDVLRVQGVSLTALRLLLADAYSGRMAGVRPDNVAEVVAGARRLQLPG AAQRATDAVGPQLSLANCYEVLSAAKRQRLNELRDAAYCFMSDHYLEVLREPAVFGRL SGAERDLLLRRRLRAGRAHLLAAALGPAGERAGSRPQSPSGDADARGDAAVYCFHAAA GEWRELTRLPEGAPARGCGLCVLYNYLFVAGGVAPAGPDGRARPSDQVFCYNPATDSW SAVRPLRQARSQLRLLALDGHLYAVGGECLLSVERYDPRADRWAPVAPLPRGAFAVAH EATTCHGEIYVSGGSLFYRLLKYDPRRDEWQECPCSSSRERSADMVALDGFIYRFDLS GSRGEAQAAGPSGVSVSRYHCLAKQWSPCVAPLRLPGGPTGLQPFRCAALDGAIYCVS RAGTWRFQPAREGEAGGDAGQGGGFEALGAPLDVRGVLIPFALSLPEKPPRGEQGAP" BASE COUNT 1478 a 1736 c 1957 g 1535 t ORIGIN 1 AGCCCACCCG CCTGGCTGCG CGTCCCGGGC CCGGCGGCTG AAGAGGAGCC 51 GCGGCGAGGA ACAAGAGTGT GGTGAGAGGA CGCGGAAACA CCTGATATCC 101 CAGCAAAGGA AGCTGCAGAG GAAGGGTCCT GCCCCGTGAG CGAGGACAGC 151 CTCCAGGAGC ACAGCGGCTT CTCCTAACAT CCCCCTGGCA GTGAATTACG 201 GATGACCCCG TATATGAGGA GGTGTGGATG CCGACACACG GGGAGAGCTC 251 CTGAGACCAG CACACAGGAC CAGTGTCCTC CCCGTGACCT TGCAGTTGTG 301 TAGCCACAAG TGGGAGCCAC AGAGAAGGAG AGCCCATAAG GGGACTCTTT 351 TCAAGACACG GTGTTAAATC AAGGCTCTGA AGTTGGGGCC ACAGCCGGTC 401 TCAGCTTTCT GTGTTCTGGG GGCGCGGTGA TATGACAGCA TGTGAAGTGA 451 GTGGGTGAGC ACGGACTCCT GAGAGCGAGG GCGCACCCAA TACCTAGTTA 501 TCCGGTACTG CAGAGAGACA CCTGGTCAGC CAGCGTTGCA AAGAGGGATA 551 GCTAGTCACT CAGGCTGCAG AGAGAGACAC CTGGTCACCC AGGCTGCAGA 601 AACAACCGCA GTCAACTGCA GCTCCAGTCA TTGCTGGATG TTGGCTGACG 651 CGGTCCTGGC GCCAGCTGGA GATCCATTCA CCAAGACTTT CTGGGCAGAT 701 TTAAAGTGGC TGGGGTTCAG AAGTTCAGCA AGTTGGACAC ACCCCTCCTC 751 GCTGGAGAGG AGAGGGCAAA GGCGAGGCGG GGGAGCAGCG TGTGAGATTC 801 CCCCCTTCAC ACACACAACA AAGCGTGGAC ACACAGAAGT GAAATCTGAT 851 CGCGTGCCAG GAAAAGCTGT GAGGCTGGAA ACCCCGGAGT AAGGCTCGAC 901 CTTGGCCAGA CCTGCAGGCT GCGGAACCGG GGGCGCGCGG GCGCAGCGCA 951 GCACAGCCCG GCCATGGAGC ACGCGGTGGC CCCCTGCGTC CTCTACCCAG 1001 GGACTGAGCC CGGGGCTGCC GGGGAGAGCG AGAGCGAGGG CGCCGCGTCC 1051 CCGGCGCAGA CACCCTGCAG TCTCGGCGCG TCCCTGTGCT TCAGCTCCGG 1101 GGAAGAGTCC CCGCCGCAGT CCCTCGCCTC AGCGGCGGAA GGCGCGGCCA 1151 CCTCCCCGCC CTCCAGCGGT GGCCCGCGGG TGGTGGAGCG GCAGTGGGAG 1201 GCCGGCAGCG CGGGCGCCGC GTCCCCGGAG GAGCTCGCGT CCCCTGAGGA 1251 GCGCGCGTGC CCGGAAGAGC CCGCGGCGCC GTCCCCCGAA CCGCGCGTTT 1301 GGCTTGAGGA CCCCGCGTCC CCCGAGGAGC CCGGGGAGCC CGCGCCCGTA 1351 CCCCCGGGGT TCGGGGCGGT GTACGGGGAG CCGGACCTGG TGCTGGAGGT 1401 GTCGGGGCGC CGGCTGCGCG CGCACAAGGC GGTGCTGGCG GCGCGCAGCG 1451 ACTACTTCCG CGCGCGCGCG TCGCGGGACG TGCTGCGGGT GCAGGGAGTG 1501 AGCCTGACGG CGCTGCGGCT GCTCCTCGCC GACGCCTACA GCGGGCGCAT 1551 GGCGGGCGTG CGGCCCGACA ACGTGGCCGA GGTGGTGGCC GGCGCGCGCC 1601 GCCTGCAGCT GCCCGGCGCC GCGCAGCGCG CCACCGACGC CGTGGGGCCG 1651 CAGCTGAGCC TGGCCAACTG CTACGAGGTC CTGAGCGCGG CCAAGCGGCA 1701 GCGGCTGAAC GAGCTGCGCG ACGCCGCCTA CTGCTTCATG AGCGACCACT 1751 ATCTGGAGGT GCTGCGCGAG CCCGCCGTGT TCGGCCGCCT GTCGGGCGCA 1801 GAGCGGGACC TGCTGCTGCG CCGCCGCCTG CGCGCCGGCC GCGCCCACCT 1851 CTTGGCCGCG GCGCTCGGGC CGGCGGGGGA GCGCGCGGGC AGCCGGCCTC 1901 AGAGCCCCTC GGGGGACGCG GACGCGCGCG GGGACGCGGC CGTCTACTGC 1951 TTCCACGCGG CGGCCGGAGA GTGGCGCGAG CTGACGCGGC TGCCCGAGGG 2001 CGCGCCGGCG CGGGGCTGCG GCCTGTGCGT CCTCTACAAC TACCTCTTCG 2051 TGGCGGGCGG CGTGGCGCCC GCGGGCCCCG ACGGCCGCGC GCGCCCGTCC 2101 GACCAGGTCT TCTGCTACAA CCCGGCCACG GACAGCTGGA GCGCCGTGAG 2151 GCCCCTGCGC CAGGCGCGCT CGCAGCTGCG GCTGCTGGCC CTGGACGGTC 2201 ACCTCTACGC CGTGGGCGGC GAGTGCCTGC TCAGCGTGGA GCGCTACGAC 2251 CCGCGCGCCG ACCGCTGGGC CCCCGTGGCG CCGCTGCCCC GGGGCGCCTT 2301 CGCCGTGGCG CATGAGGCCA CCACCTGCCA CGGCGAGATC TACGTGTCCG 2351 GGGGCTCCCT CTTCTATCGC CTGCTCAAGT ATGACCCGCG GCGCGACGAG 2401 TGGCAGGAGT GCCCGTGCAG CAGCAGCCGC GAGCGCTCGG CCGACATGGT 2451 GGCTCTCGAC GGCTTCATCT ACCGCTTCGA TCTGAGCGGC AGCCGCGGCG 2501 AGGCGCAGGC GGCGGGGCCG AGCGGGGTCA GCGTGTCCCG ATACCACTGC 2551 CTGGCCAAGC AGTGGAGCCC GTGCGTCGCG CCCCTGCGCC TCCCCGGCGG 2601 CCCCACGGGC CTGCAGCCCT TCCGCTGCGC CGCCCTGGAC GGCGCCATCT 2651 ACTGCGTGAG CCGCGCGGGC ACCTGGCGCT TCCAGCCTGC CCGGGAAGGC 2701 GAGGCCGGCG GCGACGCAGG CCAGGGCGGC GGCTTCGAGG CGCTGGGCGC 2751 CCCCTTGGAC GTCCGGGGTG TGCTCATCCC GTTCGCTCTC AGCCTGCCTG 2801 AGAAGCCGCC CCGAGGGGAG CAGGGCGCCC CGTAGGCCGG CGGGGTCGGC 2851 GGGCGTCTCC CTCGGCAGGG GTTTGCGGGG CCCAGGTCCC TTTGGGCCCG 2901 CGGAGGAGGA CGTGGTGGGG AGTCGGGGCC GCTGGCCACG CTGGTGGTTT 2951 GGACACTTCG AAGGAGCCCC GAGGACGCTC TCAGGGCCGC TTTCGCTTTG 3001 CTTTCCTTTT GCTTGTCTTT GCTTCTGGGG GTGGATGCCT TGAGACCCAG 3051 GAGGTGTGCG GATGGGTCCC TTGACAGACA GGACACAGAG AAGGCTGTGG 3101 GATCCAAAGG GTCAGCCTCA GGGTACAGTG GGGGTTCCTG AGGCAGCCTG 3151 CAGCCGGCCC CGGGGTGTCC CGAACCCCGC AGAGCACCGA GGCTGTGCGC 3201 AGGAGCCTGG GACCCTCAAG TAGGTCGCCG CGAACTATCG GGGGAAGCAC 3251 GCAGAGAGGG GTCACGCCTT TTATTTTTGG CTTGAGATTT AAAATTATAA 3301 CTGATAAGTA AAGCTCTTTC TGATTTAGTT GAAAATTTCG TACCATGTGC 3351 TCGCTTTGGT TGGCTCAACT CCAAAAAGAG TAATTTAATA AGCATTAAAG 3401 GTAAAATCTT TGATTACCAG TAAGGTTTCT TGTTCAATAT GCATTGGAAG 3451 TATTTCCTTC CCCACACCAT TGCTACTCAA ACTCACATGC CTAGAGCAGC 3501 TCCGTCTCAC TGTTGGACCG AGGGGGCTTT CATATTTTCA GTTGAAAGGA 3551 TGTTAACTGA TGTAGCGATG ATTTACCATT ATTTAAATTT TAAGTCTTCA 3601 GTGGCTAAAT GTGACCAAAC AGCACATCAT AAGTAGGAAA AACTTACCAG 3651 GGTGCTTGTC TATCTAAAAA GCAATTTTTG ATAGTCCACT CTGTATGCCC 3701 AGCCGCTTTC ATAATCTGGA ACGAGATAAA ATATTCCTAA AAAGCGGGGA 3751 AAATCATTTT GCTTTGACAG TTCTATAAAA AAAAGTGTAG GCACATTTTA 3801 AACCCACTGT ATATGATGTT TTCAATGTGG ATCGTGTAGT TCTGATGAGG 3851 GAGATTAATT TTTACAATCA GTATTCTAAG TGTGGCCGAG TGACAGTGGG 3901 CATAGATTTA TAACAAGGAA GTGACGTGCT TATCACCATA GATTTGCAAG 3951 TAAACTGCAT GTATTTAATT GTATTGAATT GAGTTCCAAA ATACCCTAAT 4001 AGAATTAACA CGAGGCTCAC TGCATTGACA GGGTATGAGG ATTAAAAACA 4051 AATCAGTTGG GTCGTTTCTC ATTTAACATT TTACTTTTCA AGTGTGTATA 4101 CAGAGGACTT ACTATTATGA CTTTGAGGAT GAGATCCATG CTCACAAATA 4151 GAGGCGAACA TTTGAACTCC GAATCCAACC CATTTTCTTA CTGTAAGAGG 4201 AAAAGTTACT GGAACGTAAC TAGTTGAAAT CCTGCCTACT TTAATATTAT 4251 TTGTTTGTTA ATCCAGTAAT GGGAACTGCC ATCTCTGTAG AAATCAGTGG 4301 GTAATTGAAA AATAGGTTAT GCTTTTTAAA GAGTCTGTGG TTATGAGAGG 4351 TCTCAGTTAA GTGTGTTTAG AAGTGATCAG CTTGAACCTT ATGCATGACT 4401 CGGGGGCTGG AATTTATGAT CTGGGTTACG GTATGTTCTG GGGACGTGTC 4451 TGCTTGCCCA TGGTTACTCA TGAACTGAGG GGATAGCTTG GCAACTTGGT 4501 TAATCATCTT GGGAAAGAAA AACAGACTTC ATATCGCCTG ACTTGATTGG 4551 CCTTTTATAG GAGTATACTG GAGAAATGGT TGTAGAAACA GTATTTACAG 4601 CAAAAGGAAA CAAATAATGT TCATTTTAAG CAATGTACCA TTCACACTGT 4651 CCTGCCTTTT CCTCTAGAAT TTTATTAATG GTAGAAATTT TTATATGAAA 4701 TGGGACCAGG ACCAGGCTAA TATTTTCAGT CCTTAAATAT CAAACTCATA 4751 TGCTGTTATC ACTGTGATTT TACTTGTGAA ATCATTCCTG TAATGTTTAG 4801 TGTTTGAAAA TGAAATATTG AAATTAGGTT TCCAGAGTAA CACTGTCCCC 4851 GGAAAAGGAT ATGAGAAGTG GTGGATGTTG GATGGGGGTG GTTGCACAGT 4901 CCTGTGCGGT TCCCATGGCT TTCCAGCGTT TCATTTAGTG AAGGAATGCT 4951 CACACTAGAT GTAGCACAGC TTCCTGTGGG GCCCGGCAGC AAAGCCCCAG 5001 GTGCTCCCTG TCACCTCACA ACAAAATGCA CTAAGAAACG TAAAGAATAA 5051 GAGGAATTAA TACCCACCAT TAAAGGATGT CCGGCCAACC TATTGTGGAA 5101 ATTTATAGAA TAGTATACAC CACAGTTCTG AATAGTGATA TCACATAAAA 5151 ATACATACTA GAGGACCTGC CAAGCCATGA GCAGTGTTTT TGCTTTTTGG 5201 GGGCCAGTCC CTGGGGTGTG GAGCCGCTAG GGTTTGCACC CATGAAACAG 5251 AAAAGCCACA CCCTCCAAGG TGTGGCTTTC ATTTTGGGAC TGCTGCAGGG 5301 AGGGCAGAGG CATTGCTGAG ACTGCCTGGC AACGGCTGAT GCCCCAGGTA 5351 GGACCTTTTC CATTTCAAAG TGGTGTTCTA AGTCTGCGTC CAACACTGTG 5401 TAGGAAAAAG GTTGGTGCAA AAATATTCCT GGTCATCCAC CCATTAAAAT 5451 AGTTAGATGA GGCTATTGCC TTGATGACAG CTGTCCACAC TCCTCATGAA 5501 ATTAACCCGT ATGCCGGGGC ATTTCCAAAT GTCTGACTCG TGAAATTAAC 5551 CCATACGCAG GGGCCTTTCC AAATGTCTGA AAAGGCAGTG GTGTCTTTTG 5601 GGGAAAATGT TATGCATGGA AGCCTGACCT TTTGCTTAGT TGACAGCAAT 5651 CCCTTCTGTA TTGCCAATCA AGGTTCATTT GAGATGCAGA GGAATGAGCT 5701 TGAGCCTTCC TCCTTTTCCT TCCGGTTTTA TTCTTCCTCT TGGGAACATC 5751 CCTCCACTCC GCACTGCTTC CTGCAGCTTT GTAGAGCTGG ATTTGGAACT 5801 TCGGGATTTG GTTTCTGAGT CTGTGGAGGC ACCGACTTCT CCTGTAAGAA 5851 AATGAATGTT GTGGAAATTC TTTGGCTACT TAACTAAAAC TCGTGACTGT 5901 ATAAGTTTGG CTACAAATAA GTAAGAAATT AATCATCTGC TCTGTTTCTG 5951 CTAATTTCTG GTGTCACTTC AGTAATTCTG GTAGCAGCCG TTGAATCTGT 6001 CAGTCTCTTA GGTAACTTCC TGTAAACGAT TTGGAAATAG GATGTTTTCA 6051 ACGTTCTTTT GTCTTTTGCT GAAGTCAGGA TAGATTCAAG ACATAATCTC 6101 TTGTAAGATC TAAATAGAGC AAATGTAAAC AAAAGTGCAT TTTTGTATTC 6151 TTGTTAATTT TAGATGCTTT CCTAGCTTAC AAAAAGTTCT ATTTTTGGGT 6201 TAAAAATCAA TCAACTTTCT GATATTTCCC CTTCTGCAAT GTTATTGTTC 6251 ATAAGAAAAC ACGAGCTGAA AATGGAAATC TGCAGTTGTT TCAGTTGTCT 6301 TGAATTTCTT TCAGTGGCCA CATCATTTCC ACGTTTTCCA CATCCGGGAG 6351 GAAGCCTGGA CTGTGCAGCC TTCGGGCACC CGGCACAGAC ACTGTGCTGG 6401 CAGGAGCTTC AGACACGCCA AGTGGATGGA TTTGGATTGA ACGCATATGA 6451 AACAGGAGAC GGGTTCTCAT GTGAGATCAA AGCTCCTCCA AAGCCTGTTC 6501 AAGCTCTAAG CGATTCTCAA ATGTTACCAT TTATTAAAGG TAAACTACAC 6551 CTGTTGAAGG CCAAGTTCAG GGCAGCTGTT GTGATCTGTG TAGTTAATGT 6601 ATTTATTAAT GCTTGACTTT TAAAATCCTG GGCATAAATA GTGCAGAGCC 6651 TCGTATGTTT GTCAGTTCAT GCCGAGATGA AATAAATCAC GCAGAAAGTG 6701 CCAGTC // LOCUS AB018259 4652 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0716 protein, complete cds. ACCESSION AB018259 NID g3882152 VERSION AB018259.1 GI:3882152 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hj03473. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XI. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (5), 277-286 (1998) MEDLINE 99087487 REFERENCE 2 (bases 1 to 4652) AUTHORS Ohara,O., Suyama,M., Nagase,T., Ishikawa,K. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (08-OCT-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4652 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hj03473" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 192. .2489 /gene="KIAA0716" CDS 192. .2489 /gene="KIAA0716" /codon_start=1 /product="KIAA0716 protein" /protein_id="BAA34436.1" /db_xref="PID:d1035417" /db_xref="PID:g3882153" /db_xref="GI:3882153" /translation="MAGKWRFINCYCNSSNGEVVRLQNFYKTELNKEEMYIRYIHKLY DLHLKAQNFTEAAYTLLLYDELLEWSDRPLREFLTYPMQTEWQRKEHLHLTIIQNFDR GKCWENGIILCRKIAEQYESYYDYRNLSKMRMMEASLYDKIMDQQRLEPEFFRVGFYG KKFPFFLRNKEFVCRGHDYERLEAFQQRMLNEFPHAIAMQHANQPDETIFQAEAQYLQ IYAVTPIPESQEVLQREGVPDNIKSFYKVNHIWKFRYDRPFHKGTKDKENEFKSLWVE RTSLYLVQSLPGISRWFEVEKREVVEMSPLENAIEVLENKNQQLKTLISQCQTRQMQN INPLTMCLNGVIDAAVNGGVSRYQEAFFVKEYILSHPEDGEKIARLRELMLEQAQILE FGLAVHEKFVPQDMRPLHKKLVDQFFVMKSSLGIQEFSACMQASPVHFPNGSPRVCRN SAPASVSPDGTRVIPRRSPLSYPAVNRYSSSSLSSQASAEVSNITGQSESSDEVFNMQ PSPSTSSLSSTHSASPNVTSSAPSSARASPLLSDKHKHSRENSCLSPRERPCSAIYPT PVEPSQRMLFNHIGDGALPRSDPNLSAPEKASPARHTTSVSPSPAGRSPLKGSVQSFT PSPVEYHSPGLISNSPVLSGSYSSGISSLSRCSTSETSGFENQVNEQSAPLPVPVPVP VPSYGGEEPVRKESKTPPPYSVYERTLRRPVPLPHSLSIPVTSEPPALPPKPLAARSS HLENGARRTDPGPRPRPLPRKVSQL" BASE COUNT 1358 a 934 c 974 g 1386 t ORIGIN 1 GGGAGAAGCT AGGAAAAAAT GTCTTTGAGC TGTGAGATGC TTGTATATTT 51 TGAAAATATG ATTATATGCA TGTGTTTGTA TTTTATGACT TGGATAATCT 101 GAAAATCAAT TTGCTTTGTC AATGCTTCCT GGATTAGAAT TCCACTATTT 151 GGTCCCTATC CTAGTCTACT AAAGAAAATT GAGCGGGAAA CATGGCGGGA 201 AAGTGGCGTT TCATTAATTG CTACTGTAAC TCGTCTAATG GAGAGGTTGT 251 TAGATTACAG AACTTCTATA AGACTGAACT GAACAAGGAG GAGATGTATA 301 TACGCTACAT TCACAAACTC TATGATCTGC ATCTCAAAGC ACAGAACTTT 351 ACAGAAGCTG CATATACCCT CCTCTTATAT GACGAGCTAC TGGAATGGTC 401 TGATCGGCCC CTCAGGGAGT TCCTGACCTA CCCCATGCAA ACAGAATGGC 451 AGCGCAAAGA GCACCTGCAC CTCACCATCA TCCAGAACTT TGACAGAGGC 501 AAATGTTGGG AGAATGGCAT TATCTTGTGC CGGAAGATTG CAGAGCAGTA 551 TGAGAGTTAT TATGACTACA GAAACCTGAG CAAGATGCGG ATGATGGAAG 601 CCTCTTTGTA TGACAAAATT ATGGACCAGC AACGTCTTGA ACCAGAGTTC 651 TTCAGAGTTG GATTTTATGG AAAAAAATTT CCATTTTTCT TAAGAAATAA 701 GGAGTTTGTG TGTCGAGGGC ATGACTACGA GAGGCTGGAA GCCTTCCAAC 751 AGAGAATGCT GAACGAGTTC CCCCATGCCA TCGCCATGCA GCACGCCAAC 801 CAGCCCGATG AGACCATCTT CCAGGCAGAA GCTCAGTATT TGCAGATATA 851 TGCTGTGACT CCCATTCCAG AGAGCCAGGA GGTCCTGCAG AGAGAGGGTG 901 TTCCGGACAA CATCAAAAGC TTCTATAAAG TGAATCACAT CTGGAAATTC 951 CGCTATGACC GACCATTTCA CAAAGGCACA AAAGATAAAG AGAATGAATT 1001 CAAGAGTCTC TGGGTGGAGA GAACGTCATT ATACTTGGTG CAGAGTTTGC 1051 CTGGCATCTC TCGCTGGTTT GAAGTGGAAA AGCGTGAAGT GGTAGAAATG 1101 AGTCCTCTGG AAAATGCAAT TGAAGTGCTA GAAAATAAGA ATCAGCAGCT 1151 GAAGACTCTG ATTAGTCAGT GTCAGACAAG ACAGATGCAG AATATTAATC 1201 CCCTGACTAT GTGCCTGAAT GGAGTTATAG ATGCTGCAGT TAATGGTGGC 1251 GTTTCCAGGT ATCAAGAGGC ATTCTTTGTC AAAGAATATA TCTTAAGTCA 1301 CCCTGAAGAT GGGGAGAAAA TTGCACGATT AAGAGAGCTG ATGCTTGAGC 1351 AGGCACAGAT TCTGGAATTT GGTTTGGCCG TGCATGAGAA GTTTGTACCT 1401 CAAGATATGA GACCCCTTCA CAAAAAGCTG GTTGACCAAT TCTTTGTGAT 1451 GAAGTCGAGC TTAGGGATAC AGGAGTTCTC TGCTTGTATG CAAGCCAGTC 1501 CTGTCCATTT TCCTAATGGA AGCCCTCGTG TGTGTAGAAA CTCAGCACCT 1551 GCTTCTGTGA GCCCAGATGG TACCAGGGTA ATTCCTAGAC GCAGCCCGTT 1601 AAGTTACCCA GCTGTCAACC GATATTCTTC CTCCTCACTG TCCTCACAAG 1651 CTTCTGCTGA AGTAAGCAAT ATTACAGGGC AATCAGAAAG CTCTGATGAA 1701 GTCTTTAACA TGCAGCCAAG TCCATCTACC TCAAGCTTGA GTTCTACTCA 1751 CTCGGCTTCA CCTAATGTGA CAAGTTCTGC TCCATCGAGT GCCAGAGCTT 1801 CTCCTTTGTT GTCTGACAAA CACAAACATT CCCGAGAAAA CTCTTGCCTG 1851 TCACCAAGAG AGAGACCATG CAGTGCCATC TATCCAACAC CTGTGGAGCC 1901 TTCGCAGAGG ATGCTGTTTA ATCATATTGG AGACGGGGCC TTGCCACGCA 1951 GTGACCCAAA TCTCTCTGCA CCTGAAAAAG CTTCACCAGC AAGACACACG 2001 ACATCAGTAT CCCCCTCGCC TGCCGGGCGA TCTCCATTGA AGGGCTCTGT 2051 GCAGTCTTTC ACCCCCTCTC CAGTGGAGTA CCACTCGCCA GGACTCATCT 2101 CCAACTCCCC TGTCTTGTCG GGCAGCTACA GCAGTGGGAT TTCTTCTCTC 2151 AGCCGGTGCA GCACGTCGGA AACCTCAGGC TTTGAAAATC AGGTGAATGA 2201 ACAGTCGGCC CCCCTGCCGG TGCCAGTGCC GGTGCCCGTG CCGAGCTACG 2251 GCGGGGAGGA GCCAGTGCGC AAGGAGAGCA AGACTCCGCC CCCGTACAGC 2301 GTCTACGAGC GGACTCTGCG GCGCCCCGTC CCGCTACCTC ACAGCCTCTC 2351 CATCCCCGTC ACGTCGGAGC CGCCCGCGCT GCCCCCCAAG CCTCTGGCAG 2401 CGCGATCCAG CCACCTGGAG AATGGGGCCC GGAGGACTGA CCCCGGCCCG 2451 CGGCCCAGGC CCCTGCCCCG CAAGGTCTCT CAGTTATAAG TCACTTTTCT 2501 ATGTACCTGC GATGCATTCT TTGCCCGTTT ACAAAATAAG AAGTATGATG 2551 AGAAGACATT TAGTGTAGGC ACTTTAATAA CTTACTCAGC TCCTTCGATG 2601 AATGGAATTA AAACTTGCTT ATTAAATATC ATGTTGCACA ATATTAAAAG 2651 TTGCTGATCT AAAACGCCAG ATGTTAAATG AAGTATGGCT GAATTTCATT 2701 AAAACGTTTC TCATTTGGAA GTGGTAAATA GTGATAAAGA CTCCTTTTGT 2751 ACCTTTTTAT GTTCACTTTT TTTTATATAG TTTAATCTTA AAACCAATAC 2801 GATATTGTCA AACGATACAA TGTGTGACAA TGTTGTATCG TTTTTACTGA 2851 ATACTTGATA CTTGGAGAAA GCTTATTAAG TCAGTGCACA TCCTAACACA 2901 GTGGTCCTTA TTTTAGAAGA CTTCTGTAAA TAAGGCAAGG TTTATCAGTG 2951 CAGATCATCA GAATTAAAGT TCAAGCAGGC GAGCAAGACA GTATACTTAA 3001 GGGGTTGCAA AGCTTGGGAC TGGAAATTGT TTTGTTCTTG AAACAAAATA 3051 CTTCTTTAAG GTTGCTTTTG CTGTTTGACT GCTGTCTACA TTCGTAAAAT 3101 TCTATTTTGT GAATTGGTAG CTAAATCCCT TACTACCCTG ACACCGTGGT 3151 ATCTACTGTA TTTCTTTTCA AGGTGCAATT TGCTTCAGAG TTCCAATCAG 3201 CTAGATTAAG CAAGAGGCTC CAGAAGAAAT GTTTACTTGA ATTTTGCGCT 3251 TCCTTTCTTG ATAGTTTCCT ATATAAAATT TGTCATTGAA CAAGAGCAAA 3301 TGCTGAAGTA TTAATGAGGC ACAAATGACT GTGCCCCATT AGCAAGAATT 3351 CAGGAATCAA TACAGACAGT ATTAAATTAA TAGCTTAAGT GAAGAAAAAA 3401 AAAAACTTAG TGAAAATGTA TTAGCACGAT TAAATGGCAA AAGGACTTAT 3451 AAAAGGCAAG GGCATTAACT TTCAGTCCTG CACAAAATAA AAAATTCCTC 3501 ACGACTCTCC ACTTTTACCA GTGGAGTTTG TCTTAGCTGA CCTGTCGTCT 3551 TTCTCTTGAA GGAGGATTGC TGTAGACTTC TCTAGCTTGA ATATTGCAAC 3601 ATAGCATCTT AGGTCTAGAT AGGGATGCTA ATGCCAGTTG TAGAAGTGTG 3651 AAAAAAGCAC CTTGTATGTA GTAATGTATT TTATATCTTT GTTTTTTCTT 3701 TTACTGACTG TTTATAACAC TCAATTGACA ATAGATATGA ACTGTATTTT 3751 AAATCATACT GTTAAATATT TTCCCTCTTT TGTTGGGAAG CTCATTTTAG 3801 TTTAACCATG TTTGTTTTGT TGGTAGCTTA CCTGGAAGGC AGTGACCACT 3851 TTTTTATATT CTCTTAATGA AACCATTCAG CAGGTATATG CTGTTGAGGC 3901 TGGTTATAGA GGTTTTCTAT AATAAATGTT CAAGTATTTT TGTATATAAC 3951 TGGTTAATTT TAATAAGAGA TACCATTATG TGTAAAAAAA AGTAAAAATA 4001 AACGCAAACA GTTGTTGATG CAGTATGATT GTTATAATTA TGCCAAATAC 4051 TTTACGTATG GAAAAAGAAT ATTTGTACAT ATGTGCTTTT AACAATTCTG 4101 CCATATTGAC TTTACAATTT TGAATGTCGG AAAAATTAAT ATATGTTAAA 4151 TATTTATGTT TAGTGAAAGT GTTCATAATT GAGAAAAGGA ACATATGCAT 4201 TTTAGCTTTG TATCTTGCAA GTTTTGCAGT CAGAAATTTT TTGAACTAGC 4251 TTTTGCTTTT GATAACACTT CGTGTTTGTA ACCACATTCA TATATATATA 4301 CATATATATG TGAAGCTCCA TATTTCTGTT GCTTTAAAGA AGTAAAACCT 4351 TCCATTTAAA TAAGATGACA TGCATAAGAT AACAAAGCTT CCTTGATTTC 4401 CTTTTCCTGT GTAATTTAAT AGATTTGTTG ACTAGTGCTT GGGCACATTA 4451 TAAATCAGTG TTATTTGCTC TTGGAGCCAT TTTTTAAAAA AAATTTTGGC 4501 AGTGAGCAGT TGAATTTATC TTGAATTTAT CATGTGTGTG TATTTCTGAA 4551 GCAGCTACAT AGCAGAACAT TTTAAGAGAT TCTGTTAGCC CACATGTTCA 4601 TGTTGGTTGC TGCTGAATGG TAAATATTAA ATAAAATTAC CAGATTAATC 4651 TT // LOCUS AB018279 4353 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0736 protein, complete cds. ACCESSION AB018279 NID g3882192 VERSION AB018279.1 GI:3882192 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk03846. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XI. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (5), 277-286 (1998) MEDLINE 99087487 REFERENCE 2 (bases 1 to 4353) AUTHORS Ohara,O., Suyama,M., Nagase,T., Ishikawa,K. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (08-OCT-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4353 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk03846" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 435. .2663 /gene="KIAA0736" CDS 435. .2663 /gene="KIAA0736" /codon_start=1 /product="KIAA0736 protein" /protein_id="BAA34456.1" /db_xref="PID:d1035437" /db_xref="PID:g3882193" /db_xref="GI:3882193" /translation="MEEGFRDRAAFIRGAKDIAKEVKKHAAKKVVKGLDRVQDEYSRR SYSRFEEEDDDDDFPAPSDGYYRGEGTQDEEEGGASSDATEGHDEDDEIYEGEYQGIP RAESGGKGERMADGAPLAGVRGGLSDGEGPPGGRGEAQRRKEREELAQQYEAILRECG HGRFQWTLYFVLGLALMADGVEVFVVGFVLPSAEKDMCLSDSNKGMLGLIVYLGMMVG AFLWGGLADRLGRRQCLLISLSVNSVFAFFSSFVQGYGTFLFCRLLSGVGIGGSIPIV FSYFSEFLAQEKRGEHLSWLCMFWMIGGVYAAAMAWAIIPHYGWSFQMGSAYQFHSWR VFVLVCAFPSVFAIGALTTQPESPRFFLENGKHDEAWMVLKQVHDTNMRAKGHPERVF SVTHIKTIHQEDELIEIQSDTGTWYQRWGVRALSLGGQVWGNFLSCFGPEYRRITLMM MGVWFTMSFSYYGLTVWFPDMIRHLQAVDYASRTKVFPGERVEHVTFNFTLENQIHRG GQYFNDKFIGLRLKSVSFEDSLFEECYFEDVTSSNTFFRNCTFINTVFYNTDLFEYKF VNSRLINSTFLHNKEGCPLDVTGTGEGAYMVYFVSFLGTLAVLPGNIVSALLMDKIGR LRMLAGSSVMSCVSCFFLSFGNSESAMIALLCLFGGVSIASWNALDVLTVELYPSDKR TTAFGFLNALCKLAAVLGISIFTSFVGITKAAPILFASAALALGSSLALKLPETRGQV LQ" BASE COUNT 914 a 1181 c 1192 g 1066 t ORIGIN 1 GCCGGGAGCA GTCGCCGCTG CCGCCTCCGC CCGCGGCCGG GACCCCCGTC 51 CTCGCCCGGG ACTCCTTACC CGGGGAACCT AGACCAGGTC TCCAGAGGCT 101 TGTGGAAGAG AAGCAGGCGA CCCTTCCTGA GTTATCCTGG CTTAGCCTCC 151 CAATCTGGCT CCCCTTCCCC TTCCCATTCC CCTGCTCCCC CTGTCCCTTC 201 CCCATCCACC CAACTGAACT GGGTATAGGT CAAAGCTCCT CTCCTTCCTT 251 TTCCTTCCTA GGCACTCATT GGCTAGGACC TGTTTGCTCT TTTTTTTGTG 301 CCCAGAGATA CTGGAACACG CTTCATCTAA GTAACTGTGG GGAGGGGTCT 351 TTTTGACTCT ACAAGTCCTT GAGCAAAAAG CTGAAAAAGA AGCAGGAGGT 401 GGAGAAGACC CAGTGAAGTG CCCCAAGCCC CATCATGGAA GAGGGCTTCC 451 GAGACCGGGC AGCTTTCATC CGTGGGGCCA AAGACATTGC TAAGGAAGTC 501 AAAAAGCATG CGGCCAAGAA GGTGGTGAAG GGCCTGGACA GAGTCCAGGA 551 CGAATATTCC CGAAGATCGT ACTCCCGCTT TGAGGAGGAG GATGATGATG 601 ATGACTTCCC TGCTCCCAGT GATGGTTATT ACCGAGGAGA AGGGACCCAG 651 GATGAGGAGG AAGGTGGTGC ATCCAGTGAT GCTACTGAGG GCCATGACGA 701 GGATGATGAG ATCTATGAAG GGGAATATCA GGGCATTCCC CGGGCAGAGT 751 CTGGGGGCAA AGGCGAGCGG ATGGCAGATG GGGCGCCCCT GGCTGGAGTA 801 AGGGGGGGCT TGAGTGATGG GGAGGGTCCC CCTGGGGGCC GGGGGGAGGC 851 ACAACGACGG AAAGAACGAG AAGAACTGGC CCAACAGTAT GAAGCCATCC 901 TACGGGAGTG TGGCCACGGC CGCTTCCAGT GGACACTGTA TTTTGTGCTT 951 GGTCTGGCGC TGATGGCTGA CGGTGTGGAG GTCTTTGTGG TGGGCTTCGT 1001 GCTGCCCAGC GCTGAGAAAG ACATGTGCCT GTCCGACTCC AACAAAGGCA 1051 TGCTAGGCCT CATCGTCTAC CTGGGCATGA TGGTGGGAGC CTTCCTCTGG 1101 GGAGGTCTGG CTGACCGGCT GGGTCGGAGG CAGTGTCTGC TCATCTCGCT 1151 CTCAGTCAAC AGCGTCTTCG CCTTCTTCTC ATCTTTTGTC CAGGGTTACG 1201 GCACTTTCCT CTTCTGCCGC CTACTTTCTG GGGTTGGGAT TGGAGGGTCC 1251 ATCCCCATTG TCTTCTCCTA TTTCTCCGAG TTTCTGGCCC AGGAGAAACG 1301 AGGGGAGCAT TTGAGCTGGC TCTGCATGTT TTGGATGATT GGTGGCGTGT 1351 ACGCAGCTGC TATGGCCTGG GCCATCATCC CCCACTATGG GTGGAGTTTT 1401 CAGATGGGTT CTGCCTACCA GTTCCACAGC TGGAGGGTCT TCGTCCTCGT 1451 CTGCGCCTTT CCTTCTGTGT TTGCCATTGG GGCTCTGACC ACGCAGCCTG 1501 AGAGCCCCCG TTTCTTCCTA GAGAATGGAA AGCATGATGA GGCCTGGATG 1551 GTGCTGAAGC AGGTCCATGA TACCAACATG CGAGCCAAAG GACATCCTGA 1601 GCGAGTGTTC TCAGTAACCC ACATTAAGAC GATTCATCAG GAGGATGAAT 1651 TGATTGAGAT CCAGTCGGAC ACAGGGACCT GGTACCAGCG CTGGGGGGTC 1701 CGGGCCTTGA GCCTAGGGGG GCAGGTTTGG GGGAATTTTC TCTCCTGTTT 1751 TGGTCCCGAA TATCGGCGCA TCACTCTGAT GATGATGGGT GTGTGGTTCA 1801 CCATGTCATT CAGCTACTAT GGCCTGACCG TCTGGTTTCC TGACATGATC 1851 CGCCATCTCC AGGCAGTGGA CTACGCATCC CGCACCAAAG TGTTCCCCGG 1901 GGAGCGCGTA GAGCATGTAA CTTTTAACTT CACGTTGGAG AATCAGATCC 1951 ACCGAGGCGG GCAGTACTTC AATGACAAGT TCATTGGGCT GCGGCTCAAG 2001 TCAGTGTCCT TTGAGGATTC CCTGTTTGAA GAGTGTTATT TTGAGGATGT 2051 CACATCCAGC AACACGTTTT TCCGCAACTG CACATTCATC AACACTGTGT 2101 TCTATAACAC TGACCTGTTC GAGTACAAGT TTGTGAACAG CCGTCTGATA 2151 AACAGTACAT TCCTGCACAA CAAGGAGGGC TGCCCGCTAG ACGTGACAGG 2201 GACGGGCGAA GGTGCCTACA TGGTATACTT TGTGAGCTTC CTGGGGACAC 2251 TGGCAGTGCT TCCTGGGAAT ATCGTGTCTG CCCTGCTCAT GGACAAGATC 2301 GGCAGGCTCA GAATGCTTGC TGGCTCCAGC GTGATGTCCT GTGTCTCCTG 2351 CTTCTTCCTG TCTTTTGGGA ACAGTGAGTC GGCCATGATC GCTCTGCTCT 2401 GCCTTTTTGG CGGGGTCAGC ATTGCATCCT GGAATGCGCT GGACGTGTTG 2451 ACTGTTGAAC TCTACCCCTC AGACAAGAGG ACCACAGCTT TTGGCTTCCT 2501 GAATGCCCTG TGTAAGCTGG CAGCTGTGCT GGGGATCAGC ATCTTCACAT 2551 CCTTCGTGGG AATCACCAAG GCTGCACCCA TCCTCTTTGC CTCAGCTGCC 2601 CTTGCCCTTG GCAGCTCTCT GGCCCTGAAG CTGCCTGAGA CCCGGGGGCA 2651 GGTGCTGCAG TGAAGGGGTC TCTAGGGCTT TGGGATTGGC AGGCACACTG 2701 TGAGACCAAC AACTCCTTCC TTCCCCTCCC TGCCCTGCCA TCCTGACCTC 2751 CAGAGCCCTC ACTCCCCACT CCCCGTGTTT GGTGTCTTAG CTGTGTGTGC 2801 GTGTGCGTGT GCATGTGTGT AAACCCCGTG GGCAGGGACT ACAGGGAAGG 2851 CTCCTTCATC CCAGTTTTGA GATGAAGCTG TACTCCCCAT TTCCCACTGC 2901 CCTTGACTTT GCACAAGAGA AGGCTGAGCC CCATCCTTCT CCCCCTGTTA 2951 GAGAGGGGCC CTTGCTTCCC TGTTCCAGGG GTTCCAGAAT AGGCTTCCTG 3001 CCTTCCCCAT CATTCCCTCT GCCTAGGCCC TGGTGAAACC ACAGGTATGC 3051 AATTATGCTA GGGGCTGGGG CTCTGGTGTA GACCATGGAC CAAAAGAACT 3101 TCTTAGAGTC TGAAGAGTGG GCCTCGGGTG CCCTCTCACA TCTCCTGTTG 3151 GATGCTGGGG GAGAAGCAAT AAACCTCAGC CCTCTGGCCT CCACTTTCCT 3201 CTCAATTTGG GCTGCAAATA TGAAGCCTGA ATTTTATGAA ATTAGCTTTC 3251 TGATTCTTAT TTATTAATAG ATTAAGTTCT GAGGCAGCTC CGCAGGACTG 3301 TGTGTGAATG TGTATGTATA CTTACATATG TGTGTGCATG TGCCATGGGG 3351 CGGGGGGTAT CACTATACTG TCCTCAAATA TAAGCCAAGG GTAATTTCAG 3401 CGGATGCACA CACAACCCTG CCTCCCACAG TTCCTCCCCT AATCTGGTTT 3451 CTGTGTTGAG CCTGGGATGG AGGAGCCCTA GGCCAGCCTG GGATAAGAGT 3501 CCCACAGTCT AGGGAGATCT GAGGGCATCC GACAAGGCCC ATCTCCTTCC 3551 CTCCTCAAGA AGCAGAGGCC TCCTCTGGAG TGAGAGGCTC CACCCACTAC 3601 AGCACAGGCG GGAATAGCAC AGCTGCCCTC CCATGCTCCC TACCTGTCCC 3651 CTCACAGGGA GGGGAGCAGG GGAGGGAAAG AAACCAGGCA TCTGGTCAAA 3701 CCAGCAGATC AAAAAGCACA AAGAGCTGGG GCAGAGGCAG GAAGCAGGGG 3751 CCCTCCTGGC AGCTCCTCTG AGTGGGGAGA GGTTGGGCAG TGAGTGAGGG 3801 ACCCCTAATG CAGGGACTAG AAGCCTCAGT TTCCCCATTT TACCCTTCCA 3851 CACAATAGCC TCTGTAGGTT AGGCTGCCCC ATCCCACCCT ACTCTGTGTG 3901 GCTGCTTTCT TTGGTGCCCT CCCCTCACCC CACTGTAGCT GTGACGTGTT 3951 GTAGTTTTTA GATGTTTGTA AAATGTTTAA AAAAATGTTA AAAGGAAAAA 4001 AGTGAAAATA ACAAAAAAGA AAATCAAAAT TCACCTTCGT CATGCTGCGT 4051 CCAGTGCCCC AACCCTGTGG TCACTCTCCC CATTTTGTAA CACTGTACCA 4101 GGTGGTGACT GTTTAACTCT TTGGTGTCTG TGCTCAAAAG ACTGCCTTCT 4151 CCAGTGCCCA GTGTATGAGT GTGTGCCCTG TGCCCTTGTC CCTCACTCCC 4201 CACATGCTGG ACGTAGCCCT CTTCCTCGCA CCCCTGGGAG GGACCCATCC 4251 ATCTCCCTTG CTCTCCTGGG GAACCCTAAA CCCAACTCTG TTGATGTGAA 4301 AAATGCAGTG AAAAATATTG ACGAAAAATA AAACGGAAAC AAATCCTCAA 4351 AAT // LOCUS AB018287 4238 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0744 protein, complete cds. ACCESSION AB018287 NID g3882208 VERSION AB018287.1 GI:3882208 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk04110. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XI. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (5), 277-286 (1998) MEDLINE 99087487 REFERENCE 2 (bases 1 to 4238) AUTHORS Ohara,O., Suyama,M., Nagase,T., Ishikawa,K. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (08-OCT-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4238 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk04110" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 151. .1923 /gene="KIAA0744" CDS 151. .1923 /gene="KIAA0744" /codon_start=1 /product="KIAA0744 protein" /protein_id="BAA34464.1" /db_xref="PID:d1035445" /db_xref="PID:g3882209" /db_xref="GI:3882209" /translation="MHSMISSVDVKSEVPVGLEPISPLDLRTDLRMMMPVVDPVVREK QLQQELLLIQQQQQIQKQLLIAEFQKQHENLTRQHQAQLQEHIKELLAIKQQQELLEK EQKLEQQRQEQEVERHRREQQLPPLRGKDRGRERAVASTEVKQKLQEFLLSKSATKDT PTNGKNHSVSRHPKLWYTAAHHTSLDQSSPPLSGTSPSYKYTLPGAQDAKDDFPLRKT ASEPNLKVRSRLKQKVAERRSSPLLRRKDGNVVTSFKKRMFEVTESSVSSSSPGSGPS SPNNGPTGSVTENETSVLPPTPHAEQMVSQQRILIHEDSMNLLSLYTSPSLPNITLGL PAVPSQLNASNSLKEKQKCETQTLRQGVPLPGQYGGSIPASSSHPHVTLEGKPPNSSH QALLQHLLLKEQMRQQKLLVAGGVPLHPQSPLATKERISPGIRGTHKLPRHRPLNRTQ SAPLPQSTLAQLVIQQQHQQFLEKQKQYQQQIHMNKLLSKSIEQLKQPGSHLEEAEEE LQGDQAMQEDRAPSSGNSTRSDSSACVDDTLGQVGAVKVKEEPVDSDEDAQIQEMESG EQAAFMQQVIGKDLAPGFVIKVII" BASE COUNT 1340 a 816 c 850 g 1232 t ORIGIN 1 GGGGAAGAGA GGCACAGACA CAGATAGGAG AAGGGCACCG GCTGGAGCCA 51 CTTGCAGGAC TGAGGGTTTT TGCAACAAAA CCCTAGCAGC CTGAAGAACT 101 CTAAGCCAGA TGGGGTGGCT GGACGAGAGC AGCTCTTGGC TCAGCAAAGA 151 ATGCACAGTA TGATCAGCTC AGTGGATGTG AAGTCAGAAG TTCCTGTGGG 201 CCTGGAGCCC ATCTCACCTT TAGACCTAAG GACAGACCTC AGGATGATGA 251 TGCCCGTGGT GGACCCTGTT GTCCGTGAGA AGCAATTGCA GCAGGAATTA 301 CTTCTTATCC AGCAGCAGCA ACAAATCCAG AAGCAGCTTC TGATAGCAGA 351 GTTTCAGAAA CAGCATGAGA ACTTGACACG GCAGCACCAG GCTCAGCTTC 401 AGGAGCATAT CAAGGAACTT CTAGCCATAA AACAGCAACA AGAACTCCTA 451 GAAAAGGAGC AGAAACTGGA GCAGCAGAGG CAAGAACAGG AAGTAGAGAG 501 GCATCGCAGA GAACAGCAGC TTCCTCCTCT CAGAGGCAAA GATAGAGGAC 551 GAGAAAGGGC AGTGGCAAGT ACAGAAGTAA AGCAGAAGCT TCAAGAGTTC 601 CTACTGAGTA AATCAGCAAC GAAAGACACT CCAACTAATG GAAAAAATCA 651 TTCCGTGAGC CGCCATCCCA AGCTCTGGTA CACGGCTGCC CACCACACAT 701 CATTGGATCA AAGCTCTCCA CCCCTTAGTG GAACATCTCC ATCCTACAAG 751 TACACATTAC CAGGAGCACA AGATGCAAAG GATGATTTCC CCCTTCGAAA 801 AACTGCCTCT GAGCCCAACT TGAAGGTGCG GTCCAGGTTA AAACAGAAAG 851 TGGCAGAGAG GAGAAGCAGC CCCTTACTCA GGCGGAAGGA TGGAAATGTT 901 GTCACTTCAT TCAAGAAGCG AATGTTTGAG GTGACAGAAT CCTCAGTCAG 951 TAGCAGTTCT CCAGGCTCTG GTCCCAGTTC ACCAAACAAT GGGCCAACTG 1001 GAAGTGTTAC TGAAAATGAG ACTTCGGTTT TGCCCCCTAC CCCTCATGCC 1051 GAGCAAATGG TTTCACAGCA ACGCATTCTA ATTCATGAAG ATTCCATGAA 1101 CCTGCTAAGT CTTTATACCT CTCCTTCTTT GCCCAACATT ACCTTGGGGC 1151 TTCCCGCAGT GCCATCCCAG CTCAATGCTT CGAATTCACT CAAAGAAAAG 1201 CAGAAGTGTG AGACGCAGAC GCTTAGGCAA GGTGTTCCTC TGCCTGGGCA 1251 GTATGGAGGC AGCATCCCGG CATCTTCCAG CCACCCTCAT GTTACTTTAG 1301 AGGGAAAGCC ACCCAACAGC AGCCACCAGG CTCTCCTGCA GCATTTATTA 1351 TTGAAAGAAC AAATGCGACA GCAAAAGCTT CTTGTAGCTG GTGGAGTTCC 1401 CTTACATCCT CAGTCTCCCT TGGCAACAAA AGAGAGAATT TCACCTGGCA 1451 TTAGAGGTAC CCACAAATTG CCCCGTCACA GACCCCTGAA CCGAACCCAG 1501 TCTGCACCTT TGCCTCAGAG CACGTTGGCT CAGCTGGTCA TTCAACAGCA 1551 ACACCAGCAA TTCTTGGAGA AGCAGAAGCA ATACCAGCAG CAGATCCACA 1601 TGAACAAACT GCTTTCGAAA TCTATTGAAC AACTGAAGCA ACCAGGCAGT 1651 CACCTTGAGG AAGCAGAGGA AGAGCTTCAG GGGGACCAGG CGATGCAGGA 1701 AGACAGAGCG CCCTCTAGTG GCAACAGCAC TAGGAGCGAC AGCAGTGCTT 1751 GTGTGGATGA CACACTGGGA CAAGTTGGGG CTGTGAAGGT CAAGGAGGAA 1801 CCAGTGGACA GTGATGAAGA TGCTCAGATC CAGGAAATGG AATCTGGGGA 1851 GCAGGCTGCT TTTATGCAAC AGGTAATAGG CAAAGATTTA GCTCCAGGAT 1901 TTGTAATTAA AGTCATTATC TGAACATGAA ATGCATTGCA GGTTTGGTAA 1951 ATGGATATGA TTTCCTATCA GTTTATATTT CTCTATGATT TGAGTTCAGT 2001 GTTTAAGGAT TCTACCTAAT GCAGATATAT GTATATATCT ATATAGAGGT 2051 CTTTCTATAT ACTGATCTCT ATATAGATAT CAATGTTTCA TTGAAAATCC 2101 ACTGGTAAGG AAATACCTGT TATACTAAAA TTATGATACA TAATATCTGA 2151 GCAGTTAATA GGCTTTAAAT TTATCCCAAA GCCTGCTACA CCAATTACTT 2201 CTAAAGAAAA CAAATTCACT GTTATTTTGA GTTTATGTGT TGAGATCAGT 2251 GACTGCTGGA TAGTCTCCCA GTCTGATCAA TGAAGCATTC GATTAGTTTT 2301 TGATTTTTTG CAACATCTAG AATTTAATTT TCACATCACT GTACATAATG 2351 TATCATACTA TAGTCTTGAA CACTGTTAAA GGTAGTCTGC CCCTTCCTTC 2401 CTCTCTCTTT TTTTAGTTAA GTAGAAATGT TCTGGTCACC ATGCCAGTAG 2451 TCCTAGGTTA TTGTGTAGGT TGCAATTGAA CATATTAGGA ATACAGGTGG 2501 TTTTAAATAT ATAGATGCAA ATTGCAGCAC TACTTTAAAT ATTAGATTAT 2551 GTCTCACATA GCACTGCTCA TTTTACTTTT ATTTTGTGTA ATTTGATGAC 2601 ACTGTCTATC AAAAAAGAGC AAATGAAGCA GATGCAAATG TTAGTGAGAA 2651 GTAATGTGCA GCATTATGGT CCAATCAGAT ACAATATTGT GTCTACAATT 2701 GCAAAAAACA CAGTAACAGG ATGAATATTA TCTGATATCA AGTCAAAATC 2751 AGTTTGAAAA GAAGGTGTAT CATATTTTAT ATTGTCACTA GAATCTCTTA 2801 AGTATAATTC CATAATGACA TGGGCATATA CCGTAACATT CTGGCAAATA 2851 ACAATTAGAA AAGATAGGTT TAACAAAAAA ATTTACTTGT ATATAATGCA 2901 CCTTCAGGAG GACTATGTCC TTTGATGCTA TAAAATACAA ACAACTTTGA 2951 AGGCAACAGA AGACACTGTT TATTCAAGTC AGTTCTTTGT CAGGTTCCTG 3001 CTGTTCTCCT ACAGAAAAGT GATTCTGTGA GGGTGAACAG GAAATGCCTT 3051 GTGGAAACAG GAAGTCCAAG TGATTCATGT ACTGAGGAAT GTAGGAAAAA 3101 AAATCTGAGG ATAGTGCTTT ACTCTTTCTG TTTTTAAAGG GCACTCTATG 3151 AATTGATTTA TTGTCTAAGA AAATAACACC ACAAGTAGGG AAATTGTTAC 3201 GGAAGCTTTT CACTGGAACA TTTCCTTCAT ATTCCCTTTT GATATGTTTA 3251 CCTTGTTTTA TAGGTTTACT TTTGTTAAGC TAGTTAAAGG TTCGTTGTAT 3301 TAAGACCCCT TTAATATGGA TAATCCAAAT TGACCTAGAA TCTTTGTGAG 3351 GTTTTTTCTA TTAAAATATT TATATTTCTA AATCCGAGGT ATTTCAAGGT 3401 GTAGTATCCT ATTTCAAAGG AGATATAGCA GTTTTGCCAA ATGTAGACAT 3451 TGTTCAACTG TATGTTATTG GCACGTGTTG TTTACATTTT GCTGTGACAT 3501 TTAAAAATAT TTCTTTAAAA ATGTTACTGC TAAAGATACA TTATCCTTTT 3551 TTAAAAAGTC TCCATTCAAA TTAAATTAAC ATAACTAGAA GTTAGAAAGT 3601 TTAAAAGTTT TCCACATAAT GAAAGTCCTT CTGATAATTT GACAAATAGC 3651 TATAATAGGA ACACTCCCTA TCACCAACAT ATTTTGGTTA GTATATTCCT 3701 TCATATTAAA ATGACTTTTT GTCAGTTGTT TTGCATTAAA AATATGGCAT 3751 GCCTAAGATA AAATTGTATA TTTTTTCCAT CTCATAAATA TTCATTTTCT 3801 TCAAAGTCTT TTTTCAATCT CATAAAAAAG GGATAGTGCA TCTTTTAAAA 3851 TACATTTTAT TTGGGGAGGA ACATGTGGCT GAGCAGACTT TTGTATAATA 3901 TTACTTCAAA GATATGTAAT CACAAACAAA AAAAACTATT TTTTATAATG 3951 TCATTTGAGA GAGTTTCATC AGTACAGTTG GTGGACGTTA ATTGTTTGAA 4001 TTTGATAGTC TTTGAATTTA ATCAAGAAAC TACCTGGAAC CAGTGAAAAG 4051 GAAAGCTGGA CTTAAATAAT CTTAGAATTA ATTGATAAAT GTCTCTTTTA 4101 AAATCTACTG TATTTATTAT AATTTACACC CTTGAAGGTG ATCTCTTGTT 4151 TTGTGTTGTA AATATATTGT TTGTATGTTT CCCTTCTTGC CTTCTGTTAT 4201 AAGTCTCTTC CTTTCTCAAA TAAAGTTTTT TTTAAAAG // LOCUS AB018413 6006 bp mRNA PRI 06-APR-1999 DEFINITION Homo sapiens mRNA for Gab2, complete cds. ACCESSION AB018413 NID g4589374 VERSION AB018413.1 GI:4589374 KEYWORDS Gab2; gab2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nishida,K., Yoshida,Y., Itoh,M., Fukada,T., Ohtani,T., Shirogane,T., Atsumi,T., Takahashi-Tezuka,M., Ishihara,K., Hibi,M. and Hirano,T. TITLE Gab-family adapter proteins act downstream of cytokine and growth factor receptors and T- and B-cell antigen receptors JOURNAL Blood 93 (6), 1809-1816 (1999) MEDLINE 99168966 REFERENCE 2 (bases 1 to 6006) AUTHORS Hirano,T., Hibi,M. and Nishida,K. TITLE Direct Submission JOURNAL Submitted (05-OCT-1998) to the DDBJ/EMBL/GenBank databases. Toshio Hirano, Osaka University Medical School, Division of Molecular Oncology Biomedical Research Center; 2-2 Yamada-oka, Suita, Osaka 565-0871, Japan (E-mail:hirano@molonc.med.osaka-u.ac.jp, Tel:06-879-3880, Fax:06-879-3889) FEATURES Location/Qualifiers source 1. .6006 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1. .2031 /gene="gab2" CDS 1. .2031 /gene="gab2" /codon_start=1 /product="Gab2" /protein_id="BAA76737.1" /db_xref="PID:d1040490" /db_xref="PID:g4589375" /db_xref="GI:4589375" /translation="MSGGGDVVCTGWLRKSPPEKKLRRYAWKKRWFILRSGRMSGDPD VLEYYKNDHSKKPLRIINLNFCEQVDAGLTFNKKELQDSFVFDIKTSERTFYLVAETE EDMNKWVQSICQICGFNQAEESTDSLRNVSSAGHGPRSSPAELSSSSQHLLRERKSSA PSHSSQPTLFTFEPPVSNHMQPTLSTSAPQEYLYLHQCISRRAENARSASFSQGTRAS FLMRSDTAVQKLAQGNGHCVNGISGQVHGFYSLPKPSRHNTEFRDSTYDLPRSLASHG HTKGSLTGSETDNEDVYTFKTPSNTLCREFGDLLVDNMDVPATPLSAYQIPRTFTLDK NHNAMTVATPGDSAIAPPPRPPKPSQAETPRWGSPQQRPPISENSRSVAATIPRRNTL PAMDNSRLHRASSCETYEYPQRGGESAGRSAESMSDGVGSFLPGKMIVGRSDSTNSED NYVPMNPGSSTLLAMERAGDNSQSVYIPMSPGAHHFDSLGYPSTTLPVHRGPSRGSEI QPPPVNRNLKPDRKAKPTPLDLRNNTVIDELPFKSPITKSWSRANHTFNSSSSQYCRP ISTQSITSTDSGDSEENYVPMQNPVSASPVPSGTNSPAPKKSTGSVDYLALDFQPSSP SPHRKPSTSSVTSDEKVDYVQVDKEKTQALQNTMQEWTDVRQSSEPSKGAKL" BASE COUNT 1460 a 1634 c 1615 g 1297 t ORIGIN 1 ATGAGCGGCG GCGGCGACGT GGTGTGCACC GGCTGGCTGA GGAAATCGCC 51 TCCCGAGAAG AAGTTGAGGC GCTATGCCTG GAAGAAACGC TGGTTTATCC 101 TGCGGAGTGG CCGGATGAGC GGTGACCCAG ATGTTCTGGA ATACTACAAG 151 AACGATCACT CCAAGAAGCC TCTGCGGATC ATCAACCTGA ACTTCTGTGA 201 GCAGGTAGAT GCAGGCCTGA CCTTTAACAA GAAGGAGCTG CAGGATAGTT 251 TTGTGTTTGA CATCAAGACC AGTGAACGCA CCTTTTACCT GGTGGCTGAG 301 ACAGAAGAGG ACATGAATAA GTGGGTCCAG AGCATCTGCC AGATCTGTGG 351 CTTCAATCAG GCTGAGGAGA GCACAGACTC CCTGAGAAAT GTTTCCTCAG 401 CCGGTCATGG CCCCCGCTCT TCTCCAGCTG AGCTCAGCAG CTCTAGCCAG 451 CACCTTCTCC GAGAGCGCAA GTCCTCAGCC CCATCACACT CCAGCCAGCC 501 AACTCTGTTC ACGTTTGAAC CCCCTGTGTC AAACCACATG CAGCCCACCT 551 TGTCCACCAG CGCACCTCAG GAGTATCTCT ACTTGCACCA GTGCATAAGC 601 CGAAGAGCAG AAAATGCAAG GAGTGCCAGC TTCTCTCAGG GCACCAGAGC 651 CTCTTTTCTC ATGAGGAGTG ACACAGCTGT ACAAAAACTT GCCCAGGGCA 701 ATGGACACTG TGTCAACGGG ATCAGTGGTC AAGTCCATGG CTTCTATAGC 751 CTTCCCAAGC CGAGCCGGCA CAATACAGAA TTCAGAGACA GTACCTACGA 801 CCTCCCCCGC AGCCTGGCCT CCCATGGCCA CACCAAGGGC AGCCTCACAG 851 GCTCCGAGAC AGATAATGAG GATGTGTACA CCTTCAAGAC GCCCAGCAAC 901 ACCCTGTGCA GGGAGTTCGG GGACCTCCTG GTAGACAATA TGGATGTTCC 951 GGCCACCCCA CTCTCAGCCT ACCAGATCCC TAGGACATTC ACTCTGGACA 1001 AAAACCACAA TGCCATGACA GTGGCCACTC CTGGGGACTC AGCCATAGCT 1051 CCCCCACCCC GCCCCCCCAA GCCAAGTCAG GCAGAAACAC CTCGATGGGG 1101 CAGTCCTCAG CAGAGACCGC CAATCAGTGA AAATAGCAGA TCTGTCGCTG 1151 CCACCATCCC CAGACGCAAC ACCCTCCCTG CAATGGACAA CAGCCGACTT 1201 CACCGAGCTT CTTCCTGTGA GACCTACGAG TACCCACAGC GTGGTGGAGA 1251 GAGTGCAGGC CGGTCTGCTG AATCCATGAG TGATGGAGTT GGCTCTTTCC 1301 TGCCAGGGAA AATGATTGTG GGCCGATCGG ACAGCACCAA TTCTGAAGAC 1351 AACTATGTGC CCATGAATCC AGGTTCTTCC ACCCTGTTGG CCATGGAACG 1401 AGCAGGTGAT AATTCCCAGA GCGTCTACAT CCCAATGAGC CCAGGGGCCC 1451 ATCACTTTGA CTCACTTGGC TACCCATCAA CAACCCTTCC TGTGCACCGA 1501 GGCCCCAGCA GAGGAAGTGA GATTCAGCCA CCCCCTGTCA ACCGCAACCT 1551 CAAACCTGAT CGGAAAGCAA AGCCAACACC ACTTGACCTG AGGAACAACA 1601 CCGTCATCGA TGAACTCCCC TTCAAGTCAC CTATCACCAA GTCTTGGTCT 1651 AGGGCCAACC ACACCTTCAA CTCCAGCTCC TCCCAGTACT GCCGCCCCAT 1701 CTCCACCCAG AGCATCACCA GCACAGACTC AGGAGACAGC GAAGAGAACT 1751 ATGTCCCTAT GCAAAACCCA GTGTCTGCAT CTCCCGTTCC CAGTGGCACG 1801 AACAGTCCTG CCCCTAAGAA GAGCACCGGC AGCGTTGATT ATCTGGCCCT 1851 GGACTTCCAG CCGAGCTCCC CAAGCCCCCA CCGCAAGCCA TCTACTTCAT 1901 CCGTCACCTC TGATGAGAAG GTGGACTACG TTCAGGTGGA CAAGGAGAAG 1951 ACCCAGGCCC TGCAGAACAC CATGCAGGAG TGGACAGACG TGCGGCAGTC 2001 CTCAGAGCCT TCCAAGGGTG CCAAGCTGTG ATGAGAGGGC CACCGCAGAG 2051 CCCAGGAGGC AGCATCTCCA GAGCTGGCCC TTCCCATCTC CCCTCTCCCC 2101 TCTCCCGTTC TTCCTCCCAT CCACCTCCTC TCTACTCTGC CAGTCTCAGC 2151 CTTCAAAGCA CTTGACATCA GGGACCCTGA ACCCTTCCCC TGGGAGGTGA 2201 GGGCCTGATC AAGGCACCTC CTCTGCCCAC TCGGGGCCCA GCTGTGATTT 2251 TTATCAGTAA TGGCCATGCC TCCACCCACC TTAGTTAGGA GCTACTTCCA 2301 AAAAGCATCC TTCAGCCTCT TCCTGTCCTT TAGACCTGAC TCTCTACCAG 2351 ATGTTTGGAG GGAAGGGCTG GGGCTCTGAG CCAGATTCCA CACCTCACGT 2401 TCAGTCACAG CCCTCAGCTA TCTTCCCTCC GGCCACTGGG CTACCTCTCC 2451 TTCAGTCCCA GAAGACAAGT CTCACCAACC CAGGGAGTCA AGGACCAGCA 2501 AACCAAAGTG GATAATGGAC TTTTTCATTC CTGTTTTTCT TGGCAGGAGA 2551 GAAGCAAGGC CACTAAAAGA GGAGATGGTG GAGACGGAGG CTCAGCAGTG 2601 GTCTTGAGGG GTAAAGGACT TAGATGCCCA GATGAAGAGG GAAAGCTGAC 2651 ATCTGCAGGG AACCCACTTT GAGGCTGAGG CCATGGCAGG ACAGCTGCTG 2701 TGGGGTGCAG AGGCAGAAGA TGAAGGACAA AAGGAAGGGA AAACTGATGG 2751 CCAACCTAGA GCAGCAAGGA GCAGGGCTTG GAGCTCGGGT GGTGGAGATG 2801 ACAAGGACAC TGTGGGGTCT GGGTCCCCAG AACTCTGGAG CTACAGGCCA 2851 CTCTAGGCCC AAGGGCTAGT CCTCTTCCCC AGTCCCCTCA GAGGCCCCCG 2901 CCAGCCCCAC CTTGAAAGCA GCATACAGGG GAAGGCTTGG ACCAAGCTGG 2951 GCGACCAAGC ACATGGGGCA GGAACACATG GTAAAGGGGT GGGGAATATG 3001 GGAGGGAGTG TGGTGTGGAT GGGGGTGATG CAGGGACTGA GGGGAACCCT 3051 GGGACAGGCA CAGGCTGGGC AGAGGCACAG GGCAGTGCAG GGGACTCTGC 3101 AGTGGGGTCG GGAAGTGAGT TTCTTTGCAG TGAGCAGTGC AGTGGAAGTC 3151 GGGCACAGAG GTAGCAGACA GATGTGAAGC AGTGGTGAAG GCCATGTAGC 3201 AAGTGGGGAA ATACATCCAA AGGGCCTGGG AGTTGGGGGG TGCCCAACGC 3251 AATCCTTGGG GGTGCAGGGT GGAGCAGAAA GTGAAGGAGG GACACGTGCA 3301 AGAGTGGTGT GCATGGTGGT GTGACATGAG GACCGTTCCT AGGATGGGAC 3351 AGTGGGTCAG GCAGGACAAG GAGAAAGCAG GGCAGAATGA TGCCTAGAGG 3401 ACCACATCAG GCATGGCTGA CAGCTTGTGC CCATGGGCTG TGGCGTATGT 3451 CAGATCGCAG GGTAGGAACG AGTCTGGCCT GGTGCCGGCC CAGTGTTTCC 3501 TCAGCTCATC CGCCCTCTGT TGCTCCCTAG CATTCCAGGA GCCATCTTGG 3551 ACTCTCCTCC CCAGGTTTGA AAGGCCATCA GATTAGCAGG GACGGGGTGT 3601 AGGGCATCAC CCAAGGTTCC TTCTCTTAAA CTAAGGGTGG GGGATCTGAA 3651 TGTTTTTATG TTGACTGTTC TTGACTAAAT TTTCAAGAGT TTCAGAAGCA 3701 ACAGGACAGA CCAGACGTTT CATTCTACCC TGGGGCGAAC AGAACTTCTT 3751 CCTCCCAAAC AATGACTTCC TGCCATGTTT GATGGGGACA GCTACCACTG 3801 TCCTCTGCCC CCATTCCCCT TTCAGCTCCC ATGAGCATGC ATAGTTCACC 3851 AGACCAATGG CCTAGCCATT CTCTAAGTCC CATCCTGGAA GAAGTTATTT 3901 CTTCAAGAGC TGCACCTCTC CTCCTAGCAT TAGTTTAGAT CAACTCAAGG 3951 AGTATTTATT AATGGCTGCT GTCTCCAGTT TCTGGGGTTA AGCACTAAGG 4001 ACACAAGAAT CAATCAGACC TTCTCCCTGA ACTTAAGATA GCCACAATCA 4051 GAAAAAGGAC AAGGACATGA GACAGTGGTG ATGGCCATCA GACAGAGACT 4101 TCAAATGCTG ATGGAGGGCA GAGGAAGTAC TTAGGGAGGT TGGTGTCAGA 4151 GGCAGGAGTG GGGGATCAGG GAAGGTGGAT TCTAGGAAAA GGGAGTGCCT 4201 GAGGTAGGCC TTAGAAGGGG ATGAGTCAGA TTTTTACAGA GGAGGAGGGC 4251 AGGGCTTGGG TCCAGTGGAG GAAGAAGGAA GGAGAGGCTT GGAAAGCCTT 4301 GTGTCTTGGG AAAAAAAGGC CTTTGAGCAT ATGGGTCCAG CCACTCAGAA 4351 GTGCAGGGGC CATGCCTTGG TGTTCCAATA AGTGAATGGA AGCAGTGGTG 4401 GTAGCTACAC TGGGCAGAGT TGGCAGGGTG CTGGTTCACT CTGCCCAGCC 4451 CTGAATGTGT GCCTTAAAGG CCCCCTACAA GGGGCCCCAT ACGACAGAGC 4501 TTTTAACTGG TGCCTTCCCT GTACCCGCAG CAGCCACAAG TGGGCCCAGA 4551 CTATTGCAGC CTCCCATAAA CATGTGAGCA TGTTCTGAGT GTGCCATGAT 4601 GTGAGTGGAC CTGGCTGGAA TCTTCGGAGA GCGACTGAGG TGTTCAAATC 4651 GAATCTCCCA GGAGGCTTCC TTCCAGCCCC CTATTCTGGT AACTACCAGG 4701 AGGCTTCCTT CCAGCCCCCT ATTCTGGTAA CTACCAAAAT CCCTCGGGTG 4751 CAAGTGTAGG GGTAGAGATG GAAGGATGAG AGGTGAAATT GACCTTTTGA 4801 AAGCAAAGCT CTGGCTCACA GGCCCCAAAC TACCAGCCGT ATCTAGCATA 4851 TCCCCACCCT CCACCCACTA CCTCCTCCAA CAAAGGAGTC AACTCAGTTG 4901 AAAAAACTGG TCCTTTGGCC TATCCATGGG TCAAAGTCCA CCTCTCCTGG 4951 GGGCCTGGAG AGGACTGAGC CTACGGAAAG GGGATACCTT CCCACTCAGC 5001 ACTGCTTCAC ACAGGCCCCC TGCCTGGGGC TCTCCAAGGA GCCTTCTTCA 5051 CCCACTTCCA GCTCCACTTC TGCAAGGTTA AGTCAAGTGA GAACGATGAG 5101 AAATAGGGAG ATGGTGTCTC CTTAAGTCCT TGATCTGCCT GTCTGTGGAA 5151 TGGGAGGTTG GATTAGCTGC GCTGAGGTCC CATCCACAGC TGGTGCTCAG 5201 CTGCTTGAAG GGGAGACTCC CTCCTCTGTA ACTTCTTTCT GGGGGATTGG 5251 GGTGGGCAGT ACCTATCCCC AGTCCCCTCC TAGCTTGACT TTAGTGGTTT 5301 CCAATGTAGA AGTTAACAAA GTATGCCCCA TTCCTGTGAC AAAAGCACAA 5351 CCATTCTGAA GTTACTGGAG CATGGGCTCA GCTCATCCTC CCTCTGGCCC 5401 CTTCTCCCAT GGGGACATCT CGGCCCAGCA CCCCTATCCC ATTTCCAGAG 5451 TTCTTCCTTC CCCATCTGGG CCTTCATAAA ATGCAGGGGA AGCCAGACTG 5501 GTCTCAGGAG CGCTAAAGCC CTTCCGTGGG GGGTCGTCTT TCTGGGACTA 5551 GCCCTGCTGT TTAGGACCTG GGACCACAAT GGGGTACCTG CCGAGGGGGT 5601 CCCCAAGAGA TCCAGGCTGT CATGTGATTT ATGGTGGCAT GTGTTGTGTA 5651 TTTGTTGGCT ACTTGTGTCT TGAAATCTAG AATTATTTCA CGCAGAATTG 5701 TCACTGTTTG TCAGGAAGAG AAAATGGGCT AGTGGAAGCC CAGTCTTGAG 5751 TTCTTGTCTT GTTACCATTT AAAATTGACA TTTAATTTTC AAATCACTGT 5801 TGGTGCCTAA TCACTTAAGT TATTAATTTA TTCTGTTGTA TTCTTTTTTT 5851 TAAATTGTAA CATATTTATC CGGTGGGTGG GACAGGAGTG TGTTCAAGTG 5901 GGTCATGTTT TTGCTGTGGT GACACATGGT ACAGGCTTGG AGCTTGCAGG 5951 TCCCTTTCTA CTGTGGTGTT GGAGCAGGAC AATAAAGTCC ACTAGAAATG 6001 CACCCC // LOCUS AB019517 923 bp mRNA PRI 08-MAY-1999 DEFINITION Homo sapiens PKIG mRNA for protein kinase inhibitor gamma, complete cds. ACCESSION AB019517 NID g4760550 VERSION AB019517.1 GI:4760550 KEYWORDS protein kinase inhibitor gamma. SOURCE Homo sapiens 12-weeks fetus cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 923) AUTHORS Saito,T. and Miyajima,N. TITLE Protein kinase inhibitor gamma mRNA, complete cds JOURNAL Published Only in DataBase (1999) In press REFERENCE 2 (bases 1 to 923) AUTHORS Saito,T. and Miyajima,N. TITLE Direct Submission JOURNAL Submitted (04-NOV-1998) to the DDBJ/EMBL/GenBank databases. Toshiyuki Saito, National Institute of Radiological Sciences, Genome Research Group; Anagawa 4-9-1, Inage, Chiba 263-8555, Japan (E-mail:t_saito@nirs.go.jp, Tel:81-43-206-3135, Fax:81-43-251-9818) FEATURES Location/Qualifiers source 1. .923 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q" /tissue_type="12-weeks fetus" gene 5. .235 /gene="PKIG" CDS 5. .235 /gene="PKIG" /codon_start=1 /product="protein kinase inhibitor gamma" /protein_id="BAA77336.1" /db_xref="PID:d1041102" /db_xref="PID:g4760551" /db_xref="GI:4760551" /translation="MMEVESSYSDFISCDRTGRRNAVPDIQGDSEAVSVRKLAGDMGE LALEGAEGQVEGSAPDKEAGNQPQSSDGTTSS" BASE COUNT 232 a 270 c 229 g 192 t ORIGIN 1 AGGCATGATG GAGGTCGAGT CCTCCTACTC GGACTTCATC TCCTGTGACC 51 GGACAGGCCG TCGGAATGCG GTCCCTGACA TCCAGGGAGA CTCAGAGGCT 101 GTGAGCGTGA GGAAGCTGGC TGGAGACATG GGCGAGCTGG CACTCGAGGG 151 GGCAGAAGGA CAGGTGGAGG GAAGCGCCCC AGACAAGGAA GCTGGCAACC 201 AGCCCCAGAG CAGCGATGGG ACCACCTCGT CTTGAATCTG ACCTTGTCCA 251 AGAAGGCTGG ACGAGAGACC TTCTGTCCCC TCCCAGAGGG GAAACCCTGG 301 CACTGGCCCA GCAGCCTCTT CTCTGAGCTC CATGTCCCAG ATAAACCAGG 351 CCAGACTGAG AAGGCTCCCC AGAGGCCTCT GTGGCCTCCA CTCCGGGAAA 401 GCCCTCTGCC CACACCCACA GGCTTCACAT TCCCACCACC TTCGCACCGT 451 GCCCAGGTAC ACTTTCAAGA CACTGTAACC ACAAGATGTT ATTTATTGAG 501 CTGGCGCCGG GACTTGGGCG GGGCCTGCCC TACAGTGAGC AGCCCACACA 551 GGAACGCTCC TCTCGCGAGC GGCCCGGGCA GGGACCCTGT CCCAACACCA 601 ACACCTCCTC TCCAGCCCAA TCTTCTGGGT CCAGACCTGC TTGTCCCTTT 651 TTTAGAAAAC ACTTTTAAAC TTTTTAAAAA TTTTAAACCT TTTTTCAGCA 701 GATATGGAGA GAGCTGACAA TCAATTCACA TTTTTTAAGC CATTTTAGCT 751 AAACTGTCAT TGTGCATCTC TGAGGTTCCC TCATGGAGCT CCACAGATCC 801 ATTTTTAGGG AAGGGATTTT GGCTCAAAAC GATCTGACCA CCTCTGCCCT 851 GTCCACCAGG ATAAGTGACA CCTAGGACCC AGGAAATAAA TGCCGATGAT 901 TTGTGTGAAA AAAAAAAAAA AAA // LOCUS AB020316 4196 bp mRNA PRI 12-MAY-1999 DEFINITION Homo sapiens mRNA for dermatan/chondroitin sulfate 2-sulfotransferase, complete cds. ACCESSION AB020316 NID g4803734 VERSION AB020316.1 GI:4803734 KEYWORDS dermatan/chondroitin sulfate 2-sulfotransferase. SOURCE Homo sapiens cell_line:Lymphoma Raji cell cDNA to mRNA, clone_lib:Raji cell cDNA library. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kobayashi,M., Sugumaran,G., Liu,J., Shworak,N.W., Silbert,J.E. and Rosenberg,R.D. TITLE Molecular cloning and characterization of a human uronyl 2-sulfotransferase that sulfates iduronyl and glucuronyl residues in dermatan/chondroitin sulfate JOURNAL J. Biol. Chem. 274 (15), 10474-10480 (1999) MEDLINE 99214613 REFERENCE 2 (bases 1 to 4196) AUTHORS Kobayashi,M. and Rosenberg,R.D. TITLE Direct Submission JOURNAL Submitted (19-NOV-1998) to the DDBJ/EMBL/GenBank databases. Robert D Rosenberg, Massachusetts Institute of Technology, Biology; 77 Massachusetts Avenue, Cambridge, MA 02139, USA (E-mail:masashik@MIT.EDU, Tel:1-617-253-8803, Fax:1-617-258-6553) FEATURES Location/Qualifiers source 1. .4196 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Lymphoma Raji cell" /clone_lib="Raji cell cDNA library" CDS 104. .1324 /codon_start=1 /product="dermatan/chondroitin sulfate 2-sulfotransferase" /protein_id="BAA77510.1" /db_xref="PID:d1041276" /db_xref="PID:g4803735" /db_xref="GI:4803735" /translation="MKKKQQHPGGGADPWPHGAPMGGAPPGLGSWKRRVPLLPFLRFS LRDYGFCMATLLVFCLGSLLYQLSGGPPRFLLDLRQYLGNSTYLDDHGPPPSKVLPFP SQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIHNKTRLTKNEQMELIKNISTAE QPYLFTRHVHFLNFSRFGGDQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQNHMIRTP SMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGEWALERAKLNVNE NFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTVPSPEAV QILYQRMRYEYEFYHYVKEQFHLLKRKFGLKSHVSKPPLRPHFFIPTPLETEEPIDDE EQDDEKWLEDIYKR" polyA_site 4196 /note="19 A nucleotides" BASE COUNT 1137 a 940 c 968 g 1151 t ORIGIN 1 CGGCCCTCCC ATGTGCAGCC CGGCCAGCCG GGCTCTCCTC CTCGCGGCGG 51 ATGGGTGACC TTTTCCTGGC ACGGGCAGGC TGTGGGAGGC AGCGGAGCAG 101 GCGATGAAGA AGAAGCAGCA GCATCCCGGC GGCGGCGCGG ATCCCTGGCC 151 CCATGGGGCC CCTATGGGGG GCGCCCCTCC GGGCCTGGGC AGCTGGAAGC 201 GTCGGGTGCC CCTGCTGCCT TTCCTGCGCT TCTCCCTCCG GGACTACGGC 251 TTCTGCATGG CCACCCTGCT GGTCTTCTGC CTGGGCTCCC TCCTCTATCA 301 GCTCAGCGGG GGACCCCCTC GCTTCCTGCT CGACCTGCGG CAGTACTTGG 351 GAAATTCCAC TTACTTGGAT GACCATGGAC CACCTCCTAG TAAGGTACTA 401 CCTTTCCCAA GCCAGGTGGT GTACAACAGG GTAGGCAAGT GTGGGAGCCG 451 TACTGTGGTC TTGCTTCTGA GAATCTTGTC GGAGAAGCAC GGATTTAATT 501 TGGTCACATC AGACATTCAC AACAAAACCA GGCTTACTAA AAATGAACAA 551 ATGGAACTGA TTAAAAATAT AAGTACTGCC GAACAACCCT ATTTATTCAC 601 TCGACATGTT CATTTCCTCA ACTTCTCAAG GTTTGGAGGA GACCAGCCTG 651 TCTACATCAA CATCATTAGA GACCCCGTCA ACCGGTTCTT ATCCAACTAT 701 TTTTTCCGTC GCTTTGGAGA CTGGAGAGGG GAACAAAATC ACATGATCCG 751 CACCCCCAGC ATGAGGCAGG AGGAGCGCTA CCTGGATATC AATGAGTGTA 801 TTCTTGAAAA CTATCCCGAG TGCTCCAACC CCAGGTTATT TTACATCATT 851 CCGTACTTTT GTGGACAGCA TCCCAGATGC AGGGAGCCTG GTGAATGGGC 901 CCTTGAGAGA GCAAAGCTGA ACGTGAATGA AAACTTCCTG CTCGTGGGGA 951 TTCTTGAAGA GTTGGAAGAT GTGCTGCTGT TACTGGAAAG ATTTTTACCT 1001 CATTACTTCA AGGGCGTGCT CAGTATCTAC AAAGACCCAG AGCACAGGAA 1051 GCTTGGAAAC ATGACTGTGA CGGTGAAGAA GACTGTCCCC TCTCCTGAGG 1101 CTGTGCAGAT CCTCTACCAG CGGATGAGAT ACGAGTACGA GTTTTACCAC 1151 TACGTCAAAG AGCAGTTCCA CCTGCTGAAG CGCAAGTTTG GACTTAAGTC 1201 TCACGTCAGC AAGCCCCCCC TGAGGCCACA CTTCTTTATC CCAACTCCAC 1251 TGGAAACCGA GGAGCCAATC GACGATGAAG AACAGGATGA TGAAAAGTGG 1301 CTGGAAGATA TTTATAAGAG GTGATGTGAC TGTGTTGCCT CTATGGCTTT 1351 ATCTCCCTTT TCCAGAAAGT TCTTTGTTTG GGGAAGTAAA ATCCTTAAGG 1401 GACTAAATTA ATGCTTGGGT GCATTAAAAA GAACAAAACA TTCCCACATG 1451 TTGGGGTCAT TGGGAGATGC CCGGTTTTGC GGGTTTTATT TGTTTAATTT 1501 TATTCTGTGT TTTCTCTTGG CTCTTTGGGT CTTTCCCGGG TACACTAGAT 1551 GGCTCCATCC CAAGGCATCT TGTCATAAAA CAGCTTTCCC CCACCCCATA 1601 TCATGGGAAA AGGGGGAGAA ATATAGCCCC TAGCCTAATA ACTTATCATT 1651 TGTAAAATGA CTTATAAAAA TATTACCTCA ATGGTAGGAG ACATCCAGAC 1701 TTGTATATTT CAGTGGAAAT ACAAAACCAC TTCAGAGACC AGGGTATCTC 1751 CTCTGGAAGG ATCTAAGAGA AGGTAAGACA GATTAGGACA TCGAAAAGGA 1801 GGATGGAGCC AGGTGCCATG GCTTGAGCCT ATAATCCGAG GCTGAGGTGG 1851 GAGGATCACT TGAGCCCAGG AGTTTGAGGT TGCAGTGAGC TGTGATCACA 1901 CCACTGCACT CCAGCCTGGG TGACAGAGTG AGACTCTGTC TCAATTAATT 1951 TTTTTTTTTT AAAGGAGGAG GATCTCCATG GGTAAGTGGT TTCTACCCGC 2001 ATGGGTAGAG TTCTGCCTCT GGTCCTTCTC AGGGGGCACT TTCACCAAGA 2051 GCAGTGTAAT TATCTCTGAA AGAGCAAGTC AGCTTGTGCC GCATCCCCAA 2101 CCAATCCACA GCCTGGAGTA CCTTTCAAGG TCAAAGTGCA TGGCCAGCTC 2151 CATTGAGACA TTCCATTTCA AAGCACCGTG CTGACAGATA TCAAAGTACT 2201 CTAGCAGGGA AAATAATTTG TTTGCTGTGT AAGGAAGAAT GTAGACAAGA 2251 CAGATAAATC TGAAGGTCAT GTGGCATCAG GGAAAGGGCA TGGCTGTGTC 2301 TTTTGCACCC AATATGAAAC ATCTTCTCCC AACACTGCTT TAATGGAAGT 2351 TCTAGGAACC AATTTAGCTC AGGCATTTGA CTCCTACAGC AGAAGTTCTG 2401 AGCCTGACCA CAGATGGTGT GTAATCTATC AAACACACCC CTGGCCAAGT 2451 TGGGTCCTAT AGGACCTGGT ACTATGTACT ATTGTAACTT CTAGTTCCCT 2501 AAGAGGTACC TGTTTTCAGT AAAAAGGGGT CCTGAGTTCT GTGCAGGTGG 2551 AAGAGCTACC CGAGAACTAC CTGAGTTCTG TGCAGGTAGA GTCCCATTTC 2601 TTATGGGACC TGTGTGCTCC TGAGAACTCT TACTTGAGAC ATCAAAAAGA 2651 AGCAGCAAGA GCTTCTGGGA CAGAGACTGC TTGGCCAGCT TTGTAAGTAA 2701 GTGGCTGCCT CCAATGTGAT GTGAGTACAT GTTGGGCAGT CTCACTGTCC 2751 TAAGGTATGT CTTCTTTCCA CCTCCCACTG CCCCTCCCCT GCCACCTATC 2801 AATGATGCCT TGGTTCAGTC ATTAGAAATC TGTTGCTTTG AGTTCTGAAA 2851 TATTTTCACC TTAAAAAAAA TGCTGAAAAT ACACATTCTC CTGGGAAGAC 2901 GATAAACAGC TAGCTAAGAA GCCGAGGTTC AGTGGTGGCA GCAGGAAGGA 2951 CACTGCCACA AATTTTGTCT ATTTCATATT TGTCCCCTAG AGCCAGCCCT 3001 AGCAAATGTG TGAGTTGGGA GTAGTTAATA GTAAATAAGA CTCTGACTTT 3051 ACACAAGCTA CACATTTTAT ACTTTTCATA AACCACAAAG TCTCTCTAGA 3101 ATTTTTTCTG CCTTCACTAA AATTGGACTG TAGCCAAGAT ATAAAGCAAG 3151 TCATTTGGAA CCTGCCGAGT GAGCACTGAA GCTACTTTAT CATGAGATGT 3201 GTGTTAAGAA GGCTGCAGCC CACAGGAGTC CAGGGAAGGC GGGGACCACA 3251 GAGGCACAGA GTCCAGCACT TGGCCGCTCA TGGGCCTTCT TTCTGCCTCA 3301 GAGGACGGGG GCAGAGAAGT GATGAAGGGA AATGTTCTTA GAGGAGGAAA 3351 TATCCTTTGT CCTGTTCAGA GAGACCAGGG CCCTACCATT AGGCATACTT 3401 TCAGAAGCAA CCTGGAGAAC AGCTATCAAT CATATTCAAA ACCAGTACAA 3451 GAACTGCTGC CTGGTACCCT GTGAGTCATT TCTATGAAAT TCCATATAAA 3501 GAATGATGAT AAGTTTACAC ACTGTGCAAT CTCACAATCT GAAAATAAAG 3551 TTGAGTTGGC TGTGTTTTCT CTGCTCTTGT CAGAACATTG GGACAATTGG 3601 TCGTTCAAAA ACATTCATCC TCTTACTGCA AGTTTATCTG GGTACTTTTA 3651 CCTGTGTGTT CAAAGGCATT TCTTTTCAGC AGTGATCATT ATAACTTCAC 3701 AAAAAAAGAT GCTGACGGAT TTACTTACAG GGCCTTAATG TTATTTTGTC 3751 CCAGCCAACA CCCTCTAGGT CCTAAAAGTC AAGGTACTTC AGTTTATTTG 3801 GCAAACATGA CAACATTTTT TTGGCCCTGG GCCCAACAGT TTGTACTTCA 3851 TGAAACATAT TGTACATTTT ACATAGTTTA ATTTAAAAAA TACCTTTTAA 3901 GCTAGTTGAT CTTTGACTGT CTTATTTATT ATAACCTTTC AGCACATTCC 3951 AAGGTTTTAG TTACTCAGGA AGGAGTTAAT TAAAATGATT TTATTTTGGT 4001 CTGATGGATG TTTTTTAAAA GGAAAATTAT TATTATGAAC CTTCAGCCTA 4051 CTTTCTTGAG TGCCGTAAAA GTGCTTGTAA ATCTTTTTTT TTTTTTAAGA 4101 AGAAAGAAAA AAATGGTGTT TGACGTTGAT GGAAATTCAA AAATATATAT 4151 GGAACTGAAA CATTAACTTA GCTAAAATAA AAGCAATCTG TGTTTG // LOCUS AB020655 4220 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0848 protein, complete cds. ACCESSION AB020655 NID g4240184 VERSION AB020655.1 GI:4240184 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk05607. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (6), 355-364 (1998) MEDLINE 99156230 REFERENCE 2 (bases 1 to 4220) AUTHORS Ohara,O., Suyama,M., Kikuno,R., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (02-DEC-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4220 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk05607" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 112. .3045 /gene="KIAA0848" CDS 112. .3045 /gene="KIAA0848" /codon_start=1 /product="KIAA0848 protein" /protein_id="BAA74871.1" /db_xref="PID:d1038605" /db_xref="PID:g4240185" /db_xref="GI:4240185" /translation="MKPSIAEMLHRGRMLWIILLSTIALGWTTPIPLIEDSEEIDEPC FDPCYCEVKESLFHIHCDSKGFTNISQITEFWSRPFKLYLQRNSMRKLYTNSFLHLNN AVSINLGNNALQDIQTGAFNGLKILKRLYLHENKLDVFRNDTFLGLESLEYLQADYNV IKRIESGAFRNLSKLRVLILNDNLIPMLPTNLFKAVSLTHLDLRGNRLKVLFYRGMLD HIGRSLMELQLEENPWNCTCEIVQLKSWLERIPYTALVGDITCETPFHFHGKDLREIR KTELCPLLSDSEVEASLGIPHSSSSKENAWPTKPSSMLSSVHFTASSVEYKSSNKQPK PTKQPRTPRPPSTSQALYPGPNQPPIAPYQTRPPIPIICPTGCTCNLHINDLGLTVNC KERGFNNISELLPRPLNAKKLYLSSNLIQKIYRSDFWNFSSLDLLHLGNNRISYVQDG AFINLPNLKSLFLNGNDIEKLTPGMFRGLQSLHYLYFEFNVIREIQPAAFSLMPNLKL LFLNNNLLRTLPTDAFAGTSLARLNLRKNYFLYLPVAGVLEHLNAIVQIDLNENPWDC TCDLVPFKQWIETISSVSVVGDVLCRSPENLTHRDVRTIELEVLCPEMLHVAPAGESP AQPGDSHLIGAPTSASPYEFSPPGGPVPLSVLILSLLVLFFSAVFVAAGLFAYVLRRR RKKLPFRSKRQEGVDLTGIQMQCHRLFEDGGGGGGGSGGGGRPTLSSPEKAPPVGHVY EYIPHPVTQMCNNPIYKPREEEEVAVSSAQEAGSAERGGPGTQPPGMGEALLGSEQFA ETPKENHSNYRTLLEKEKEWALAVSSSQLNTIVTVNHHHPHHPAVGGVSGVVGGTGGD LAGFRHHEKNGGVVLFPPGGGCGSGSMLLDRERPQPAPCTVGFVDCLYGTVPKLKELH VHPPGMQYPDLQQDARLKETLLFSAEKGFTDHQTQKSDYLELRAKLQTKPDYLEVLEK TTYRF" BASE COUNT 1145 a 1000 c 910 g 1165 t ORIGIN 1 CTGATGGATT TGCATTCAGG TTCCAGCCCT GCGTTTCCTA TATTGACTCC 51 TTATACACGA CCTGGCGCTC CAGTTTAGGA GGAGACGTTG TTTTGTAATC 101 AACCACGAAC GATGAAACCT TCCATAGCTG AGATGCTTCA CAGAGGAAGG 151 ATGTTGTGGA TAATTCTTCT AAGCACAATT GCTCTAGGAT GGACTACCCC 201 GATTCCCCTA ATAGAGGACT CAGAGGAAAT AGATGAGCCC TGTTTTGATC 251 CATGCTACTG TGAAGTTAAA GAAAGCCTCT TTCATATACA TTGTGACAGT 301 AAAGGATTTA CAAATATTAG TCAGATTACC GAGTTCTGGT CAAGACCTTT 351 TAAACTGTAT CTGCAGAGGA ATTCTATGAG GAAATTATAT ACCAACAGTT 401 TTCTTCATTT GAATAATGCT GTGTCTATTA ATCTTGGGAA CAATGCATTG 451 CAGGACATTC AGACTGGAGC TTTCAATGGT CTTAAGATTT TAAAGAGACT 501 ATATCTACAT GAAAACAAAC TAGATGTCTT CAGAAATGAC ACCTTCCTTG 551 GCTTGGAAAG TCTAGAATAT CTGCAGGCAG ATTACAATGT CATTAAACGT 601 ATTGAGAGTG GGGCATTTCG GAACCTAAGT AAATTGAGGG TTCTGATTTT 651 AAATGATAAT CTCATCCCCA TGCTTCCAAC CAATTTATTT AAGGCTGTCT 701 CTTTAACCCA TTTGGACCTA CGTGGAAATA GGTTAAAGGT TCTTTTTTAC 751 CGAGGAATGC TAGATCACAT TGGCAGAAGC CTGATGGAGC TCCAGCTGGA 801 AGAAAACCCT TGGAACTGTA CATGTGAAAT TGTACAACTG AAGAGTTGGC 851 TGGAACGCAT TCCTTATACT GCCCTGGTGG GAGACATTAC CTGTGAGACC 901 CCTTTCCACT TCCATGGAAA GGACCTACGA GAAATCAGGA AGACAGAACT 951 CTGTCCCTTG TTGTCTGACT CTGAGGTAGA GGCTAGTTTG GGAATTCCAC 1001 ATTCGTCATC AAGTAAGGAG AATGCATGGC CAACTAAGCC TTCCTCAATG 1051 CTATCCTCTG TTCATTTTAC TGCTTCTTCT GTCGAATACA AGTCCTCAAA 1101 TAAACAGCCT AAGCCCACCA AACAGCCTCG AACACCAAGG CCACCCTCCA 1151 CCTCCCAAGC TTTATATCCT GGTCCAAACC AGCCTCCCAT TGCTCCTTAT 1201 CAGACCAGAC CACCAATCCC CATTATATGC CCCACTGGGT GTACCTGTAA 1251 TTTGCACATC AATGACCTTG GCTTGACTGT CAACTGCAAA GAGCGAGGAT 1301 TTAATAACAT TTCTGAACTT CTTCCAAGGC CCTTGAATGC CAAGAAACTG 1351 TATCTGAGTA GCAATCTGAT TCAGAAAATA TACCGTTCTG ATTTTTGGAA 1401 TTTTTCTTCC TTGGATCTCT TGCATCTGGG GAACAATCGT ATTTCCTATG 1451 TCCAAGATGG GGCCTTTATC AACTTGCCCA ACTTAAAGAG CCTCTTCCTT 1501 AATGGCAACG ATATAGAGAA GCTGACACCA GGCATGTTCC GAGGCCTACA 1551 GAGTTTGCAC TACTTGTACT TTGAGTTCAA TGTCATCCGG GAAATCCAGC 1601 CTGCAGCCTT CAGCCTCATG CCCAACTTGA AGCTGCTATT CCTCAATAAT 1651 AACTTACTGA GGACTCTGCC AACAGACGCC TTTGCTGGCA CATCCCTGGC 1701 CCGGCTCAAC CTGAGGAAGA ACTACTTCCT CTATCTTCCC GTGGCTGGTG 1751 TCCTGGAACA CTTGAATGCC ATTGTCCAGA TAGACCTCAA TGAGAATCCT 1801 TGGGACTGCA CCTGTGACCT GGTCCCCTTT AAACAGTGGA TCGAAACCAT 1851 CAGCTCAGTC AGTGTGGTTG GTGATGTGCT TTGCAGGAGC CCTGAGAACC 1901 TCACGCACCG TGATGTGCGC ACTATTGAGC TGGAAGTTCT TTGCCCAGAG 1951 ATGCTGCACG TTGCACCAGC TGGAGAATCC CCAGCCCAGC CTGGAGATTC 2001 TCACCTTATT GGGGCACCAA CCAGTGCATC ACCTTATGAG TTTTCTCCTC 2051 CTGGGGGCCC TGTGCCACTT TCTGTGTTAA TTCTCAGCCT GCTGGTTCTG 2101 TTTTTCTCAG CAGTCTTTGT TGCTGCAGGC CTCTTTGCCT ACGTGCTCCG 2151 AAGGCGTCGA AAGAAGCTGC CCTTCAGAAG CAAGCGGCAG GAAGGTGTGG 2201 ACCTTACTGG CATCCAAATG CAATGCCACA GGCTGTTTGA GGATGGTGGA 2251 GGTGGTGGTG GCGGAAGTGG GGGTGGTGGT CGACCAACTC TTTCCTCTCC 2301 AGAGAAGGCC CCTCCCGTGG GTCATGTGTA TGAGTACATC CCCCACCCGG 2351 TTACCCAAAT GTGCAACAAC CCCATCTACA AGCCTCGTGA GGAGGAGGAG 2401 GTGGCTGTTT CATCAGCCCA AGAAGCAGGG AGTGCAGAAC GTGGGGGTCC 2451 AGGGACACAA CCACCGGGAA TGGGTGAGGC TCTCCTAGGA AGTGAGCAGT 2501 TTGCTGAGAC ACCCAAGGAG AACCATAGTA ACTACCGGAC CTTGCTGGAA 2551 AAAGAGAAGG AGTGGGCCCT AGCAGTGTCC AGCTCCCAGC TTAACACCAT 2601 AGTGACGGTG AATCACCATC ACCCTCACCA CCCAGCAGTT GGTGGGGTTT 2651 CAGGAGTAGT TGGGGGAACT GGGGGAGACT TGGCAGGGTT CCGCCACCAT 2701 GAGAAAAATG GTGGGGTGGT GCTGTTTCCT CCTGGGGGAG GCTGTGGTAG 2751 TGGCAGTATG CTACTAGATC GAGAGAGGCC ACAGCCTGCC CCCTGCACAG 2801 TGGGATTTGT GGACTGTCTC TATGGAACAG TGCCCAAATT AAAGGAACTG 2851 CACGTGCACC CTCCTGGCAT GCAATACCCA GACTTACAGC AGGATGCCAG 2901 GCTCAAAGAA ACCCTTCTCT TCTCGGCTGA AAAGGGCTTC ACAGACCACC 2951 AAACCCAAAA AAGTGATTAC CTCGAGTTAA GGGCCAAACT TCAAACCAAG 3001 CCGGATTACC TCGAAGTCCT GGAGAAGACA ACATACAGGT TCTAACAGAG 3051 AGAAGAAAAT ATATTAGTGC TTTTTTTTTT TCAAAAGAAA AGGAAAATAA 3101 AAGAAATATA TCCCTTGCTC CCTTTACACT TGTCCCAGTA ACTCCATCCT 3151 CACGATCTTT CCTACCCTGA ACAAAACTAA AACCGCATGA TAACTAGAGA 3201 ATACAGATGT ATGCTCTCCC CTCTCAGATG CGATTTGGAG GAAGGGCCAT 3251 ACTCAGATCA TTAATCAATG AAAGTGCCTT CGCAGACTTT TGCCAGCAAA 3301 TGTTATCATT ATTTTTTTAT ACTGAAACTT GAGACTTTGA CTGTGCCATG 3351 TATAAGATAT ACTGGGGATC ATTGTATGGA TCCTAATTAA GTAAAATTCA 3401 ATGTGTCTTT TTATTTTCAG TAACTATTTT TTTTATAGTT GTAGTTTTGA 3451 TTTAAAGGGG GGGAAACAAG TTGACATTTG TCATTTGTGG CTTTCTTTCT 3501 TATCATCATG GCACAGATTC TGTACATGTA TTAACAATGC AGTTTGCTGC 3551 ATGCCTGGAA ACTGCAAGAT GGGGAGGGAG GGGGCGGGTC TTGCTCGAAT 3601 GTTCTCACTC ACTATCTTGT CTCTCATACA CATATCCATC TGCAGACCAT 3651 GGTCCCTAAA CCTGCAGTTT CTTCCTGCTG GTGCAGGTAC ACACACACAC 3701 ACACACACAC ACACACACAC ACACACACAC TCCATACCAT TTCCATTCAA 3751 CTCAATCTGC TTAGTTCGGG TTTGATCCAT TTTTCTCCTC TCCATAGTTC 3801 TACCACCCTT CACTGCTAGT GTACAGCTCA CAGCATCTCT CCTACCACCC 3851 TGGGAAAAGT CACATTCTAG GCTGGATTCT ACTGAAGTAG AGGCCTGGTG 3901 GTTTAACAGC CGAAGACACA CGCCTAGGGA TGAGCACCCT CTTTGTGACA 3951 CTTCATCCTG ATGCCAAGAT TTTTTGGAGA ACATCTGCAC TTTCTTATAT 4001 ATATATATAA TATTAAAAAA AGCAACTGGG TATATATCAC TAACAGCTTT 4051 GTAGGAAGCA CTTTTAATGT TTTCTCTCTC ACACACAAAG ATTCAAAAGA 4101 AATGTGCATA TATTTATTCT GTAATTTCAG TGATTATAAA TTGTAAAGGT 4151 AATGTTTAAT CATTGTATAG TGATTATGCC TCTGGTATAG CTTTCCTAAT 4201 AAAAAGTTTT TGAAAAACAT // LOCUS AB020659 4467 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0852 protein, complete cds. ACCESSION AB020659 NID g4240192 VERSION AB020659.1 GI:4240192 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hk06101. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 5 (6), 355-364 (1998) MEDLINE 99156230 REFERENCE 2 (bases 1 to 4467) AUTHORS Ohara,O., Suyama,M., Kikuno,R., Nagase,T. and Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (02-DEC-1998) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4467 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hk06101" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 1365. .4277 /gene="KIAA0852" CDS 1365. .4277 /gene="KIAA0852" /codon_start=1 /product="KIAA0852 protein" /protein_id="BAA74875.1" /db_xref="PID:d1038609" /db_xref="PID:g4240193" /db_xref="GI:4240193" /translation="MLCFLDDGAGMDPSDAASVIQFGKSAKRTPESTQIGQYGNGLKS GSMRIGKDFILFTKKEDTMTCLFLSRTFHEEEGIDEVIVPLPTWNARTREPVTDNVEK FAIETELIYKYSPFRTEEEVMTQFMKIPGDSGTLVIIFNLKLMDNGEPELDIISNPRD IQMAETSPEGTKPERRSFRAYAAVLYIDPRMRIFIHGHKVQTKRLSCCLYKPRMYKYT SSRFKTRAEQEVKKAEHVARIAEEKAREAESKARTLEVRLGGDLTRDSRVMLRQVQNR AITLRREADVKKRIKEAKQRALKEPKELNFVFGVNIEHRDLDGMFIYNCSRLIKMYEK VGPQLEGGMACGGVVGVVDVPYLVLEPTHNKQDFADAKEYRHLLRAMGEHLAQYWKDI AIAQRGIIKFWDEFGYLSANWNQPPSSELRYKRRRAMEIPTTIQCDLCLKWRTLPFQL SSVEKDYPDTWVCSMNPDPEQDRCEASEQKQKVPLGTFRKDMKTQEEKQKQLTEKIRQ QQEKLEALQKTTPIRSQADLKKLPLEVTTRPSTEEPVRRPQRPRSPPLPAVIRNAPSR PPSLPTPRPASQPRKAPVISSTPKLPALAAREEASTSRLLQPPEAPRKPANTLVKTAS RPAPLVQQLSPSLLPNSKSPREVPSPKVIKTPVVKKTESPIKLSPATPSRKRSVAVSD EEEVEEEAERRKERCKRGRFVVKEEKKDSNELSDSAGEEDSADLKRAQKDKGLHVEVR VNREWYTGRVTAVEVGKHVVRWKVKFDYVPTDTTPRDRWVEKGSEDVRLMKPPSPEHQ SLDTQQEGGEEEVGPVAQQAIAVAEPSTSECLRIEPDTTALSTNHETIDLLVQILRNC LRYFLPPSFPISKKQLSAMNSDELISFPLKEYFKQYEVGLQNLCNSYQSRADSRAKAS EESLRTSERKLRETEEKLQKLRTNIVALLQKVQEDIDINTDDELDAYIEDLITKGD" BASE COUNT 1235 a 1051 c 1127 g 1054 t ORIGIN 1 CGAGACTCAG ATTGTCTGAA TTGAGCTATC GCAACTTAAT GCTAAAAGCT 51 CCTTAAAGCT ACAGATTTAT GACATAGTTC CTTCCAAAAT ATTACATCAT 101 AAATCATTGA GAAGATTAAA AAAAAACACT TGAAGAAATT GTAGTTTTAA 151 ACATCTCTGC ATATATTTTG GATAGCTACT AGGTTACTTT AACTGTCATT 201 AAGGAGCACA GACTTACTGA AGCTTTACTG GACAGAATCC TGGGAAATCG 251 ATATCATTAT AAGGTTATAT TTCCCAGTTA GCGGGTGAAG GGCTGGAGAC 301 CTTATTGCAG TCATGGCTTT CACAAATTAC AGCAGTCTGA ATCGAGCTCA 351 GCTAACCTTT GAATATCTGC ACACAAATTC GTAAGTATCC TCTAGGTGCC 401 ACTGAGGTAA CCAGTAACTC GTTCCTTGAT ATTATATGGA AATCGTTTCC 451 CCAGAAAATT TTGCTTTTTC ACTTTTTGAG ATGTATCCCA CTGGAGTGAA 501 ATGTGTCACT GGATATCTTG AGCTCTGTAT TGAAGAACTG AGATCAGTGA 551 AATACTTGTT GCTAATCCAG AAGAATCTGA TTTTTGTTTA TTGGATCAAA 601 ATTTTCTAAA TGCAAACTTT AGTTATTTGA AGTCAATATG TTGAGTTGTT 651 TCATTCAAGT GTTTATAGGA ATCCAACAAA TACTGCTCTA TTGGATCGCC 701 AAATGTTGGA CTATTTTAGT ATCAACCGTT TCCCCTCTGT AGTGACAACG 751 TCCTAAACAG TTAGGTTTAT AACAAGTGTT TACTTTCTAA CAAGAAAACA 801 GAAGACATTT AAATGACAAC TTTCAAGAAG AAAATTTTTA TTTTTTCAGA 851 AGTTGGCATT ATCTTCCTGG CAGATTGCTC ACATCCAATA TTATTTGTAT 901 ATGCTAAACA GGAAACGGCA ACTTGTTTAT ATCTCTATTT AGATAGTCTT 951 TCCCCAAAAT TTCCACAGAA ACATACAGTG TTCATGGTTC TTGAGTTCAT 1001 GAAGGAGTAA TCTAATCACT CCAACATGGT CTGGAATGTT TCAGGTTTAA 1051 TCCATATGCC CACTCTCTTG GAGGCTGTCC AGTAGCGTCA AAACTTTAGT 1101 GTTTTAATAC ATTCACCTGT TACTTTTGAG ATGAAGTTCA CCTTTCTTGG 1151 ATCACATGCA AAGGATGTTT AGGTCTGTGA AGAAAAGAAT TTCTAGGCCG 1201 GGTGCTGTGG CTCACGCCTG TAATCCCAGC ACTTTGGGAG GCCGAGAACC 1251 ACTCACGAAT TCTTGTTTGG TGCTCTTGCT GAACTGGTTG ATAATGCAAG 1301 AGATGCTGAT GCCACCAGAA TAGATATTTA TGCAGAAAGA CGAGAGGACC 1351 TTCGAGGAGG ATTTATGCTT TGCTTTTTGG ATGATGGAGC AGGAATGGAT 1401 CCAAGTGATG CTGCCAGTGT GATCCAGTTT GGGAAGTCGG CCAAGCGAAC 1451 ACCTGAGTCT ACTCAGATTG GGCAGTACGG GAATGGGTTA AAATCGGGCT 1501 CAATGCGCAT TGGGAAGGAT TTTATCCTGT TCACCAAGAA GGAAGACACC 1551 ATGACGTGCC TCTTCCTGTC TCGCACGTTT CATGAGGAAG AAGGCATTGA 1601 TGAAGTGATA GTCCCACTGC CCACCTGGAA TGCTCGGACC CGGGAACCTG 1651 TCACAGACAA TGTAGAGAAA TTTGCCATTG AGACAGAACT CATCTATAAG 1701 TACTCTCCAT TCCGCACTGA GGAGGAAGTG ATGACCCAGT TTATGAAGAT 1751 TCCTGGGGAC AGCGGAACAT TGGTGATCAT CTTCAATCTC AAACTCATGG 1801 ATAATGGAGA GCCAGAACTA GACATAATCT CAAATCCAAG AGATATCCAG 1851 ATGGCAGAGA CGTCCCCAGA GGGCACGAAG CCAGAGCGGC GCTCGTTCCG 1901 TGCCTATGCC GCTGTGCTCT ATATTGATCC CCGGATGAGG ATCTTCATCC 1951 ATGGGCACAA GGTGCAGACC AAGAGGCTCT CCTGCTGCCT GTACAAGCCC 2001 AGGATGTACA AGTACACGTC AAGCCGTTTC AAGACCCGTG CGGAGCAGGA 2051 GGTGAAGAAA GCAGAGCACG TAGCAAGGAT TGCTGAAGAG AAGGCGCGGG 2101 AGGCAGAGAG CAAAGCTCGG ACATTAGAAG TACGCCTAGG TGGAGACCTC 2151 ACGCGGGACT CCAGGGTGAT GTTGCGACAG GTCCAGAACA GAGCCATCAC 2201 TCTGCGCAGA GAAGCCGATG TCAAGAAGAG GATCAAGGAG GCCAAGCAGC 2251 GAGCACTTAA AGAACCTAAG GAACTGAATT TTGTTTTTGG TGTCAACATT 2301 GAACACCGGG ATCTGGATGG CATGTTCATC TACAACTGTA GCCGACTGAT 2351 CAAAATGTAT GAGAAAGTGG GCCCACAGCT GGAAGGGGGC ATGGCATGTG 2401 GCGGGGTTGT TGGGGTTGTT GATGTGCCCT ACCTGGTCCT GGAGCCTACA 2451 CACAACAAAC AGGACTTTGC TGATGCCAAG GAGTACCGGC ACCTGCTCCG 2501 AGCAATGGGG GAGCACCTGG CGCAGTATTG GAAGGATATT GCCATCGCCC 2551 AGAGGGGAAT CATCAAGTTC TGGGATGAGT TTGGCTACCT CTCTGCCAAC 2601 TGGAACCAGC CCCCGTCCAG TGAGCTGCGT TACAAACGCC GGAGAGCTAT 2651 GGAAATCCCC ACCACCATCC AGTGCGATTT GTGTCTGAAA TGGAGAACCC 2701 TCCCCTTCCA GCTGAGTTCT GTGGAAAAAG ATTACCCTGA CACCTGGGTT 2751 TGCTCCATGA ACCCTGATCC TGAACAGGAC CGGTGTGAGG CTTCTGAACA 2801 AAAGCAGAAG GTTCCCCTGG GAACATTCAG AAAGGACATG AAGACGCAGG 2851 AAGAGAAGCA GAAACAACTG ACAGAGAAAA TTCGCCAGCA GCAGGAGAAG 2901 CTGGAGGCCC TTCAGAAAAC CACACCCATC CGCTCCCAAG CAGACCTGAA 2951 GAAATTGCCC TTGGAAGTGA CCACCAGACC TTCCACTGAG GAACCTGTGC 3001 GTAGACCTCA GCGTCCTCGG TCGCCCCCTT TACCTGCTGT GATCAGGAAC 3051 GCCCCCAGCA GACCCCCTTC TTTGCCAACT CCTAGACCAG CCAGCCAGCC 3101 CCGAAAGGCT CCTGTCATCA GCAGTACCCC AAAGCTCCCT GCTTTGGCAG 3151 CCCGGGAGGA GGCCAGCACA TCTAGGCTGC TCCAGCCACC TGAGGCACCC 3201 CGAAAGCCTG CCAACACTCT CGTCAAGACT GCATCCCGAC CTGCCCCTCT 3251 GGTGCAGCAA CTGTCACCAT CTTTACTGCC CAACTCCAAG AGCCCTCGGG 3301 AGGTTCCTTC TCCCAAAGTC ATCAAGACTC CAGTGGTGAA GAAGACAGAG 3351 TCACCCATCA AACTCTCCCC GGCTACCCCT AGTCGGAAGC GGAGTGTCGC 3401 AGTTTCTGAT GAGGAAGAAG TTGAGGAGGA AGCTGAGAGG AGGAAGGAGA 3451 GGTGCAAGCG GGGCAGATTT GTTGTGAAGG AGGAAAAGAA GGACTCGAAT 3501 GAGCTCTCAG ACAGTGCTGG GGAAGAGGAC TCGGCTGACC TCAAGAGAGC 3551 TCAGAAAGAT AAAGGGCTGC ACGTGGAGGT GCGTGTGAAC AGGGAGTGGT 3601 ACACGGGCCG TGTCACAGCC GTGGAGGTGG GCAAGCATGT GGTGCGGTGG 3651 AAGGTGAAGT TTGACTACGT GCCCACAGAC ACGACACCAA GAGACCGCTG 3701 GGTGGAGAAA GGCAGTGAGG ATGTGCGGCT GATGAAACCC CCTTCTCCGG 3751 AACATCAGAG CCTTGATACA CAACAGGAGG GCGGGGAGGA GGAGGTGGGC 3801 CCTGTGGCCC AGCAGGCCAT AGCTGTCGCA GAGCCCTCCA CTTCCGAATG 3851 CCTCCGCATT GAGCCTGACA CCACTGCCCT GAGCACCAAT CACGAGACCA 3901 TCGACCTGCT TGTCCAGATC CTCCGGAATT GTTTACGGTA CTTCCTGCCT 3951 CCAAGTTTCC CCATCTCCAA GAAGCAGCTG AGTGCTATGA ATTCAGATGA 4001 GCTAATATCT TTTCCTCTGA AGGAGTACTT CAAGCAATAT GAAGTAGGGC 4051 TCCAAAACCT GTGCAATTCC TACCAGAGCC GTGCTGACTC CCGGGCCAAG 4101 GCCTCCGAGG AAAGCCTGCG CACCTCCGAG AGGAAGCTCC GCGAGACGGA 4151 GGAGAAGCTG CAGAAGCTGA GGACCAACAT CGTGGCACTC CTGCAAAAGG 4201 TGCAGGAGGA CATAGACATC AACACAGATG ATGAGCTGGA CGCCTACATT 4251 GAGGACCTCA TCACCAAGGG GGACTGAAGG CAGGAGAGAG AGCAGCTCCC 4301 CTGCCCACCT GCCCCTCAAC CCTGTAGCTG CAGGGGGAGG GGACTTCATT 4351 CATGGGTTGG TGGTCGCACC TTGGTTTGAC TTACACGGGA CATTTGTGTT 4401 TTTGGAGGAA AAGATACCCT GATTCTTTGA ATCTTCCTTA AGTTTATAAA 4451 TATTTATTTT TTAAAAG // LOCUS AB022663 3056 bp mRNA PRI 09-JUN-1999 DEFINITION Homo sapiens HFB30 mRNA, complete cds. ACCESSION AB022663 NID g5019617 VERSION AB022663.1 GI:5019617 KEYWORDS HFB30. SOURCE Homo sapiens fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ueki,N., Seki,N., Yano,K., Masuho,Y., Saito,T. and Muramatsu,M. TITLE Isolation and characterization of a novel human gene (HFB30) which encodes a protein with a RING finger motif JOURNAL Biochim. Biophys. Acta 1445 (2), 232-236 (1999) MEDLINE 99255429 REFERENCE 2 (bases 1 to 3056) AUTHORS Ueki,N., Seki,N., Yano,K., Masuho,Y. and Muramatsu,M. TITLE Direct Submission JOURNAL Submitted (19-JAN-1999) to the DDBJ/EMBL/GenBank databases. Nobuhide Ueki, Helix Research Institute, Third Department; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:mmasaaki@hri.co.jp, Tel:81-438-52-3951, Fax:81-438-52-3952) FEATURES Location/Qualifiers source 1. .3056 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /dev_stage="fetal" /tissue_type="brain" gene 237. .1661 /gene="HFB30" CDS 237. .1661 /gene="HFB30" /codon_start=1 /protein_id="BAA78677.1" /db_xref="PID:d1042449" /db_xref="PID:g5019618" /db_xref="GI:5019618" /translation="MSSEDREAQEDELLALASIYDGDEFRKAESVQGGETRIYLDLPQ NFKIFVSGNSNECLQNSGFEYTICFLPPLVLNFELPPDYPSSSPPSFTLSGKWLSPTQ LSALCKHLDNLWEEHRGSVVLFAWMQFLKEETLAYLNIVSPFELKIGSQKKVQRRTAQ ASPNTELDFGGAAGSDVDQEEIVDERAVQDVESLSNLIQEILDFDQAQQIKCFNSKLF LCSICFCEKLGSECMYFLECRHVYCKACLKDYFEIQIRDGQVQCLNCPEPKCPSVATP GQVKELVEAELFARYDRLLLQSSLDLMADVVYCPRPCCQLPVMQEPGCTMGICSSCNF AFCTLCRLTYHGVSPCKVTAEKLMDLRNEYLQADEANKRLLDQRYGKRVIQKALEEME SKEWLEKNSKSCPCCGTPIEKLDGCNKMTCTGCMQYFCWICMGSLSRANPYKHFNDPG SPCFNRLFYAVDVDDDIWEDEVED" BASE COUNT 848 a 579 c 673 g 956 t ORIGIN 1 GAAGAGGCGG CGCGCCGGTT GAGCAGGGCG TTTCTAGGCC TGGTCGGCTG 51 GCGGCGATGG CAGGATTTTC ATAATATATG TAGTATGAGT TCCACATCTT 101 GGCCTCTTAC CCAGCTTCAG CAGTCTCAGC TCCACCAGTT AGAGAATAAA 151 TGGGATTTGC ATGAACTCCA CTCTGAGCTG AAATACAGAC TGCGGTGTTA 201 ATATCCTTTT CTTCTGATTG TTATTAACAG GTCCTTATGT CGTCAGAAGA 251 TCGAGAAGCT CAGGAGGATG AATTGCTGGC CCTGGCAAGT ATTTACGATG 301 GAGATGAATT TAGAAAAGCA GAGTCTGTCC AAGGTGGAGA AACCAGGATC 351 TATTTGGATT TGCCACAGAA TTTCAAGATA TTTGTGAGCG GCAATTCAAA 401 TGAGTGTCTC CAGAATAGTG GCTTTGAATA CACCATTTGC TTTCTGCCTC 451 CACTTGTGCT GAACTTTGAA CTGCCACCAG ATTATCCATC CTCTTCCCCA 501 CCTTCATTCA CACTTAGTGG CAAATGGCTG TCACCAACTC AGCTATCTGC 551 TCTATGCAAG CACTTAGACA ACCTATGGGA AGAACACCGT GGCAGCGTGG 601 TCCTGTTTGC CTGGATGCAA TTTCTTAAGG AAGAGACCCT AGCATACTTG 651 AATATTGTCT CTCCTTTTGA GCTCAAGATT GGTTCTCAGA AAAAAGTGCA 701 GAGAAGGACA GCTCAAGCTT CTCCCAACAC AGAGCTAGAT TTTGGAGGAG 751 CTGCTGGATC TGATGTAGAC CAAGAGGAAA TTGTGGATGA GAGAGCAGTG 801 CAGGATGTGG AATCACTGTC AAATCTGATC CAGGAAATCT TGGACTTTGA 851 TCAAGCTCAG CAGATAAAAT GCTTTAATAG TAAATTGTTC CTGTGCAGTA 901 TCTGTTTCTG TGAGAAGCTG GGTAGTGAAT GCATGTACTT CTTGGAGTGC 951 AGGCATGTGT ACTGCAAAGC CTGTCTGAAG GACTACTTTG AAATCCAGAT 1001 CAGAGATGGC CAGGTTCAAT GCCTCAACTG CCCAGAACCA AAGTGCCCTT 1051 CGGTGGCCAC TCCTGGTCAG GTCAAAGAGT TAGTGGAAGC AGAGTTATTT 1101 GCCCGTTATG ACCGCCTTCT CCTCCAGTCC TCCTTGGACC TGATGGCAGA 1151 TGTGGTGTAC TGCCCCCGGC CGTGCTGCCA GCTGCCTGTG ATGCAGGAAC 1201 CTGGCTGCAC CATGGGTATC TGCTCCAGCT GCAATTTTGC CTTCTGTACT 1251 TTGTGCAGGT TGACCTACCA TGGGGTCTCC CCATGTAAGG TGACTGCAGA 1301 GAAATTAATG GACTTACGAA ATGAATACCT GCAAGCGGAT GAGGCTAATA 1351 AAAGACTTTT GGATCAAAGG TATGGTAAGA GAGTGATTCA GAAGGCACTG 1401 GAAGAGATGG AAAGTAAGGA GTGGCTAGAG AAGAACTCAA AGAGCTGCCC 1451 ATGTTGTGGA ACTCCCATAG AGAAATTAGA CGGATGTAAC AAGATGACAT 1501 GTACTGGCTG TATGCAATAT TTCTGTTGGA TTTGCATGGG TTCTCTCTCT 1551 AGAGCAAACC CTTACAAACA TTTCAATGAC CCTGGTTCAC CATGTTTTAA 1601 CCGGCTGTTT TATGCTGTGG ATGTTGACGA CGATATTTGG GAAGATGAGG 1651 TAGAAGACTA GTTAACTACT GCTCAAGATA TGGAAGTGGA TTGTTTTTCC 1701 CTAATCTTCC GTCAAGTACA CAAAGTAACT TTGCGGGATA TTTAGGGTAC 1751 TATTCATTCA CTCTTCCTGC GTAGAAGATA TGGAAGAACG AGGTTTATAT 1801 TTTCATGTGG TACTACTGAA GAAGGTGCAT TGATACATTT TTAAATGTAA 1851 GTTGAGAAAA ATTTATAAGC CAAAGGTTCA GAAAATTAAA CTACAGAATA 1901 TTAAATATTA TAATGTGCCC AAAGCTCTGA ATAGTTAAAA ATTAAATATT 1951 TATTTTCTTC CCCAAGCTTT AGGTAAGGAG AAGAGGGGTC AAGAGTTAAA 2001 CTTAGAGACC CTTTGTCTCT GAGAAGCATC CTTCTAAGAC ATTCTGTTGG 2051 AGTTCCCTCA GTACTATTCC TTACAACTGG AGTGGGTAGA AGCCTTATGA 2101 AAATTATACT GACAACCTGA TCTCGTTTAC TCCATGTTAA TCACATTCCT 2151 ACCAACCTAA ATTTCTGCCA AGTCAGTTCT CTTTGGAGGA AGCCCATTAC 2201 TCCATAACTG ACTGGATGGT CCAGTGTCAT TTTGATCTGC TTTTCAGAAT 2251 GGAAATTTAT AATATAAATA TATGTTTTAA TACCACAAAC ACTGTGGGGC 2301 ACTTGATATT TATTAGTGGT GTATATGGTC CACTGGTTTC AGGCCAAATC 2351 TAGAATTTAG TGATACTGGC TCAGAAGAAT TTAAGTTCTA TTCAGCTTTC 2401 CTGGGTGTTA GAACCTAGAT TCAAAATGGC TTGTCTTTGC TACTTTTGTT 2451 CCACATTCTC TCTCTTTACC TTTGCCCTAC CTTCTGTTTG TAAGGATGAT 2501 TTTAATATTA ATTCCAGGAA CACATGTTTT TATTTCCTCT GCTGGAATAG 2551 GAGAAAGCTT AATATATTCC CAAGCTGAAT GTTCTTGAAT TATATCTGCA 2601 TTTACCACTG TATATGCATA TAGTGACAGT ATAACCTGTC TATACTACAG 2651 TAGTTCCAGT CCTAACTTTC TGAATTATTT TTAAAGATCT TCTCTAACAA 2701 GCTATGGGAA TTTGGCTTCA TACTCTTTCT TTGCAACAGC AGTGTTCTGG 2751 GTGATAATTT TGAATTGATA CCTGTTCCTT TTTCTGGGTT TTGTTGGCTT 2801 TTTGAAAAAT TGTCTTTCCT TATCATTGGT GGGAGGCTTG GTAGCAAAGT 2851 AACATTTTTT GGAAAAGAGG ACAGAAAAAT TGAACTACAG CTTGAGAACG 2901 TATTCTTTTT TTCCTACTTT GTTATTGCAA ATTGAGGAAT CACTTTTAAC 2951 TGTTTTAGGT GTGTGTGTCC AGAGTGAGCA AGGATTATGT TTTTGGATTG 3001 TCAAAGAGGA TGCTTAGTCT TAAAATAAAA ATAAATTTAA AAATCATCTT 3051 ATAATT // LOCUS AB023021 3019 bp mRNA PRI 23-JUN-1999 DEFINITION Homo sapiens FUT9 mRNA for alpha-1,3-fucosyltransferase IX, complete cds. ACCESSION AB023021 NID g5139692 VERSION AB023021.1 GI:5139692 KEYWORDS alpha-1,3-fucosyltransferase IX; FUT9. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kaneko,M., Kudo,T., Iwasaki,H., Ikehara,Y., Nishihara,S., Nakagawa,S., Sasaki,K., Shiina,T., Inoko,H., Saitou,N. and Narimatsu,H. TITLE Alpha 1,3-fucosyltransferase IX (Fuc-TIX) is very highly conserved between human and mouse; molecular cloning, characterization and tissue distribution of human Fuc-TIX JOURNAL FEBS Lett. 453, 237-242 (1999) REFERENCE 2 (bases 1 to 3019) AUTHORS Kaneko,M., Kudo,T. and Narimatsu,H. TITLE Direct Submission JOURNAL Submitted (29-JAN-1999) to the DDBJ/EMBL/GenBank databases. Mika Kaneko, Institute of Life Science, Soka University, Division of Cell Biology; Soka University, 1-236 Tangi-cho, Hachioji, Tokyo 192-8577, Japan (E-mail:mika@t.soka.ac.jp, Tel:81-426-91-2495(ex.5132), Fax:81-426-91-9315) FEATURES Location/Qualifiers source 1. .3019 /organism="Homo sapiens" /db_xref="taxon:9606" gene 295. .1374 /gene="FUT9" CDS 295. .1374 /gene="FUT9" /codon_start=1 /product="alpha-1,3-fucosyltransferase IX" /protein_id="BAA81685.1" /db_xref="PID:d1045487" /db_xref="PID:g5139693" /db_xref="GI:5139693" /translation="MTSTSKGILRPFLIVCIILGCFMACLLIYIKPTNSWIFSPMESA SSVLKMKNFFSTKTDYFNETTILVWVWPFGQTFDLTSCQAMFNIQGCHLTTDRSLYNK SHAVLIHHRDISWDLTNLPQQARPPFQKWIWMNLESPTHTPQKSGIEHLFNLTLTYRR DSDIQVPYGFLTVSTNPFVFEVPSKEKLVCWVVSNWNPEHARVKYYNELSKSIEIHTY GQAFGEYVNDKNLIPTISACKFYLSFENSIHKDYITEKLYNAFLAGSVPVVLGPSREN YENYIPADSFIHVEDYNSPSELAKYLKEVDKNNKLYLSYFNWRKDFTVNLPRFWESHA CLACDHVKRHQEYKSVGNLEKWFWN" BASE COUNT 937 a 594 c 541 g 947 t ORIGIN 1 GCGCGCGCGG CGCAGCAGCT CCAGATTCAC TGCTCTCCCC TGCAGCTCCC 51 CGCGCCCCCG CCGCTGTCGC TGCCTCGGTG TCCCCCAGCC CCAGTCGCGC 101 TCTTAGGACA GCGCCGCCAC CGCCGCCTGG CCCTGCCTGC CTCCTGCGCC 151 GCGCAGCCCT CGCGAGCGCC CCGGATGGCG CTTTACCCCT AGGACCGATT 201 TAGAATGTAA TAACTCAAGG ATTTGATAAT ACAGTGAAGT AGTATAACAA 251 CTGTCTACGT GCTTCCCATG ATATGTTCTC TATATTGAAA AATTATGACA 301 TCAACATCCA AAGGAATTCT TCGCCCATTT TTAATTGTCT GCATTATCCT 351 GGGCTGTTTC ATGGCATGTC TTCTCATTTA CATCAAACCT ACCAACAGCT 401 GGATCTTCAG TCCAATGGAA TCAGCCAGCT CTGTGCTGAA AATGAAAAAC 451 TTCTTTTCCA CCAAAACTGA TTATTTTAAT GAAACTACTA TTCTGGTGTG 501 GGTGTGGCCA TTTGGGCAGA CCTTTGACCT TACATCCTGC CAAGCAATGT 551 TCAACATCCA AGGATGCCAT CTCACAACGG ACCGTTCACT GTACAACAAA 601 TCCCATGCAG TTCTGATCCA TCACCGAGAC ATCAGTTGGG ATCTGACAAA 651 TTTACCTCAG CAAGCTAGGC CACCCTTCCA GAAATGGATT TGGATGAATT 701 TGGAATCACC AACTCACACT CCCCAAAAGA GTGGCATTGA GCACTTGTTT 751 AACCTGACTC TGACTTACCG CCGTGATTCA GATATCCAAG TGCCTTATGG 801 CTTCTTGACG GTAAGCACAA ATCCCTTCGT GTTTGAAGTG CCAAGCAAAG 851 AGAAATTGGT GTGCTGGGTT GTGAGTAACT GGAACCCTGA GCATGCCAGA 901 GTCAAGTATT ACAATGAGCT AAGCAAAAGC ATTGAAATCC ATACCTACGG 951 GCAAGCATTT GGAGAATATG TCAATGATAA AAATTTGATT CCTACCATAT 1001 CTGCTTGTAA ATTTTATCTT TCCTTTGAAA ATTCAATCCA CAAGGATTAC 1051 ATCACGGAAA AGCTATACAA TGCTTTTCTG GCTGGCTCTG TACCTGTTGT 1101 TCTGGGACCA TCTAGGGAAA ACTATGAGAA TTATATTCCA GCAGATTCAT 1151 TCATTCATGT GGAAGATTAT AACTCTCCCA GTGAGCTAGC AAAGTATCTG 1201 AAGGAAGTCG ACAAAAACAA TAAGTTATAC CTTAGTTACT TTAACTGGAG 1251 GAAGGATTTC ACTGTAAATC TTCCACGATT TTGGGAATCA CATGCATGTT 1301 TGGCTTGCGA TCATGTGAAA AGGCATCAAG AATATAAGTC TGTTGGTAAT 1351 TTAGAGAAAT GGTTTTGGAA TTAAAATTTT TCATCACTTG CACACTTGAT 1401 AAATATTTTG ATGAGATATC ATCCAAGTAT TGAGGATAAG AAGAGATGCA 1451 ACATACTACT TTTGTGTCAC AATTTATTTT TATCACCCTC TCTAGGGTAA 1501 CGTGTATATT TTGGTGGAGA TTTTTAAAAG CTCAGCATGA GCAATCATTC 1551 CATTCGGTTT TAAATTATCC TGTATATACC TAATTATGTG CACTGGAGAG 1601 TAATTTATTC TTCATTATCA TTTGTAAACA TTGCTTTTTC ACATTTTTGT 1651 AGTTGTCCAT AATGTAAGCT TGTGGTTTGA TTATTGTTTC CACACTGATC 1701 AGCTGTTTAA TCTATTTGGG AAATGAAGAT GCACATCTTA AAGTATGAAA 1751 AATTTTCACT AAGTATTACA ATGTCTAGTT CCAACTTTGC ATACTATAAC 1801 AGAGGAAGAA CATGTTGCGA TTGAATTCTA ACCTCTTTGA CTCCTAAGAT 1851 GAATGAAGTG TATAACTGTC TCTATTTGAT CTATTTTTTT TACCTGTTTA 1901 TCACATTTGT GAAGGTGAAA TTATTCATGG AGTGAATAAG AAAGATATGA 1951 AGCAGAACTG TTCTATTCAG GAAGCTATTA GACTTCTCAT TTATTTTCAT 2001 TAAGCTGATT TGCAGCTACT TATTCTCATG GTCTTAAATT AAATTATTCA 2051 AGTATTTTTA AATATCCAAT TTGTTGTGAT TTTCAGCACC TGGGAAGTAA 2101 TCCCAATAAT ACTTTAGAAA ATCTAAGACA GTTCTTTCTG CTACTGATGA 2151 CACTCATTGT CATAATAAAA CAAATAATTT CCTCAAATAA CAAAGAAAAA 2201 TGATACCTAT AAATATATTT ATAAATGGTG TCATTTATGA ACAATGTTTA 2251 ATTATGTATC AATTTAAGAT TTTTTTCTGA AGCCCTAATA TTTAAAATGG 2301 CCTTATTTTA CCATATGGAT ATAAGATTTG GCTCATAATG ATGAGCCCTA 2351 TCATTTGATT TGAGTTCTAT CATTTAAGAG AGCCTAAATA AAATTATCAT 2401 CAAGGTATTA AATATAAGAC GTTAAATATA ATAAAGTGGG GATATATAGA 2451 AAACACACAG TGTTAGCACA GAGTAAGATC TCAATGCACA TTTGTTGGAT 2501 GAATAAATAA ATGCAATTGA ATTCCCAGAA AAATGATTGT TTCAAGGAAG 2551 TGACAGTTCT ACTTTAGAAG TACTAATTGG AGATGACTTT TATATCCCAT 2601 TTTGGTAATT ATTCATACAT AGCACATATG ACCATGATGT TCAGGGCTTT 2651 ATAGAACCAA ATAAACCTAC CATTACATGA AAATTTTGAA GAGTATATTC 2701 CTGAACTTGC AGCTGCCTAT AGCATTCTCC TCCATAAGGC GAGAGGAGAG 2751 CCGGATATAA GCAGAGATCG TAGGTGAAAA AAGATGGGCC ATTAAGGATT 2801 CTTCAAGGAA AACATTCCTC AAAATGATCT TTAGTGCCTT TATTCCTTTC 2851 AGGCCAATAA TCCACCCAAA AATTAGAATG TTCTGAGACA TCCATTTGGG 2901 TCTATTTGTG ACTTATTCGT ATTACAAATT GACATCAGCA AAATTCAAAA 2951 AGAATTCAGT AAAAAGGGCA TAGATCCAGC TATCCAGACT ATCTTGTGAG 3001 AGAATAAGCT CACCCCAGC // LOCUS AB023155 5574 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0938 protein, complete cds. ACCESSION AB023155 NID g4589519 VERSION AB023155.1 GI:4589519 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hh04777. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 5574) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .5574 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hh04777" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 115. .3081 /gene="KIAA0938" CDS 115. .3081 /gene="KIAA0938" /codon_start=1 /product="KIAA0938 protein" /protein_id="BAA76782.1" /db_xref="PID:d1040535" /db_xref="PID:g4589520" /db_xref="GI:4589520" /translation="MCVTKKLFFIVQRTIFVGCVIWKFCLHYVLRGFLCFNSMQLDRN TLPKKGLRYTPSSRQANQEEGKEWLRSHSTGGLQDTGNQSPLVSPSAMSSSAAGKYHF SNLVSPTNLSQFNLPGPSMMRSNSIPAQDSSFDLYDDSQLCGSATSLEERPRAISHSG SFRDSMEEVHGSSLSLVSSTSSLYSTAEEKAHSEQIHKLRRELVASQEKVATLTSQLS ANAHLVAAFEKSLGNMTGRLQSLTMTAEQKESELIELRETIEMLKAQNSAAQAAIQGA LNGPDHPPKDLRIRRQHSSESVSSINSATSHSSIGSGNDADSKKKKKKNWVNSRGSEL RSSFKQAFGKKKSTKPPSSHSDIEELTDSSLPASPKLPHNAGDCGSASMKPSQSASAI CECTEAEAEIILQLKSELREKELKLTDIRLEALSSAHHLDQIREAMNRMQNEIEILKA ENDRLKAETGNTAKPTRPPSESSSSTSSSSSRQSLGLSLNNLNITEAVSSDILLDDAG DATGHKDGRSVKIIVSISKGYGRAKDQKSQAYLIGSIGVSGKTKWDVLDGVIRRLFKE YVFRIDTSTSLGLSSDCIASYCIGDLIRSHNLEVPELLPCGYLVGDNNIITVNLKGVE ENSLDSFVFDTLIPKPITQRYFNLLMEHHRIILSGPSGTGKTYLANKLAEYVITKSGR KKTEDAIATFNVDHKSSKELQQYLANLAEQCSADNNGVELPVVIILDNLHHVGSLSDI FNGFLNCKYNKCPYIIGTMNQGVSSSPNLELHHNFRWVLCANHTEPVKGFLGRYLRRK LIEIEIERNIRNNDLVKIIDWIPKTWHHLNSFLETHSSSDVTIGPRLFLPCPMDVEGS RVWFMDLWNYSLVPYILEAVREGLQMYGKRTPWEDPSKWVLDTYPWSSATLPQESPAL LQLRPEDVGYESCTSTKEATTSKHIPQTDTEGDPLMNMLMKLQEAANYSSTQSCDSES TSHHEDILDSSLESTL" BASE COUNT 1690 a 1145 c 1128 g 1611 t ORIGIN 1 CTATCACTAA ACTGTCATTG AATTGTACTG CATTAGAAAG GAACTCAAAT 51 ATGTGTGACG GCAATGGACA TCTTGTCACC TTTAGTTGGC CTTTTTCAAT 101 GAGTTAAGCA TTATATGTGT GTTACCAAAA AATTATTTTT TATAGTTCAG 151 AGAACCATTT TTGTTGGATG TGTAATTTGG AAGTTTTGTT TACATTATGT 201 CCTTAGGGGT TTTCTTTGTT TTAACAGCAT GCAGCTTGAC AGAAATACAC 251 TACCCAAAAA GGGACTAAGA TATACCCCAT CATCTCGGCA GGCCAACCAA 301 GAAGAGGGCA AAGAGTGGTT GCGTTCTCAT TCTACTGGAG GGCTTCAGGA 351 CACTGGCAAC CAGTCACCTC TGGTTTCCCC TTCTGCCATG TCATCTTCTG 401 CAGCTGGAAA ATACCACTTT TCTAACTTGG TGAGCCCAAC AAATTTGTCT 451 CAATTTAACC TTCCCGGGCC CAGCATGATG CGCTCAAACA GCATCCCAGC 501 CCAAGACTCT TCCTTCGATC TCTATGATGA CTCCCAGCTT TGTGGGAGTG 551 CCACTTCTCT GGAGGAAAGA CCTCGTGCCA TCAGTCATTC GGGCTCATTC 601 AGAGACAGCA TGGAAGAAGT TCATGGCTCT TCATTATCAC TGGTGTCCAG 651 CACTTCTTCT CTTTACTCTA CAGCTGAAGA AAAGGCTCAT TCAGAGCAAA 701 TCCATAAACT GCGGAGAGAG CTGGTTGCAT CACAAGAAAA AGTTGCTACC 751 CTCACATCTC AGCTTTCAGC AAATGCTCAC CTTGTAGCAG CTTTTGAAAA 801 GAGCTTAGGG AATATGACTG GCCGATTGCA AAGTCTAACT ATGACAGCGG 851 AACAAAAGGA ATCTGAACTT ATAGAACTAA GAGAAACCAT TGAAATGCTG 901 AAGGCTCAGA ATTCTGCTGC CCAGGCGGCT ATTCAGGGAG CACTGAATGG 951 TCCAGACCAT CCTCCCAAAG ATCTTCGCAT CAGAAGACAG CATTCCTCTG 1001 AAAGTGTTTC TAGTATCAAC AGTGCCACAA GCCATTCCAG TATTGGCAGT 1051 GGTAATGATG CCGACTCCAA GAAGAAGAAA AAGAAAAACT GGGTGAACTC 1101 TAGAGGAAGT GAGCTGAGAA GTTCTTTCAA ACAAGCCTTT GGGAAGAAAA 1151 AGTCCACCAA GCCTCCTTCA TCACATTCTG ACATTGAAGA GCTTACTGAT 1201 TCATCCCTTC CGGCATCCCC CAAGTTACCC CATAATGCTG GTGACTGTGG 1251 CTCAGCATCC ATGAAGCCCT CACAATCTGC TTCAGCGATC TGTGAATGCA 1301 CAGAAGCTGA GGCAGAGATA ATTCTGCAGC TGAAGAGCGA GCTCAGAGAA 1351 AAGGAATTAA AATTAACGGA TATTCGGCTG GAGGCCCTCA GCTCTGCTCA 1401 TCATCTTGAT CAGATCCGGG AAGCCATGAA CCGGATGCAG AATGAAATTG 1451 AAATACTGAA AGCTGAAAAT GACCGGTTGA AGGCAGAAAC TGGTAACACA 1501 GCTAAGCCTA CTCGGCCACC GTCAGAATCC TCAAGCAGCA CCTCCTCTTC 1551 ATCTTCCAGG CAGTCATTAG GACTTTCTCT AAACAATTTG AACATCACAG 1601 AGGCTGTTAG CTCAGATATT TTGCTAGATG ATGCTGGTGA TGCAACTGGA 1651 CATAAAGATG GCCGCAGTGT GAAAATTATA GTCTCCATAA GCAAGGGCTA 1701 TGGTCGAGCA AAGGACCAAA AATCTCAGGC ATATTTGATA GGATCCATTG 1751 GTGTTAGTGG AAAAACCAAG TGGGATGTCT TAGATGGTGT AATAAGACGT 1801 CTCTTTAAGG AATATGTATT CCGAATTGAT ACATCCACTA GCCTTGGTCT 1851 GAGCTCTGAC TGCATTGCTA GCTACTGTAT AGGAGACTTA ATTAGATCCC 1901 ATAACCTAGA AGTGCCTGAA TTGCTGCCTT GTGGATACCT TGTTGGAGAT 1951 AATAACATCA TCACTGTGAA CCTCAAAGGG GTAGAAGAAA ATAGTTTGGA 2001 CAGTTTTGTT TTTGATACGC TGATTCCTAA ACCAATTACC CAAAGGTACT 2051 TTAACTTGTT GATGGAGCAT CACAGAATTA TACTCTCAGG ACCGAGTGGT 2101 ACTGGAAAGA CCTATTTGGC AAACAAACTT GCTGAATATG TAATAACCAA 2151 ATCTGGAAGG AAAAAAACAG AGGATGCAAT TGCCACTTTT AATGTGGACC 2201 ACAAGTCAAG TAAGGAATTG CAACAATATC TAGCTAACCT GGCTGAACAG 2251 TGCAGTGCTG ATAATAATGG AGTGGAGCTC CCAGTTGTAA TAATTCTTGA 2301 TAATCTTCAT CATGTGGGCT CTCTGAGTGA TATCTTCAAT GGTTTTCTCA 2351 ATTGTAAATA CAACAAATGT CCATATATTA TTGGAACAAT GAATCAGGGA 2401 GTTTCTTCAT CACCAAATCT AGAGCTGCAT CACAATTTCA GGTGGGTATT 2451 ATGTGCAAAT CATACAGAAC CAGTGAAAGG CTTTTTAGGC AGATATCTTC 2501 GAAGAAAACT CATAGAGATA GAAATTGAAA GGAACATTCG CAATAATGAC 2551 CTAGTCAAAA TTATAGATTG GATTCCGAAG ACGTGGCATC ATCTCAACAG 2601 TTTTTTGGAA ACACACAGTT CTTCTGACGT TACCATTGGT CCCCGACTAT 2651 TCCTTCCTTG CCCCATGGAT GTAGAAGGTT CTAGAGTATG GTTCATGGAT 2701 CTCTGGAACT ATTCTTTAGT ACCTTATATT CTGGAGGCAG TGAGAGAGGG 2751 TCTTCAGATG TATGGGAAAC GCACACCATG GGAAGATCCT TCAAAGTGGG 2801 TGCTTGACAC ATATCCATGG AGCTCAGCAA CTCTGCCTCA GGAGAGCCCA 2851 GCCTTACTTC AGCTGCGACC AGAAGATGTT GGGTATGAAA GCTGCACATC 2901 CACTAAGGAA GCCACAACCT CAAAGCACAT TCCGCAAACT GACACAGAAG 2951 GAGATCCCCT GATGAATATG CTAATGAAAC TCCAAGAAGC AGCCAATTAC 3001 TCGAGCACAC AAAGCTGCGA CAGCGAAAGC ACCAGCCACC ATGAAGACAT 3051 TTTGGATTCA TCTCTTGAAT CTACCCTCTA GAGGGTGAAA AAAGTTAAGG 3101 GAAAAGACTT TGCTTTTAAA AAAATGTTTC AAAAGAAAGG TATTTTCACT 3151 AAACCACTGC CAGTATAAAA GCACCCTGTC AAGGGCCCTG ACCCAGAGTT 3201 GTGGTCTCCA AGGAGGCAGC AGAACTAAGT CTGAACCGCC AAGATGCTAA 3251 ATTGCAATGG AAGCTTAACT TTAGTTTATT TCTAAGCATT TTTTATATCT 3301 GTGGAGTAAT AGAAAGCTCC ATTACTCAAC TGGAAAGGAC CCTAATGACA 3351 GGGCAACTGA ACAGATTGCA CATGGGATAG CCAAACTGGA CTTTCTTTGT 3401 TTCCTCTTTA AAAGTTTACA ATGCAGACCA TTTTTTGTCC CTTCCTTTTG 3451 TTTCCTCTGA GGGGCTGTTC GCCCCAGGCA GGGTCCATCT TTCTGATCTG 3501 TCCAACCTCC TTTGTGCCAC ACGGTGCTGG TCACAGGGCT TCAGTAGTGT 3551 TTGTGTTGTG CGCTCACCCC ATTCCAGAAC AAATCCAAGA GGCCAGTCCT 3601 CCATAAGCAC AAATGGAATT GTGCAACCAC CAGAAAAACA CTACTGTGGC 3651 AAACTGGAGA AGTGCCAATT TAATTCTAAC TGCCACGTTC TCATGATGTG 3701 CTCCACCAAC TTTTTAGTAT ATGAGTCACT GGTTTTATAA GGTTGTTTTT 3751 ACCACAGTGG TCTTTTTAAA CCACCTGCCC ACTCCCTTAA CAAGAGTTTT 3801 ATACCAATTA TTAGTCAACA CTGATAAAAG GCTTTTTTAG GGCTTTATTT 3851 GTTTGAGCCT TTTCAGTGAA AGAAGGAACA TTTCCTATGG TGCTGTCTCA 3901 CTGCCTTAAA ACAGATTTCT ATGACAGTTT AACAGTTGGT TTAAATCCTA 3951 AACCATTGGT AATTTCCACT GTCTTTTCAT TTACAACCAA GCAACACCAG 4001 TTAACATAGT AGCCTCATCT CTATATATCT TTCTCTTTTT TTTTTTTTTT 4051 TGAAGAAATG GATAGGAGAA AGATCAGTAT TTTTAGCCTT GTGAATAGAT 4101 CGCTTTGCCT ATCCTCCAAA ATATTAAAAT AACCCAGAAA TGCTCTTTGA 4151 CCGTCACTTA AAACCTAAGA CATGTGGCGA AATTCCATCC AGTTCTAAGT 4201 GAAAGAGTTT CAGAAGGCAG GAGATTTTGA ATTATTATCC AGCAGGGCTG 4251 GAAGCACTAG ATGCAGCATG AGCACAACTA TTCGGCTTTC CTTCCCTATT 4301 GTTTTTGTTT TTTTAATGAG TTTTGACGCA TGTTGTTTTG ATTGCTATTG 4351 TTGTACATGA GAAATTCAGC ATTAAAGAAC ACTGAAGCGG TAAGGTCACT 4401 GTGGAAGAGG AAGCGTTTAT ACTGTAAAAG AAGGTTAGAT TTGCACAGTC 4451 TACTGGGTAG GTATTGTAAA TAATAATTTT TAAAACTTGC ACAAATCAAA 4501 ACAAACACAA ACAAAATTGT ATTTTATCCT ATTGGTGTTA AGAGGTGTTT 4551 CACTTGCTGA GATTTCCTGT ACATTGCAAA CAAATACAGA ATGCAAACCC 4601 TCAAAGCTGT ATTATCTGGT GTGTTTGTCC TGTATTTACA GTTGTTTTTG 4651 ACTATGCAGG AGCTATCAGT GCTAGAGTGA GCATGCTTCA AAACTGTACA 4701 TGAAGCCAAT ATATTTTTGG ATAAGTAAAA CTGTCTGAAA GTACATCTGT 4751 CATGGCAGGC TTTAAAGAGA GTGCATGAAA ACTGATCAGT CATTGGAGAA 4801 GTTACCACCA CACACAAAGG ACAGGTTTTA AGTTTATGAA ACCCAAGGGC 4851 TAGGCCATGG TATAGACTTC TTCTATGAGT GTGTGAAAAT GTGTTACTTT 4901 TAGGACGTGT ATTTGGTGCT ACTCTCTGTG ACCACCAATG GGTCAGTTGC 4951 TATAGAACAA CAACACCACG AAACATCTGT GCAGTTTTCA GAGTGTCACA 5001 AAGTCAATAG GTCCTTACAC GGTGCTATTG CCCTAAGGGA AATCCGAACT 5051 GAATTTATGC ACATAGAATT GTCACCCTGA CTTTGAAGCC TCAAACATGG 5101 ATCAAATCTG TTGTGAAACA TCAATATATG TAGCTGGATG AGTGACTAGT 5151 TTCCCTTGTA TAATATGTGA TCTAAGAAAA TTGCTAATCT TTCCCTGCCA 5201 TTTTGAGAAA CACAGTCCAA ACATGAGCAT AAACAGAATT TCCTGCAATA 5251 CATCCCAGTA GGTCCACCTA GTTTACAACT TAAACTAGTT TGTGAAACAT 5301 TTGTCTGTAT ACATTTTATA TTTTGTACAT TTTGATGTAA CATATCATGT 5351 AAATAGGCAG AAACAGTGAA ATAAATCATC TGAAAAGTTT TGTAGTCTTT 5401 GTAAAGCCCC AACAATAAGT ACTTGGTGTC AATGGACTTA ACTGGATGAT 5451 GTATTTTCTA TTGGTTTATT GTTCCTCTAG CTTGTAAACC AGCTTGCATA 5501 TATTTTTTTG CAAATGTGCA CCCTGTATCT GTCTAAATTA TTACTTTGCC 5551 ATTAAAGTGG AATTATTTAT TGAC // LOCUS AB023183 4924 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0966 protein, complete cds. ACCESSION AB023183 NID g4589575 VERSION AB023183.1 GI:4589575 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hj06369. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 4924) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4924 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hj06369" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 167. .3565 /gene="KIAA0966" CDS 167. .3565 /gene="KIAA0966" /codon_start=1 /product="KIAA0966 protein" /protein_id="BAA76810.1" /db_xref="PID:d1040563" /db_xref="PID:g4589576" /db_xref="GI:4589576" /translation="MELFQAKDHYILQQGERALWCSRRDGGLQLRPATDLLLAWNPIC LGLVEGVIGKIQLHSDLPWWLILIRQKALVGKLPGDHEVCKVTKIAVLSLSEMEPQDL ELELCKKHHFGINKPEKIIPSPDDSKFLLKTFTHIKSNVSAPNKKKVKESKEKEKLER RLLEELLKMFMDSESFYYSLTYDLTNSVQRQSTGERDGRPLWQKVDDRFFWNKYMIQD LTEIGTPDVDFWIIPMIQGFVQIEELVVNYTESSDDEKSSPETPPQESTCVDDIHPRF LVALISRRSRHRAGMRYKRRGVDKNGNVANYVETEQLIHVHNHTLSFVQTRGSVPVFW SQVGYRYNPRPRLDRSEKETVAYFCAHFEEQLNIYKKQVIINLVDQAGREKIIGDAYL KQVLLFNNSHLTYVSFDFHEHCRGMKFENVQTLTDAIYDIILDMKWCWVDEAGVICKQ EGIFRVNCMDCLDRTNVVQAAIARVVMEQQLKKLGVMPPEQPLPVKCNRIYQIMWANN GDSISRQYAGTAALKGDFTRTGERKLAGVMKDGVNSANRYYLNRFKDAYRQAVIDLMQ GIPVTEDLYSIFTKEKEHEALHKENQRSHQELISQLLQSYMKLLLPDDEKFHGGWALI DCDPSLIDATHRDVDVLLLLSNSAYYVAYYDDEVDKVNQYQRLSLENLEKIEIGPEPT LFGKPKFSCMRLHYRYKEASGYFHTLRAVMRNPEEDGKDTLQCIAEMLQITKQAMGSD LPIIEKKLERKSSKPHEDIIGIRSQNQGSLAQGKNFLMSKFSSLNQKVKQTKSNVNIG NLRKLGNFTKPEMKVNFLKPNLKVNLWKSDSSLETMENTGVMDKVQAESDGDMSSDND SYHSDEFLTNSKSDEDRQLANSLESVGPIDYVLPSCGIIASAPRLGSRSQSLSSTDSS VHAPSEITVAHGSGLGKGQESPLKKSPSAGDVHILTGFAKPMDIYCHRFVQDAQNKVT HLSETRSVSQQASQERNQMTNQVSNETQSESTEQTPSRPSQLDVSLSATGPQFLSVEP AHSVASQKTPTSASSMLELETGLHVTPSPSESSSSRAVSPFAKIRSSMVQVASITQAG LTHGINFAVSKVQKSPPEPEIINQVQQNELKKMFIQCQTRIIQI" BASE COUNT 1427 a 1044 c 1115 g 1338 t ORIGIN 1 TCTCCTCCTA CCGGTCGGGT GCCCCGGGGC GTTCCCTCTG CCGCTGCTTC 51 TCGGCGCGGT TCCTACCCGG CCGCTCCCCG AGGCGCGGGC TCTGGCGGCC 101 TCGACCGACT AGGACGCCCC GTGCGCCGCC CGCGGGCCGC CGCCTCCCTG 151 GGCGCGCGGG GCCAGCATGG AGCTCTTCCA AGCCAAGGAC CACTACATCC 201 TGCAGCAGGG CGAGCGCGCG CTGTGGTGCA GCCGCCGCGA CGGCGGCCTC 251 CAGCTCCGAC CCGCTACTGA TCTACTTCTT GCCTGGAATC CCATTTGTTT 301 GGGGTTGGTA GAAGGTGTTA TTGGGAAAAT TCAACTTCAT TCAGATCTTC 351 CATGGTGGCT TATTCTAATT CGGCAGAAAG CATTGGTGGG CAAACTCCCA 401 GGAGACCATG AGGTCTGTAA AGTTACCAAA ATTGCTGTGC TCTCACTTTC 451 TGAAATGGAA CCTCAGGATC TTGAGCTAGA GCTCTGTAAG AAGCATCATT 501 TTGGTATTAA CAAACCAGAG AAGATCATAC CATCTCCTGA TGACTCAAAG 551 TTTCTACTGA AGACCTTTAC GCATATTAAA TCCAATGTGT CTGCTCCTAA 601 TAAAAAGAAA GTTAAGGAAA GTAAAGAGAA GGAGAAGTTG GAGAGGAGAT 651 TACTTGAAGA GTTGCTGAAG ATGTTCATGG ACTCAGAATC CTTTTATTAT 701 AGCTTGACCT ATGACCTGAC CAATTCCGTG CAGAGGCAGA GCACTGGGGA 751 GAGGGACGGT CGGCCCCTCT GGCAGAAGGT TGATGACCGA TTTTTTTGGA 801 ATAAATACAT GATACAAGAT CTTACTGAGA TTGGTACTCC AGATGTGGAC 851 TTTTGGATTA TCCCCATGAT CCAAGGTTTT GTGCAGATTG AAGAACTTGT 901 GGTTAATTAT ACCGAATCAT CTGATGATGA GAAAAGCAGC CCAGAGACCC 951 CCCCTCAGGA GTCCACCTGT GTAGATGATA TTCACCCACG ATTTCTAGTG 1001 GCTCTCATTT CACGCCGAAG TAGGCACAGA GCAGGAATGC GCTATAAACG 1051 AAGAGGAGTG GATAAAAATG GAAATGTTGC CAATTATGTG GAGACTGAGC 1101 AGTTGATTCA TGTTCATAAT CATACCCTGT CATTTGTTCA AACACGAGGC 1151 TCTGTGCCTG TCTTTTGGAG CCAGGTTGGG TATCGATATA ACCCAAGACC 1201 GCGGCTGGAC AGAAGTGAAA AGGAAACTGT TGCCTATTTC TGTGCCCATT 1251 TCGAAGAACA ACTGAACATT TACAAAAAAC AGGTTATTAT TAACTTGGTA 1301 GACCAGGCAG GAAGAGAGAA GATTATTGGC GATGCTTACC TGAAGCAAGT 1351 GTTGCTTTTC AACAACTCAC ACCTCACTTA CGTTTCGTTT GACTTCCATG 1401 AGCACTGCCG AGGAATGAAG TTTGAGAATG TTCAGACACT AACAGATGCC 1451 ATTTATGACA TTATTCTTGA TATGAAGTGG TGTTGGGTTG ATGAAGCTGG 1501 GGTAATATGT AAGCAGGAAG GGATTTTTCG TGTTAATTGT ATGGACTGCC 1551 TGGATCGCAC CAACGTGGTC CAAGCTGCCA TCGCGAGAGT GGTCATGGAA 1601 CAGCAGCTGA AAAAATTAGG TGTGATGCCC CCGGAACAGC CATTACCTGT 1651 GAAATGTAAT CGCATCTACC AGATAATGTG GGCCAATAAT GGTGACTCCA 1701 TTAGCAGACA GTATGCTGGG ACAGCTGCTC TGAAGGGTGA CTTTACAAGG 1751 ACAGGAGAAA GGAAGTTAGC AGGAGTTATG AAAGATGGAG TGAACTCAGC 1801 AAACAGATAT TACCTCAACC GATTTAAGGA TGCTTATAGG CAAGCTGTTA 1851 TAGATTTGAT GCAAGGCATT CCAGTGACAG AAGATCTTTA TTCCATATTT 1901 ACCAAGGAGA AAGAACATGA AGCTTTGCAT AAGGAAAATC AGAGAAGCCA 1951 CCAGGAACTA ATTAGCCAGC TCTTACAAAG TTACATGAAG TTACTACTGC 2001 CTGATGATGA GAAGTTCCAT GGGGGCTGGG CCCTCATTGA CTGTGACCCT 2051 AGCCTCATTG ATGCTACTCA CAGAGACGTG GATGTGCTGT TACTGCTTTC 2101 TAACTCTGCC TACTACGTGG CCTATTATGA TGATGAAGTT GATAAAGTAA 2151 ACCAGTATCA ACGACTAAGT CTAGAAAACC TGGAAAAAAT TGAAATAGGC 2201 CCTGAACCCA CTCTTTTTGG TAAGCCAAAG TTCTCCTGCA TGCGACTGCA 2251 CTACAGATAC AAAGAAGCGA GTGGCTATTT CCACACATTG CGAGCTGTAA 2301 TGCGTAATCC TGAAGAGGAT GGAAAAGATA CCCTTCAGTG CATTGCAGAG 2351 ATGCTGCAGA TCACCAAGCA AGCCATGGGA TCGGATTTAC CCATAATTGA 2401 GAAGAAACTT GAGAGGAAGA GCAGTAAACC TCACGAAGAC ATCATTGGTA 2451 TCAGGTCTCA AAACCAAGGT TCTTTGGCCC AGGGAAAGAA TTTTTTAATG 2501 AGCAAATTTT CATCTCTAAA TCAAAAAGTG AAGCAGACCA AATCCAATGT 2551 AAATATTGGC AACCTCCGAA AGCTAGGAAA CTTTACCAAA CCTGAAATGA 2601 AAGTTAACTT TCTAAAACCA AACTTAAAAG TAAATCTTTG GAAATCAGAT 2651 AGTAGTCTTG AAACTATGGA AAACACAGGA GTGATGGATA AGGTTCAGGC 2701 AGAGTCTGAT GGGGACATGT CTTCAGATAA TGACTCATAC CACTCTGATG 2751 AATTCCTTAC AAATTCTAAG TCTGATGAAG ACAGGCAGCT AGCTAACTCA 2801 TTAGAGAGTG TAGGGCCAAT AGATTACGTT CTTCCTAGTT GTGGTATTAT 2851 TGCCTCAGCG CCTCGATTGG GCAGTCGGTC CCAGTCTCTT AGCAGCACAG 2901 ATAGTAGCGT TCATGCTCCT TCAGAGATTA CTGTTGCTCA TGGGAGTGGG 2951 CTTGGAAAAG GCCAGGAGTC TCCTTTGAAG AAAAGTCCTT CTGCTGGCGA 3001 CGTACACATA TTGACTGGCT TTGCCAAGCC TATGGATATT TACTGCCACA 3051 GATTTGTGCA AGATGCACAG AACAAAGTGA CCCACCTATC AGAGACCAGA 3101 TCTGTGTCTC AGCAGGCTAG TCAGGAAAGA AATCAAATGA CCAATCAAGT 3151 TTCAAATGAA ACCCAATCAG AATCAACAGA ACAGACACCT TCTCGGCCAT 3201 CGCAATTAGA TGTCTCTCTT TCTGCAACAG GCCCACAGTT TTTGTCAGTT 3251 GAGCCAGCGC ATTCAGTTGC ATCTCAAAAA ACCCCCACCT CCGCTTCCAG 3301 CATGCTTGAA CTTGAGACAG GGCTTCATGT AACTCCTTCT CCTTCAGAGA 3351 GCAGTAGCAG CAGAGCAGTC TCTCCCTTTG CCAAGATTCG AAGTTCCATG 3401 GTCCAGGTTG CTAGTATTAC CCAAGCTGGA TTAACCCATG GGATAAACTT 3451 TGCAGTGTCA AAAGTTCAGA AGAGTCCTCC AGAACCTGAA ATCATTAATC 3501 AAGTCCAGCA AAATGAACTT AAAAAGATGT TTATACAATG CCAGACACGG 3551 ATAATTCAGA TTTAGCTTTT AGCCATAAGA ATCCTTCCAT GGCTTTTATT 3601 TAAAAATATG AAATTTTCAC CTCTTGGGGT ATTTTAATTG TACTGTCTGA 3651 ACCCAGGGAT CACAAATTCT GTTCATTGGA AAGGGTTTTA AACGGAGTCG 3701 GAACCTGAGT AGATTTCCAA ATTTTACAGC CAGGACTACA GAAGTGCATC 3751 ATTCTAGAAT GTGTAGACCT GAGTAGCTTA TACACTACAG AGCACTTTGC 3801 TTATTTGAAA GTAATTCAGC AACAGGTCAC TTTGGGATAT AACCTGAACC 3851 TTTTTTTGGA GTGGGGTGGG TAGACTACAG TAGACACAAG GGCTGGACAT 3901 GCAGATGCTT AGGGGATTAG CGTTTTTCAT AATTTGTTCT GTTTGTCAGT 3951 TCATTCCTGT GTGTTCTTAC CTCTACAAAG TAAATTACAC ATTTAGTTTT 4001 TAGTGACTTT AACATGTTAC TGAAGCATTT GAATATAAAG CTATTTTAGT 4051 TTTGATGGCT TAACTGTTCC CTGAGGAGTT GAGGGTTATT GACACTAGAG 4101 AAATGAATTC TCATTTGATC CTAATTTTCC CCGTATTCTA CTTGAACACA 4151 TTAAAAATAC TCTGCTGCCT ATACAATGTA AACCTAGGAG CATTAAGACT 4201 TGTCACACAG TAAACCTGAT ACATCAGAGG TGAATACCAG CACCTATTAG 4251 GTTTCATTTT GCTGTTTTCA GGAATGTAAG AACACCCATA TTGGCTACTG 4301 GAAATTCTAG CAGTCAGTCA GGTTTTAATT TATTCCAGGA GGGGCATCCT 4351 CGACATCTTA TGTAGATGAT CCACAACTTC AAAATTTAGT CTGGGCCTAG 4401 TGCAGTGGCT CACACCTATA ATCCCAACAC TTTGGGAGGC CAGGAGTTTT 4451 GAGACCAGCC TGGGAAACAT CTGCCTCTAC AAAAAAATAC AAAAATTAGC 4501 TGGGCATAGT GGTGCATGCC TGTGTTTCTA GCTACGCAGG AGGATTGCTT 4551 GAGCCCATGA GATTGAGGCT GCAGTGAGCT GTGATCGTGC CACTGACCTC 4601 CAGCCTGGGG GACAGAGCAA GACCGTGTCT CAAAAACAAT TTAGTCTGAA 4651 ACACAATTGT GCTGAATCTG TCTGACTATA ACTCTGACCA CACAGAACCA 4701 GGGCTGCCCC TGTAATCCCC ACAGTAAGAA AGTTGTATGG CATATTCCAA 4751 CAAGTATTGG TTCGTCTGGT GTCTTTAGAG CTTTACTCTG TTGAAGTGAC 4801 TGATTCTCAA CTGAACATTA TGTCGTTACT TTGATAAGCA TTCCACTTTT 4851 GTTATTTATT AGTGCTATCT TTTTTTTTAC GTGTTAAATC TTGTGATTAT 4901 TAAAATAAAG TACCATTGTA ATTT // LOCUS AB023188 4999 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA0971 protein, complete cds. ACCESSION AB023188 NID g4589585 VERSION AB023188.1 GI:4589585 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hj06832. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 4999) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .4999 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hj06832" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 59. .2005 /gene="KIAA0971" CDS 59. .2005 /gene="KIAA0971" /codon_start=1 /product="KIAA0971 protein" /protein_id="BAA76815.1" /db_xref="PID:d1040568" /db_xref="PID:g4589586" /db_xref="GI:4589586" /translation="MLTTLKPFGSVSVESKMNNKAGSFFWNLRQFSTLVSTSRTMRLC CLGLCKPKIVHSNWNILNNFHNRMQSTDIIRYLFQDAFIFKSDVGFQTKGISTLTALR IERLLYAKRLFFDSKQSLVPVDKSDDELKKVNLNHEVSNEDVLTKETKPNRISSRKLS EECNSLSDVLDAFSKAPTFPSSNYFTAMWTIAKRLSDDQKRFEKRLMFSHPAFNQLCE HMMREAKIMQYKYLLFSLHAIVKLGIPQNTILVQTLLRVTQERINECDEICLSVLSTV LEAMEPCKNVHVLRTGFRILVDQQVWKIEDVFTLQVVMKCIGKDAPIALKRKLEMKAL RELDRFSVLNSQHMFEVLAAMNHRSLILLDECSKVVLDNIHGCPLRIMINILQSCKDL QYHNLDLFKGLADYVAATFDIWKFRKVLFILILFENLGFRPVGLMDLFMKRIVEDPES LNMKNILSILHTYSSLNHVYKCQNKEQFVEVMASALTGYLHTISSENLLDAVYSFCLM NYFPLAPFNQLLQKDIISELLTSDDMKNAYKLHTLDTCLKLDDTVYLRDIALSLPQLP RELPSSHTNAKVAEVLSSLLGGEGHFSKDVHLPHNYHIDFEIRMDTNRNQVLPLSDVD TTSATDIQRLLTYISFAGLSELKS" BASE COUNT 1539 a 895 c 930 g 1635 t ORIGIN 1 GCTCTAAGGA AAACGACAGC ACGTGTTCTT TTTCACTAGT AGAAGTGACG 51 TTGGTTTCAT GTTGACAACT TTGAAGCCAT TTGGAAGTGT TTCAGTGGAG 101 AGCAAAATGA ATAACAAAGC GGGCTCCTTT TTCTGGAACC TTAGACAATT 151 CAGTACATTA GTTTCAACAA GCAGAACTAT GAGGCTATGT TGTTTGGGAC 201 TTTGCAAACC AAAAATAGTT CATTCAAACT GGAACATTTT AAATAACTTT 251 CATAACAGAA TGCAATCAAC TGATATCATT AGATATCTCT TTCAGGATGC 301 ATTCATTTTT AAATCAGATG TTGGCTTTCA AACAAAGGGC ATAAGCACTC 351 TAACAGCCCT TAGAATTGAA AGACTACTTT ATGCTAAAAG ACTGTTTTTT 401 GACTCAAAGC AGTCTCTTGT CCCTGTTGAT AAATCTGATG ATGAATTGAA 451 GAAAGTAAAC CTTAATCATG AAGTCTCCAA TGAAGATGTT CTTACCAAGG 501 AAACAAAACC AAACCGTATC AGCAGTAGAA AACTGTCTGA GGAATGTAAT 551 TCCCTGAGTG ATGTGTTAGA TGCATTTTCA AAAGCGCCCA CATTTCCTAG 601 TAGCAACTAT TTCACAGCAA TGTGGACAAT TGCCAAAAGA CTGTCTGATG 651 ACCAGAAGCG CTTTGAAAAA CGACTGATGT TTAGCCACCC TGCATTTAAT 701 CAGCTCTGTG AACATATGAT GAGAGAAGCC AAGATCATGC AGTATAAGTA 751 CCTACTGTTC AGTCTTCACG CCATAGTGAA GCTTGGAATC CCTCAGAACA 801 CTATTTTGGT GCAGACTTTG CTGAGGGTGA CCCAGGAACG TATCAATGAG 851 TGTGATGAGA TATGCCTTTC AGTTTTGTCA ACTGTTTTAG AGGCAATGGA 901 ACCATGCAAG AATGTTCATG TTCTACGAAC GGGATTCAGA ATACTAGTTG 951 ATCAGCAAGT TTGGAAAATA GAAGATGTCT TCACATTACA AGTTGTGATG 1001 AAGTGTATTG GAAAAGATGC ACCGATTGCT CTTAAGAGGA AACTGGAGAT 1051 GAAAGCCTTG AGGGAATTAG ACAGATTTTC TGTTTTGAAT AGCCAACACA 1101 TGTTTGAAGT ACTAGCTGCC ATGAATCACC GATCTCTTAT ACTCCTGGAT 1151 GAATGCAGTA AGGTGGTCCT AGATAATATC CATGGGTGTC CTTTAAGAAT 1201 AATGATCAAC ATATTGCAGT CCTGCAAAGA CCTCCAGTAC CATAATTTGG 1251 ATCTCTTCAA GGGACTTGCA GATTATGTGG CTGCAACTTT CGACATCTGG 1301 AAGTTCAGAA AAGTTCTTTT TATCCTCATT TTATTTGAAA ACCTTGGCTT 1351 TCGACCTGTT GGTTTAATGG ACCTGTTTAT GAAGAGAATA GTAGAGGATC 1401 CTGAATCCCT AAACATGAAA AACATTCTAT CTATTCTTCA TACTTACTCT 1451 TCTCTCAATC ATGTCTACAA ATGCCAGAAC AAAGAACAGT TCGTGGAAGT 1501 TATGGCTAGT GCTCTGACTG GTTATCTTCA CACTATTTCT TCTGAAAACT 1551 TATTGGATGC AGTATATTCA TTTTGCTTGA TGAATTACTT TCCCCTGGCT 1601 CCTTTTAATC AGCTTCTGCA AAAAGACATC ATCAGTGAGC TGCTGACATC 1651 AGATGACATG AAGAATGCTT ACAAGCTGCA TACTTTGGAT ACTTGTCTAA 1701 AACTTGATGA TACTGTCTAT CTGAGGGACA TAGCCTTGTC ACTCCCACAG 1751 CTGCCGCGGG AGCTGCCATC GTCACATACA AATGCAAAGG TGGCAGAGGT 1801 GCTGAGCAGC CTTCTGGGAG GTGAAGGACA CTTCTCAAAG GATGTGCACT 1851 TGCCACACAA TTATCATATT GATTTTGAAA TCAGAATGGA CACTAACAGG 1901 AATCAAGTGC TACCACTTTC TGATGTGGAT ACAACTTCTG CTACAGATAT 1951 TCAAAGGTTG CTTACATATA TTTCATTTGC TGGGCTTTCT GAATTAAAAT 2001 CCTAATTCTA AAAGACTTCA TATTAAATAG CAGTAGTAAA TCTAAGACTT 2051 GATGTTTGTA TGGAGCCTGC AGCACACCAA GAGACTTGTC ATCCATCAAT 2101 GTACTCCTTA GAAAGTGTAA ATGTGGACGT TATGGTTGAA TACAATGGAC 2151 TACAGTATTG GTCATTTAAT GAGACAGACA TTTTTTTAAA ATTAGGTTTT 2201 GAAGCTATAG TTCAATAAAA CAATATTAAT ATAATTGAAA CTATCAATGC 2251 AGTGTAGACT TTGGACAAGA TAATATAATA GTGATCATAA AGAATAAAGA 2301 CTCGAGGCTT CTACAAAAAT CAAATACCCA ATAAATATTT TGCTCTACCA 2351 CCAAGACTTT TCCAGAGGTA GAGAAACCGT AGTTTTTTAA TTAAACTGTA 2401 TTTGTATGGG TTAGAGCACA TTGAAGCTAA TCCTAAGTTG TTTCTTTGGC 2451 TCACACATTA CAAATAGAGT GTCTCTGGGG AGAAATATGA TGCTATACCA 2501 TGAGAGAACA TCCTGCCTCT TCAGTAGATG CATAAGATCA TGCCATCATT 2551 GCACACACCC GTGGGTCAGT TACCCTTTAT GTAAAAGCTC TGCTACATGG 2601 ATCCTGACCC CAAGGAACTT GTATCTTCTG TTAACTCCAC TGGACACACA 2651 CACACACATA AATATGATAG AAACATACAA AAAAATCTGT AGGGATCCAA 2701 GCTCTTATCT GCCCATCCCT CTTCATTCAC ACACAGTTCT GTGACTTTCA 2751 CCTCTTTCTG AATGCATCTA GCTTATTTCT TGGCCATCTT TTCAACCGCT 2801 GTCTTTCCTA TTAGTGCTAC CCATCTCCCC TCAGAAGAGG CTTTCTCTCT 2851 GACCTCATTC CAAGATCAGT TTTCTTTTTC GGGTACTCCG CTATTCTGGC 2901 ATAATAAAAT AACAGTTGTT CTCAGCATAA AGTCATGAGA AGTGAAGAAT 2951 GTGAGCATTC CTTTAAAAAC ATAAAATTTG TCTTATAACA GCCAGAGTAT 3001 AGGTGAAGGC AGTAAGGATA TAAATGTCTT TTTTTTTTCT TTCTTTCTCC 3051 CAGTAAACTT AGTATTTCCA AAGTGCCAGG AACAATGAAA AAACAAATTA 3101 CACCTTCCAC ATTTTCGTTA TATTTACAGA TTTTATTTTA ACAGTCTTGC 3151 CTTTCTCTTG GTGTTTTCTT TCTTGTATTT AAACTCCTAT TTCTGTTTTT 3201 TAAATGCTGC TTTCTCTGGG AGATGATATG TGTAAGAAGA GCATTTACTA 3251 TTTGGTATTA TTTACACCGT TACTACTACT GGTAGAATAA ATAATTATAC 3301 TCAGATCAGA AATCAGCACT ACACAGAAAA CCCTTAAAAT TGGTGCTTTC 3351 ATGTCTGAAA ATTTTCCTCT TCAAAAATGT AGCCATCCTT AAAGGGAACT 3401 GAGAGGTAAC TGTGTATAAT GTTGTATATT TTTGATTTAT AAAGTAGCCT 3451 TGTATAGTGG AATATCTGCA GAGGCCAAGT AAAAGAAGTG AAGGACTAAG 3501 ACAGGAATAT CAAGGATTGT GGTCGGTGGA GAGCTCAACT TGGAAATCTC 3551 CAGAGAACAA GAATGCAGCA AGACTAAATA GTAATATCAA TAACTGTTCT 3601 TGTTTTGTTT ATTTTTGTTT TCAGAGTAGC TGTGCTATGT GTTTCCAGAT 3651 CTGCTTATTG TTTGGGTTCA AGCCACCCCA GAGGATTCCT TGCTATGAAA 3701 ATGCGGCATT TGAATGCAAT GGGTTTTCAT GTGATCTTGG TGAGAAAAAA 3751 ATGCAAATGA AATATTCTTT AATTACTGAT GTACATTCTA ACAGCTGCCA 3801 GGAATCTTGT GGTTGAGACT GGTTACTAGG ACTAGAGGAA TTGTCAGCAT 3851 TATTGGAATG AGTGCTTTAG TTACTGTTTT CTTTGAAAAA GAACATTGCT 3901 AGTACACGAT TTGGGTTCTG CTCAGCTGAT TACTGTCATT ACTAGAGCTT 3951 TTACAAATGT TGTTCTGGTC AGCTTTAATC CTTTTTTGCT AAGAAGAAGA 4001 ATGCAAATAA TATATTTTCA TATTTAAGAT ACACAAAACC ATATTTAATT 4051 TCCAGGTTAT CTTGAACAAG GGAAGGAAAG ATGTTAAAGC TTTGTTTTCA 4101 TGGCCTGTGT GAAGCTGTTA CAGGATGTTT GTTGCCAGAA AAGGCTGGAT 4151 GGTTTTAAGA AGATGGGTCT TTCTCTTCCC TCTTATGCTT CAATTTTATT 4201 TTGTACCCCA CAGCTTCTCT TGAACAGCCT CTTCTGATGT TTCTAGGTAG 4251 TAGGAACCTT ACAAATACAT TTATATATAT GTTGCTGTTG GAATCTATCT 4301 AAATGCCACT CATTTGGTCC TGCGCATAAT GTAAGATGTA AACATTCCAA 4351 GTCTTCTATT ATTATACTTT AGTATTGGAA ATGGATTACG TGTGGACTAT 4401 TGAGATTAAG ATGACTCCTT TCTCTTAGTG GGCTACTATT TAAGACTTTG 4451 TTTTTTATAT TGCTGATTAT ATATATTCTA AGGAAGATTT TCTCAAGAAA 4501 GCAGTTTTTG CACCATTTAG AGGACCTTTA AAATTTGTAT AATTTTCACA 4551 GTTAAAGCTT TCTTAAACCC TAGCATGTAG TCTTATTATG CCTTTAAAAT 4601 ATGAGTAGTT ATAAGGTCAC ATTTGCTTTC AAAAAGCTAT TGTCCTCATG 4651 TTACTTTACT ATGTTTGGAT CTCTAAACTA GCTCTTTTGT TAGTATCGTC 4701 TCACAAACTG ATTATTTTCC TCTTTCTTTG GTAAGGTCAA TAACTGGGAG 4751 ATGGACAAAC TAGAGATGGA AGATGCAGTC ACATTTTTGA AGACTAAAAT 4801 CTATTCAGTA GAAGCTCTTC CTGTTGCTGC TGTAAATGTG CAAAGCACAC 4851 AATAAAGTGA AAATCAACCT TTTCATATTA GGAGACATGC ATTTGTAAAA 4901 ATTAATAAAG ATGACAAGTC AGTTGTCAAT GGAATTGAGC TATCTGCTAA 4951 GACAAAAAAT GTTACCTCAG TTCACTATTA AAATTAATTT TAGGAGTGG // LOCUS AB023225 5231 bp mRNA PRI 16-JUN-1999 DEFINITION Homo sapiens mRNA for KIAA1008 protein, complete cds. ACCESSION AB023225 NID g4589659 VERSION AB023225.1 GI:4589659 KEYWORDS . SOURCE Homo sapiens adult male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:hh04776. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6 (1), 63-70 (1999) MEDLINE 99246063 REFERENCE 2 (bases 1 to 5231) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (04-FEB-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .5231 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hh04776" /clone_lib="pBluescriptII SK plus" /dev_stage="adult" /sex="male" /tissue_type="brain" gene 94. .2880 /gene="KIAA1008" CDS 94. .2880 /gene="KIAA1008" /note="h04476 cDNA clone for KIAA1008 has a 1-bp deletion at position 1122 of the sequence of KIAA1008." /codon_start=1 /product="KIAA1008 protein" /protein_id="BAA76852.1" /db_xref="PID:d1040605" /db_xref="PID:g4589660" /db_xref="GI:4589660" /translation="MLKSKTFLKKTRAGGVMKIVREHYLRDDIGCGAPGCAACGGAHE GPALEPQPQDPASSVCPQPHYLLPDTNVLLHQIVSAWRPGTWASVASSLRLPGSLETY VEQEQGENANDRNDRAIRVAAKWYNEHLKKMSADNQLQVIFITNDRRNKEKAIEEGIP AFTCEEYVKSLTANPELIDRLACLSEEGNEIESGKIIFSEHLPLSKLQQGIKSGTYLQ GTFRASRENYLEATVWIHGDNEENKEIILQGLKHLNRAVHEDIVAVELLPKSQWVAPS SVVLHDEGQNEEDVEKEEERERMLKTAVSEKMLKPTGRVVGIIKRNWRPYCGMLSKSD IKESRRHLFTPADKRIPRIRIETRQASTLEGRRIIVAIDGWPRNSRYPNGHFVRNLGD VGEKETETEVLLLEHDVPHQPFSQAVLSFLPKMPWSITEKDMKNREDLRHLCICSVDP PGCTDIDDALHCRELENGNLEVGVHIADVSHFIRPGNALDQESARRGTTVYLCEKRID MVPELLSSNLCSLKCDVDRLAFSCIWEMNHNAEILKTKFTKSVINSKASLTYAEAQLR IDSANMNDDITTSLRGLNKLAKILKKRRIEKGALTLSSPEVRFHMDSETHDPIDLQTK ELRETNSMVEEFMLLANISVAKKIHEEFSEHALLRKHPAPPPSNYEILVKAARSRNLE IKTDTAKSLAESLDQAESPTFPYLNTLLRILATRCMMQAVYFCSGMDNDFHHYGLASP IYTHFTSPIRRYADVIVHRLLAVAIGADCTYPELTDKHKLADICKNLNFRHKMAQYAQ RASVAFHTQLFFKSKGIVSEEAYILFVRKNAIVVLIPKYGLEGTVFFEEKDKPNPQLI YDDEIPSLKIEDTVFHVFDKVKVKIMLDSSNLQHQKIRMSLVEPQIPGISIPTDTSNM DLNGPKKKKMKLGK" BASE COUNT 1669 a 950 c 1140 g 1472 t ORIGIN 1 AAGGGAAGAA CCTCCGGGGT TAGGCGTATT CTAGATTGAC GCCTTTTGCT 51 GGAAGAGCGC TGCTGGGGTT AGGATTCTGC GCGGCGAGGC AAGATGCTCA 101 AGTCCAAGAC GTTCTTAAAA AAGACCCGGG CGGGCGGCGT GATGAAGATC 151 GTGCGCGAGC ACTACCTGCG AGACGACATC GGCTGCGGTG CGCCCGGGTG 201 CGCAGCGTGT GGAGGGGCGC ACGAGGGGCC GGCCCTGGAG CCGCAGCCCC 251 AGGACCCGGC GAGCAGCGTC TGCCCGCAAC CGCACTACTT GCTGCCCGAC 301 ACTAATGTGT TACTGCACCA GATTGTAAGT GCCTGGAGGC CGGGGACCTG 351 GGCTTCTGTG GCCTCCAGCC TGCGACTCCC AGGCAGCTTA GAAACCTATG 401 TAGAACAAGA ACAGGGAGAA AATGCTAATG ACAGGAATGA TAGAGCGATT 451 CGAGTAGCAG CAAAATGGTA CAATGAACAT TTGAAAAAAA TGTCAGCAGA 501 CAACCAGCTG CAAGTTATCT TCATAACAAA TGACAGGAGA AACAAAGAGA 551 AAGCCATAGA AGAAGGAATA CCAGCTTTCA CTTGTGAAGA ATATGTAAAG 601 AGCCTAACTG CTAACCCCGA ACTCATAGAT CGTCTTGCTT GTTTGTCTGA 651 AGAAGGGAAT GAAATAGAAA GTGGAAAAAT AATATTTTCA GAGCATCTTC 701 CCTTAAGTAA GCTACAGCAA GGCATAAAAT CTGGTACATA CCTTCAAGGA 751 ACATTTAGAG CTAGCAGGGA AAATTACTTG GAAGCTACAG TATGGATTCA 801 TGGCGACAAT GAAGAAAATA AAGAGATAAT CTTACAGGGA CTTAAACATT 851 TAAACAGAGC TGTTCACGAA GATATTGTGG CTGTGGAGCT TCTCCCCAAG 901 AGTCAGTGGG TAGCACCATC TTCTGTGGTT TTACATGATG AAGGTCAAAA 951 TGAAGAAGAT GTGGAGAAAG AAGAAGAGAG AGAACGAATG CTTAAGACTG 1001 CTGTAAGCGA GAAAATGTTG AAGCCTACAG GTAGAGTTGT AGGAATAATA 1051 AAAAGGAATT GGAGACCATA TTGTGGCATG CTTTCCAAGT CTGACATTAA 1101 GGAGTCAAGA AGACATCTCT TTACACCTGC TGATAAGAGA ATCCCTCGAA 1151 TTCGCATAGA AACCAGACAG GCTTCCACAT TAGAAGGACG GAGAATTATT 1201 GTTGCTATTG ATGGTTGGCC CAGAAATTCC AGATATCCAA ATGGACACTT 1251 TGTGAGAAAT TTAGGTGATG TTGGAGAGAA AGAGACTGAA ACAGAAGTTT 1301 TGTTACTTGA ACACGATGTT CCCCATCAGC CTTTTTCACA GGCTGTTCTT 1351 AGTTTTCTGC CAAAGATGCC CTGGAGCATT ACTGAAAAGG ACATGAAAAA 1401 CCGAGAAGAC CTGAGGCATC TGTGTATTTG TAGTGTAGAC CCACCAGGAT 1451 GTACTGATAT AGACGATGCT CTACATTGTC GAGAACTCGA AAATGGAAAT 1501 TTGGAGGTTG GTGTTCATAT TGCTGATGTG AGCCATTTTA TTAGGCCAGG 1551 AAATGCCTTG GATCAAGAAT CAGCCAGAAG AGGAACAACT GTGTATCTTT 1601 GTGAAAAGAG GATTGACATG GTTCCAGAGT TGCTTAGCTC TAACTTGTGT 1651 TCCTTAAAAT GTGACGTGGA CAGGCTGGCA TTTTCATGTA TTTGGGAAAT 1701 GAATCACAAT GCTGAAATCT TAAAAACGAA GTTTACCAAA AGTGTTATTA 1751 ATTCAAAGGC ATCTCTGACA TATGCTGAAG CTCAGTTGAG AATTGATTCA 1801 GCAAACATGA ATGATGATAT TACCACTAGT CTCCGTGGAC TGAATAAACT 1851 AGCCAAAATT CTGAAGAAAA GAAGGATTGA AAAAGGGGCT TTGACTCTAT 1901 CCTCTCCTGA AGTTCGATTC CACATGGACA GTGAAACTCA CGATCCTATA 1951 GATCTGCAGA CCAAGGAACT TAGGGAAACA AATTCCATGG TTGAAGAATT 2001 TATGTTACTT GCCAATATTT CTGTTGCAAA AAAAATTCAT GAGGAATTTT 2051 CTGAACATGC TCTGCTTCGA AAACATCCTG CTCCACCTCC ATCAAATTAT 2101 GAAATTCTTG TTAAGGCAGC CAGGTCAAGG AATTTGGAAA TTAAGACTGA 2151 TACAGCCAAG TCTTTGGCTG AGTCTTTGGA TCAGGCCGAA TCTCCTACTT 2201 TTCCATATCT AAACACTCTG TTGAGAATAT TAGCCACTCG CTGTATGATG 2251 CAAGCTGTGT ACTTCTGTTC TGGAATGGAT AATGATTTTC ATCACTATGG 2301 CTTAGCGTCT CCAATATACA CACATTTTAC TTCACCCATT AGAAGATACG 2351 CAGATGTCAT TGTTCATCGG CTTTTGGCTG TGGCTATTGG GGCTGACTGT 2401 ACTTATCCAG AGTTGACAGA CAAACACAAG CTTGCAGATA TATGTAAAAA 2451 TCTAAATTTC CGGCACAAAA TGGCTCAATA TGCCCAACGT GCATCAGTGG 2501 CTTTTCATAC CCAGTTATTC TTCAAAAGCA AAGGAATAGT AAGTGAAGAA 2551 GCCTATATTT TATTTGTAAG AAAGAATGCC ATTGTGGTAT TAATTCCAAA 2601 GTATGGTTTA GAAGGGACAG TCTTTTTTGA AGAAAAGGAC AAACCAAACC 2651 CACAGCTTAT TTATGATGAT GAGATACCCT CACTTAAAAT AGAAGATACA 2701 GTGTTCCATG TATTTGATAA AGTTAAAGTG AAAATCATGT TAGACTCATC 2751 TAATCTTCAA CATCAGAAGA TCCGAATGTC CCTGGTAGAA CCACAGATAC 2801 CAGGAATAAG CATTCCTACT GATACTTCAA ACATGGACCT TAATGGACCA 2851 AAGAAAAAGA AGATGAAGCT TGGAAAATAG CTATATTCAA CAAAAATCTT 2901 CAAAGACTGG TTTCTTTTTT AAAAGAAAAA ACTTGAAAGA ACACTTCTAA 2951 GCCTAAGTGT GTGATACAGT TTGTTACTTT TAAGTACATT TTAATAATTT 3001 CAGACATCTG CATTTTTATT GAACAGTTGA CTGTATCTGA CCCATCATAC 3051 TACTATACTT CTGGGTTGAA CAGAATTATT TATGCAGAAT AATTCAATTG 3101 AATATCCATC ACTTAAATAC AGTGACAGGA CAGCAACTTC AGGGATCTGT 3151 AAAGATCATT TAAATGGAGT GCTCATCTCA TTGAGGAGCA GATTAATTTT 3201 GCGTAAGTAC TTTGATTATT TAATATTTGT AAGAAAAAAC TTTCATTTTC 3251 CTACAGAGGA AAATAGAACA ATTTTAGAAG CAAGGAACAA TCTCTTTTCT 3301 AAGTCTTGGA AGCTGTCAGT GTTGAGGATG TAATCTCCTT TGCCATCTTT 3351 AATTCACCTA ACTTACACTA GGTGTTCTCT TACTGTCTTT AAAAGCTTCC 3401 TGTATTTTAT TAGTGGTCCT TGAAAAACTG TGAATGTTTG GGAATTGGTA 3451 GAAAGGCAAA AAGTAGGATA TTTTGACCTG ACTGGAAAGA TGGTTGTGTT 3501 TTTATTGCCA GGTAATAAGT GTGATCATTG TTGAACTTCA GCTCCAGTGT 3551 CTCTCCAGAA TAAGACATTG GCATTCAAAT GTCTATATCT TGTTACTTAC 3601 AAAATAAAAA ACAGATAATT AGTGGCTTTT AAATTGTAGT TATATCAGTG 3651 TATATACACG AGGGGAACTG TATAAAGACA TACTAAAGGG AACAGATTAA 3701 AATAAGTATT ATTAATAAAA TTTGGTGTGC CAGACTGACT ACTTCCCTTG 3751 CTAATCACAG AGATTAGTAA TGATTAAATT AATATCTTCA GGAATATTTT 3801 GGGATAGGGT TGCCTTAAAA CATTTTACTT GGCTTATTCA ATTTCTAAAG 3851 CACTTACGTT GTGCCAGGTT CCACTCAAGT AAACTATCTC TGCCTTCAAG 3901 GAGCTTGCAG TATAGTGAGA AAAGCCTGCC AAGTAAATCA GCAGGTATAC 3951 TAAATAGGTA GTCATTTAGG CACTCAATAA ATGTTGACTA ATTTTTCCAG 4001 TTTCCTATTA CTGAGGAAAA CCTCCATACA CTGAGGCAGA AACTTGTGTC 4051 AGTTGGAGGA AAATAGACTT GAGTAGTCTT TGGTCCAGGT AAAATGGGTC 4101 AGTGACACTG AAACAGTAGA GTAGGAGTCA AAAAGCCTTT TCTGTGAAGT 4151 GCTAGAACAT TTTAGACTTT GTTGGCTATG GTCTGTATTG CAACTATAGT 4201 ACACAACTCT GCCGTTGATA GTGTGTAAGG AGCCATATAC AATATGTAAA 4251 CAAATCAACA TGACTGTGTT TCAACAAGGC CTGATTTGTT AAGTTTCATA 4301 TATAGTCATA TGTCACTTAA TGACAGGGAT ATGTTCTGAG ACATGTCACT 4351 TGGCAATTTC ATCATTGTGT GAAAATCATG ATGTAGTTAC ACAAACCTAG 4401 ATGGTATACC CTATGGTATC CTATATGGGA TAGCCTATTG CTCCTAACCT 4451 ACAAACCTGT ATGGCATATT ACTGTACTGA TTACTGTAGG CAGTTGTAAC 4501 ACAGTGCTAA GTATTTGTGT ATCTAAACAT ATCTAAACAT AGAAAAGGTA 4551 CAGTAAAAAT ATATATATAG ATATATATAA GATATAAAAT GGTACACCTA 4601 TTTAGGACAT TTACCATGAA CAAGTTGGCA GGACTGGAAG CTGGCTCTGG 4651 GTGAGTCAGT AAGTGGTTAG TGAATGTGAA GGCCATGATA TTACTGTGCA 4701 CTACTGTAGA CTTTTATAAA CACTATATAC TTAGGCTACA TAAAGTAATT 4751 TCACTATGAT GTTACCATGG CTGTGTCCCC AGGCAATAGG AATTTTTCAG 4801 CTCCATTGTA ATCTTACGGG ATCATCATCG TATATTTGGC CATTGTTGAC 4851 CAAAAGTTAT GCAGCACGTC ATTATAATTT TGATGTGTCA CAAAACATTA 4901 CTCATTTGAT TTCCCCCACC CCCGCCAACC ATTTAAAAAA GTTTGCCGGC 4951 TAGGTGCGGT GGCTCATGCC TGTAATCCTA GCATTTTGGG AGGACGAGGC 5001 GGGTGGATCA CTCGATGTCA GGAGTTTGAG ACCAGCCTGG CCAACATGGT 5051 AAAACGCCGT CTCTACTAAA AATACAAAAA CTTAGCTGGG TGTAACGGCG 5101 GATGCCTGTA ATCCCAGCTA CTTGGGAGGC TGAGGCAGGA GAATCGCTTG 5151 AACCTGGGAG GCGGAAGCTG CAGTGAGCCG AGATTGCACC ACTGCAGTTC 5201 AGCCTGGATG ATGAGAGTGA GACTCCATCG C // LOCUS AB026118 2819 bp mRNA PRI 06-AUG-1999 DEFINITION Homo sapiens mRNA for MALT1, complete cds. ACCESSION AB026118 NID g5706377 VERSION AB026118.1 GI:5706377 KEYWORDS MALT1. SOURCE Homo sapiens Pre B-cell cell_line:KOCL-33 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Akagi,T., Motegi,M., Tamura,A., Suzuki,R., Hosokawa,Y., Suzuki,H., Ota,H., Nakamura,S., Morishima,Y., Taniwaki,M. and Seto,M. TITLE A novel gene, MALT1 at 18q21, is involved in t(11;18)(q21;q21) found in low-grade B-cell lymphoma of mucosa-associated lymphoid tissue JOURNAL Oncogene (1999) In press REFERENCE 2 (bases 1 to 2819) AUTHORS Seto,M. TITLE Direct Submission JOURNAL Submitted (13-APR-1999) to the DDBJ/EMBL/GenBank databases. Masao Seto, Aichi Cancer Center Research Institute, Laboratory of Chemotherapy; 1-1 Kanokoden, Chikusa-ku, Nagoya, Aichi 464-8681, Japan (E-mail:mseto@aichi-cc.pref.aichi.jp, Tel:81-52-762-6111, Fax:81-52-764-2982) COMMENT Sequence updated (04-Aug-1999). FEATURES Location/Qualifiers source 1. .2819 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KOCL-33" /cell_type="Pre B-cell" /chromosome="18" /map="18q21" gene 66. .2507 /gene="MALT1" CDS 66. .2507 /gene="MALT1" /codon_start=1 /product="MALT1" /protein_id="BAA83099.1" /db_xref="PID:d1046926" /db_xref="PID:g5706378" /db_xref="GI:5706378" /translation="MSLLGDPLQALPPSAAPTGPLLAPPAGATLNRLREPLLRRLSEL LDQAPEGRGWRRLAELAGSRGRLRLSCLDLEQCSLKVLEPEGSPSLCLLKLMGEKGCT VTELSDFLQAMEHTEVLQLLSPPGIKITVNPESKAVLAGQFVKLCCRATGHPFVQYQW FKMNKEIPNGNTSELIFNAVHVKDAGFYVCRVNNNFTFEFSQWSQLDVCDIPESFQRS VDGVSESKLQICVEPTSQKLMPGSTLVLQCVAVGSPIPHYQWFKNELPLTHETKKLYM VPYVDLEHQGTYWCHVYNDRDSQDSKKVEIIIDELNNLGHPDNKEQTTDQPLAKDKVA LLIGNMNYREHPKLKAPLVDVYELTNLLRQLDFKVVSLLDLTEYEMRNAVDEFLLLLD KGVYGLLYYAGHGYENFGNSFMVPVDAPNPYRSENCLCVQNILKLMQEKETGLNVFLL DMCRKRNDYDDTIPILDALKVTANIVFGYATCQGAEAFEIQHSGLANGIFMKFLKDRL LEDKKITVLLDEVAEDMGKCHLTKGKQALEIRSSLSEKRALTDPIQGTEYSAESLVRN LQWAKAHELPESMCLKFDCGVQIQLGFAAEFSNVMIIYTSIVYKPPEIIMCDAYVTDF PLDLDIDPKDANKGTPEETGSYLVSKDLPKHCLYTRLSSLQKLKEHLVFTVCLSYQYS GLEDTVEDKQEVNVGKPLIAKLDMHRGLGRKTCFQTCLMSNGPYQSSAATSGGAGHYH SLQDPFHGVYHSHPGNPSNVTPADSCHCSRTPDAFISSFAHHASCHFSRSNVPVETTD EIPFSFSDRLRISEK" BASE COUNT 820 a 557 c 668 g 774 t ORIGIN 1 CCGGGGCCGA GGCCCGTGAC GGGGCGGGCG GGAGCCCCGG CAGTCCGGGG 51 TCGCCGGCGA GGGCCATGTC GCTGTTGGGG GACCCGCTAC AGGCCCTGCC 101 GCCCTCGGCC GCCCCCACGG GGCCGCTGCT CGCCCCTCCG GCCGGCGCGA 151 CCCTCAACCG CCTGCGGGAG CCGCTGCTGC GGAGGCTCAG CGAGCTCCTG 201 GATCAGGCGC CCGAGGGCCG GGGCTGGAGG AGACTGGCGG AGCTGGCGGG 251 GAGTCGCGGG CGCCTCCGCC TCAGTTGCCT AGACCTGGAG CAGTGTTCTC 301 TTAAGGTACT GGAGCCTGAA GGAAGCCCCA GCCTGTGTCT GCTGAAGTTA 351 ATGGGTGAAA AAGGTTGCAC AGTCACAGAA TTGAGTGATT TCCTGCAGGC 401 TATGGAACAC ACTGAAGTTC TTCAGCTTCT CAGCCCCCCA GGAATAAAGA 451 TTACTGTAAA CCCAGAGTCA AAGGCAGTCT TGGCTGGACA GTTTGTGAAA 501 CTGTGTTGCC GGGCAACTGG ACATCCTTTT GTTCAATATC AGTGGTTCAA 551 AATGAATAAA GAGATTCCAA ATGGAAATAC ATCAGAGCTT ATTTTTAATG 601 CAGTGCATGT AAAAGATGCA GGCTTTTATG TCTGTCGAGT TAATAACAAT 651 TTCACCTTTG AATTCAGCCA GTGGTCACAG CTGGATGTTT GCGACATCCC 701 AGAGAGCTTC CAGAGAAGTG TTGATGGCGT CTCTGAATCC AAGTTGCAAA 751 TCTGTGTTGA ACCAACTTCC CAAAAGCTGA TGCCAGGCAG CACATTGGTT 801 TTACAGTGTG TTGCTGTTGG AAGCCCTATT CCTCACTACC AGTGGTTCAA 851 AAATGAATTA CCATTAACAC ATGAGACCAA AAAGCTATAC ATGGTGCCTT 901 ATGTGGATTT GGAACACCAA GGAACCTACT GGTGTCATGT ATATAATGAT 951 CGAGACAGTC AAGATAGCAA GAAGGTAGAA ATCATCATAG ATGAATTAAA 1001 TAATCTTGGT CATCCTGATA ATAAAGAGCA AACAACTGAC CAGCCTTTGG 1051 CGAAGGACAA GGTTGCCCTT TTGATAGGAA ATATGAATTA CCGGGAGCAC 1101 CCCAAGCTCA AAGCTCCTTT GGTGGATGTG TACGAATTGA CTAACTTACT 1151 GAGACAGCTG GACTTCAAAG TGGTTTCACT GTTGGATCTT ACTGAATATG 1201 AGATGCGTAA TGCTGTGGAT GAGTTTTTAC TCCTTTTAGA CAAGGGAGTA 1251 TATGGGTTAT TATATTATGC AGGACATGGT TATGAAAATT TTGGGAACAG 1301 CTTCATGGTC CCCGTTGATG CTCCAAATCC ATATAGGTCT GAAAATTGTC 1351 TGTGTGTACA AAATATACTG AAATTGATGC AAGAAAAAGA AACTGGACTT 1401 AATGTGTTCT TATTGGATAT GTGTAGGAAA AGAAATGACT ACGATGATAC 1451 CATTCCAATC TTGGATGCAC TAAAAGTCAC CGCCAATATT GTGTTTGGAT 1501 ATGCCACGTG TCAAGGAGCA GAAGCTTTTG AAATCCAGCA TTCTGGATTG 1551 GCAAATGGAA TCTTTATGAA ATTTTTAAAA GACAGATTAT TAGAAGATAA 1601 GAAAATCACT GTGTTACTGG ATGAAGTTGC AGAAGATATG GGTAAGTGTC 1651 ACCTTACCAA AGGCAAACAG GCTCTAGAGA TTCGAAGTAG TTTATCTGAG 1701 AAGAGAGCAC TTACTGATCC AATACAGGGA ACAGAATATT CTGCTGAATC 1751 TCTTGTGCGG AATCTACAGT GGGCCAAGGC TCATGAACTT CCAGAAAGTA 1801 TGTGTCTTAA GTTTGACTGT GGTGTTCAGA TTCAATTAGG ATTTGCAGCT 1851 GAGTTTTCCA ATGTCATGAT CATCTATACA AGTATAGTTT ACAAACCACC 1901 GGAGATAATA ATGTGTGATG CCTACGTTAC TGATTTTCCA CTTGATCTAG 1951 ATATTGATCC AAAAGATGCA AATAAAGGCA CACCTGAAGA AACTGGCAGC 2001 TACTTGGTAT CAAAGGATCT TCCCAAGCAT TGCCTCTATA CCAGACTCAG 2051 TTCACTGCAA AAATTAAAGG AACATCTAGT CTTCACAGTA TGTTTATCAT 2101 ATCAGTACTC AGGATTGGAA GATACTGTAG AGGACAAGCA GGAAGTGAAT 2151 GTTGGGAAAC CTCTCATTGC TAAATTAGAC ATGCATCGAG GTTTGGGAAG 2201 GAAGACTTGC TTTCAAACTT GTCTTATGTC TAATGGTCCT TACCAGAGTT 2251 CTGCAGCCAC CTCAGGAGGA GCAGGGCATT ATCACTCATT GCAAGACCCA 2301 TTCCATGGTG TTTACCATTC ACATCCTGGT AATCCAAGTA ATGTTACACC 2351 AGCAGATAGC TGTCATTGCA GCCGGACTCC AGATGCATTT ATTTCAAGTT 2401 TCGCTCACCA TGCTTCATGT CATTTTAGTA GAAGTAATGT GCCAGTAGAG 2451 ACAACTGATG AAATACCATT TAGTTTCTCT GACAGGCTCA GAATTTCTGA 2501 AAAATGACCT CCTTGTTTTT GAAAGTTAGC ATAATTTTAG ATGCCTGTGA 2551 AATAGTACTG CACTTACATA AAGTGAGACA TTCTGAAAAG GCAAATTTGT 2601 ATATGTAGAG AAAGAATAGT AGTAACTGTT TCATAGCAAA CTTCAGGACT 2651 TTGAGATGTT GAAATTACAT TATTTAATTA CAGACTTCCT CTTTCTAAGA 2701 TTTTGTGAAT TGGTTGAATA GTTCTATACA AATGAAGTAT GGAGGTGTGT 2751 ATGTTTATAT GTATATAACA AAATATTTTC ATTGTGACCA CTCTGAAGTA 2801 AGAGCAATGG GAATGGCAT // LOCUS AB026190 2238 bp mRNA PRI 23-APR-1999 DEFINITION Homo sapiens mRNA for Kelch motif containing protein, complete cds. ACCESSION AB026190 NID g4650843 VERSION AB026190.1 GI:4650843 KEYWORDS Kelch motif containing protein. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2238) AUTHORS Yoshida,K. and Sugano,S. TITLE Kelch motif containing protein JOURNAL Published Only in DataBase (1999) In press REFERENCE 2 (bases 1 to 2238) AUTHORS Yoshida,K. and Sugano,S. TITLE Direct Submission JOURNAL Submitted (15-APR-1999) to the DDBJ/EMBL/GenBank databases. Kenichi Yoshida, Institute of Medical Science, University of Tokyo, Department of Virology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan (E-mail:kyoshida@ims.u-tokyo.ac.jp, Tel:81-3-5449-5343) FEATURES Location/Qualifiers source 1. .2238 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 121. .1950 /codon_start=1 /product="Kelch motif containing protein" /protein_id="BAA77027.1" /db_xref="PID:d1040790" /db_xref="PID:g4650844" /db_xref="GI:4650844" /translation="MEGKPMRRCTNIRPGETGMDVTSRCTLGDPNKLPEGVPQPARMP YISDKHPRQTLEVINLLRKHRELCDVVLVVGAKKIYAHRVILSACSPYFRAMFTGELA ESRQTEVVIRDIDERAMELLIDFAYTSQITVEEGNVQTLLPAACLLQLAEIQEACCEF LKRQLDPSNCLGIRAFADTHSCRELLRIADKFTQHNFQEVMESEEFMLLPANQLIDII SSDELNVRSEEQVFNAVMAWVKYSIQERRPQLPQVLQHVRLPLLSPKFLVGTVGSDPL IKSDEECRDLVDEAKNYLLLPQERPLMQGPRTRPRKPIRCGEVLFAVGGWCSGDAISS VERYDPQTNEWRMVASMSKRRCGVGVSVLDDLLYAVGGHDGSSYLNSVERYDPKTNQW SSDVAPTSTCRTSVGVAVLGGFLYAVGGQDGVSCLNIVERYDPKENKWTRVASMSTRR LGVAVAVLGGFLYAVGGSDGTSPLNTVERYNPQENRWHTIAPMGTRRKHLGCAVYQDM IYAVGGRDDTTELSSAERYNPRTNQWSPVVAMTSRRSGVGLAVVNGQLMAVGGFDGTT YLKTIEVFDPDANTWRLYGGMNYRRLWGGVGVIKMTHCESHIW" BASE COUNT 633 a 452 c 595 g 558 t ORIGIN 1 CGGCGGTGGA GGAGGCAGAG AGGAGTGGAG GGCGGAGTAG ACGGAGGAGG 51 CTGCTGCAGA GAAGAAAGTG TCAGAGCCGG TTCGGCTTTA GAGTGTGGTG 101 AAGGGTACTT TTCATGGTGC ATGGAAGGAA AGCCAATGCG CAGGTGTACC 151 AACATTCGAC CAGGAGAGAC TGGAATGGAT GTAACAAGCC GCTGCACCCT 201 TGGAGACCCC AACAAACTGC CAGAAGGGGT TCCCCAACCT GCCCGCATGC 251 CCTATATCTC AGACAAGCAC CCTCGACAAA CCTTGGAAGT GATTAACCTT 301 CTGAGAAAGC ACCGGGAGCT ATGTGATGTG GTGCTAGTTG TGGGCGCCAA 351 GAAGATATAT GCCCATCGAG TCATTTTGTC AGCCTGTAGT CCCTACTTCC 401 GAGCTATGTT TACAGGAGAA TTGGCAGAGA GCCGTCAGAC AGAAGTAGTG 451 ATCCGAGACA TTGACGAGAG GGCTATGGAA TTACTGATTG ACTTTGCGTA 501 TACCTCCCAG ATAACAGTAG AAGAGGGCAA TGTTCAGACT CTTCTGCCAG 551 CTGCTTGCCT CCTCCAGCTG GCAGAAATAC AGGAAGCCTG CTGTGAATTC 601 TTAAAGAGAC AATTAGATCC TTCTAACTGC CTGGGCATTC GGGCTTTTGC 651 TGACACACAT TCATGTCGTG AGTTGCTAAG GATAGCAGAC AAGTTCACCC 701 AACATAACTT TCAAGAGGTA ATGGAGAGTG AAGAGTTCAT GTTGCTTCCA 751 GCCAATCAAC TCATTGATAT AATATCCAGT GATGAGCTAA ACGTTCGCAG 801 TGAAGAACAA GTGTTCAATG CAGTGATGGC CTGGGTCAAA TACAGTATTC 851 AGGAAAGACG TCCTCAATTA CCCCAGGTGC TGCAGCATGT TCGTTTGCCT 901 TTGCTTAGTC CCAAGTTCCT GGTCGGCACA GTAGGCTCTG ATCCCCTCAT 951 CAAAAGTGAT GAAGAATGCA GAGACTTGGT AGATGAGGCT AAAAACTACC 1001 TCCTATTGCC GCAAGAACGA CCACTAATGC AAGGACCAAG GACGAGACCA 1051 CGGAAACCTA TCCGATGTGG GGAAGTACTC TTTGCAGTTG GTGGTTGGTG 1101 CAGTGGAGAT GCCATTTCCA GTGTTGAACG ATATGATCCA CAGACCAATG 1151 AATGGAGAAT GGTGGCTTCA ATGAGCAAAA GGAGATGCGG AGTTGGGGTC 1201 AGTGTTCTTG ATGATCTGTT ATATGCAGTA GGAGGCCATG ATGGATCCTC 1251 TTATCTCAAT AGTGTTGAAA GGTATGACCC CAAAACAAAC CAGTGGAGCA 1301 GTGATGTGGC CCCTACAAGC ACCTGCAGGA CAAGTGTTGG TGTAGCAGTA 1351 CTTGGAGGCT TTCTTTATGC TGTGGGTGGC CAGGATGGTG TGTCTTGCCT 1401 CAACATTGTT GAGAGGTATG ATCCGAAGGA GAACAAGTGG ACTCGGGTAG 1451 CTTCTATGAG TACCAGAAGA CTAGGTGTGG CTGTGGCTGT GTTAGGAGGG 1501 TTCTTATATG CTGTAGGTGG CTCTGACGGG ACATCTCCTC TCAACACAGT 1551 GGAACGTTAC AATCCTCAGG AAAACAGATG GCACACTATA GCCCCTATGG 1601 GGACCCGGAG GAAACACCTA GGCTGTGCAG TATATCAGGA CATGATCTAT 1651 GCTGTAGGAG GTAGAGATGA CACTACAGAG CTGAGCAGTG CTGAGAGATA 1701 CAACCCCAGA ACCAACCAGT GGTCTCCAGT GGTGGCCATG ACATCACGCC 1751 GTAGTGGAGT TGGCCTGGCA GTGGTCAATG GACAGCTCAT GGCAGTAGGA 1801 GGTTTTGATG GCACAACATA CTTGAAGACC ATAGAAGTTT TTGATCCTGA 1851 TGCCAATACA TGGAGGTTAT ATGGCGGGAT GAATTACCGT CGGCTATGGG 1901 GTGGCGTAGG AGTTATTAAA ATGACACATT GTGAATCCCA TATTTGGTGA 1951 ACACAGAGAA GACAGTCTTG TATATATTCC TCTGTATTCT GGGGAGCTTT 2001 GACCTTGGAG CTTTGTACAG CTTGAGAAAA CATTAGAACA AATTTTATTA 2051 TTTGCCGGTG CCTCAACAAA TGGAAATACA ATCCAATGAA AGTACTTCAC 2101 CTGCAAGATG CACAATAATT TTCACTCTGT GCAGAAGAAT ATTTATTTTT 2151 GGTTTTAATT TATCATGGTT TTTTGTTGTT TTCGTTTTGA ACTTATCCTT 2201 CCTCCCACAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA // LOCUS AB028964 5341 bp mRNA PRI 04-AUG-1999 DEFINITION Homo sapiens mRNA for KIAA1041 protein, complete cds. ACCESSION AB028964 NID g5689418 VERSION AB028964.1 GI:5689418 KEYWORDS . SOURCE Homo sapiens brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:fh02801. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kikuno,R., Nagase,T., Ishikawa,K., Hirosawa,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro JOURNAL DNA Res. 6, 197-205 (1999) REFERENCE 2 (bases 1 to 5341) AUTHORS Ohara,O., Nagase,T. and Kikuno,R. TITLE Direct Submission JOURNAL Submitted (17-JUN-1999) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1. .5341 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="fh02801" /clone_lib="pBluescriptII SK plus" /tissue_type="brain" gene 313. .2181 /gene="KIAA1041" CDS 313. .2181 /gene="KIAA1041" /codon_start=1 /product="KIAA1041 protein" /protein_id="BAA82993.1" /db_xref="PID:d1046820" /db_xref="PID:g5689419" /db_xref="GI:5689419" /translation="MGLYGQACPSVTSLRMTSELESSLTSMDWLPQLTMRAAIQKSDA TQNAHGTGISKKNALLDPNTTLDQEEVQQHKDGKPPYSYASLITFAINSSPKKKMTLS EIYQWICDNFPYYREAGSGWKNSIRHNLSLNKCFLKVPRSKDDPGKGSYWAIDTNPKE DALPTRPKKRARSVERASTPYSIDSDSLGMECIISGSASPTLAINTVTNKVTLYNTDQ DGSDSPRSSLNNSLSDQSLASVNLNSVGSVHSYTPVTSHPESVSQSLTPQQQPQYNLP ERDKQLLFSEYNFEDLSASFRSLYKSVFEQSLSQQGLMNIPSESSQQSHTSCTYQHSP SSTVSTHPHSNQSSLSNSHGSGLNTTGSNSVAQVSLSHPQMHPQPSPHPPHRPHGLPQ HPQRSPHPAPHPQQHSQLQSPHPQHPSPHQHIQHHPNHQHQTLTHQAPPPPQQVSCNS GVSNDWYATLDMLKESCRIASSVNWSDVDLSQFQGLMESMRQADLKNWSLDQVQFADL CSSLNQFFTQTGLIHSQSNVQQNVCHGAMHPTKPSQHIGTGNLYIDSRQNLPPSVMPP PGYPHIPQALSTPGTTMAGHHRAMNQQHMMPSQAFQMRRSLPPDDIQDDFDWDSIV" BASE COUNT 1466 a 1131 c 1128 g 1616 t ORIGIN 1 GCGAGACGCC CTCTCTCTTC CAGCCGCAGC CGCGTTGCGG GTTCTTCCCT 51 GTGGAACAGT AGAGACCTAG TCTGCCTTCT TTCACAAGAT AATCCTTCAA 101 TTATTTGAAG GTGATTATTC AGCATATATC TCTTGAGTAC CTGTGCATGC 151 AAGACACTGT CCTGGATATT GTGAGAGATC CAAAGTTTAG TATGAGATCA 201 TTTTCCCTTG CCTGAGTTTT GAGCTGATTG AGAAATAAGA ACCTACAGAT 251 GAAGAAGCAA GACAGTACCT TTAAAAAAAT TAGCTTTTTA TTCAGATTCT 301 CTTTGTGGCC AGATGGGTTT GTATGGACAG GCTTGTCCAT CTGTAACTTC 351 ATTAAGGATG ACATCTGAAC TGGAGAGCAG CCTAACGTCT ATGGACTGGT 401 TACCACAGCT CACCATGAGA GCAGCCATCC AAAAATCTGA TGCTACACAA 451 AATGCACATG GAACAGGAAT TTCTAAGAAG AATGCACTCC TTGACCCAAA 501 TACAACTCTG GACCAGGAAG AAGTCCAACA GCACAAAGAT GGGAAACCTC 551 CATACAGTTA TGCCAGCCTC ATTACATTTG CAATTAATAG CTCACCCAAA 601 AAGAAAATGA CTTTAAGTGA AATTTATCAG TGGATTTGTG ATAACTTCCC 651 ATATTATAGA GAGGCTGGCA GTGGTTGGAA GAATTCCATA CGACATAATC 701 TGTCATTGAA CAAATGTTTC CTTAAAGTGC CTCGATCTAA GGATGACCCT 751 GGAAAGGGGT CCTACTGGGC AATAGACACC AATCCGAAGG AAGATGCGCT 801 GCCTACTCGG CCAAAGAAGA GGGCACGATC TGTAGAACGG GCCTCAACTC 851 CATATAGCAT AGATTCAGAT TCTTTGGGAA TGGAGTGTAT TATTTCGGGA 901 AGTGCCTCTC CAACTCTGGC AATCAACACT GTGACTAACA AAGTAACATT 951 GTATAACACT GATCAGGATG GTAGTGATAG CCCACGCAGT AGCCTTAACA 1001 ACAGTCTCTC AGACCAGAGT TTGGCATCTG TTAATTTGAA CAGTGTTGGA 1051 AGTGTGCATA GTTATACACC GGTGACAAGC CATCCAGAAT CAGTCTCTCA 1101 ATCATTAACT CCTCAGCAGC AACCACAGTA CAACCTTCCA GAAAGAGACA 1151 AGCAACTACT TTTCTCAGAA TATAATTTTG AAGATCTTAG TGCCTCATTT 1201 CGGAGCCTTT ATAAGTCAGT TTTTGAGCAG TCACTTAGTC AACAAGGTTT 1251 GATGAACATC CCTTCTGAAT CTTCCCAGCA GTCCCACACT TCATGTACCT 1301 ATCAGCACTC TCCCAGCAGT ACAGTGAGCA CTCACCCACA CAGCAACCAA 1351 AGCAGCCTGT CCAACAGTCA TGGCAGTGGC CTCAACACCA CAGGCAGTAA 1401 TTCGGTTGCA CAGGTCTCAC TGTCTCACCC CCAGATGCAC CCACAGCCAT 1451 CTCCACATCC TCCCCATCGA CCGCATGGTT TACCGCAGCA TCCGCAGCGT 1501 TCCCCACACC CAGCACCACA CCCACAGCAA CACAGCCAGC TCCAGTCCCC 1551 TCACCCCCAG CATCCCTCTC CACATCAACA CATACAGCAC CATCCGAACC 1601 ATCAGCATCA GACGTTAACA CATCAGGCAC CCCCACCCCC ACAACAGGTA 1651 TCCTGTAATT CTGGTGTTTC AAATGATTGG TATGCGACAC TTGATATGCT 1701 AAAAGAAAGC TGTCGAATTG CCAGCAGTGT TAATTGGTCA GATGTAGACC 1751 TTTCACAGTT TCAAGGTCTG ATGGAGAGTA TGAGACAGGC AGATCTCAAG 1801 AACTGGTCTT TAGACCAGGT TCAGTTTGCC GATCTTTGTT CTTCTCTTAA 1851 TCAGTTCTTT ACACAAACTG GCCTTATACA TTCACAGAGT AATGTTCAAC 1901 AAAATGTTTG TCATGGTGCC ATGCATCCAA CAAAACCTTC CCAACACATT 1951 GGAACAGGAA ATTTGTACAT AGATTCTAGG CAAAATCTCC CTCCTTCAGT 2001 GATGCCACCC CCTGGTTATC CTCATATCCC ACAGGCACTC AGCACTCCAG 2051 GAACAACGAT GGCAGGCCAT CACAGAGCCA TGAACCAGCA GCACATGATG 2101 CCTTCCCAAG CCTTCCAGAT GCGGCGTTCC CTGCCTCCAG ATGACATCCA 2151 GGATGACTTT GATTGGGATT CAATTGTGTA GGGCTTGTTT CTGCAAGACA 2201 CCAGACCCTA ACGTTACCTT TCTGTGCAGT GAAGGGAAAG GTTTAAGAGA 2251 ATCCAGTTGA GAAAACAAAC TTGCTAATCA CTTTACCAAT GTTATCAAAA 2301 TTACTTTTGA AGACAATCAG AAGGATTTTA GCTGGATAAC TTACTGCTTT 2351 TATCTGACCC AAGCAAGTAC TACATGTTTG TCTCCCTGCC AGCTGCCCTA 2401 TGTAGCTCCT AACTGTTGTG TGATTTGGAC GGCTTTTTGC ATATTTGTGT 2451 CAGTTTGATG TTAACCACAA GTGCCAGACT GATTTTTCAG ACGGAGCCTA 2501 TTTTGCTGCA AGCAGTTTAT ATAAAGATAC ATATGTGTAA ATATATGTAC 2551 AAAAATTACT GAAAGGCTTC AGTTTTTTCT AATTGGATTA TTATGTCTTG 2601 AAAGTTATTG TCAGTTTTTA TTCCTTGTTA GGCTATTTTC TGCAGGATGC 2651 TTTTAACTGA TGTAGGAAAC TGAAAGGAAA TAGATTTTTT CCAAAACCCA 2701 GTTCCCCTTA TTTAATCTTT TTTAGAAATG TGGGTAATGA ATTCTATCTA 2751 ATAAGTCAAG GAAACCAGAA TTTGACACAC TCCAACAATC CAAAGGGGCA 2801 TGTTGCTCCT GAGCAGCATG AAGAACTGAC CAAATTGGTT TTGATGCTTG 2851 GGGGATCATA GAGTATTTAT GTCTGCTTTT CTAAATCTGC ATTATAATAG 2901 CTCTAAAATT TGTTGATTGG TAAGAAATTG GGCATTGCTT GGCTCTTTAA 2951 ACACATCAGT GCTTCCACAT TCACCTATGT ATTTATTATT CAAAAGTGTC 3001 ATTTTAATAT TTATTGCTAC CTTCTGTGAA TGCTTAGCTC CTGTCGGGTT 3051 CATTAAGGAG AAATGTGTCT GAAAGCACAG AACTATTATT ATTTTTTTCT 3101 TTTTTTGAGA TGGAGTCTTG CCCTGTTGCC CAGGCTGGAG TGCAGTGGCA 3151 CGATCTCGGC TCACTGCAAC CTCCGCCTCC CGGGTTCAAG CAATTCTCCT 3201 GCCTCAGCCT CCCGAGTAGC TGGGAGTGCA GGTGCACACC GCCATGCCTG 3251 GCTGATTTTT TGTATTTTAG TAGAGATGGG GTTTCACCAT GTTGCCTGGG 3301 CTGGTTGCAA ACTCTTGAGC TCAGGCAGTC TGCCTGCCTC AGCCTCCTAG 3351 AGTGCTGGGA TTACAGACAT GAGCCACTGC GCCTGGCCCA GCGTTACTTT 3401 TCTTGATAAG AATTTACAGA TAAGACACTC AGAGAGACAT TGTGTCATTC 3451 TCTATCATCA CTCCTTTTAC CATGTGAATA GATTTCTCTG TCTTGGCATT 3501 CTTGCTGATT CAGACTTATT AGTTAAATCG GATTTTTCTG GAATTTGAAT 3551 CAGATTTCAT TTGGGCAACA ACAGCAGGGC AGGGTTTCTA AAGCAGGTTT 3601 CCCCACTCAT CTAGGTTGTC TTGAAAGGAG GATAGAGCCA CTTACAGTTT 3651 TATTTGCTTC AGTGTTTAAT GTAGATATTA TGTGGCCTGC TGTTTCTGTT 3701 GTCTTGTATG TCTGTGTATG CATGACATTT GGTCCCTTTC TTTGAAATGC 3751 AGCCCCTCTT ACGGGTTTTT TAGTGGTGGT AGTTTGTTTT GTTTCATATC 3801 ATATCACAGT TTGCATGGAC TGATTATCTT AGTTTTATGA TAAAACTAGA 3851 AGAAATGAGG GTTTTTTTTA ATGAATAGAT TTTGAATTGA TTCTTTAGAC 3901 CCCCCAAAAA GTGTCAGTTC TAATGGGGAA ATATATTCCA TCAAGTCAAA 3951 GAGTAATAAC TGCTCACTAG TTGCTTTTTA GCATCTCTGG TCTTATCAGC 4001 CATGCTAAAT CACTTTAATT AGCCTTAGTG ATTCTGTGGT TGAGTAATCT 4051 CTACTTGAAC TAAACAAACA TCTTTTGTTT CTGTGTGTGT GTGCTGTGTG 4101 TGAGAGTGTG CGCGCGCGTG TGTGTGTGTG TGTTTTAATG GAGTGTTGCC 4151 TTGAATGAAT CACTGGGAAG CCAGCCATGG TAAGGGCTGG TGAGGTTGGG 4201 GAGAAAGGAA GAGCTTTATG TTTCTCTGTT GTTTGGACCC TACTTGGCAT 4251 GAAAAAGGAA GCTCAGTTCC AGCCCCTTGG ATCAACGAAA ATCAGAGGAT 4301 TCTGGAAAGG CAGCCAACTT GCGCCTCTTA GAAGGATCAG AGGCAAGATG 4351 AGATGGCAGC CTGCAGAGTA AATGCTTGAA AAAAGAGGGG TGATTTTCAA 4401 TGGTTTGCTT AAGTCACTGT TTTCTAGACA CCAAAATAGC TGTTTTGAAA 4451 CTGTTTTAAT TGCTTGGGTA GCAATGTGCA CTTTAAACAA TTTGGATATT 4501 GGAATGCAGT CTCATACTGA GTGATTTGAG TAGAACCACT GATGATGATT 4551 TTATAAATTG TGTGAAGCGA ATTTCCCATT TGGCAATCAT TTACTGATTT 4601 GCAGTGATGA ATATTTTTAT GAGAATTTAA ACTTAGCAAG AATGGCCATG 4651 GAGGCAAAGC CTTCACCCAG ACCCATCCCA CTCTCCTGTG ATCCAGGTGG 4701 TCCAGGAGCC CAGGACAGGC CTTTTCTGTG GGCCCTGGCC AGACAGGGTT 4751 ACCTGGTGAG GTGCAGAGAG TCCCTCTAGT GGCCATTTTG TATGGTAGTT 4801 GCTAATGCAG AACAAGTTCT GTCTTGGGCT TAAATTGACT GAAGACTTTA 4851 GGGGGAAAGA ATAGTAAATG CATGTAAACA AATGGGGACA CTCTGTTCAG 4901 GAGAATAATC CGACTGGCAT TTGTGGCAGT TTTTGAAATG TAAATGTATT 4951 CATGTGTGTT CTTGTAAATA CGTGTCGCTC AGATGTCCTT TGAAGTGGGA 5001 GGGAATCAAT CCGGGGATAA TTTCAAATGG AATAGAGTAT TTTGATATTG 5051 TTCATTCAGA GGGTGATGTG TACATATCTA TATTGTATAT ATGTGATGAA 5101 AATGCATTGG CTTTTTGTGC AGATACAACC TGCTCTCTGT ACTGCTGTTG 5151 GACAGTCAGT GTTTTAATGT TTCTACAGTT TTGCTATTGC ACGATTTCAT 5201 ATTTTGCCTC TATGATGAAC GGCAACCATT ATTTGTAACT GTTTAGTGCT 5251 GTAAAGAAAT ATTCCAAGTG TCATTAGGAT TGTTGCTGCC AGAACTGATA 5301 TGCATGAATG GCACTTAAAA TAAATATATT ATGTTAACTC T // LOCUS AB030653 4152 bp mRNA PRI 04-AUG-1999 DEFINITION Homo sapiens mRNA for epsilon-adaptin, complete cds. ACCESSION AB030653 NID g5689376 VERSION AB030653.1 GI:5689376 KEYWORDS epsilon-adaptin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Takatsu,H., Sakurai,M., Shiba,Y., Yoshida,Y. and Nakayama,K. TITLE Identification and characterization of epsilon-adaptin that constitutes AP-4 clathrin adaptor-related complex JOURNAL Unpublished (1999) REFERENCE 2 (bases 1 to 4152) AUTHORS Nakayama,K. TITLE Direct Submission JOURNAL Submitted (31-JUL-1999) to the DDBJ/EMBL/GenBank databases. Kazuhisa Nakayama, University of Tsukuba, Institute of Biological Sciences; 1-1-1 Tennohdai, Tsukuba, Ibaraki 305-8572, Japan (E-mail:kazunaka@sakura.cc.tsukuba.ac.jp, Tel:+81-298-53-6005, Fax:+81-298-53-6006) FEATURES Location/Qualifiers source 1. .4152 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 31. .3444 /codon_start=1 /product="epsilon-adaptin" /protein_id="BAA82969.1" /db_xref="PID:d1046796" /db_xref="PID:g5689377" /db_xref="GI:5689377" /translation="MSDIVEKTLTALPGLFLQNQPGGGPAAAKASFSSRLGSLVRGIT ALTSKHEEEKLIQQELSSLKATVSAPTTTLKMMKECMVRLIYCEMLGYDASFGYIHAI KLAQQGNLLEKRVGYLAVSLFLHESHELLLLLVNTVVKDLQSTNLVEVCMALTVVSQI FPREMIPAVLPLIEDKLQHSKEIVRRKAVLALYKFHLIAPNQVQHIHIKFRKALCDRD VGVMAASLHIYLRMIKENSSGYKDLTGSFVTILKQVVGGKLPVEFNYHSVPAPWLQIQ LLRILGLLGKDDQRTSELMYDVLDESLRRAELNHNVTYAILFECVHTVYSIYPKSELL EKAAKCIGKFVLSPKINLKYLGLKALTYVIQQDPTLALQHQMTIIECLDHPDPIIKRE TLELLYRITNAQNITVIVQKMLEYLHQSKEEYVIVNLVGKIAELAEKYAPDNAWFIQT MNAVFSVGGDVMHPDIPNNFLRLLAEGFDDETEDQQLRLYAVQSYLTLLDMENVFYPQ RFLQVMSWVLGEYSYLLDKETPEEVIAKLYKLLMNDSVSSETKAWLIAAVTKLTSQAH SSNTVERLIHEFTISLDTCMRQHAFELKHLHENVELMKSLLPVDRSCEDLVVDASLSF LDGFVAEGLSQGAAPYKPPHQRQEEKLSQEKVLNFEPYGLSFSSSGFTGRQSPAGISL GSDVSGNSAETGLKETNSLKLEGIKKLWGKEGYLPKKESKTGDESGALPVPQESIMEN VDQAITKKDQSQVLTQSKEEKEKQLLASSLFVGLGSESTINLLGKADTVSHKFRRKSK VKEAKSGETTSTHNMTCSSFSSLSNVAYEDDYYSNTLHDTGDKELKKFSLTSELLDSE SLTELPLVEKFSYCSLSTPSLFANNNMEIFHPPQSTAASVAKESSLASSFLEETTEYI HSNAMEVCNNETISVSSYKIWKDDCLLMVWSVTNKSGLELKSADLEIFPAENFKVTEQ PGCCLPVMEAESTKSFQYSVQIEKPFTEGNLTGFISYHMMDTHSAQLEFSVNLSLLDF IRPLKISSDDFGKLWLSFANDVKQNVKMSESQAALPSALKTLQQKLRLHIIEIIGNEG LLACQLLPSIPCLLHCRVHADVLALWFRSSCSTLPDYLLYQCQKVMEGS" BASE COUNT 1281 a 752 c 850 g 1269 t ORIGIN 1 GCGGCGGCGG CATCGCGGGC GGCGGCGGCG ATGAGCGACA TAGTGGAGAA 51 GACGCTGACG GCGCTGCCGG GACTCTTTCT GCAGAACCAG CCCGGTGGTG 101 GGCCCGCGGC CGCCAAGGCG TCCTTCTCCT CGAGGCTGGG CAGCCTTGTC 151 CGCGGCATCA CAGCCCTCAC CTCCAAGCAC GAAGAAGAAA AATTAATCCA 201 GCAGGAACTG AGTAGTCTGA AAGCGACTGT TTCTGCTCCT ACTACAACAC 251 TGAAAATGAT GAAGGAATGT ATGGTGAGAC TTATATATTG TGAAATGCTT 301 GGATATGATG CTTCCTTTGG CTATATTCAT GCAATCAAGT TAGCCCAACA 351 AGGAAACCTC TTAGAAAAAA GAGTAGGTTA TTTGGCTGTT TCCTTATTTC 401 TACATGAAAG TCATGAATTA TTGCTTCTCC TTGTGAATAC AGTTGTAAAG 451 GATCTGCAGA GCACTAACCT AGTAGAAGTG TGTATGGCAC TGACTGTTGT 501 TAGCCAGATT TTCCCCCGCG AAATGATTCC AGCTGTTCTT CCATTAATAG 551 AAGATAAACT TCAACATTCT AAGGAGATTG TACGAAGAAA AGCTGTTCTG 601 GCATTATACA AATTCCATCT CATTGCTCCT AATCAAGTAC AACATATTCA 651 TATTAAGTTT CGGAAAGCAC TTTGTGACAG AGATGTTGGG GTCATGGCTG 701 CCTCCTTGCA TATATATCTT AGAATGATTA AGGAGAATTC ATCTGGATAT 751 AAAGACTTGA CTGGGAGTTT TGTAACCATT TTGAAGCAAG TAGTTGGAGG 801 AAAGCTCCCA GTAGAATTCA ATTACCACAG TGTGCCAGCA CCATGGTTAC 851 AAATTCAGCT CTTGAGAATA CTGGGACTTC TAGGAAAAGA TGATCAAAGG 901 ACAAGTGAAT TAATGTATGA TGTTCTTGAT GAATCCTTAC GAAGAGCTGA 951 GTTAAATCAC AATGTCACAT ATGCTATTTT GTTTGAATGT GTGCATACAG 1001 TCTATTCTAT TTATCCTAAA TCGGAATTAC TTGAGAAGGC TGCCAAGTGC 1051 ATTGGAAAAT TTGTTCTGTC ACCTAAAATA AATCTAAAAT ATTTAGGACT 1101 GAAGGCTCTT ACCTATGTTA TCCAACAGGA TCCTACTCTG GCTCTTCAAC 1151 ACCAGATGAC AATAATTGAA TGTTTAGATC ATCCTGATCC CATTATTAAA 1201 AGAGAGACTC TGGAACTTCT TTACAGAATT ACTAATGCAC AGAATATAAC 1251 AGTTATTGTC CAGAAAATGC TTGAATATTT ACATCAGAGC AAAGAAGAGT 1301 ATGTCATCGT CAATTTGGTC GGCAAAATAG CAGAGCTGGC TGAGAAATAT 1351 GCTCCTGATA ATGCATGGTT TATTCAGACA ATGAATGCTG TGTTTTCAGT 1401 AGGAGGAGAT GTAATGCATC CTGATATTCC CAATAACTTT CTGAGACTAC 1451 TAGCGGAAGG TTTTGATGAT GAAACAGAAG ATCAGCAATT AAGACTCTAT 1501 GCAGTTCAGT CTTATCTCAC TTTACTGGAT ATGGAAAATG TGTTCTATCC 1551 ACAGAGATTT CTTCAAGTTA TGAGTTGGGT ATTAGGGGAA TATTCCTACC 1601 TCTTAGATAA GGAAACGCCA GAGGAAGTTA TAGCTAAGCT CTACAAGTTA 1651 CTTATGAATG ACTCTGTGTC TTCAGAAACA AAAGCCTGGT TAATTGCTGC 1701 TGTGACCAAA TTGACATCTC AGGCGCACTC TTCTAATACA GTTGAGAGAT 1751 TAATCCATGA ATTTACCATA TCTTTGGATA CTTGTATGAG ACAACATGCA 1801 TTTGAATTAA AACATTTGCA TGAGAATGTG GAACTTATGA AGAGCTTGCT 1851 TCCAGTTGAC AGGAGTTGTG AAGACTTGGT GGTAGATGCT TCTTTATCTT 1901 TTCTGGATGG TTTTGTGGCT GAAGGACTCA GTCAGGGTGC AGCGCCTTAC 1951 AAACCTCCCC ATCAACGCCA GGAGGAAAAG CTTTCTCAGG AAAAAGTTCT 2001 CAATTTTGAA CCATATGGAC TCTCCTTTTC TTCATCTGGC TTCACTGGAC 2051 GACAGTCTCC TGCTGGCATT TCTCTTGGTT CAGATGTATC TGGGAATAGT 2101 GCTGAGACAG GACTGAAAGA GACAAATAGC TTGAAGCTGG AAGGTATAAA 2151 GAAATTGTGG GGGAAAGAAG GCTATCTTCC CAAGAAGGAA AGCAAAACTG 2201 GTGATGAAAG TGGAGCTCTG CCTGTTCCTC AAGAGAGTAT AATGGAGAAT 2251 GTAGATCAAG CTATAACTAA AAAGGATCAA TCTCAAGTTC TTACCCAATC 2301 TAAAGAGGAG AAAGAAAAGC AGCTGCTGGC ATCATCATTA TTTGTTGGTC 2351 TAGGATCAGA AAGTACAATC AACCTGCTGG GAAAAGCAGA TACTGTCTCT 2401 CACAAGTTCA GAAGGAAATC AAAAGTCAAA GAAGCTAAAA GTGGCGAAAC 2451 AACCAGTACT CATAATATGA CCTGTTCTTC CTTTAGTTCT TTGTCAAATG 2501 TGGCATATGA AGATGATTAT TATTCGAATA CTTTGCACGA TACAGGAGAC 2551 AAGGAATTAA AGAAATTTTC TCTCACTTCA GAACTTTTGG ATTCTGAGTC 2601 ACTCACAGAA CTGCCCTTGG TTGAGAAATT CTCATATTGT AGTCTGTCTA 2651 CACCTTCATT GTTTGCTAAT AACAACATGG AAATTTTTCA CCCTCCTCAA 2701 TCTACTGCAG CCTCAGTTGC CAAGGAAAGC TCTTTAGCTT CATCTTTTTT 2751 GGAAGAAACT ACTGAATACA TACACTCAAA TGCTATGGAA GTCTGTAATA 2801 ATGAAACTAT ATCAGTGTCT TCTTATAAAA TTTGGAAAGA TGATTGTTTA 2851 TTGATGGTCT GGTCAGTCAC TAATAAGAGT GGTTTGGAAT TGAAAAGTGC 2901 TGACTTAGAA ATTTTTCCTG CAGAAAATTT CAAGGTGACT GAGCAACCTG 2951 GATGCTGTTT GCCTGTAATG GAAGCAGAAA GCACCAAAAG CTTTCAATAT 3001 AGTGTGCAGA TAGAAAAACC TTTTACAGAA GGAAATCTTA CTGGTTTTAT 3051 TAGTTATCAT ATGATGGATA CTCATTCTGC TCAGCTGGAA TTTTCTGTAA 3101 ACTTATCACT ATTAGATTTC ATTAGACCAT TAAAAATCTC AAGTGACGAC 3151 TTTGGGAAAC TCTGGTTATC CTTCGCAAAT GATGTGAAAC AAAATGTAAA 3201 AATGTCAGAA TCTCAAGCTG CACTTCCTTC TGCACTAAAG ACTCTGCAAC 3251 AGAAACTAAG ACTCCATATT ATTGAGATTA TAGGCAATGA AGGGCTATTG 3301 GCCTGTCAGC TGCTCCCATC CATCCCCTGC TTACTGCATT GCCGAGTTCA 3351 TGCAGATGTA TTAGCCCTGT GGTTCAGATC CTCCTGTTCT ACTCTTCCTG 3401 ACTATTTACT GTATCAATGT CAAAAGGTGA TGGAGGGATC CTAGCAGAAG 3451 CCCTGCTAAA TTTTACTCCA TCAAGATCAA TGGTTTACAT AGATAAACTT 3501 ATTTACCAAA GTAAAAAGAA CTCATGGTAC TTCTAATGAA AATGGGGATT 3551 ATTACAAGTG TGGTTTATAT GTTTTCTTTA TGATTCCTGG TCAAGAAAGA 3601 TCCCCAAAAC TGTATCCCTA ACCTTTAACT CAGGATTGTA CAGTATGTTT 3651 AGGTCCCTCA AAAAGTGACC TAAGCTAATG TTATAAACTG CTAATGATTT 3701 ATATATCACT TAGTGTGTAG AGGGACTGAA AATATTTATT TTGTTATAAA 3751 TAATTTTATA GCACGTTTAC TCTAGTGCTA GCTAATTTGT AATAAAGCCA 3801 AGTCTCAGTT TTCTGCATTA AATGGAAGGG AGACATGAAA TTGATAATCT 3851 CAAACTTAAT TCATATTTGG CTTTGGAATG TAGTGTATGG TTTTTGGTAG 3901 GGAAGTCAAT ATTTTCGATT ATGTTTTGCT TAGATCAGTG TTGAACTAAG 3951 TTGGCATAGC ACACACTAAC AGTTGTAGGA GATCCATGAG CATGCTGGAT 4001 ATTGAGGGGA TCCAATCTGT GATCTATTTT TTATTATAGA TAACTGTTAC 4051 ACATAATTTC AGATCGTTTG GCTGTAAAAT TTGTCTGTTT TCCACTTGGA 4101 GTTATTTTAA AATTCAAAGT AAATCACTCT ATTCTTATTT TGAAAACTCA 4151 GA // LOCUS AF000145 4380 bp mRNA PRI 16-JUL-1999 DEFINITION Homo sapiens germinal center kinase related protein kinase mRNA, complete cds. ACCESSION AF000145 NID g3095031 VERSION AF000145.1 GI:3095031 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4380) AUTHORS Diener,K., Wang,X.S., Chen,C., Meyer,C.F., Keesler,G., Zukowski,M., Tan,T.H. and Yao,Z. TITLE Activation of the c-Jun N-terminal kinase pathway by a novel protein kinase related to human germinal center kinase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (18), 9687-9692 (1997) MEDLINE 97420743 REFERENCE 2 (bases 1 to 4380) AUTHORS Katrina,D., Wang,X.S., Chen,C., Zukowski,M., Tan,T.-H. and Yao,Z. TITLE Direct Submission JOURNAL Submitted (16-APR-1997) 261, Amgen, Inc., 3200 Walnut St., Boulder, CO 80301, USA FEATURES Location/Qualifiers source 1. .4380 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="macrophage" CDS 361. .3015 /note="GLK; germinal center kinase-like kinase" /codon_start=1 /product="germinal center kinase related protein kinase" /protein_id="AAC15472.1" /db_xref="PID:g3095032" /db_xref="GI:3095032" /translation="MAQEDFELIQRIGSGTYGDVYKARNVNTGELAAIKVIKLEPGED FAVVQQEIIMMKDCKHPNIVAYFGSYLRRDKLWICMEFCGGGSLQDIYHVTGPLSELQ IAYVSRETLQGLYYLHSKGKMHRDIKGANILLTDNGHVKLADFGVSAQITATIAKRKS FIGTPYWMAPEVAAVERKGGYNQLCDLWAVGITAIELAELQPPMFDLHPMRALFLMTK SNFQPPKLKDKMKWSNSFHHFVKMALTKNPKKRPTAEKLLQHPFVTQHLTRSLAIELL DKVNNPDHSTYHDFDDDDPEPLVAVPHRIHSTSRNVREEKTRSEITFGQVKFDPPLRK ETEPHHELPDSDGFLDSSEEIYYTARSNLDLQLEYGQGHQGGYFLGADKSLLKSVEEE LHQRGHVAHLEDDEGDDDESKHSTLKAKIPPPLPPKPKSIFIPQEMHSTEDENQGTIK RCPMSGSPAKPSQVPPRPPPPRLPPHKPVALGNGMSSFQLNGERDGSLCQQQNEHRGT NLSRKEKKDVPKPISNGLPPTPKVHMGACFSKVFNGCPLKIHCASSWINPDTRDQYLI FGAEEGIYTLNLNELHETSMEQLFPRRCTWLYVMNNCLLSISGKASQLYSHNLPGLFD YARQMQKLPVAIPAHKLPDRILPRKFSVSAKIPETKWCQKCCVVRNPYTGHKYLCGAL QTSIVLLEWVEPMQKFMLIKHIDFPIPCPLRMFEMLVVPEQEYPLVCVGVSRGRDFNQ VVRFETVNPNSTSSWFTESDTPQTNVTHVTQLERDTILVCLDCCIKIVNLQGRLKSSR KLSSELTFDFQIESIVCLQDSVLAFWKHGMQGRSFRSNEVTQEISDSTRIFRLLGSDR VVVLESRPTDNPTANSNLYILAGHENSY" BASE COUNT 1509 a 745 c 835 g 1291 t ORIGIN 1 GAAGAAGAGA TTTTAAACAA AAAACGATCT AAAAAAATTC AGAAGAAATA 51 TGATGAAAGG AAAAAGAATG CCAAAATCAG CAGTCTCCTG GAGGAGCAGT 101 TCCAGCAGGG CAAGCTTCTT GCGTGCATCG CTTCAAGGCC GGGACAGTGT 151 GGCCGAGCAG ATGGCTATGT TGCTAGAGGG CAAAGAGTTG GAGTTCTATC 201 TTAGGAAAAT CAAGGCCGCA AAGGCAAATA AATCCTTGTT TTGTCTTCAC 251 CCATGTAATA AAGGTGTTTA TTGTTTTGTT CCCACCAAAA AAAAAAAAAA 301 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 351 AAAAAAAACC ATGGCGCAGG AGGACTTCGA GCTGATTCAG CGCATCGGCA 401 GCGGCACCTA CGGCGACGTC TACAAGGCAC GGAATGTTAA CACTGGTGAA 451 TTAGCAGCAA TTAAAGTAAT AAAATTGGAA CCAGGAGAAG ACTTTGCAGT 501 TGTGCAGCAA GAAATTATTA TGATGAAAGA CTGTAAACAC CCAAATATTG 551 TTGCTTATTT TGGAAGCTAT CTCAGGCGAG ATAAGCTTTG GATTTGCATG 601 GAGTTTTGTG GAGGTGGTTC TTTACAGGAT ATTTATCACG TAACTGGACC 651 TCTGTCAGAA CTGCAAATTG CATATGTTAG CAGAGAAACA CTGCAGGGAT 701 TATATTATCT TCACAGTAAA GGAAAAATGC ACAGAGATAT AAAGGGAGCT 751 AACATTCTAT TAACGGATAA TGGTCATGTG AAATTGGCTG ATTTTGGAGT 801 ATCTGCACAG ATAACAGCTA CAATTGCCAA ACGGAAGTCT TTCATTGGCA 851 CACCATATTG GATGGCTCCA GAAGTTGCAG CTGTTGAGAG GAAGGGGGGT 901 TACAATCAAC TCTGTGATCT CTGGGCAGTG GGAATCACTG CCATAGAACT 951 TGCAGAGCTT CAGCCTCCTA TGTTTGACTT ACACCCAATG AGAGCATTAT 1001 TTCTAATGAC AAAAAGCAAT TTTCAGCCTC CTAAACTAAA GGATAAAATG 1051 AAATGGTCAA ATAGTTTTCA TCACTTTGTG AAAATGGCAC TTACCAAAAA 1101 TCCGAAAAAA AGACCTACTG CTGAAAAATT ATTACAGCAT CCTTTTGTAA 1151 CACAACATTT GACACGGTCT TTGGCAATCG AGCTGTTGGA TAAAGTAAAT 1201 AATCCAGATC ATTCCACTTA CCATGATTTC GATGATGATG ATCCTGAGCC 1251 TCTTGTTGCT GTACCACATA GAATTCACTC AACAAGTAGA AACGTGAGAG 1301 AAGAAAAAAC ACGCTCAGAG ATAACCTTTG GCCAAGTGAA ATTTGATCCA 1351 CCCTTAAGAA AGGAGACAGA ACCACATCAT GAACTTCCCG ACAGTGATGG 1401 TTTTTTGGAC AGTTCAGAAG AAATATACTA CACTGCAAGA TCTAATCTGG 1451 ATCTGCAACT GGAATATGGA CAAGGACACC AAGGTGGTTA CTTTTTAGGT 1501 GCAGACAAGA GTCTTCTCAA GTCTGTTGAA GAAGAATTGC ATCAGCGAGG 1551 ACACGTCGCA CATTTAGAAG ATGATGAAGG AGATGATGAT GAATCTAAAC 1601 ACTCAACTCT GAAAGCAAAA ATTCCACCTC CTTTGCCACC AAAGCCTAAG 1651 TCTATCTTCA TACCACAGGA AATGCATTCT ACTGAGGATG AAAATCAAGG 1701 AACAATCAAG AGATGTCCCA TGTCAGGGAG CCCAGCAAAG CCATCCCAAG 1751 TTCCACCTAG ACCACCACCT CCCAGATTAC CCCCACACAA ACCTGTTGCC 1801 TTAGGAAATG GAATGAGCTC CTTCCAGTTA AATGGTGAAC GAGATGGCTC 1851 ATTATGTCAA CAACAGAATG AACATAGAGG CACAAACCTT TCAAGAAAAG 1901 AAAAGAAAGA TGTACCAAAG CCTATTAGTA ATGGTCTTCC TCCAACACCT 1951 AAAGTGCATA TGGGTGCATG TTTTTCAAAA GTTTTTAATG GGTGTCCCTT 2001 GAAAATTCAC TGTGCATCAT CATGGATAAA CCCAGATACA AGAGATCAGT 2051 ACTTGATATT TGGTGCCGAA GAAGGGATTT ATACCCTCAA TCTTAATGAA 2101 CTTCATGAAA CATCAATGGA ACAGCTATTC CCTCGAAGGT GTACATGGTT 2151 GTATGTAATG AACAATTGCT TGCTATCAAT ATCTGGTAAA GCTTCTCAGC 2201 TTTATTCCCA TAATTTACCA GGGCTTTTTG ATTATGCAAG ACAAATGCAA 2251 AAGTTACCTG TTGCTATTCC AGCACACAAA CTCCCTGACA GAATACTGCC 2301 AAGGAAATTT TCTGTATCAG CAAAAATCCC TGAAACCAAA TGGTGCCAGA 2351 AGTGTTGTGT TGTAAGAAAT CCTTACACGG GCCATAAATA CCTATGTGGA 2401 GCACTTCAGA CTAGCATTGT TCTATTAGAA TGGGTTGAAC CAATGCAGAA 2451 ATTTATGTTA ATTAAGCACA TAGATTTTCC TATACCATGT CCACTTAGAA 2501 TGTTTGAAAT GCTGGTAGTT CCTGAACAGG AGTACCCTTT AGTTTGTGTT 2551 GGTGTCAGTA GAGGTAGAGA CTTCAACCAA GTGGTTCGAT TTGAGACGGT 2601 CAATCCAAAT TCTACCTCTT CATGGTTTAC AGAATCAGAT ACCCCACAGA 2651 CAAATGTTAC TCATGTAACC CAACTGGAGA GAGATACCAT CCTTGTATGC 2701 TTGGACTGTT GTATAAAAAT AGTAAATCTC CAAGGAAGAT TAAAATCTAG 2751 CAGGAAATTG TCATCAGAAC TCACCTTTGA TTTCCAGATT GAATCAATAG 2801 TGTGCCTACA AGACAGTGTG CTAGCTTTCT GGAAACATGG AATGCAAGGT 2851 AGAAGTTTTA GATCTAATGA GGTAACACAA GAAATTTCAG ATAGCACAAG 2901 AATTTTCAGG CTGCTTGGAT CTGACAGGGT CGTGGTTTTG GAAAGTAGGC 2951 CAACTGATAA CCCCACAGCA AATAGCAATT TGTACATCCT GGCGGGTCAT 3001 GAAAACAGTT ACTGAGAATT GTTGTGCTTT GACAGTTAAC TCTAGAAAGA 3051 AAGAACACTA CCACTGCAAC ATTAATGGAT GCTTGAAGCT GTACAAAAGC 3101 TGCAGTAACC TGTCTTCAGT TACTTTGTAA TTTATTGTGG CATGAGATAA 3151 GATGGGGAAA ATTTTGTTTT ATGTGGTATG GATATATTTA GCATATTGAA 3201 CCACACAAGT GCTTAATTCA TTGTTATGTA ATCTTTGTAC ATATAGGCAG 3251 TATTTTTTCT GTGAAACTTC ATATTGCTGA AGACATACAC TAAGAATTTA 3301 TGTAGATAAT GTACTTTTAT GAGATGTACA AGTAAGTGTC TTATCTGTAC 3351 AGATGTAAAT GTTGATGAAA ATGCAATTGG GGTTAATATT TTAAGAATTC 3401 TTTAGTATAT TCTTGGGTGT GGCTATATTA CAAAATGGGA TGCTGGCAAT 3451 GAAACAATAC ATTTAACACT ATTGTATTTT TATTATATGT AATTTAGTAA 3501 TATGAATATA AATCTTGTAA CTTTTAAAAT TGTAATGGAG GCTGTAATCA 3551 TTTTATAATC TTTTTAATTT TAATGCAAGT ACACTGGTGT TTATATTTGC 3601 ACAAAGTATT GATATGTGAT GTATTAAGTC ACAAAAGTAA GCTGTGACAT 3651 TGTCTATAAG CATTTGGCTC CACAAATGTA TTTGGATTGT TTTCTATGTG 3701 AAGCAAACCA ATTATAATTA ACCACATGTT GTAGTAACTG GTCTTTTTAT 3751 ATTTAAGCAG AATCCTGTAA GATTGCTTGT CTTTGCTTAA AAACAATACC 3801 TTTGAACATT TTTGAATCAC AGAATAGCGG TACCATGATA GAATACTGCA 3851 ATTGTGGTCA GAATTACAGT ATGCACAAAG AATTAATTAG CATTATTAAA 3901 GAGTCCTCAC TAAACATTTC ATATGATCAC ACTGAAGAAC TGTAACATTC 3951 CATAGAGTGA AGTGGTTCAA ATTTCTCTTG GAATTTTTAC TTTTGTTGGC 4001 CTTATTTTAT GATCCTTTTC ATATTTCTTT TGACTTAGAG TATTAATACA 4051 TGGCCAAAAT AATTTAGTTA CTACCTCATA CAAACAATAT AATGGTTACT 4101 ACACATCACA GGAACTTAGT TTTGGTTTAA GTCATTTTTG ATTGCTTTTT 4151 TCCAATGGAA TATGTATATA CCAGGTTTTA GCAAAATGCA CACTTTTGGC 4201 TCTTTTTGGT ATATGTTCTT TATATTTTAA TGTGAGTATA TACACTAAGA 4251 ACAAACTAAA TTGTGATTTA TGATCTTCAT TTATTTTAAT GATAATGGTT 4301 TTAAAATATG TTCCTGATTG TACATATTGT AAAATAAACA TGTTTTTTAA 4351 CAAAAAAAAA AAAGAAAAAA AAAAAAAAAA // LOCUS AF000367 4152 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens cdc14 homolog mRNA, complete cds. ACCESSION AF000367 NID g2662416 VERSION AF000367.1 GI:2662416 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4152) AUTHORS Li,L., Ernsting,B.R., Wishart,M.J., Lohse,D.L. and Dixon,J.E. TITLE A family of putative tumor suppressors is structurally and functionally conserved in humans and yeast JOURNAL J. Biol. Chem. 272 (47), 29403-29406 (1997) MEDLINE 98037751 REFERENCE 2 (bases 1 to 4152) AUTHORS Li,L. and Dixon,J.E. TITLE Direct Submission JOURNAL Submitted (18-APR-1997) Biochemistry, University of Michigan, 1301 Catherine, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1. .4152 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 398. .2140 /codon_start=1 /product="cdc14 homolog" /protein_id="AAB88277.1" /db_xref="PID:g2662417" /db_xref="GI:2662417" /translation="MKDRLYFATLRNRPKSTVNTHYFSIDEELVYENFYADFGPLNLA MVYRYCCKLNKKLKSYSLSRKKIVHYTCFDQRKRANAAFLIGAYAVIYLKKTPEEAYR ALLSGSNPPYLPFRDASFGNCTYNLTILDCLQGIRKGLQHGFFDFETIDVDEYEHYER VENGDFNCIVPGKFLAFSGPHPKSKIENGYPLHAPEAYFPYFKKHNVTAVVRLNKKIY EAKRFTDAGFEHYDLFFIDGSTPSDNIVRRFLNICENTEGAIAVHCKAGLGRTGTLIA CYVMKHYRFTHAEIIAWIRICRPGSIIGPQQHFLEEKQASLWVQGDIFRSKLKNRPSS EGSINKILSGLDDMSIGGNLSKTQNMERFGEDNLEDDDVEMKNGITQGDKLRALKSQR QPRTSPSCAFRSDDTKGHPRAVSQPFRLSSSLQGSAVTLKTSKMALSPSATAKRINRT SLSSGATVRSFSINSRLASSLGNLNAATDDPENKKTSSSSKAGFTASPFTNLLNGSSQ PTTRNYPELNNNQYNRSSNSNGGNLNSPPGPHSAKTEEHTTILRPSYTGLSSSSARFL SRSIPSLQSEYVHY" BASE COUNT 1279 a 815 c 841 g 1217 t ORIGIN 1 ATCACTTTGG AAGCCGGGGG GAACACTTTG CCCTGCCCTG AGAGCTGGTC 51 TGCGTTTCCC AGGCGCGGCG GCGGCGGAGC AGCAGCTGCA GCAGCCGAGT 101 CCAAATAGGA GCGGCCACAG CCAGGGGCGT GTGCGCCCCG CGCGGAGCGA 151 GCTCGGGTTC CCCTCGGAAT GTCCCCGGGG CGCCCGGCGC GCTGACCCCG 201 AAGCCGCCTC CGCCTTCGGC GCCTGCTGCC TCCCTCGGCC AGGCTTGTTG 251 TTCGGGACTG TGAGCTTCCT GGCTCCTGGG CAGTGGGGAA GCCCCCGGGG 301 GCGAGTGACC TCAGCTGGCC ACGACCCAGC CCTCCCCCGT GCGTATCTCG 351 CTTAAGATGG CAGCGGAGTC AGGGAACTAA TCGGGGCTTG TGAGTTCATG 401 AAAGATCGGT TATATTTTGC TACTTTAAGG AATAGACCAA AAAGCACAGT 451 AAATACCCAC TATTTCTCCA TCGATGAGGA GCTGGTCTAT GAAAATTTCT 501 ATGCAGATTT TGGACCGCTG AACTTGGCAA TGGTGTACAG ATATTGCTGC 551 AAACTAAACA AGAAACTAAA ATCATACAGT TTGTCAAGAA AGAAAATAGT 601 GCACTACACC TGTTTTGACC AACGGAAAAG AGCAAATGCA GCATTTTTGA 651 TAGGTGCCTA TGCAGTAATC TATTTAAAGA AGACACCAGA AGAAGCCTAC 701 AGAGCACTCC TGTCTGGCTC AAACCCCCCC TATCTTCCAT TCAGGGATGC 751 TTCCTTTGGA AATTGCACTT ACAATCTCAC CATTCTCGAC TGTTTGCAGG 801 GAATCAGAAA GGGATTACAA CATGGATTTT TTGACTTTGA GACAATTGAT 851 GTGGATGAAT ATGAACATTA TGAGCGAGTT GAAAATGGTG ACTTCAACTG 901 TATTGTTCCA GGAAAATTTT TAGCATTTAG TGGACCACAT CCTAAAAGCA 951 AAATTGAGAA TGGTTATCCT CTTCACGCCC CTGAAGCCTA CTTTCCTTAT 1001 TTCAAAAAGC ATAATGTGAC TGCAGTTGTG AGGCTAAACA AAAAGATTTA 1051 TGAGGCAAAG CGCTTCACAG ACGCTGGCTT CGAGCACTAT GACCTCTTCT 1101 TCATAGATGG CAGCACACCC AGTGACAACA TCGTGCGAAG GTTCCTGAAC 1151 ATCTGTGAGA ACACCGAAGG GGCCATCGCC GTTCACTGCA AAGCTGGTCT 1201 TGGAAGAACA GGGACATTGA TAGCCTGTTA TGTAATGAAA CACTACAGGT 1251 TTACACATGC TGAAATAATT GCTTGGATTA GAATATGCCG GCCAGGCTCT 1301 ATTATAGGAC CCCAGCAGCA CTTCCTGGAA GAAAAACAAG CATCGTTGTG 1351 GGTCCAAGGA GACATTTTCC GATCCAAACT GAAAAATCGA CCATCCAGTG 1401 AAGGAAGTAT TAATAAAATT CTTTCTGGCC TAGATGATAT GTCTATTGGT 1451 GGAAATCTTT CAAAAACACA AAACATGGAA CGATTTGGAG AGGATAACTT 1501 AGAAGATGAT GATGTGGAAA TGAAAAATGG TATAACCCAG GGAGACAAAC 1551 TACGTGCCTT AAAAAGTCAG AGACAGCCAC GTACCTCACC ATCCTGTGCA 1601 TTTAGGTCAG ATGATACAAA AGGACATCCA AGAGCAGTGT CCCAGCCTTT 1651 CAGATTAAGT TCATCCCTGC AAGGATCTGC AGTTACTTTG AAGACATCAA 1701 AAATGGCACT GTCCCCTTCA GCAACGGCCA AGAGGATCAA CAGAACTTCT 1751 TTGTCTTCGG GTGCCACTGT AAGAAGCTTT TCCATAAACT CCCGGCTAGC 1801 CAGTTCTCTA GGGAACTTGA ATGCTGCAAC AGATGATCCA GAGAACAAAA 1851 AGACCTCCTC ATCCTCTAAG GCAGGCTTCA CAGCCAGCCC GTTTACCAAC 1901 CTCTTGAATG GCAGCTCCCA GCCAACTACC AGAAATTACC CTGAGCTCAA 1951 CAATAATCAG TACAACAGAA GCAGCAACAG CAACGGGGGC AACCTGAACA 2001 GCCCCCCAGG CCCCCACAGC GCCAAGACAG AGGAGCACAC CACCATCCTC 2051 CGACCCTCCT ACACCGGGCT TTCTTCTTCT TCAGCGAGAT TCCTGAGCCG 2101 TTCTATCCCT TCCCTTCAGT CTGAATATGT TCATTACTAA GGCCTTGCCA 2151 CTCCAGTGAA AGCTGTTCTT CTCTTAGACA CAATTTCTTC ATCTGGACGA 2201 GCAGTGGAGA GGGAAAGCAA CTTCTTGCTG GAAGAATATC TCTGCCTTCT 2251 TACCTTAAAT TAAAAAGAGC ACTAAGATAA CACCTTCAAG AGACTTGAAA 2301 ACAGAAAACT GGTTAATGAC TACTATAAAT GCACTGAAAC TATGTTATGG 2351 AGATTTCCAT ACTTTTAAAG ACAGTTTTAA TGTTGAATTT GGTATTTTGA 2401 AGGGTTATTT TTAATGTATT TTGGTAATAC ATTTATTATT ATATTTACAT 2451 GTACAGTGTT ACATTATATA TGTATTGTGA ACTTTAAAAG ACTATTTTGA 2501 TAAATTTATA AATATATAAA ATTATGTAAA AACTACACTA TATTTTGATT 2551 TAGATTTTCC TGCTGTTTGC TACCAAAAAT TTGTATTTTA AATCTGTTTA 2601 GTTTTAGTAT GGTTTTGTCT CTAATGAATA AATAATTCCT TCTTATTAAG 2651 AAGAAGTAAG GGAGAAAGTT TTTAGAAAGT GATTTTTATG CTCGCACTAT 2701 AAATATGGCA GGTCAGTTCA TTCTTTTGGG AAGTCAGTTT AGTTACACTG 2751 AGTTTATCCA AGTTTATCTC TACCAAGAGT ATAATGGCAT GGGATGGCTT 2801 ATTTAGGACA ATTCCCTTTC CTATTGTTTT TGTTGCTGAG CCAATTTGAG 2851 TTAGTTTTGC ATCCTGGGGG GCTTTAAAAT ACAGCATGCA GTGAAAGATC 2901 AGAATTCACT GAATATTTCT TCTGAGAGCA TGGTTTCATG GTTTTTCTCT 2951 ATGAAATGAC TCAATATTCC AAATGTTTTT TTTTCCTTCC TCCTTTCAAA 3001 AGAGTTCTTA ACCCAATTAG GATATCCTGC TTTGGGTATG AGGTTGTTGT 3051 TGCCTGTAAT CACACATGGT TTGACATCAG TTTTAAATCA ATGGAGAGAA 3101 AAAACTGAAA AAGATGCTGC TAAGTAGTTC TCTGTATTAA AGGAGATATT 3151 TTTAAAACAG GGTACAACCC CCTGCTGCAC ACGCTAGCAT ATCTGGAACC 3201 TACTATGAAA ATGAAAGGAC CCTTATAGGT ACTCACAGCC CTTTCATGTA 3251 AGTATGATCT GATATTTAGG TCTTCAGAAG CCTGTAGGTT TCATTTCTAT 3301 GAGGAATCGA GGAGCGTTAC ATCCTGATAT CCTTCCAGGC TGCTTAAGAA 3351 TGGACTGCTT CGACACTGAA AGTGCTAGTT AAATGGATTC ATATGAAGTG 3401 CTTTACTCCC AACCATTGAG TTATTTATAA TGTATTTATT AGGGGAGGGT 3451 ACCTTGAGTC TATTATATAT GCTTCATCAA AACATCTTGT TCATGTTTTA 3501 TGTTTTTAAA AAAGGCATTT GAATGAATGT TTGACTCAGG TTTGTTAAAT 3551 TAACTTCAGT AACTGCAGTA CCAAAAATTA CACTCAACTG ATGAAAAAAA 3601 CGAATTGTAT GATTTAGGAA TCAAAAACTA AAATAAGTGG AATTATGTAT 3651 CTTTTCTAAA GTTAAAAAAG TAAAATATTT TATTATGAGT TATTATAAAA 3701 ATTGGTTAAT TGTATAGGAA GATGACAGTA TTTTTTTCAA GTTATCATAA 3751 AAAGTAATTC AGATGACATT TGAGAAGTAG GGGAAAGGGA ATCATGTTGA 3801 CAGTTTTAGT TCTGTGAACA CTAATTTGTG TGAAGCTATT AAAATGATTG 3851 TAAAGTTGAC TACTGTAAAT TTCCCATAAT TATGTGTGTA TATGTGTCAT 3901 ATGTATGTAC ATGTATATGT CTAAAAATTA CTTTACACAT GTGCCTACAT 3951 AGACACACCA AGAAGTGGAT GTATATAATA TAGAAAGTAT ATAGCAAAGT 4001 AATTTTACTC TGATAATAAA AATTGTTTGA CATGTATTTT GTTATGAATA 4051 GTTTATCTTC CAAAAGATAT TTTGCTCTAT TTTAAAGTGT AGAAGAATAC 4101 ACTGCTAATA AATAATAAAA GTTTTATTCA ATTTAAAAAA AAAAAAAAAA 4151 AA // LOCUS HSAF000993 5418 bp mRNA PRI 31-OCT-1997 DEFINITION Homo sapiens ubiquitous TPR motif, X isoform (UTX) mRNA, alternative transcript 2, complete cds. ACCESSION AF000993 NID g2580571 VERSION AF000993.1 GI:2580571 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5418) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 5418) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1. .5418 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" gene 1. .5418 /gene="UTX" /note="X/Y homologous gene" CDS 27. .4232 /gene="UTX" /note="alternative transcript 2" /codon_start=1 /product="ubiquitous TPR motif, X isoform" /protein_id="AAC51840.1" /db_xref="PID:g2580572" /db_xref="GI:2580572" /translation="MKSCGVSLATAAAAAAAFGDEEKKMAAGKASGESEEASPSLTAE EREALGGLDSRLFGFVRFHEDGARTKALLGKAVRCYESLILKAEGKVESDFFCQLGHF NLLLEDYPKALSAYQRYYSLQSDYWKNAAFLYGLGLVYFHYNAFQWAIKAFQEVLYVD PSFCRAKEIHLRVGLMFKVNTDYESSLKHFQLALVDCNPCTLSNAEIQFHIAHLYETQ RKYHSAKEAYEQLLQTENLSAQVKATVLQQLGWMHHTVDLLGDKATKESYAIQYLQKS LEADPNSGQSWYFLGRCYSSIGKVQDAFISYRQSIDKSEASADTWCSIGVLYQQQNQP MDALQAYICAVQLDHGHAAAWMDLGTLYESCNQPQDAIKCYLNATRSKSCSNTSALAA RIKYLQAQLCNLPQGSLQNKTKLLPSIEEAWSLPIPAELTSRQGAMNTAQQNTSDNWS GGHAVSHPPVQQQAHSWCLTPQKLQHLEQLRANRNNLNPAQKLMLEQLESQFVLMQQH QMRPTGVAQVRSTGIPNGPTADSSLPTNSVSGQQPQLALTRVPSVSQPGVRPACPGQP LANGPFSAGHVPCSTSRTRGSTDTILIGNNHITGNGSNGNVPYLQRNALTLPHNRTNL TSSAKEPWKNQLSNSTQGLHKGQSSHSAGPNGERPLSSTGPSQHLQAAGSGIQNQNGH PTLPSNSVTQGAALNHLSSHTATSGGQQGITLTKESKPSGNILTVPETSRHTGETPNS TASVEGLPNHVHQMTADAVCSPSHGDSKSPGLLSSDNPQLSALLMGKANNNVGTGTCD KVNNIHPAVHTKTDNSVASSPSSAISTATPSPKSTEQTTTNSVTSLNSPHSGLHTING EGMEESQSPMKTDLLLVNHKPSPQIIPSMSVSIYPSSAEVLKACRNLGKNGLSNSSIL LDKCPPPRPPSSPYPPLPKDKLNPPTPSIYLENKRDAFFPPLHQFCTNPNNPVTVIRG LAGALKLDLGLFSTKTLVEANNEHMVEVRTQLLQPADENWDPTGTKKIWHCESNRSHT TIAKYAQYQASSFQESLREENEKRSHHKDHSDSESTSSDNSGRRRKGPFKTIKFGTNI DLSDDKKWKLQLHELTKLPAFVRVVSAGNLLSHVGHTILGMNTVQLYMKVPGSRTPGH QENNNFCSVNINIGPGDCEWFVVPEGYWGVLNDFCEKNNLNFLMGSWWPNLEDLYEAN VPVYRFIQRPGDLVWINAGTVHWVQAIGWCNNIAWNVGPLTACQYKLAVERYEWNKLQ SVKSIVPMVHLSWNMARNIKVSDPKLFEMIKYCLLRTLKQCQTLREALIAAGKEIIWH GRTKEEPAHYCSICEVEVFDLLFVTNESNSRKTYIVHCQDCARKTSGNLENFVVLEQY KMEDLMQVYDQFTLAPPLPSASS" BASE COUNT 1639 a 1146 c 1109 g 1524 t ORIGIN 1 AAAGCAAAAG AATTCGCTGC GTTTCCATGA AATCCTGCGG AGTGTCGCTC 51 GCTACCGCCG CCGCTGCCGC CGCCGCTTTC GGTGATGAGG AAAAGAAAAT 101 GGCGGCGGGA AAAGCGAGCG GCGAGAGCGA GGAGGCGTCC CCCAGCCTGA 151 CAGCCGAGGA GAGGGAGGCG CTCGGCGGAC TGGACAGCCG CCTCTTTGGG 201 TTCGTGAGAT TTCATGAAGA TGGCGCCAGG ACGAAGGCCC TACTGGGCAA 251 GGCTGTTCGC TGCTATGAAT CTCTAATCTT AAAAGCTGAA GGAAAAGTGG 301 AGTCTGATTT CTTTTGTCAA TTAGGTCACT TCAACCTCTT ATTGGAAGAT 351 TATCCAAAAG CATTATCTGC ATACCAGAGG TACTACAGTT TACAGTCTGA 401 CTACTGGAAG AATGCTGCCT TTTTATATGG TCTTGGTTTG GTCTACTTCC 451 ATTATAATGC ATTTCAGTGG GCAATTAAAG CATTTCAGGA GGTGCTTTAT 501 GTTGATCCCA GCTTTTGTCG AGCCAAGGAA ATTCATTTAC GAGTTGGGCT 551 TATGTTCAAA GTGAACACAG ACTATGAGTC TAGTTTAAAG CATTTTCAGT 601 TAGCTTTGGT TGACTGTAAT CCCTGCACTT TGTCCAATGC TGAAATTCAA 651 TTTCACATTG CCCACTTATA TGAAACCCAG AGGAAATATC ATTCTGCAAA 701 AGAAGCTTAT GAACAACTTT TGCAGACAGA GAATCTTTCT GCACAAGTAA 751 AAGCAACTGT CTTACAACAG TTAGGTTGGA TGCATCACAC TGTAGATCTC 801 CTGGGAGATA AAGCCACCAA GGAAAGCTAT GCTATTCAGT ATCTCCAAAA 851 GTCCTTGGAA GCAGATCCTA ATTCTGGCCA GTCCTGGTAT TTCCTCGGAA 901 GGTGCTATTC AAGTATTGGG AAAGTTCAGG ATGCCTTTAT ATCTTACAGG 951 CAGTCTATTG ATAAATCAGA AGCAAGTGCA GATACATGGT GTTCAATAGG 1001 TGTGCTATAT CAGCAGCAAA ATCAGCCCAT GGATGCTTTA CAGGCCTATA 1051 TTTGTGCTGT ACAATTGGAC CATGGCCATG CTGCAGCCTG GATGGACCTA 1101 GGCACTCTCT ATGAATCCTG CAACCAGCCT CAGGATGCCA TTAAATGCTA 1151 CTTAAATGCA ACTAGAAGCA AAAGTTGTAG TAATACCTCT GCACTTGCAG 1201 CACGAATTAA GTATTTACAG GCTCAGTTGT GTAACCTTCC ACAAGGTAGT 1251 CTACAGAATA AAACTAAATT ACTTCCTAGT ATTGAGGAGG CGTGGAGCCT 1301 ACCAATTCCC GCAGAGCTTA CCTCCAGGCA GGGTGCCATG AACACAGCAC 1351 AGCAGAATAC TTCTGACAAT TGGAGTGGTG GACATGCTGT GTCACATCCT 1401 CCAGTACAGC AACAAGCTCA TTCATGGTGT TTGACACCAC AGAAATTACA 1451 GCATTTGGAA CAGCTCCGCG CAAATAGAAA TAATTTAAAT CCAGCACAGA 1501 AACTGATGCT GGAACAGCTG GAAAGTCAGT TTGTCTTAAT GCAACAACAC 1551 CAAATGAGAC CAACAGGAGT TGCACAGGTA CGATCTACTG GAATTCCTAA 1601 TGGGCCAACA GCTGACTCAT CACTGCCTAC AAACTCAGTC TCTGGCCAGC 1651 AGCCACAGCT TGCTCTGACC AGAGTGCCTA GCGTCTCTCA GCCTGGAGTC 1701 CGTCCTGCCT GCCCTGGGCA GCCTTTGGCC AATGGACCCT TTTCTGCAGG 1751 CCATGTTCCC TGTAGCACAT CAAGAACGCG GGGAAGTACA GACACTATTT 1801 TGATAGGCAA TAATCATATA ACAGGAAATG GAAGTAATGG AAACGTGCCT 1851 TACCTGCAGC GAAACGCACT CACTCTACCT CATAACCGCA CAAACCTGAC 1901 CAGCAGCGCA AAGGAGCCGT GGAAAAACCA ACTATCTAAC TCCACTCAGG 1951 GGCTTCACAA AGGTCAGAGT TCACATTCGG CAGGTCCTAA TGGTGAACGA 2001 CCTCTCTCTT CCACTGGGCC TTCCCAGCAT CTCCAGGCAG CTGGCTCTGG 2051 TATTCAGAAT CAGAACGGAC ATCCCACCCT GCCTAGCAAT TCAGTAACAC 2101 AGGGGGCTGC TCTCAATCAC CTCTCCTCTC ACACTGCTAC CTCAGGTGGA 2151 CAACAAGGCA TTACCTTAAC CAAAGAGAGC AAGCCTTCAG GAAACATATT 2201 GACGGTGCCT GAAACAAGCA GGCACACTGG AGAGACACCT AACAGCACTG 2251 CCAGTGTCGA GGGACTTCCT AATCATGTCC ATCAGATGAC GGCAGATGCT 2301 GTTTGCAGTC CTAGCCATGG AGATTCTAAG TCACCAGGTT TACTAAGTTC 2351 AGACAATCCT CAGCTCTCTG CCTTGTTGAT GGGAAAAGCC AATAACAATG 2401 TGGGTACTGG AACCTGTGAC AAAGTCAATA ACATCCACCC AGCTGTTCAT 2451 ACAAAGACTG ATAACTCTGT TGCCTCTTCA CCATCTTCAG CCATTTCAAC 2501 AGCAACACCT TCTCCAAAAT CCACTGAGCA GACAACCACA AACAGTGTTA 2551 CCAGCCTTAA CAGCCCTCAC AGTGGGCTAC ACACAATTAA TGGAGAAGGG 2601 ATGGAAGAAT CTCAGAGCCC CATGAAAACA GATCTGCTTC TGGTTAACCA 2651 CAAACCTAGT CCACAGATCA TACCATCAAT GTCTGTGTCC ATATACCCCA 2701 GCTCAGCAGA AGTTCTGAAG GCATGCAGGA ATCTAGGTAA AAATGGCTTA 2751 TCTAACAGTA GCATTTTGTT GGATAAATGT CCACCTCCAA GACCACCATC 2801 TTCACCATAC CCTCCCTTGC CAAAGGACAA GTTGAATCCA CCTACACCTA 2851 GTATTTACTT GGAAAATAAA CGTGATGCTT TCTTTCCTCC ATTACATCAA 2901 TTTTGTACAA ATCCGAACAA CCCTGTTACA GTAATACGTG GCCTTGCTGG 2951 AGCTCTTAAG TTAGACCTGG GACTTTTCTC TACTAAAACT TTGGTGGAAG 3001 CTAACAATGA ACATATGGTA GAAGTGAGGA CACAGTTGTT GCAGCCAGCA 3051 GATGAAAACT GGGATCCCAC TGGAACAAAG AAAATCTGGC ATTGTGAAAG 3101 TAATAGATCT CATACTACAA TTGCTAAATA TGCACAGTAC CAGGCCTCCT 3151 CATTCCAGGA ATCATTGAGA GAAGAAAATG AAAAAAGAAG TCATCATAAA 3201 GACCACTCAG ATAGTGAATC TACATCGTCA GATAATTCTG GGAGGAGGAG 3251 GAAAGGACCC TTTAAAACCA TAAAGTTTGG GACCAATATT GACCTATCTG 3301 ATGACAAAAA GTGGAAGTTG CAGCTACATG AGCTGACTAA ACTTCCTGCT 3351 TTTGTGCGTG TCGTATCAGC AGGAAATCTT CTAAGCCATG TTGGTCATAC 3401 CATATTGGGC ATGAACACAG TTCAACTATA CATGAAAGTT CCAGGGAGCA 3451 GAACACCAGG TCATCAGGAA AATAACAACT TCTGTTCAGT TAACATAAAT 3501 ATTGGCCCAG GTGACTGTGA ATGGTTTGTT GTTCCTGAAG GTTACTGGGG 3551 TGTTTTGAAT GACTTCTGTG AAAAAAATAA TTTGAATTTC CTAATGGGTT 3601 CTTGGTGGCC CAATCTTGAA GATCTTTATG AAGCAAATGT TCCAGTGTAT 3651 AGGTTTATTC AGCGACCTGG AGATTTGGTC TGGATAAATG CAGGCACTGT 3701 TCATTGGGTT CAGGCTATTG GCTGGTGCAA CAACATTGCT TGGAATGTTG 3751 GTCCACTTAC AGCCTGCCAG TATAAATTGG CAGTGGAACG GTACGAATGG 3801 AACAAATTGC AAAGTGTGAA GTCAATAGTA CCCATGGTTC ATCTTTCCTG 3851 GAATATGGCA CGAAATATCA AGGTCTCAGA TCCAAAGCTT TTTGAAATGA 3901 TTAAGTATTG TCTTCTAAGA ACTCTGAAGC AATGTCAGAC ATTGAGGGAA 3951 GCTCTCATTG CTGCAGGAAA AGAGATTATA TGGCATGGGC GGACAAAAGA 4001 AGAACCAGCT CATTACTGTA GCATTTGTGA AGTGGAGGTT TTTGATCTGC 4051 TTTTTGTCAC TAATGAGAGT AATTCACGAA AGACCTACAT AGTACATTGC 4101 CAAGATTGTG CACGAAAAAC AAGCGGAAAC TTGGAAAACT TTGTGGTGCT 4151 AGAACAGTAC AAAATGGAGG ACCTGATGCA AGTCTATGAC CAATTTACAT 4201 TAGCTCCTCC ATTACCATCC GCCTCATCTT GATATTGTTC CATGGACATT 4251 AAATGAGACC TTTTCTGCTA TTCAGGAAAT AACCCAGTTC TGCACCACTG 4301 GTTTTTGTAG CTATCTCGTA AGGCTGCTGG CTGAAAACTG TGTCTATGCA 4351 ACCTTCCAAG TGCGGAGTGT CAACCAACTG GACGGGAGAG AGTACTGCTC 4401 CTACTCCAGG ACTCTCACAA AGCTGATGAG CTGTACTTCA GAAAAAAATA 4451 ATAATTTCCA TGTTTTGTAT ATATCTGACA AAACTGGCAA CATCTTACAG 4501 ACTACTGACT TGAAGACAAC CTCTTTTATA TTTCTCTATT TCTGGGCTGA 4551 TGAATTTGTT TTCATCTGTC TTTTCCCCCT TCAGAATTTT CCTTGGAAAA 4601 AAAATACTAG CCTAGCTGGT CATTTCTTTG TAAGGTAGTT AGCAATTTTA 4651 AGTCTTTCTT TGGTCAACTT TTTTTTAATG TGAAAAGTTA GGTAAGACAC 4701 TTTTTTACTG CTTTTATGTT TTTCTGTCTT GTTTTGAGAC CATGATGGTT 4751 ACACTTTTGG TTCCTAAATA AAATTTAAAA AATTAACAGC CAAGTCACAA 4801 AGGTAATGGA TTGCACATAG ACTAAGGAAT AAACTTCAGA TTTGTGATTT 4851 TTGTTTCTAA TCTTGATGTA AATTTACACT ATTATAATAC ATATTTATTG 4901 CTTGAAAATA TTTGTGAATG GAATGCTGTT ATTTTTTCCA GATTTACCTG 4951 CCATTGAAAT TTTAAGGAGT TCTGTAATTT CAAACACTAC TCCTATTACA 5001 TTTTCTATGT GTAAATAAAA CTGCTTAGCA TTGTACAGAA ACTTTTATTA 5051 AAATTGTTTA ATGTTTAAAG AGTTTCTATT GTTTGAGTTT AAAAAAGACT 5101 TTATGTACAG TGCCCAGTTT TGTTCATTTT GAAATCTGAT AAATATATTT 5151 ATATATACTT ATGTATGTAT ATATAATATA TATAGAAATC TGGATATATA 5201 TGTATAAATC TTTAGAACTT AAATTTTTCT CGTTTAGTTC ACATCTATGG 5251 TAGATTTTTG AGGTGTCTAC TGTAAAGTAT TGCTTACAAA AAGTATGATT 5301 ATTTTTAAAG AAATATATAT GGTATGTATC CTCAAGACCT AAAATGTCAG 5351 ACTGGTTTAT TGTTAAGTTG CAATTACTGC AATGACAGAC CAATAAACAA 5401 TTGCTGCCAA AAAAAAAA // LOCUS AF001042 4998 bp mRNA PRI 25-MAY-1997 DEFINITION Homo sapiens RNA editase (RED1) mRNA, complete cds. ACCESSION AF001042 NID g2114492 VERSION AF001042.1 GI:2114492 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4998) AUTHORS Villard,L., Tassone,F., Haymowicz,M., Welborn,R. and Gardiner,K. TITLE Map location, genomic organization and expression patterns of the human RED1 RNA editase JOURNAL Somat. Cell Mol. Genet. (1997) In press REFERENCE 2 (bases 1 to 4998) AUTHORS Villard,L., Tassone,F. and Gardiner,K. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Eleanor Roosevelt, 1899 Gaylord Street, Denver, CO 80206, USA FEATURES Location/Qualifiers source 1. .4998 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.3; within 300 kb of the CD18 gene" gene 1. .4998 /gene="RED1" CDS 16. .2241 /gene="RED1" /codon_start=1 /product="RNA editase" /protein_id="AAB58300.1" /db_xref="PID:g2114493" /db_xref="GI:2114493" /translation="MDIEDEENMSSSSTDVKENRNLDNVSPKDASTPGPGEGSQLSNG GGGGPGRKRPLEEGSNGHSKYRLKKRRKTPGPVLPKNALMQLNEIKPGLQYTLLSQTG PVHAPLFVMSVEVNGQVFEGSGPTKKKAKLHAAEKALRSFVQFPNASEAHLAMGRTLS VNTDFTSDQADFPDTLFNGFETPDKAEPPFYVGSNGDDSFSSSGDLSLSASPVPASLA QPPLPVLPPFPPPSGKNPVMILNELRPGLKYDFLSESGESHAKSFVMSVVVDGQFFEG SGRNKKLAKARAAQSALAAIFNLHLDQTPSRQPIPSEGLQLHLPQVLADAVSRLVLGK FGDLTDNFSSPHARRKVLAGVVMTTGTDVKDAKVISVSTGTKCINGEYMSDRGLALND CHAEIISRRSLLRFLYTQLELYLNNKDDQKESIFQKSERGGFRLKENVQFHLYISTSP CGDARIFSPHEPILEGSRSYTQAGLQWCNHGSLQPRPPGLLSDPSTSTFQGAGTTEPA DRHPNRKARGQLRTKIESGEGTIPVRSNASIQTWDGVLQGERLLTMSCSDKIARWNVV GIQGSLLSIFVEPIYFSSIILGSLYHGDHLSRAMYQRISNIEDLPPLYTLNKPLLSGI SNAEARQPGKAPNFSVNWTVGDSAIEVINATTGKDELGRASRLCKHALYCRWMRVHGK VPSHLLRSKITKPNVYHESKLAAKEYQAAKARLFTAFIKAGLGAWVEKPTEQDQFSLT P" exon 44. .978 /gene="RED1" /number=2 exon 979. .1093 /gene="RED1" /number=3 exon 1094. .1262 /gene="RED1" /number=4 exon 1263. .1411 /gene="RED1" /number=5 exon 1412. .1531 /gene="RED1" /number=6 exon 1532. .1700 /gene="RED1" /number=7 exon 1701. .1882 /gene="RED1" /number=8 exon 1883. .2061 /gene="RED1" /number=9 exon 2062. .>2241 /gene="RED1" BASE COUNT 1140 a 1292 c 1457 g 1109 t ORIGIN 1 CAAAAGTATT TTGCCATGGA TATAGAAGAT GAAGAAAACA TGAGTTCCAG 51 CAGCACTGAT GTGAAGGAAA ACCGCAATCT GGACAACGTG TCCCCCAAGG 101 ATGCGAGCAC ACCTGGGCCT GGCGAGGGCT CTCAGCTCTC CAATGGGGGT 151 GGTGGTGGCC CCGGCAGAAA GCGGCCCCTG GAGGAGGGCA GCAATGGCCA 201 CTCCAAGTAC CGCCTGAAGA AAAGGAGGAA AACACCAGGG CCCGTCCTCC 251 CCAAGAACGC CCTAATGCAG CTGAATGAGA TCAAGCCTGG TTTGCAGTAC 301 ACACTCCTGT CCCAGACTGG GCCCGTGCAC GCGCCTTTGT TTGTCATGTC 351 TGTGGAGGTA AATGGCCAGG TTTTTGAGGG CTCCGGTCCC ACAAAGAAAA 401 AGGCAAAACT CCATGCTGCT GAGAAGGCCT TGAGGTCTTT CGTTCAGTTT 451 CCTAATGCCT CTGAGGCCCA CCTGGCCATG GGGAGGACCC TGTCTGTCAA 501 CACGGACTTC ACATCTGACC AGGCGGACTT CCCTGACACG CTCTTCAATG 551 GTTTTGAAAC TCCTGACAAG GCGGAGCCTC CCTTTTACGT GGGCTCCAAT 601 GGGGATGACT CCTTCAGTTC CAGCGGGGAC CTCAGCTTGT CTGCTTCCCC 651 GGTGCCTGCC AGCCTAGCCC AGCCTCCTCT CCCTGTCTTA CCACCATTCC 701 CACCCCCGAG TGGGAAGAAT CCCGTGATGA TCTTGAACGA ACTGCGCCCA 751 GGACTCAAGT ATGACTTCCT CTCCGAGAGC GGGGAGAGCC ATGCCAAGAG 801 CTTCGTCATG TCTGTGGTCG TGGATGGTCA GTTCTTTGAA GGCTCGGGGA 851 GAAACAAGAA GCTTGCCAAG GCCCGGGCTG CACAGTCTGC CCTGGCCGCC 901 ATTTTTAACT TGCACTTGGA TCAGACGCCA TCTCGCCAGC CTATTCCCAG 951 TGAGGGTCTT CAGCTGCATT TACCGCAGGT TTTAGCTGAC GCTGTCTCAC 1001 GCCTGGTCCT GGGTAAGTTT GGTGACCTGA CCGACAACTT CTCCTCCCCT 1051 CACGCTCGCA GAAAAGTGCT GGCTGGAGTC GTCATGACAA CAGGCACAGA 1101 TGTTAAAGAT GCCAAGGTGA TAAGTGTTTC TACAGGAACA AAATGTATTA 1151 ATGGTGAATA CATGAGTGAT CGTGGCCTTG CATTAAATGA CTGCCATGCA 1201 GAAATAATAT CTCGGAGATC CTTGCTCAGA TTTCTTTATA CACAACTTGA 1251 GCTTTACTTA AATAACAAAG ATGATCAAAA AGAATCCATC TTTCAGAAAT 1301 CAGAGCGAGG GGGGTTTAGG CTGAAGGAGA ATGTCCAGTT TCATCTGTAC 1351 ATCAGCACCT CTCCCTGTGG AGATGCCAGA ATCTTCTCAC CACATGAGCC 1401 AATCCTGGAA GGGTCTCGCT CTTACACCCA GGCTGGATTG CAGTGGTGCA 1451 ATCATGGCTC ACTGCAGCCT CGACCTCCTG GGCTCTTAAG CGATCCTTCC 1501 ACCTCAACCT TCCAAGGAGC TGGGACTACA GAACCAGCAG ATAGACACCC 1551 AAATCGTAAA GCAAGAGGAC AGCTACGGAC CAAAATAGAG TCTGGTGAGG 1601 GGACAATTCC AGTGCGCTCC AATGCGAGCA TCCAAACGTG GGACGGGGTG 1651 CTGCAAGGGG AGCGGCTGCT CACCATGTCC TGCAGTGACA AGATTGCACG 1701 CTGGAACGTG GTGGGCATCC AGGGATCCCT GCTCAGCATT TTCGTGGAGC 1751 CCATTTACTT CTCGAGCATC ATCCTGGGCA GCCTTTACCA CGGGGACCAC 1801 CTTTCCAGGG CCATGTACCA GCGGATCTCC AACATAGAGG ACCTGCCACC 1851 TCTCTACACC CTCAACAAGC CTTTGCTCAG TGGCATCAGC AATGCAGAAG 1901 CACGGCAGCC AGGGAAGGCC CCCAACTTCA GTGTCAACTG GACGGTAGGC 1951 GACTCCGCTA TTGAGGTCAT CAACGCCACA ACTGGGAAGG ATGAGCTGGG 2001 CCGCGCGTCC CGCCTGTGTA AGCACGCGTT GTACTGTCGC TGGATGCGTG 2051 TGCACGGCAA GGTTCCCTCC CACTTACTAC GCTCCAAGAT TACCAAGCCC 2101 AACGTGTACC ATGAGTCCAA GCTGGCGGCA AAGGAGTACC AGGCCGCCAA 2151 GGCGCGTCTG TTCACAGCCT TCATCAAGGC GGGGCTGGGG GCCTGGGTGG 2201 AGAAGCCCAC CGAGCAGGAC CAGTTCTCAC TCACGCCCTG ACCCGGGCAG 2251 ACATGATGGG GGGTGCAGGG GGCTGTGGGC ATCCAGCGTC ATCCTCCAGA 2301 ACCTCACATC TGAACTGGGG GCAGGTGCAT ACCTTGGGGA GGGAGTAGGG 2351 GGACACGGGG GACCACCAGG TGTCCACGGT TGTCCCCAGC ATCTCACATC 2401 AGACCTGGGG CAGGTGCGCA GTGTGGGGAG GGGATGGGGT GCGTCAGGGC 2451 CCAGCATCGC CGCCTGGCAT CTCCTCGCCG CAGCATTTCC CCTTCTGAAC 2501 CGTCCAGTGA CTGCTTTCAA TCTCGGTTTA CGTTTAGAAA TTGAGTTCTA 2551 CTGAGTAGGG CTTCCTTAAG TTTAGGAAAA TAGAAATTAC TTTGTGTGAA 2601 ATTCTTGAAT AAATAATTTA TTCAGAGCTA GGAATGTGGT TTATAAAATA 2651 GGAAGTAATT GTGTCAGGTC ACTTTTATGC CACATTATTT TAATTGCAAA 2701 AAAGCATCTA TATATGGAGG AGGGTGGGAA AATAGAGGTA GGAAATAGTA 2751 GCCTAAAGGA AATCGCCACA CGTCTGTCTA AACTTAGGTC TCTTTTCTCC 2801 GTAGGTACCT CCCTGGGTAG TTCCACACAC TAGGTTGTAA CAGTCTCTCC 2851 CTGAGGAGCA GACTCCCAGC ATGGTGTAGC GTGGCCCTGT CATGCACATG 2901 GGGTCCCGCA GCAGTGACTG TGTGTCCTGC AGAGGCGTGA CCCAGGCCCC 2951 TGTAGCCCTC AGCCTCCTCT AGAAGCTTCT GTACTCCTTG TAGGATCAGA 3001 TCATGGAAAA CTTTTCTCAG TTTACTTCTA AGTAATCACA GATAATACAT 3051 GGCCAGTAAT CCCAGGCTGG CCATTCATTC AGGTTTTTTA AAGGATATTT 3101 AACTTTTATG GACTAGAAGG AATCACGAGG GCTACTGCAC AATACATGGC 3151 CTAAGTTCCC TCTGTTCCTT CCTCTGAATC GAATGGATGT GGGTGACCGC 3201 CCGAAGGCCT TCACAGGATG GAAGTAGAAT GATTTCAGTA GATACTCATT 3251 CTTGGAAAAT GCCATAGTTT TAAATTATTG TTTCCAGCTT TATCAAAGAC 3301 ATGTTTGAAA AATAAAAAGC ATCCAAGTGA GAGCTGGTGA GACCACGTGC 3351 TGCTGGCGTA GTGTAGGCCA GACATTGACA GTCCTGACGG GAGCTCAGGG 3401 CTGCCCAGCG CCCAGCGTGC ACGGGACGGC CCCACGACAG AGGGAGTCAG 3451 CCCGGGAGGT CAGGAGCGCG GCGGGCGAGG GCCCTGTGTG GACCACCTCC 3501 ACCAAGCTCA GAGATTTGCA CCAGGTGCCT TGTTGCCTCC GCTCAGGATG 3551 AAAGAGGAGC TGAGAGAAGT GCTCTGCCTG CCAGTGCAGT GCCCAGCTCC 3601 AAGGCTCTAG AGGGTGTTCA GGTGGGTCTC CTGGGGCCAT GGGGAGAGAT 3651 TGGTGCAGAC CTTACCCCAC AGCATACACC TGCCACAGCG AAATCCAGGG 3701 TGTTGGCACC TGTGTGTCCG TGATGAGCCT AGGAAACCAG AGCAGGGGCA 3751 GAGGGGCGTC ATCCTCCCAC CGGACGCTGG GAGCTCAGAC CCCAAAACTG 3801 AAACACCGTG GCTTCGGCGG GGGGTGTGCC TCCTGATGTC AGGAGCCCCA 3851 TCCACGTGTG TCCACACAGA TCTCGTCGCA GCACGGCAGG AAGGGGTGCT 3901 GCTTAGGGCT CATTGTTGGG GACATGACCG GGTTCAGCGG CTAAAACATC 3951 TGCCCCACAG CAGCCTCCTC CTCCACCGAA AAGGGTAGTT GTCTCCCTGA 4001 AGCAGTCACA GCAGGCGTCT CTGCCGCTCC GTCACCACAG TGGGGTTTTG 4051 TTCAGGCAAA TCGCGCTGGG GTTCTGCACC TGCAAAAGGA GAGGGGTCTG 4101 TTGTCGCTGG CTTTCCCCCA AGCAGGCTCT TGCACACTCT AGAAAAAACA 4151 CCTTGTAAGT CTGTGCATTT TTATTGTCTT GATAAATTGT ATTTTTTTCT 4201 AATGGGGATT GGGAGATGGA CTTCGTTTTT AAAAATATGT GGATTTTGGT 4251 TACCAAGTTT AGTGTTAATA TATTCCATAT ACATACAAAA CTACCCGGTA 4301 TGTCTGGCTT TTCCCTTCTG TCAGGTAATA GCTAAAGTCA GCATGATTGC 4351 TCCCTGTACC ACCCCAAATA AGTGAGTGCC TCACCTTGTG GGGCCTGAGC 4401 AGCTACCTTG AGACCATGTG AGGTGGCACC TTTCCGGGGT GGACTCGTGC 4451 GGCCTTGAGG ACAGGCACAG GGCACCCTAT CCCAAGCCGT CCAGGCAGGA 4501 GGAAGGCAGC CAAGGCAACT GGGTTCTGGG AGCCCTGGGT GGGGCAGCTG 4551 TGGGGAGGAA CTGGGTTCGG GGCAGCCCTG GGCAGGGCGG CTGTGGGGCA 4601 AGAACTGGGT TCGGGGAGCC CTGTCCGGCG GGGGGCTGTG GGGCGGGGAG 4651 CTGGGTTCGG GGAGCCCTGG GCGGGGTGGC TGTTGGGGGG AACTGGGTTC 4701 GGGGAGCCCT GGGCGGGGTG GCTTTTGGGG GGAACTGGGT TCGGGGAGCC 4751 CTGTGCGGGG TGGCTGTTGA GGAGGACTTG GGTTCAGGGA GCCCTGGGCG 4801 GGGTGGCTGT CAGGGGGAAC TGGTTTCCGG GAGCCCTGGG CCGGGGCAGG 4851 GGGCGGCTGT AGGAAGGAAC TGGTTTCGGG GAGCCCTGGG CGGGGCGGCT 4901 GTGGGGAGGA AGGTGACGTG CAGGGGACCA GAGGCTCTGC ACTGCTCCTA 4951 GGACAGCTCA TCTGTAATCA GAAAAAAAAT AAACAAAATA CAGAACGC // LOCUS AF001846 3058 bp mRNA PRI 05-JAN-1999 DEFINITION Homo sapiens lymphoid phosphatase LyP1 mRNA, complete cds. ACCESSION AF001846 NID g4100631 VERSION AF001846.1 GI:4100631 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3058) AUTHORS Roifman,C.M. TITLE Human cDNA of LyP1 Protein Tyrosine Phosphatase JOURNAL Unpublished REFERENCE 2 (bases 1 to 3058) AUTHORS Roifman,C.M. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) Immunology and Allergy, The Hospital For Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada FEATURES Location/Qualifiers source 1. .3058 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="thymocytes" CDS 42. .2468 /codon_start=1 /product="lymphoid phosphatase LyP1" /protein_id="AAD00904.1" /db_xref="PID:g4100632" /db_xref="GI:4100632" /translation="MDQREILQKFLDEAQSKKITKEEFANEFLKLKRQSTKYKADKTY PTTVAENAKNIKKNRYKDILPYDYSRVELSLITSDEDSSYINANFIKGVYGPKAYIAT QGPLSTTLLDFWRMIWEYSVLIIVMACMEYEMGKKKCERYWAEPGEMQLEFGPFSVSC EAEKRKSDYIIRTLKVKFNSETRTIYQFHYKNWPDHDVPSSIDPILELIWDVRCYQED DSVPICIHCSAGCGRTGVICAIVDYTWMLLKDGIIPENFSVFSLIREMRTQRPSLVQT QEQYELVYNAVLELFKRQMDVIRDKHSGTESQAKHCIPEKNHTLQADSYSPNLPKSTT KAAKMMNQQRTKMEIKESSSFDFRTSEISAKEELVLHPAKSSTSFDFLELNYSFDKNA DTTMKWQTKAFPIVGEPLQKHQSLDLGSLLFEGCSNSKPVNAAGRYFNSKVPITRTKS TPFELIQQRETKEVDSKENFSYLESQPHDSCFVEMQAQKVMHVSSAELNYSLPYDSKH QIRNASNVKHHDSSALGVYSYIPLVENPYFSSWPPSGTSSKMSLDLPEKQDGTVFPSS LLPTSSTSLFSYYNSHDSLSLNSPTNISSLLNQESAVLATAPRIDDEIPPPLPVRTPE SFIVVEEAGEFSPNVPKSLSSAVKVKIGTSLEWGGTSEPKKFDDSVILRPSKSVKLRS PKSELHQDRSSPPPPLPERTLESFFLADEDCMQAQSIETYSTSYPDTMENSTSSKQTL KTPGKSFTRSKSLKILRNMKKSICNSCPPNKPAESVQSNNSSSFLNFGFANRFSKPKG PRNPPPTWNI" BASE COUNT 1015 a 592 c 553 g 898 t ORIGIN 1 TCCCTCAACC TACTTATAGA CTATTTTTCT TGCTCTGCAG CATGGACCAA 51 AGAGAAATTC TGCAGAAGTT CCTGGATGAG GCCCAAAGCA AGAAAATTAC 101 TAAAGAGGAG TTTGCCAATG AATTTCTGAA GCTGAAAAGG CAATCTACCA 151 AGTACAAGGC AGACAAAACC TATCCTACAA CTGTGGCTGA GAATGCCAAG 201 AATATCAAGA AAAACAGATA TAAGGATATT TTGCCCTATG ATTATAGCCG 251 GGTAGAACTA TCCCTGATAA CCTCTGATGA GGATTCCAGC TACATCAATG 301 CCAACTTCAT TAAGGGAGTT TATGGACCCA AGGCTTATAT TGCCACCCAG 351 GGTCCTTTAT CTACAACCCT CCTGGACTTC TGGAGGATGA TTTGGGAATA 401 TAGTGTCCTT ATCATTGTTA TGGCATGCAT GGAGTATGAA ATGGGAAAGA 451 AAAAGTGTGA GCGCTACTGG GCTGAGCCAG GAGAGATGCA GCTGGAATTT 501 GGCCCTTTCT CTGTATCCTG TGAAGCTGAA AAAAGGAAAT CTGATTATAT 551 AATCAGGACT CTAAAAGTTA AGTTCAATAG TGAAACTCGA ACTATCTACC 601 AGTTTCATTA CAAGAATTGG CCAGACCATG ATGTACCTTC ATCTATAGAC 651 CCTATTCTTG AGCTCATCTG GGATGTACGT TGTTACCAAG AGGATGACAG 701 TGTTCCCATA TGCATTCACT GCAGTGCTGG CTGTGGAAGG ACTGGTGTTA 751 TTTGTGCTAT TGTTGATTAT ACATGGATGT TGCTAAAAGA TGGGATAATT 801 CCTGAGAACT TCAGTGTTTT CAGTTTGATC CGGGAAATGC GGACACAGAG 851 GCCTTCATTA GTTCAAACGC AGGAACAATA TGAACTGGTC TACAATGCTG 901 TATTAGAACT ATTTAAGAGA CAGATGGATG TTATCAGAGA TAAACATTCT 951 GGAACAGAGA GTCAAGCAAA GCATTGTATT CCTGAGAAAA ATCACACTCT 1001 CCAAGCAGAC TCTTATTCTC CTAATTTACC AAAAAGTACC ACAAAAGCAG 1051 CAAAAATGAT GAACCAACAA AGGACAAAAA TGGAAATCAA AGAATCTTCT 1101 TCCTTTGACT TTAGGACTTC TGAAATAAGT GCAAAAGAAG AGCTAGTTTT 1151 GCACCCTGCT AAATCAAGCA CTTCTTTTGA CTTTCTGGAG CTAAATTACA 1201 GTTTTGACAA AAATGCTGAC ACAACCATGA AATGGCAGAC AAAGGCATTT 1251 CCAATAGTTG GGGAGCCTCT TCAGAAGCAT CAAAGTTTGG ATTTGGGCTC 1301 TCTTTTGTTT GAGGGATGTT CTAATTCTAA ACCTGTAAAT GCAGCAGGAA 1351 GATATTTTAA TTCAAAGGTG CCAATAACAC GGACCAAATC AACTCCTTTT 1401 GAATTGATAC AGCAGAGAGA AACCAAGGAG GTGGACAGCA AGGAAAACTT 1451 TTCTTATTTG GAATCTCAAC CACATGATTC TTGTTTTGTA GAGATGCAGG 1501 CTCAAAAAGT AATGCATGTT TCTTCAGCAG AACTGAATTA TTCACTGCCA 1551 TATGACTCTA AACACCAAAT ACGTAATGCC TCTAATGTAA AGCACCATGA 1601 CTCTAGTGCT CTTGGTGTAT ATTCTTACAT ACCTTTAGTG GAAAATCCTT 1651 ATTTTTCATC ATGGCCTCCA AGTGGTACCA GTTCTAAGAT GTCTCTTGAT 1701 TTACCTGAGA AGCAAGATGG AACTGTTTTT CCTTCTTCTC TGTTGCCAAC 1751 ATCCTCTACA TCCCTCTTCT CTTATTACAA TTCACATGAT TCTTTATCAC 1801 TGAATTCTCC AACCAATATT TCCTCACTAT TGAACCAGGA GTCAGCTGTA 1851 CTAGCAACTG CTCCAAGGAT AGATGATGAA ATCCCCCCTC CACTTCCTGT 1901 ACGGACACCT GAATCATTTA TTGTGGTTGA GGAAGCTGGA GAATTCTCAC 1951 CAAATGTTCC CAAATCCTTA TCCTCAGCTG TGAAGGTAAA AATTGGAACA 2001 TCACTGGAAT GGGGTGGAAC ATCTGAACCA AAGAAATTTG ATGACTCTGT 2051 GATACTTAGA CCAAGCAAGA GTGTAAAACT CCGAAGTCCT AAATCAGAAC 2101 TACATCAAGA TCGTTCTTCT CCCCCACCTC CTCTCCCAGA AAGAACTCTA 2151 GAGTCCTTCT TTCTTGCCGA TGAAGATTGT ATGCAGGCCC AATCTATAGA 2201 AACATATTCT ACTAGCTATC CTGACACCAT GGAAAATTCA ACATCTTCAA 2251 AACAGACACT GAAGACTCCT GGAAAAAGTT TCACAAGGAG TAAGAGTTTG 2301 AAAATTTTGC GAAACATGAA AAAGAGTATC TGTAATTCTT GCCCACCAAA 2351 CAAGCCTGCA GAATCTGTTC AGTCAAATAA CTCCAGCTCA TTTCTGAATT 2401 TTGGTTTTGC AAACCGTTTT TCAAAACCCA AAGGACCAAG GAATCCACCA 2451 CCAACTTGGA ATATTTAATA AAACTCAGAT TTATAATAAT ATGGGCTGCA 2501 AGTACACCTG CAAATAAAAC TACTAGAATA CTGCTAGTTA AAATAAGTGC 2551 TCTATATGCA TAATATGAAG ATATGCTAAT GTGTTAATAG CTTTTAAAAG 2601 AAAAGCAAAA TGCCAATAAG TGCCAGTTTT GCATTTTCAT ATCATTTGCA 2651 TTGAGTTGAA AACTGCAAAT AAAAGTTTGT CACTTGAGCT TATGTACAGA 2701 ATGCTATATG AGAAACACTT TTAGAATGGA TTTATTTTTC ATTTTTGCCA 2751 GTTATTTTTA TTTTCTTTTA CTTTTCTACA TAAACATAAA CTTCAAAAGG 2801 TTTGTAAGAT TTGGATCTCA ACTAATTTCT ACATTGCCAG AATATACTAT 2851 AAAAAGTTAA AAAAAAAAAC TTACTTTGTG GGTTGCAATA CAAACTGCTC 2901 TTGACAATGA CTATTCCCTG ACAGTTATTT TTGCCTAAAT GGAGTATACC 2951 TTGTAAATCT TCCCAAATGT TGTGGAAAAC TGGAATATTA AGAAAATGAG 3001 AAATTATATT TATTAGAATA AAATGTGCAA ATAATGACAA TTATTTGAAT 3051 GTAACAAG // LOCUS AF002697 1535 bp mRNA PRI 20-MAY-1998 DEFINITION Homo sapiens E1B 19K/Bcl-2-binding protein Nip3 mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF002697 NID g2511528 VERSION AF002697.1 GI:2511528 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1535) AUTHORS Chen,G., Ray,R., Dubik,D., Shi,L., Cizeau,J., Bleackley,R.C., Saxena,S., Gietz,R.D. and Greenberg,A.H. TITLE The E1B 19K/Bcl-2-binding protein Nip3 is a dimeric mitochondrial protein that activates apoptosis JOURNAL J. Exp. Med. 186 (12), 1975-1983 (1997) MEDLINE 98060856 REFERENCE 2 (bases 1 to 1535) AUTHORS Chen,G., Shi,L., Ray,R., Dubik,D., Bleackley,C., Gietz,R.D. and Greenberg,A.H. TITLE Direct Submission JOURNAL Submitted (06-MAY-1997) Cell Biology, University of Manitoba, 100 Olivia St., Winnipeg, MB R3E 0V9, Canada FEATURES Location/Qualifiers source 1. .1535 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" CDS 127. .711 /note="pro-apoptotic mitochondrial protein" /codon_start=1 /product="E1B 19K/Bcl-2-binding protein Nip3" /protein_id="AAC16738.1" /db_xref="PID:g2511529" /db_xref="GI:2511529" /translation="MSQNGAPGMQEESLQGSWVELHFSNNGNGGSVPASVSIYNGDME KILLDAQHESGRSSSKSSHCDSPPRSQTPQDTNRASETDTHSIGEKNSSQSEEDDIER RKEVESILKKNSDWIWDWSSRPENIPPKEFLFKHPKRTATLSMRNTSVMKKGGIFSAE FLKVFLPSLLLSHLLAIGLGIYIGRRLTTSTSTF" misc_feature 616. .678 /note="encodes transmembrane domain" BASE COUNT 454 a 324 c 323 g 434 t ORIGIN 1 CCTCCGCTCA GTCCGGGAGC GCACGTGGGC CGCGGCGCTC CGACCTCCGC 51 TTTCCCACCG CCCGCAGCTG AAGCACATCC CGCAGCCCGG CGCGGACTCC 101 GATCGCCGCA GTTGCCCTCT GGCGCCATGT CGCAGAACGG AGCGCCCGGG 151 ATGCAGGAGG AGAGCCTGCA GGGCTCCTGG GTAGAACTGC ACTTCAGCAA 201 TAATGGGAAC GGGGGCAGCG TTCCAGCCTC GGTTTCTATT TATAATGGAG 251 ACATGGAAAA AATACTGCTG GACGCACAGC ATGAGTCTGG ACGGAGTAGC 301 TCCAAGAGCT CTCACTGTGA CAGCCCACCT CGCTCGCAGA CACCACAAGA 351 TACCAACAGG GCTTCTGAAA CAGATACCCA TAGCATTGGA GAGAAAAACA 401 GCTCACAGTC TGAGGAAGAT GATATTGAAA GAAGGAAAGA AGTTGAAAGC 451 ATCTTGAAGA AAAACTCAGA TTGGATATGG GATTGGTCAA GTCGGCCGGA 501 AAATATTCCC CCCAAGGAGT TCCTCTTTAA ACACCCGAAG CGCACGGCCA 551 CCCTCAGCAT GAGGAACACG AGCGTCATGA AGAAAGGGGG CATATTCTCT 601 GCAGAATTTC TGAAAGTTTT CCTTCCATCT CTGCTGCTCT CTCATTTGCT 651 GGCCATCGGA TTGGGGATCT ATATTGGAAG GCGTCTGACA ACCTCCACCA 701 GCACCTTTTG ATGAAGAACT GGAGTCTGAC TTGGTTCGTT AGTGGATTAC 751 TTCTGAGCTT GCAACATAGC TCACTGAAGA GCTGTTAGAT CCTGGGGTGG 801 CCACGTCACT TGTGTTTATT TGTTCTGTAA ATGCTGCGTT CCTAATTTAG 851 TAAAATAAAA GAATAGACAC TAAAATCATG TTGATCTATA ATTACACCTA 901 TGGGATCAAT AAGCATGTCA GACTGATTAA TGTCTACTGT GAAAATTTGG 951 TAGTAAATTT TCATTTGATA TTAGATATAA ATATCTGAAT ATAAATAATT 1001 TTAATATACT AGTCATGATG TGTGTTGTAT TTTAAAAATT ATCTGCAACC 1051 TTAATTCAGC TGAAGTACTT TATATTTCAA AAGAATGAAT AACATTGATA 1101 ATAAAATCGC TACTTTAAGG GGTTTGTCCA AAATAAATAT TGTGGCCTTA 1151 TATATCACAC TATTGTAGAA AGTATTATTT AATTTAAATG GATGCAGGTT 1201 GTCTACTAAA GAAAGATTAT ATATAACTAT GCTAATTGTT CATAATCAAC 1251 AGAAACCAAG ATAGAGCTAC AAACTCAGCT GTACAGTTCG TACACTAAAC 1301 TCTTCTTGCT TTTGCATTAT AAGGAATTAA GTCTCCGATT ATTAGGTGAT 1351 CACCCTGGAT GATCAGTTTT CTGCTGAAGG CACCTACTCA GTATCTTTTC 1401 CTCTTTATCA CTCTGCATTG GTGAATTTAA TCCTCTCCTT TGTGTTCAAC 1451 TTTTGTGTGC TTTTAAAATC AGCTTTATTC TAAGCAAATC TGTGTCTACT 1501 TTAAAAAACT GGAAATGGAA AAAAAAATAA ATCTT // LOCUS AF003837 5942 bp mRNA PRI 06-SEP-1997 DEFINITION Homo sapiens Jagged1 (JAG1) mRNA, complete cds. ACCESSION AF003837 NID g2228792 VERSION AF003837.1 GI:2228792 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5942) AUTHORS Oda,T., Elkahloun,A.G., Pike,B.L., Okajima,K., Krantz,I.D., Genin,A., Piccoli,D.A., Meltzer,P.S., Spinner,N.B., Collins,F.S. and Chandrasekharappa,S.C. TITLE Mutations in the human Jagged1 gene (JAG1) are responsible for the Alagille syndrome JOURNAL Nature Genet. (1997) In press REFERENCE 2 (bases 1 to 5942) AUTHORS Oda,T., Elkahloun,A.G., Meltzer,P.S. and Chandrasekharappa,S.C. TITLE Identification and cloning of the human homolog (JAG1) of the rat Jagged1 gene from the Alagille syndrome critical region at 20p12 JOURNAL Genomics 43 (3), 376-379 (1997) MEDLINE 97422615 REFERENCE 3 (bases 1 to 5942) AUTHORS Oda,T. and Chandrasekharappa,S.C. TITLE Direct Submission JOURNAL Submitted (12-MAY-1997) LGT, NHGRI, NIH, 49 Convent Dr., MSC4442, Bldg. 49, Room 3C36, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1. .5942 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20p12" gene 1. .5942 /gene="JAG1" exon 1. .540 /gene="JAG1" /number=1 CDS 460. .4116 /gene="JAG1" /note="similar to R. norvegicus Jagged1 protein" /codon_start=1 /product="Jagged1" /protein_id="AAC51731.1" /db_xref="PID:g2228793" /db_xref="GI:2228793" /translation="MRSPRTRGRSGRPLSLLLALLCALRAKVCGASGQFELEILSMQN VNGELQNGNCCGGARNPGDRKCTRDECDTYFKVCLKEYQSRVTAGGPCSFGSGSTPVI GGNTFNLKASRGNDRNRIVLPFSFAWPRSYTLLVEAWDSSNDTVQPDSIIEKASHSGM INPSRQWQTLKQNTGVAHFEYQIRVTCDDYYYGFGCNKFCRPRDDFFGHYACDQNGNK TCMEGWMGRECNRAICRQGCSPKHGSCKLPGDCRCQYGWQGLYCDKCIPHPGCVHGIC NEPWQCLCETNWGGQLCDKDLNYCGTHQPCLNGGTCSNTGPDKYQCSCPEGYSGPNCE IAEHACLSDPCHNRGSCKETSLGFECECSPGWTGPTCSTNIDDCSPNNCSHGGTCQDL VNGFKCVCPPQWTGKTCQLDANECEAKPCVNAKSCKNLIASYYCDCLPGWMGQNCDIN INDCLGQCQNDASCRDLVNGYRCICPPGYAGDHCERDIDECASNPCLDGGHCQNEINR FQCLCPTGFSGNLCQLDIDYCEPNPCQNGAQCYNRASDYFCKCPEDYEGKNCSHLKDH CRTTPCEVIDSCTVAMASNDTPEGVRYISSNVCGPHGKCKSQSGGKFTCDCNKGFTGT YCHENINDCESNPCRNGGTCIDGVNSYKCICSDGWEGAYCETNINDCSQNPCHNGGTC RDLVNDFYCDCKNGWKGKTCHSRDSQCDEATCNNGGTCYDEGDAFKCMCPGGWEGTTC NIARNSSCLPNPCHNGGTCVVNGESFTCVCKEGWEGPICAQNTNDCSPHPCYNSGTCV DGDNWYRCECAPGFAGPDCRININECQSSPCAFGATCVDEINGYRCVCPPGHSGAKCQ EVSGRPCITMGSVIPDGAKWDDDCNTCQCLNGRIACSKVWCGPRPCLLHKGHSECPSG QSCIPILDDQCFVHPCTGVGECRSSSLQPVKTKCTSDSYYQDNCANITFTFNKEMMSP GLTTEHICSELRNLNILKNVSAEYSIYIACEPSPSANNEIHVAISAEDIRDDGNPIKE ITDKIIDLVSKRDGNSSLIAAVAEVRVQRRPLKNRTDFLVPLLSSVLTVAWICCLVTA FYWCLRKRRKPGSHTHSASEDNTTNNVREQLNQIKNPIEKHGANTVPIKDYENKNSKM SKIRTHNSEVEEDDMDKHQQKARFAKQPAYTLVDREEKPPNGTPTKHPNWTNKQDNRD LESAQSLNRMEYIV" exon 541. .846 /gene="JAG1" /number=2 exon 847. .898 /gene="JAG1" /number=3 exon 899. .1153 /gene="JAG1" /number=4 exon 1154. .1214 /gene="JAG1" /number=5 exon 1215. .1345 /gene="JAG1" /number=6 exon 1346. .1465 /gene="JAG1" /number=7 exon 1466. .1579 /gene="JAG1" /number=8 exon 1580. .1693 /gene="JAG1" /number=9 exon 1694. .1807 /gene="JAG1" /number=10 exon 1808. .1854 /gene="JAG1" /number=11 exon 1855. .2028 /gene="JAG1" /number=12 exon 2029. .2179 /gene="JAG1" /number=13 exon 2180. .2344 /gene="JAG1" /number=14 exon 2345. .2458 /gene="JAG1" /number=15 exon 2459. .2572 /gene="JAG1" /number=16 exon 2573. .2686 /gene="JAG1" /number=17 exon 2687. .2803 /gene="JAG1" /number=18 exon 2804. .2831 /gene="JAG1" /number=19 exon 2832. .2917 /gene="JAG1" /number=20 exon 2918. .3031 /gene="JAG1" /number=21 exon 3032. .3141 /gene="JAG1" /number=22 exon 3142. .3375 /gene="JAG1" /number=23 exon 3376. .3507 /gene="JAG1" /number=24 exon 3508. .3658 /gene="JAG1" /number=25 exon 3659. .5942 /gene="JAG1" /number=26 BASE COUNT 1520 a 1393 c 1544 g 1485 t ORIGIN 1 CCGGGTCCTT CTCCGAGAGC CGGGCGGGCA CGCGTCATTG TGTTACCTGC 51 GGCCGGCCCG CGAGCTAGGC TGGTTTTTTT TTTTTCTCCC CTCCCTCCCC 101 CCTTTTTCCA TGCAGCTGAT CTAAAAGGGA ATAAAAGGCT GCGCATAATC 151 ATAATAATAA AAGAAGGGGA GCGCGAGAGA AGGAAAGAAA GCCGGGAGGT 201 GGAAGAGGAG GGGGAGCGTC TCAAAGAAGC GATCAGAATA ATAAAAGGAG 251 GCCGGGCTCT TTGCCTTCTG GAACGGGCCG CTCTTGAAAG GGCTTTTGAA 301 AAGTGGTGTT GTTTTCCAGT CGTGCATGCT CCAATCGGCG GAGTATATTA 351 GAGCCGGGAC GCGGCGGCCG CAGGGGCAGC GGCGACGGCA GCACCGGCGG 401 CAGCACCAGC GCGAACAGCA GCGGCGGCGT CCCGAGTGCC CGCGGCGCGC 451 GGCGCAGCGA TGCGTTCCCC ACGGACGCGC GGCCGGTCCG GGCGCCCCCT 501 AAGCCTCCTG CTCGCCCTGC TCTGTGCCCT GCGAGCCAAG GTGTGTGGGG 551 CCTCGGGTCA GTTCGAGTTG GAGATCCTGT CCATGCAGAA CGTGAACGGG 601 GAGCTGCAGA ACGGGAACTG CTGCGGCGGC GCCCGGAACC CGGGAGACCG 651 CAAGTGCACC CGCGACGAGT GTGACACATA CTTCAAAGTG TGCCTCAAGG 701 AGTATCAGTC CCGCGTCACG GCCGGGGGGC CCTGCAGCTT CGGCTCAGGG 751 TCCACGCCTG TCATCGGGGG CAACACCTTC AACCTCAAGG CCAGCCGCGG 801 CAACGACCGC AACCGCATCG TGCTGCCTTT CAGTTTCGCC TGGCCGAGGT 851 CCTATACGTT GCTTGTGGAG GCGTGGGATT CCAGTAATGA CACCGTTCAA 901 CCTGACAGTA TTATTGAAAA GGCTTCTCAC TCGGGCATGA TCAACCCCAG 951 CCGGCAGTGG CAGACGCTGA AGCAGAACAC GGGCGTTGCC CACTTTGAGT 1001 ATCAGATCCG CGTGACCTGT GATGACTACT ACTATGGCTT TGGCTGCAAT 1051 AAGTTCTGCC GCCCCAGAGA TGACTTCTTT GGACACTATG CCTGTGACCA 1101 GAATGGCAAC AAAACTTGCA TGGAAGGCTG GATGGGCCGC GAATGTAACA 1151 GAGCTATTTG CCGACAAGGC TGCAGTCCTA AGCATGGGTC TTGCAAACTC 1201 CCAGGTGACT GCAGGTGCCA GTACGGCTGG CAAGGCCTGT ACTGTGATAA 1251 GTGCATCCCA CACCCGGGAT GCGTCCACGG CATCTGTAAT GAGCCCTGGC 1301 AGTGCCTCTG TGAGACCAAC TGGGGCGGCC AGCTCTGTGA CAAAGATCTC 1351 AATTACTGTG GGACTCATCA GCCGTGTCTC AACGGGGGAA CTTGTAGCAA 1401 CACAGGCCCT GACAAATATC AGTGTTCCTG CCCTGAGGGG TATTCAGGAC 1451 CCAACTGTGA AATTGCTGAG CACGCCTGCC TCTCTGATCC CTGTCACAAC 1501 AGAGGCAGCT GTAAGGAGAC CTCCCTGGGC TTTGAGTGTG AGTGTTCCCC 1551 AGGCTGGACC GGCCCCACAT GCTCTACAAA CATTGATGAC TGTTCTCCTA 1601 ATAACTGTTC CCACGGGGGC ACCTGCCAGG ACCTGGTTAA CGGATTTAAG 1651 TGTGTGTGCC CCCCACAGTG GACTGGGAAA ACGTGCCAGT TAGATGCAAA 1701 TGAATGTGAG GCCAAACCTT GTGTAAACGC CAAATCCTGT AAGAATCTCA 1751 TTGCCAGCTA CTACTGCGAC TGTCTTCCCG GCTGGATGGG TCAGAATTGT 1801 GACATAAATA TTAATGACTG CCTTGGCCAG TGTCAGAATG ACGCCTCCTG 1851 TCGGGATTTG GTTAATGGTT ATCGCTGTAT CTGTCCACCT GGCTATGCAG 1901 GCGATCACTG TGAGAGAGAC ATCGATGAAT GTGCCAGCAA CCCCTGTTTG 1951 GATGGGGGTC ACTGTCAGAA TGAAATCAAC AGATTCCAGT GTCTGTGTCC 2001 CACTGGTTTC TCTGGAAACC TCTGTCAGCT GGACATCGAT TATTGTGAGC 2051 CTAATCCCTG CCAGAACGGT GCCCAGTGCT ACAACCGTGC CAGTGACTAT 2101 TTCTGCAAGT GCCCCGAGGA CTATGAGGGC AAGAACTGCT CACACCTGAA 2151 AGACCACTGC CGCACGACCC CCTGTGAAGT GATTGACAGC TGCACAGTGG 2201 CCATGGCTTC CAACGACACA CCTGAAGGGG TGCGGTATAT TTCCTCCAAC 2251 GTCTGTGGTC CTCACGGGAA GTGCAAGAGT CAGTCGGGAG GCAAATTCAC 2301 CTGTGACTGT AACAAAGGCT TCACGGGAAC ATACTGCCAT GAAAATATTA 2351 ATGACTGTGA GAGCAACCCT TGTAGAAACG GTGGCACTTG CATCGATGGT 2401 GTCAACTCCT ACAAGTGCAT CTGTAGTGAC GGCTGGGAGG GGGCCTACTG 2451 TGAAACCAAT ATTAATGACT GCAGCCAGAA CCCCTGCCAC AATGGGGGCA 2501 CGTGTCGCGA CCTGGTCAAT GACTTCTACT GTGACTGTAA AAATGGGTGG 2551 AAAGGAAAGA CCTGCCACTC ACGTGACAGT CAGTGTGATG AGGCCACGTG 2601 CAACAACGGT GGCACCTGCT ATGATGAGGG GGATGCTTTT AAGTGCATGT 2651 GTCCTGGCGG CTGGGAAGGA ACAACCTGTA ACATAGCCCG AAACAGTAGC 2701 TGCCTGCCCA ACCCCTGCCA TAATGGGGGC ACATGTGTGG TCAACGGCGA 2751 GTCCTTTACG TGCGTCTGCA AGGAAGGCTG GGAGGGGCCC ATCTGTGCTC 2801 AGAATACCAA TGACTGCAGC CCTCATCCCT GTTACAACAG CGGCACCTGT 2851 GTGGATGGAG ACAACTGGTA CCGGTGCGAA TGTGCCCCGG GTTTTGCTGG 2901 GCCCGACTGC AGAATAAACA TCAATGAATG CCAGTCTTCA CCTTGTGCCT 2951 TTGGAGCGAC CTGTGTGGAT GAGATCAATG GCTACCGGTG TGTCTGCCCT 3001 CCAGGGCACA GTGGTGCCAA GTGCCAGGAA GTTTCAGGGA GACCTTGCAT 3051 CACCATGGGG AGTGTGATAC CAGATGGGGC CAAATGGGAT GATGACTGTA 3101 ATACCTGCCA GTGCCTGAAT GGACGGATCG CCTGCTCAAA GGTCTGGTGT 3151 GGCCCTCGAC CTTGCCTGCT CCACAAAGGG CACAGCGAGT GCCCCAGCGG 3201 GCAGAGCTGC ATCCCCATCC TGGACGACCA GTGCTTCGTC CACCCCTGCA 3251 CTGGTGTGGG CGAGTGTCGG TCTTCCAGTC TCCAGCCGGT GAAGACAAAG 3301 TGCACCTCTG ACTCCTATTA CCAGGATAAC TGTGCGAACA TCACATTTAC 3351 CTTTAACAAG GAGATGATGT CACCAGGTCT TACTACGGAG CACATTTGCA 3401 GTGAATTGAG GAATTTGAAT ATTTTGAAGA ATGTTTCCGC TGAATATTCA 3451 ATCTACATCG CTTGCGAGCC TTCCCCTTCA GCGAACAATG AAATACATGT 3501 GGCCATTTCT GCTGAAGATA TACGGGATGA TGGGAACCCG ATCAAGGAAA 3551 TCACTGACAA AATAATTGAT CTTGTTAGTA AACGTGATGG AAACAGCTCG 3601 CTGATTGCTG CCGTTGCAGA AGTAAGAGTT CAGAGGCGGC CTCTGAAGAA 3651 CAGAACAGAT TTCCTTGTTC CCTTGCTGAG CTCTGTCTTA ACTGTGGCTT 3701 GGATCTGTTG CTTGGTGACG GCCTTCTACT GGTGCCTGCG GAAGCGGCGG 3751 AAGCCGGGCA GCCACACACA CTCAGCCTCT GAGGACAACA CCACCAACAA 3801 CGTGCGGGAG CAGCTGAACC AGATCAAAAA CCCCATTGAG AAACATGGGG 3851 CCAACACGGT CCCCATCAAG GATTACGAGA ACAAGAACTC CAAAATGTCT 3901 AAAATAAGGA CACACAATTC TGAAGTAGAA GAGGACGACA TGGACAAACA 3951 CCAGCAGAAA GCCCGGTTTG CCAAGCAGCC GGCGTATACG CTGGTAGACA 4001 GAGAAGAGAA GCCCCCCAAC GGCACGCCGA CAAAACACCC AAACTGGACA 4051 AACAAACAGG ACAACAGAGA CTTGGAAAGT GCCCAGAGCT TAAACCGAAT 4101 GGAGTACATC GTATAGCAGA CCGCGGGCAC TGCCGCCGCT AGGTAGAGTC 4151 TGAGGGCTTG TAGTTCTTTA AACTGTCGTG TCATACTCGA GTCTGAGGCC 4201 GTTGCTGACT TAGAATCCCT GTGTTAATTT AAGTTTTGAC AAGCTGGCTT 4251 ACACTGGCAA TGGTAGTTTC TGTGGTTGGC TGGGAAATCG AGTGCCGCAT 4301 CTCACAGCTA TGCAAAAAGC TAGTCAACAG TACCCTGGTT GTGTGTCCCC 4351 TTGCAGCCGA CACGGTCTCG GATCAGGCTC CCAGGAGCCT GCCCAGCCCC 4401 CTGGTCTTTG AGCTCCCACT TCTGCCAGAT GTCCTAATGG TGATGCAGTC 4451 TTAGATCATA GTTTTATTTA TATTTATTGA CTCTTGAGTT GTTTTTGTAT 4501 ATTGGTTTTA TGATGACGTA CAAGTAGTTC TGTATTTGAA AGTGCCTTTG 4551 CAGCTCAGAA CCACAGCAAC GATCACAAAT GACTTTATTA TTTATTTTTT 4601 TAATTGTATT TTTGTTGTTG GGGGAGGGGA GACTTTGATG TCAGCAGTTG 4651 CTGGTAAAAT GAAGAATTTA AAGAAAAAAA TGTCAAAAGT AGAACTTTGT 4701 ATAGTTATGT AAATAATTCT TTTTTATTAA TCACTGTGTA TATTTGATTT 4751 ATTAACTTAA TAATCAAGAG CCTTAAAACA TCATTCCTTT TTATTTATAT 4801 GTATGTGTTT AGAATTGAAG GTTTTTGATA GCATTGTAAG CGTATGGCTT 4851 TATTTTTTTG AACTCTTCTC ATTACTTGTT GCCTATAAGC CAAAATTAAG 4901 GTGTTTGAAA ATAGTTTATT TTAAAACAAT AGGATGGGCT TCTGTGCCCA 4951 GAATACTGAT GGAATTTTTT TTGTACGACG TCAGATGTTT AAAACACCTT 5001 CTATAGCATC ACTTAAAACA CGTTTTAAGG ACTGACTGAG GCAGTTTGAG 5051 GATTAGTTTA GAACAGGTTT TTTTGTTTGT TTGTTTTTTG TTTTTCTGCT 5101 TTAGACTTGA AAAGAGACAG GCAGGTGATC TGCTGCAGAG CAGTAAGGGA 5151 ACAAGTTGAG CTATGACTTA ACATAGCCAA AATGTGAGTG GTTGAATATG 5201 ATTAAAAATA TCAAATTAAT TGTGTGAACT TGGAAGCACA CCAATCTGAC 5251 TTTGTAAATT CTGATTTCTT TTCACCATTC GTACATAATA CTGAACCACT 5301 TGTAGATTTG ATTTTTTTTT TAATCTACTG CATTTAGGGA GTATTCTAAT 5351 AAGCTAGTTG AATACTTGAA CCATAAAATG TCCAGTAAGA TCACTGTTTA 5401 GATTTGCCAT AGAGTACACT GCCTGCCTTA AGTGAGGAAA TCAAAGTGCT 5451 ATTACGAAGT TCAAGATCAA AAAGGCTTAT AAAACAGAGT AATCTTGTTG 5501 GTTCACCATT GAGACCGTGA AGATACTTTG TATTGTCCTA TTAGTGTTAT 5551 ATGAACATAC AAATGCATCT TTGATGTGTT GTTCTTGGCA ATAAATTTTG 5601 AAAAGTAATA TTTATTAAAT TTTTTTGTAT GAAAACATGG AACAGTGTGG 5651 CTCTTCTGAG CTTACGTAGT TCTACCGGCT TTGCCGTGTG CTTCTGCCAC 5701 CCTGCTGAGT CTGTTCTGGT AATCGGGGTA TAATAGGCTC TGCCTGACAG 5751 AGGGATGGAG GAAGAACTGA AAGGCTTTTC AACCACAAAA CTCATCTGGA 5801 GTTCTCAAAG ACCTGGGGCT GCTGTGAAGC TGGAACTGCG GGAGCCCCAT 5851 CTAGGGGAGC CTTGATTCCC TTGTTATTCA ACAGCAAGTG TGAATACTGC 5901 TTGAATAAAC ACCACTGGAT TAATGGAAAA AAAAAAAAAA AA // LOCUS AF004291 1842 bp mRNA PRI 23-MAR-1998 DEFINITION Homo sapiens germ cell nuclear factor (GCNF) mRNA, complete cds. ACCESSION AF004291 NID g2209118 VERSION AF004291.1 GI:2209118 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1842) AUTHORS Agoulnik,I.Y., Cho,Y., Niederberger,C., Kieback,D.G. and Cooney,A.J. TITLE Cloning, expression analysis and chromosomal localization of the human nuclear receptor gene GCNF JOURNAL FEBS Lett. 424 (1-2), 73-78 (1998) MEDLINE 98196867 REFERENCE 2 (bases 1 to 1842) AUTHORS Agoulnik,I.U., Cooney,J.A. and Kieback,D.G. TITLE Direct Submission JOURNAL Submitted (14-MAY-1997) OB/GYN, Baylor College of Medicine, 6550 Fannin, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1. .1842 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1. .1842 /gene="GCNF" CDS 187. .1551 /gene="GCNF" /function="nuclear receptor" /codon_start=1 /product="germ cell nuclear factor" /protein_id="AAC52054.1" /db_xref="PID:g2209119" /db_xref="GI:2209119" /translation="MERDEPPPPRNGFCQDELAELDPGTNDRAEQRTCLICGDRATGL HYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRK AIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASE SNQPSPGSTLSSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLPQQAR SLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIAWI KKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRY WYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTS VGKE" BASE COUNT 452 a 470 c 501 g 419 t ORIGIN 1 GGCACGAGGC GGCGCGGAGG GGCGCGGAGC GGCGCGGAAC CGGGCGGCTC 51 GGGGCCCAGA GAGAGCCGCG GCCGGGAGCT CGCGGGCTCC TGACAACCTC 101 CTCCCCTCGG CGGACGACGA CCACGGCGAC TAGGGCGCCG GTCATGGCGG 151 AGCAACAAAC CCGGCGCGGA CCCTAGGCAC CACCGCATGG AGCGGGACGA 201 ACCTCCGCCG CCGCGCAACG GTTTCTGTCA GGATGAATTG GCAGAGCTTG 251 ACCCAGGCAC TAATGATCGG GCTGAACAAC GAACCTGTCT CATTTGTGGG 301 GACCGCGCTA CAGGCTTGCA CTATGGGATC ATCTCCTGTG AGGGCTGCAA 351 AGGGTTTTTC AAGCGGAGCA TTTGCAACAA ACGGGTATAT CGATGCAGTC 401 GTGACAAGAA CTGTGTCATG TCTCGGAAGC AGAGGAACAG GTGCCAGTAC 451 TGCCGCCTGC TCAAATGCCT CCAGATGGGG ATGAACCGGA AGGCTATCAG 501 AGAAGATGGC ATGCCTGGAG GCCGGAATAA GAGCATTGGG CCAGTCCAGA 551 TATCGGAAGA AGAAATCGAA AGGATCATGT CTGGGCAGGA GTTTGAGGAA 601 GAGGCCAATC ACTGGAGCAA CCATGGTGAT AGTGACCACA GTTCCCCTGG 651 GAACAGGGCT TCGGAGAGCA ACCAGCCCTC ACCAGGCTCC ACACTGTCTT 701 CCAGTAGGTC TGTGGAACTG AATGGATTCA TGGCCTTCAG GGAACAGTAC 751 ATGGGAATGT CTGTGCCTCC ACATTACCAA TATATACCGC ACCTTTTTAG 801 CTATTCTGGC CACTCACCAC TTCTGCCCCA ACAAGCTCGC AGCCTGGATC 851 CCCAGTCATA CAGTCTGATT CACCAGCTGT TATCAGCCGA GGACCTGGAA 901 CCATTGGGCA CGCCCATGTT GATTGAAGAT GGATACGCTG TGACACAGGC 951 AGAACTATTT GCCCTGCTTT GCCGCCTGGC CGACGAGCTG CTCTTTAGGC 1001 AGATTGCCTG GATCAAGAAA CTGCCTTTCT TCTGCGAGCT CTCAATCAAG 1051 GATTACACGT GCCTCTTGAG CTCTACGTGG CAGGAGCTAA TCCTGCTGTC 1101 TTCCCTCACC GTTTACAGCA AGCAGATCTT TGGGGAACTG GCTGATGTCA 1151 CTGCCAAGTA CTCGCCCTCC GATGAAGAAC TACACAGATT TAGTGATGAA 1201 GGGATGGAGG TGATCGAGCG GCTCATCTAC CTCTATCACA AGTTCCATCA 1251 GCTAAAGGTC AGCAACGAGG AGTATGCTTG CATGAAAGCA ATTAACTTCC 1301 TAAATCAAGA TATCAGGGGT CTGACCAGTG CCTCACAGCT GGAACAATTG 1351 AATAAACGAT ACTGGTACAT TTGCCAGGAT TTTACTGAAT ATAAATACAC 1401 ACATCAGCCG AACCGCTTTC CTGATCTCAT GATGTGCTTA CCTGAGATTC 1451 GATATATTGC AGGAAAGATG GTGAATGTGC CCCTGGAGCA GCTGCCCCTC 1501 CTCTTTAAGG TGGTGCTGCA TTCCTGCAAG ACCAGTGTGG GCAAGGAATG 1551 ACCTGTTCCA GGCGCCCTCC TCAGGCCAAC CACAGCGTCT TGGGTGGGCA 1601 GGACAGGCTC TGGAGGGAAA AGCCAGAGAG ACCAAGATGG AGGCTGTGGA 1651 GCAGCATTTC CCGTTGCCTC CATAGCAAGA AGAGTTTTTG TTTGTTTGTC 1701 TGTTTTTTTA ACCTCATTTT TCTATATATT TATTTCACGA CAGAGTTGAA 1751 TGTATGGCCT TCAACATGAT GCACATGCTT TTGTGTGAAT GCAGCCAATG 1801 CATTTTCTTA CAGTTTACAG AATGTGAAGA TGTTTGTAAT TT // LOCUS AF004562 3773 bp mRNA PRI 09-APR-1998 DEFINITION Homo sapiens hUNC18a alternatively-spliced mRNA, complete cds. ACCESSION AF004562 NID g3041872 VERSION AF004562.1 GI:3041872 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3773) AUTHORS Swanson,D.A., Steel,J.M. and Valle,D. TITLE Identification and characterization of the human ortholog of rat STXBP1, a protein implicated in vesicle trafficking and neurotransmitter release JOURNAL Genomics 48 (3), 373-376 (1998) MEDLINE 98207254 REFERENCE 2 (bases 1 to 3773) AUTHORS Swanson,D.A., Steel,J.M. and Valle,D. TITLE Direct Submission JOURNAL Submitted (16-MAY-1997) Pediatrics, Johns Hopkins University S.O.M., 725 N. Wolfe St. PCTB 802, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1. .3773 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34.1" CDS 121. .1905 /function="implicated in vesicle trafficking and neurotransmitter release" /note="alternatively-spliced; similar to rat n-Sec1" /codon_start=1 /product="hUNC18a" /protein_id="AAC39688.1" /db_xref="PID:g3041873" /db_xref="GI:3041873" /translation="MAPIGLKAVVGEKIMHDVIKKVKKKGEWKVLVVDQLSMRMLSSC CKMTDIMTEGITIVEDINKRREPLPSLEAVYLITPSEKSVHSLISDFKDPPTAKYRAA HVFFTDSCPDALFNELVKSRAAKVIKTLTEINIAFLPYESQVYSLDSADSFQSFYSPH KAQMKNPILERLAEQIATLCATLKEYPAVRYRGEYKDNALLAQLIQDKLDAYKADDPT MGEGPDKARSQLLILDRGFDPSSPVLHELTFQAMSYDLLPIENDVYKYETSGIGEARV KEVLLDEDDDLWIALRHKHIAEVSQEVTRSLKDFSSSKRMNTGEKTTMRDLSQMLKKM PQYQKELSKYSTHLHLAEDCMKHYQGTVDKLCRVEQDLAMGTDAEGEKIKDPMRAIVP ILLDANVSTYDKIRIILLYIFLKNGITEENLNKLIQHAQIPPEDSEIITNMAHLGVPI VTDSTLRRRSKPERKERISEQTYQLSRWTPIIKDIMEDTIEDKLDTKHYPYISTRSSA SFSTTAVSARYGHWHKNKAPGEYRSGPRLIIFILGGVSLNEMRCAYEVTQANGKWEVL IGSTHILTPQKLLDTLKKLNKTDEEISS" BASE COUNT 929 a 1046 c 914 g 884 t ORIGIN 1 CTGACGCGCG GCTGCGGGGC GGAGAGCTGC GGCTGGCCCA GCGCGCCCAC 51 CTGAGGAGGC GGCGGGGTCC GCAGGCGTCG CGGGACGAGG AGATCGGAGC 101 CGGGAGACTC GCGCAGCGCC ATGGCCCCCA TTGGCCTCAA AGCTGTTGTC 151 GGAGAGAAGA TTATGCATGA TGTGATAAAG AAGGTCAAGA AGAAGGGGGA 201 ATGGAAGGTG CTGGTGGTGG ATCAGTTAAG CATGAGGATG CTGTCCTCCT 251 GCTGCAAGAT GACAGACATC ATGACCGAGG GCATAACGAT TGTGGAAGAT 301 ATCAATAAGC GCAGAGAGCC GCTCCCCAGC CTGGAGGCTG TGTATCTCAT 351 CACTCCATCC GAGAAGTCCG TCCACTCTCT CATCAGTGAC TTTAAGGACC 401 CGCCGACTGC TAAATACCGG GCTGCACACG TCTTCTTCAC TGACTCTTGT 451 CCAGATGCCC TGTTTAATGA ACTGGTAAAA TCCCGAGCAG CCAAAGTCAT 501 CAAAACTCTG ACGGAAATCA ATATTGCATT TCTCCCGTAT GAATCCCAGG 551 TCTATTCCTT GGACTCTGCT GACTCTTTCC AAAGCTTCTA CAGTCCCCAC 601 AAGGCTCAGA TGAAGAATCC TATACTGGAG CGCCTGGCAG AGCAGATCGC 651 GACCCTTTGT GCCACCCTGA AGGAGTACCC GGCTGTGCGG TATCGGGGGG 701 AATACAAGGA CAATGCCCTG CTGGCTCAGC TAATCCAGGA CAAGCTCGAT 751 GCCTATAAAG CTGATGATCC AACAATGGGG GAGGGCCCAG ACAAGGCACG 801 CTCCCAGCTC CTGATCCTGG ATCGAGGCTT TGACCCCAGC TCCCCTGTGC 851 TCCATGAATT GACTTTTCAG GCTATGAGTT ATGATCTGCT GCCTATCGAA 901 AATGATGTAT ACAAGTATGA GACCAGCGGC ATCGGGGAGG CACGGGTGAA 951 GGAGGTGCTC CTGGACGAGG ACGACGACCT GTGGATAGCA CTGCGCCACA 1001 AGCACATCGC AGAGGTGTCC CAGGAAGTCA CCCGGTCTCT GAAAGATTTT 1051 TCTTCTAGCA AGAGAATGAA TACTGGAGAG AAGACCACCA TGCGGGACCT 1101 GTCCCAGATG CTGAAGAAGA TGCCTCAGTA CCAGAAAGAG CTCAGCAAGT 1151 ACTCCACCCA CCTGCACCTT GCTGAGGACT GTATGAAGCA TTACCAAGGC 1201 ACCGTAGACA AACTCTGCCG AGTGGAGCAG GACCTGGCCA TGGGCACAGA 1251 TGCTGAGGGA GAGAAGATCA AGGACCCTAT GCGAGCCATC GTCCCCATTC 1301 TGCTGGATGC CAATGTCAGC ACTTATGACA AAATCCGCAT CATCCTTCTC 1351 TACATCTTTT TGAAGAATGG CATCACGGAG GAAAACCTGA ACAAACTGAT 1401 CCAGCACGCC CAGATACCCC CGGAGGATAG TGAGATCATC ACCAACATGG 1451 CTCACCTCGG CGTGCCCATC GTCACCGATT CCACGCTGCG TCGCCGGAGC 1501 AAGCCGGAGC GGAAGGAACG CATCAGCGAG CAGACCTACC AGCTCTCACG 1551 GTGGACTCCG ATTATCAAGG ACATCATGGA GGACACTATT GAGGACAAAC 1601 TTGACACCAA ACACTACCCT TATATCTCTA CCCGTTCCTC TGCCTCCTTC 1651 AGCACCACCG CCGTCAGCGC CCGCTATGGG CACTGGCATA AGAACAAGGC 1701 CCCAGGCGAG TACCGCAGTG GCCCCCGCCT CATCATTTTC ATCCTTGGGG 1751 GTGTGAGCCT GAATGAGATG CGCTGCGCCT ACGAGGTGAC CCAGGCCAAC 1801 GGAAAGTGGG AGGTGCTGAT AGGATCCACA CACATCCTCA CCCCACAGAA 1851 ACTGCTGGAC ACACTGAAGA AACTGAATAA AACAGATGAA GAAATAAGCA 1901 GTTAAAAAAA TAAGTCGCCC CTCCAAAACA CGCCCCCATC CCACAGCGCT 1951 CCGCAGCTTC CCACCACCGC CCGCCTCAGT TCCTTTGCGT CTGTTGCCTC 2001 CCCAGCCCTG CACGCCCTGG CTGGCACTGT TGCCGCTGCA TTCTCGTGTT 2051 CAGTGATGCC CTCTTCTTGT TTGAAACAAA AGAAAATAAT GCATTGTGTT 2101 TTTTAAAAAG AGTATCTTAT ACATGTATCC TAAAAAGAGA AGCTCATGTG 2151 CAATTGGTGC ACAGCAGGAG AAATTTCTGG ACTGTTAGGA TGAATGGACG 2201 CCTTCTCCCC GTTATTTAAG ATTTGTGACC TTGTACATAA CCCTGGGTGA 2251 CGTGCACATT GCTTGGGTAT GGAACGGTAG AAATTTGGGT GTTTTTAAAA 2301 CCTTGTTTGG GGTTGTTCCT GTCCTTGTTG AGAATCATAG AGATGTCTGT 2351 GTTCTTGGAG TATTTCACAC TGAGGACTAA TCTGCTATCT TCATTCCAGT 2401 CCCTACCCCT CAGTGCCTGC TCTCATCCAA ATAACCTGGG AGGTGACAAT 2451 CAGGATATCT CAGGAGGTCC AAGGTGGAAC AGACCTCTTT GCCTTTCCCA 2501 GCGTCTCATA CCCCCGGTAG TGCAGCTGTG GGTGGAGGCT GGGGTGTCTG 2551 CACGAAGTCA GGCCAGCGTC CTCCTCCACA GCCTGTCACT GCCCCCTCCC 2601 CAGCCTGTGT CCACAGTGCT GTGATCCCGA GGGAAGTCCT CCAGTCTAAG 2651 TCACAGTGCC CTGACAGGTG AGAAGCAAAC TCCCGCTGGA AGCCTCCATC 2701 TCTTTGGAAA AACAGTTAGT CTGGAGCCTG TGGCCCAGGC CCTTCTGTCC 2751 CCAGGCATCA TCCCAACAGC TCATTTTCCC TAGTCCGCCT TCGTTCAAGG 2801 GTCAGGAATG GACCAGAACA GATGGGTTCT GGAGGCCCCT GAACAGAGGG 2851 CTATGGCTGT GGAGAAGGTT CTTGGCCCGT TGGACTCACA CAGACCCTGT 2901 ACCCTCTCGG CAAGCATCTT CAGTCAGATT ATCCTCAGTT TCAGATACTT 2951 CATAATACCT TGTGTTGTGT GGGGTCATAC ATCATCGTGT TTGTAAGAGA 3001 AGATGGTCAT TTTATTCTCT GTATAAAACT TAGCTCTAAA GCAGAAACTA 3051 AAGCAGCAAA TGCAGGAAGG CTGTCTCGCC ATCCTCAAGA CTCAGCAGCT 3101 CTCATTCTCC AGTGGTGAGC ACACCATTTG TGCTGCTGCT GTTGTCGTGA 3151 AATATAATAA CAGTGGAAGT CACAAAAATG TCCCCTGCCC AGCCCCCTCG 3201 CCGCCCTTGA CCTCCTGCAG GCCATGTGTG TATTACTTGT CTAGTGATGT 3251 CCTCTCAAAG TGCTGTACGC GAGCTCGGCG CCACCTCCGC CTCCCTTTCA 3301 GAGCCTGCTC CCCGCCCTCT CTGCTCGCTG CATTGTGGTG TTCTCTTCTC 3351 AAGGCTTTGA AATCTCCCCT TGCACTGAGA TTAGTCGTCA GATCTCTCCC 3401 CGTCTCCCTC CCAACTTATA CGACCTGATT TCCTTAGGAC GGAACCGCAG 3451 GCACCTGCGC CGGGCGTCTT ACTCCCGCTG CTTGTTCTGT CCCCTCCCTC 3501 GGACCAAACA GTGCTCATGC TTCAGGACCT TGTTTGTCGA AGATGTTGGT 3551 TTCCCTTTCT CTGTTATTTA TATAAAAATA ATTTATCAAA AGGATATTTT 3601 AAAAAAGCTA GTCTGTCTTG AAACTTGTTT ACCTTAAAAT TATCAGAATC 3651 TCAGTGTTTG AAAGTACTGA AGCACAAACA TATATCATCT CTGTACCATT 3701 CTGTACTAAA GCACTTGAGT CTAATAAATA AAGAAATCAG CACCCCTTCC 3751 CGGTGTCCAG GGGGAAAAAA AAA // LOCUS AF004563 3899 bp mRNA PRI 09-APR-1998 DEFINITION Homo sapiens hUNC18b alternatively-spliced mRNA, complete cds. ACCESSION AF004563 NID g3041874 VERSION AF004563.1 GI:3041874 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3899) AUTHORS Swanson,D.A., Steel,J.M. and Valle,D. TITLE Identification and characterization of the human ortholog of rat STXBP1, a protein implicated in vesicle trafficking and neurotransmitter release JOURNAL Genomics 48 (3), 373-376 (1998) MEDLINE 98207254 REFERENCE 2 (bases 1 to 3899) AUTHORS Swanson,D.A., Steel,J.M. and Valle,D. TITLE Direct Submission JOURNAL Submitted (16-MAY-1997) Pediatrics, Johns Hopkins University S.O.M., 725 N. Wolfe St. PCTB 802, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1. .3899 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34.1" CDS 121. .1932 /function="implicated in vesicle trafficking and neurotransmitter release" /note="alternatively-spliced; similar to rat n-Sec1" /codon_start=1 /product="hUNC18b" /protein_id="AAC39689.1" /db_xref="PID:g3041875" /db_xref="GI:3041875" /translation="MAPIGLKAVVGEKIMHDVIKKVKKKGEWKVLVVDQLSMRMLSSC CKMTDIMTEGITIVEDINKRREPLPSLEAVYLITPSEKSVHSLISDFKDPPTAKYRAA HVFFTDSCPDALFNELVKSRAAKVIKTLTEINIAFLPYESQVYSLDSADSFQSFYSPH KAQMKNPILERLAEQIATLCATLKEYPAVRYRGEYKDNALLAQLIQDKLDAYKADDPT MGEGPDKARSQLLILDRGFDPSSPVLHELTFQAMSYDLLPIENDVYKYETSGIGEARV KEVLLDEDDDLWIALRHKHIAEVSQEVTRSLKDFSSSKRMNTGEKTTMRDLSQMLKKM PQYQKELSKYSTHLHLAEDCMKHYQGTVDKLCRVEQDLAMGTDAEGEKIKDPMRAIVP ILLDANVSTYDKIRIILLYIFLKNGITEENLNKLIQHAQIPPEDSEIITNMAHLGVPI VTDSTLRRRSKPERKERISEQTYQLSRWTPIIKDIMEDTIEDKLDTKHYPYISTRSSA SFSTTAVSARYGHWHKNKAPGEYRSGPRLIIFILGGVSLNEMRCAYEVTQANGKWEVL IGSTHILTPTKFLMDLRHPDFRESSRVSFEDQAPTME" BASE COUNT 966 a 1080 c 939 g 914 t ORIGIN 1 CTGACGCGCG GCTGCGGGGC GGAGAGCTGC GGCTGGCCCA GCGCGCCCAC 51 CTGAGGAGGC GGCGGGGTCC GCAGGCGTCG CGGGACGAGG AGATCGGAGC 101 CGGGAGACTC GCGCAGCGCC ATGGCCCCCA TTGGCCTCAA AGCTGTTGTC 151 GGAGAGAAGA TTATGCATGA TGTGATAAAG AAGGTCAAGA AGAAGGGGGA 201 ATGGAAGGTG CTGGTGGTGG ATCAGTTAAG CATGAGGATG CTGTCCTCCT 251 GCTGCAAGAT GACAGACATC ATGACCGAGG GCATAACGAT TGTGGAAGAT 301 ATCAATAAGC GCAGAGAGCC GCTCCCCAGC CTGGAGGCTG TGTATCTCAT 351 CACTCCATCC GAGAAGTCCG TCCACTCTCT CATCAGTGAC TTTAAGGACC 401 CGCCGACTGC TAAATACCGG GCTGCACACG TCTTCTTCAC TGACTCTTGT 451 CCAGATGCCC TGTTTAATGA ACTGGTAAAA TCCCGAGCAG CCAAAGTCAT 501 CAAAACTCTG ACGGAAATCA ATATTGCATT TCTCCCGTAT GAATCCCAGG 551 TCTATTCCTT GGACTCTGCT GACTCTTTCC AAAGCTTCTA CAGTCCCCAC 601 AAGGCTCAGA TGAAGAATCC TATACTGGAG CGCCTGGCAG AGCAGATCGC 651 GACCCTTTGT GCCACCCTGA AGGAGTACCC GGCTGTGCGG TATCGGGGGG 701 AATACAAGGA CAATGCCCTG CTGGCTCAGC TAATCCAGGA CAAGCTCGAT 751 GCCTATAAAG CTGATGATCC AACAATGGGG GAGGGCCCAG ACAAGGCACG 801 CTCCCAGCTC CTGATCCTGG ATCGAGGCTT TGACCCCAGC TCCCCTGTGC 851 TCCATGAATT GACTTTTCAG GCTATGAGTT ATGATCTGCT GCCTATCGAA 901 AATGATGTAT ACAAGTATGA GACCAGCGGC ATCGGGGAGG CACGGGTGAA 951 GGAGGTGCTC CTGGACGAGG ACGACGACCT GTGGATAGCA CTGCGCCACA 1001 AGCACATCGC AGAGGTGTCC CAGGAAGTCA CCCGGTCTCT GAAAGATTTT 1051 TCTTCTAGCA AGAGAATGAA TACTGGAGAG AAGACCACCA TGCGGGACCT 1101 GTCCCAGATG CTGAAGAAGA TGCCTCAGTA CCAGAAAGAG CTCAGCAAGT 1151 ACTCCACCCA CCTGCACCTT GCTGAGGACT GTATGAAGCA TTACCAAGGC 1201 ACCGTAGACA AACTCTGCCG AGTGGAGCAG GACCTGGCCA TGGGCACAGA 1251 TGCTGAGGGA GAGAAGATCA AGGACCCTAT GCGAGCCATC GTCCCCATTC 1301 TGCTGGATGC CAATGTCAGC ACTTATGACA AAATCCGCAT CATCCTTCTC 1351 TACATCTTTT TGAAGAATGG CATCACGGAG GAAAACCTGA ACAAACTGAT 1401 CCAGCACGCC CAGATACCCC CGGAGGATAG TGAGATCATC ACCAACATGG 1451 CTCACCTCGG CGTGCCCATC GTCACCGATT CCACGCTGCG TCGCCGGAGC 1501 AAGCCGGAGC GGAAGGAACG CATCAGCGAG CAGACCTACC AGCTCTCACG 1551 GTGGACTCCG ATTATCAAGG ACATCATGGA GGACACTATT GAGGACAAAC 1601 TTGACACCAA ACACTACCCT TATATCTCTA CCCGTTCCTC TGCCTCCTTC 1651 AGCACCACCG CCGTCAGCGC CCGCTATGGG CACTGGCATA AGAACAAGGC 1701 CCCAGGCGAG TACCGCAGTG GCCCCCGCCT CATCATTTTC ATCCTTGGGG 1751 GTGTGAGCCT GAATGAGATG CGCTGCGCCT ACGAGGTGAC CCAGGCCAAC 1801 GGAAAGTGGG AGGTGCTGAT AGGTTCTACT CACATTCTTA CTCCCACCAA 1851 ATTTCTCATG GACCTGAGAC ACCCCGACTT CAGGGAGTCC TCTAGGGTAT 1901 CTTTTGAGGA TCAGGCTCCA ACAATGGAGT GAGAGCCAAA GAAACAAAGA 1951 TCCACACACA TCCTCACCCC ACAGAAACTG CTGGACACAC TGAAGAAACT 2001 GAATAAAACA GATGAAGAAA TAAGCAGTTA AAAAAATAAG TCGCCCCTCC 2051 AAAACACGCC CCCATCCCAC AGCGCTCCGC AGCTTCCCAC CACCGCCCGC 2101 CTCAGTTCCT TTGCGTCTGT TGCCTCCCCA GCCCTGCACG CCCTGGCTGG 2151 CACTGTTGCC GCTGCATTCT CGTGTTCAGT GATGCCCTCT TCTTGTTTGA 2201 AACAAAAGAA AATAATGCAT TGTGTTTTTT AAAAAGAGTA TCTTATACAT 2251 GTATCCTAAA AAGAGAAGCT CATGTGCAAT TGGTGCACAG CAGGAGAAAT 2301 TTCTGGACTG TTAGGATGAA TGGACGCCTT CTCCCCGTTA TTTAAGATTT 2351 GTGACCTTGT ACATAACCCT GGGTGACGTG CACATTGCTT GGGTATGGAA 2401 CGGTAGAAAT TTGGGTGTTT TTAAAACCTT GTTTGGGGTT GTTCCTGTCC 2451 TTGTTGAGAA TCATAGAGAT GTCTGTGTTC TTGGAGTATT TCACACTGAG 2501 GACTAATCTG CTATCTTCAT TCCAGTCCCT ACCCCTCAGT GCCTGCTCTC 2551 ATCCAAATAA CCTGGGAGGT GACAATCAGG ATATCTCAGG AGGTCCAAGG 2601 TGGAACAGAC CTCTTTGCCT TTCCCAGCGT CTCATACCCC CGGTAGTGCA 2651 GCTGTGGGTG GAGGCTGGGG TGTCTGCACG AAGTCAGGCC AGCGTCCTCC 2701 TCCACAGCCT GTCACTGCCC CCTCCCCAGC CTGTGTCCAC AGTGCTGTGA 2751 TCCCGAGGGA AGTCCTCCAG TCTAAGTCAC AGTGCCCTGA CAGGTGAGAA 2801 GCAAACTCCC GCTGGAAGCC TCCATCTCTT TGGAAAAACA GTTAGTCTGG 2851 AGCCTGTGGC CCAGGCCCTT CTGTCCCCAG GCATCATCCC AACAGCTCAT 2901 TTTCCCTAGT CCGCCTTCGT TCAAGGGTCA GGAATGGACC AGAACAGATG 2951 GGTTCTGGAG GCCCCTGAAC AGAGGGCTAT GGCTGTGGAG AAGGTTCTTG 3001 GCCCGTTGGA CTCACACAGA CCCTGTACCC TCTCGGCAAG CATCTTCAGT 3051 CAGATTATCC TCAGTTTCAG ATACTTCATA ATACCTTGTG TTGTGTGGGG 3101 TCATACATCA TCGTGTTTGT AAGAGAAGAT GGTCATTTTA TTCTCTGTAT 3151 AAAACTTAGC TCTAAAGCAG AAACTAAAGC AGCAAATGCA GGAAGGCTGT 3201 CTCGCCATCC TCAAGACTCA GCAGCTCTCA TTCTCCAGTG GTGAGCACAC 3251 CATTTGTGCT GCTGCTGTTG TCGTGAAATA TAATAACAGT GGAAGTCACA 3301 AAAATGTCCC CTGCCCAGCC CCCTCGCCGC CCTTGACCTC CTGCAGGCCA 3351 TGTGTGTATT ACTTGTCTAG TGATGTCCTC TCAAAGTGCT GTACGCGAGC 3401 TCGGCGCCAC CTCCGCCTCC CTTTCAGAGC CTGCTCCCCG CCCTCTCTGC 3451 TCGCTGCATT GTGGTGTTCT CTTCTCAAGG CTTTGAAATC TCCCCTTGCA 3501 CTGAGATTAG TCGTCAGATC TCTCCCCGTC TCCCTCCCAA CTTATACGAC 3551 CTGATTTCCT TAGGACGGAA CCGCAGGCAC CTGCGCCGGG CGTCTTACTC 3601 CCGCTGCTTG TTCTGTCCCC TCCCTCGGAC CAAACAGTGC TCATGCTTCA 3651 GGACCTTGTT TGTCGAAGAT GTTGGTTTCC CTTTCTCTGT TATTTATATA 3701 AAAATAATTT ATCAAAAGGA TATTTTAAAA AAGCTAGTCT GTCTTGAAAC 3751 TTGTTTACCT TAAAATTATC AGAATCTCAG TGTTTGAAAG TACTGAAGCA 3801 CAAACATATA TCATCTCTGT ACCATTCTGT ACTAAAGCAC TTGAGTCTAA 3851 TAAATAAAGA AATCAGCACC CCTTCCCGGT GTCCAGGGGG AAAAAAAAA // LOCUS AF004715 2889 bp mRNA PRI 06-AUG-1997 DEFINITION Homo sapiens jerky gene product homolog mRNA, complete cds. ACCESSION AF004715 NID g2314828 VERSION AF004715.1 GI:2314828 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2889) AUTHORS Zeng,Z., Kyaw,H., Gakenheimer,K.R., Augustus,M., Fan,P., Zhang,X., Su,K., Carter,K.C. and Li,Y. TITLE Cloning, mapping, and tissue distribution of a human homologue of the mouse jerky gene product JOURNAL Biochem. Biophys. Res. Commun. 236 (2), 389-395 (1997) MEDLINE 97382443 REFERENCE 2 (bases 1 to 2889) AUTHORS Zeng,Z.Z., Kyaw,H., Gakenheimer,K.R., Augustus,M., Fan,P., Zhang,X.C., Su,K., Carter,K.C. and Li,Y. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) Protein Therapeutics, Human Genome Sciences, Inc., 9410 Key West Avenue, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1. .2889 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 207. .1535 /note="similar to Mus musculus jerky gene product encoded for by the sequence presented in the file with GenBank Accession Number U35730" /codon_start=1 /product="jerky gene product homolog" /protein_id="AAB65833.1" /db_xref="PID:g2314829" /db_xref="GI:2314829" /translation="MLEWFNQQRAKGNPISGPICAKRAEFFFYALGMDGDFNPSAGWL TRFKQRHSIREINIRNERLNGDETAVEDFCNNFRDFIERENLQPEQIYNADETGLFWK CLPSRISVIKGKCTVPGHKSIEERVTIMCCANATGLHKLKLCVVGKAKKPRSFKSTDT LNLPVSYFSQKGAWMDLSIFRQWFDKIFVPQVREYLRSKGLQEKAVLLLDNSPTHPNE NVLRSDDGQIFAKYLPPNVASLIQPSDQGVIATMKRNYRAGLLQNNLEEGNDLKSFWK KLTLLDALYEIAMAWNLVKPVTISRAWKKILPMVEEKESLDFDVEDISVATVAAILQH TKGLENVTTENLEKWLEVDSTEPGYEVLTDSEIIRRAQGQADESSENEEEEIELIPEK HINHAAALQWTENLLDYLEQQGDMILPDRLVIRKLRATIRNKQKMTKSSQ" BASE COUNT 981 a 415 c 572 g 921 t ORIGIN 1 CCACGCGTCC GATAATAAAG AAACTTGAAG ACGGAGGTTC TTCCAAACAA 51 CTGGCAGTGA TTTATGGAAT TGGTGAAACA ACAGTTCGGG ATATAAGAAA 101 AAATAAGGAA AAGATTATAA CTTATGCAAG CAGTTCTGAT TCCACAAGTC 151 TTTTGGCCAA GAGGAAATCT ATGAAGCCAT CCATGTATGA GGAATTGGAC 201 AGGGCAATGC TGGAATGGTT CAACCAGCAA AGAGCAAAAG GGAATCCCAT 251 ATCTGGACCA ATTTGTGCAA AAAGGGCAGA GTTCTTCTTT TATGCTTTGG 301 GAATGGATGG TGATTTTAAC CCCTCTGCCG GTTGGCTAAC TCGTTTTAAG 351 CAGCGGCACA GCATTAGAGA GATTAACATT AGAAATGAAA GATTAAATGG 401 AGATGAGACT GCGGTGGAAG ATTTTTGTAA TAACTTTCGA GATTTTATTG 451 AACGAGAGAA TTTACAGCCT GAACAAATCT ACAATGCAGA TGAAACTGGA 501 CTCTTTTGGA AGTGCTTGCC TTCTAGGATT TCAGTAATCA AAGGTAAATG 551 CACTGTCCCT GGGCACAAAT CAATTGAAGA AAGAGTCACA ATCATGTGTT 601 GTGCCAATGC AACAGGTTTA CACAAACTTA AACTTTGTGT TGTGGGGAAA 651 GCAAAGAAAC CTCGCTCCTT TAAATCAACT GACACCTTAA ACCTGCCAGT 701 CTCTTATTTC AGCCAAAAAG GTGCATGGAT GGATCTTTCC ATTTTCCGAC 751 AATGGTTTGA TAAAATTTTT GTGCCGCAAG TTCGAGAGTA TTTAAGATCT 801 AAAGGCTTGC AGGAAAAGGC TGTGCTCTTG TTGGATAATT CACCAACACA 851 TCCAAATGAA AATGTCCTAA GGTCAGATGA TGGCCAAATA TTTGCTAAAT 901 ATTTACCACC TAATGTGGCC TCATTGATTC AGCCTTCAGA TCAGGGAGTC 951 ATAGCAACGA TGAAGAGAAA TTATCGTGCA GGTCTTCTCC AGAACAACTT 1001 GGAAGAAGGT AATGACCTGA AATCATTCTG GAAGAAGCTA ACTCTGTTGG 1051 ATGCACTTTA TGAAATAGCA ATGGCATGGA ACTTAGTAAA ACCAGTTACC 1101 ATTAGCAGAG CATGGAAGAA GATTCTCCCT ATGGTAGAGG AGAAAGAGAG 1151 CCTGGACTTT GATGTTGAAG ATATTTCTGT GGCTACTGTG GCTGCCATTT 1201 TACAACACAC CAAAGGATTG GAAAATGTGA CTACTGAGAA CCTTGAAAAA 1251 TGGCTTGAAG TAGACAGTAC TGAACCAGGC TATGAAGTGT TAACTGATAG 1301 CGAAATCATC AGAAGAGCAC AAGGCCAGGC AGATGAATCC AGTGAAAATG 1351 AGGAGGAGGA AATAGAACTA ATTCCAGAGA AACATATTAA TCATGCAGCT 1401 GCCCTCCAGT GGACTGAAAA TTTATTGGAT TATCTAGAAC AACAAGGTGA 1451 TATGATTCTA CCTGATAGAC TGGTAATACG TAAACTTCGA GCCACCATCA 1501 GAAATAAACA GAAGATGACA AAGTCAAGTC AATAATGTCA TTTCAATTTT 1551 ATTGTTCTGC TCATTGTGTT TGTGACAAAC TCTTTGCAAT ATGGCTTAAT 1601 TTTCTTTGTG TTCTGAATTC TCAGACTTGG TCCTGTGAAA TACAGGCACA 1651 AAATGTATCT GAAGTGGTTT GAGGATTATG TGTTTTCATC ATCTGTGTCT 1701 TTTGTCCTTT TATTTGTACA GATAATCAGA AGATGATACT GAATAGATAT 1751 AAATTACATG TACACATGTA TTCACTTTTT AGAATCTGCA ATTATACCTT 1801 CTGTAACAGT GGCATTCCCT TAATTTTCTA GTGAAAGTTA GAGATAACTG 1851 AACAGACTGA AGCACTTTTC TGAAATCTTT TGCTTGATTT ATGAAGGCTG 1901 CCATAGTTAT CTTTTCTTGT GTTAACCATC TTAAATGATG TTTTGTATAT 1951 TTTATAGACT GATAGGATGA GAAAGATTTA TATTATTAGA TTTCAGGATG 2001 ATTTATAATA ATTCAAAAAT GAAATTCAAT AATGGGGAAA TAATTATGAA 2051 TTATAGAAAT TATGCCTTCA TTCTCTTACA TTTGTGTGGG TTGCAAGAGG 2101 GGGAGATTAT TCTGGAAATG AAGTAAATAT GGGAAGTTAT TGCCAAATGA 2151 GAGAGAAACT ATGGGAAAGC TGATCTATAA AGAGGCATTC TGATCAATTC 2201 ATTTGTAGGA AACTGGGAAA TAAAAACCTG GGGAACTTTA GGTTATTTAT 2251 ACAAAGGGAA TAAATAGGCT GATTTTAATT TGGTAAGTTG ATCTTTTTAT 2301 TATGAATTTG GTAATAGTAT AGGTTTATTA TTTATTCATC TAATTTTATA 2351 GTACAGGTTT TGTAATGTTA CATGTGATGA TATGAGCTCC CACCTTATAT 2401 GGGGGAACAT CTTGGGAATT TGAGATTTAA TAAGTTTTTT TTTTTTTTTT 2451 TTAGTGTTTT TACTGCATAC TCACAAATGT TGTCTATAAT TTGAAAAATA 2501 TTGTCATATC TGGCCCTTTG ATGAGAAAAG GAAATTACAA TAATAAAGTT 2551 TTATGATTTT AAATAAGTCA TATGTTTGTA TCCTGTTTTA TGAAGAAAGC 2601 AGAAATAATT ACTGAAAGTG CCAGACACTA AGGAATATTA TTGTTTATTA 2651 TTTTAATACA TATAAAAAGG GATTAATCTG CTAAAATGTA ATCTAAATCA 2701 GAATTTTGAT AAATTTTTTT TGTAAACTAA GTATGTTTAT TCAAGACATT 2751 GAAACTACTT TGCACATATG AATATTAATG TAACTTGTAA TTTAAAAGTA 2801 AAGTGTTTCC ATGCTATTTC ATGTTTTGGC CAAAAATTTT TAAAAAATAA 2851 ATTACAATTG TTCTCTATTA GTAAAAAAAA AAAAAAAAA // LOCUS AF004841 3986 bp mRNA PRI 09-SEP-1998 DEFINITION Homo sapiens CDO mRNA, complete cds. ACCESSION AF004841 NID g3559765 VERSION AF004841.1 GI:3559765 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3986) AUTHORS Kang,J.S., Gao,M., Feinleib,J.L., Cotter,P.D., Guadagno,S.N. and Krauss,R.S. TITLE CDO: an oncogene-, serum-, and anchorage-regulated member of the Ig/fibronectin type III repeat family JOURNAL J. Cell Biol. 138 (1), 203-213 (1997) MEDLINE 97362072 REFERENCE 2 (bases 1 to 3986) AUTHORS Krauss,R.S., Kang,J.S., Gao,M. and Feinleib,J.L. TITLE Direct Submission JOURNAL Submitted (20-MAY-1997) Biochemistry, Mount Sinai Medical Center, One Gustave L. Levy Place, New York, NY 10029, USA REFERENCE 3 (bases 1 to 3986) AUTHORS Krauss,R.S., Kang,J.S., Gao,M. and Feinleib,J.L. TITLE Direct Submission JOURNAL Submitted (09-SEP-1998) Biochemistry, Mount Sinai Medical Center, One Gustave L. Levy Place, New York, NY 10029, USA REMARK Nucleotide and amino acid sequence updated by submitter COMMENT On Sep 9, 1998 this sequence version replaced gi:2406629. FEATURES Location/Qualifiers source 1. .3986 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q23-q24" CDS 1. .3723 /note="immunoglobulin superfamily member; contains fibronectin type III-like domain" /codon_start=1 /product="CDO" /protein_id="AAC34901.1" /db_xref="PID:g3559766" /db_xref="GI:3559766" /translation="MDLAPYFTSEPLSAVQKLGGPVVLHCSAQPVTTRISWLHNGKTL DGNLEHIKIHQGTLTILSLNSSLLGYYQCLANNSIGAIVSGPATVSVAVLGDFGSSTK HVITAEEKSAGFIGCRVPESNPKAEVRYKIRGKWLEHSTENYLILPSGNLQILNVSLE DKGSYKCAAYNPVTHQLKVEPIGRKLLVSRPSSDDVHILHPTHSQALAVLSRSPVTLE CVVSGVPAPQVYWLKDGQDIAPGSNWRRLYSHLATDSVDPADSGNYSCMAGNKSGDVE YVTYMVNVLEHASISKGLQDQIVSLGATVHFTCDVHGNPAPNCTWFHNAQPIHPSARH LTAGNGLKISGVTVEDVGMYQCVADNGIGFMHSTGRLEIENDGGFKPVIITAPVSAKV ADGDFVTLSCNASGLPVPVIRWYDSHGLITSHPSQVLRSKSRKSQLSRPEGLNLEPVY FVLSQAGASSLHIQAVTQEHAGKYICEAANEHGTTQAEASLMVVPFETNTKAETVTLP DAAQNDDRSKRDGSETGLLSSFPVKVHPSAVESAPEKNASGISVPDAPIILSPPQTHT PDTYNLVWRAGKDGGLPINAYFV